Recently Published

DBSCAN Algorithm — Financial Transaction Fraud Detection
Pipeline completo de detección de fraude financiero no supervisado sobre el dataset sintético PaySim1. A partir de 11 variables originales se construyen 49 variables derivadas mediante ingeniería de características, ampliando el espacio de representación a 60 columnas. Sobre este espacio se aplica limpieza estructural y ocho selectores de características comparativos, reteniéndose 8 variables finales con las que DBSCAN alcanza un Silhouette de 0.766 frente a 0.422 sin selección. El documento incluye fundamentos matemáticos del algoritmo, calibración de hiperparámetros, visualización t-SNE, métricas de validación y reglas operativas de accionabilidad aplicadas a datos nuevos.
Statistics for Data Science (229711) - Chapter 1: Descriptive Statistics
This document serves as the introductory chapter for the Statistics for Data Science course at the graduate level. It focuses on the fundamental principles of Exploratory Data Analysis (EDA), shifting the focus from simple computation to critical statistical interpretation . Topics covered: Measures of Central Tendency Measures of Dispersion Measures of Shape: Skewness and Kurtosis Data Visualization for Descriptive Statistics Multivariate Descriptive Statistics Chapter Lab Activity: Exploring the mtcars Dataset
Modelo probabilístico del estado operativo de los pozos petrolíferos en Brasil
Análisis estadístico de los estados operativos de pozos petrolíferos en Brasil
E-commerce Sales Data Analysis using R
This project performs exploratory data analysis (EDA) on an e-commerce dataset using R. It includes data cleaning, revenue analysis, customer behavior insights, product performance evaluation, time-based trend analysis, and regression modeling. Key insights include top-performing countries, high-value customers, sales trends over time, and the relationship between quantity and revenue. Visualizations and statistical techniques are used to support data-driven conclusions.
Milestone Report: SwiftKey
This report provides an exploratory data analysis of the HC Corpora dataset (Twitter, Blogs, and News).
Document
Algoritma EM
Penerapan algoritma EM pada penentuan kemungkinan 2 buah koin yang dilempar
Project ca 3
..