Recently Published
Global Development Patterns via PCA and Clustering
This report applies unsupervised learning methods to World Development Indicators (WDI) data to explore latent global development structures. Principal Component Analysis (PCA) is used to reduce dimensionality and identify interpretable development dimensions, followed by hierarchical clustering in the reduced space to derive stable country groups. The analysis emphasizes methodological justification, validation, and interpretability.
Reporting Flexdashboard by Prof. Dr. Solym Manou-Abi
Reporting Flexdashboard – Analyse des vols NYC 2013
Reporting Flexdashboard – Analyse des vols NYC 2013
L'ensemble de données flights (associé au package nycflights13 en R ou utilisé dans des tutoriels Python/Pandas) est l'un des jeux de données les plus célèbres pour apprendre la science des données. Merci à notre best Professor Dr. Solym Manou-Abi.
Statistical model for newborn weight prediction
This project builds a statistical model to predict newborn birth weight using clinical data from 2,500 cases across three hospitals. Key predictors include gestational age, maternal smoking, infant sex, and biometric measures.
The final regression model shows good performance (R² ≈ 0.79), addressing heteroscedasticity with log-transformation and incorporating interactions and non-linear effects. Diagnostic tests support model reliability.
The model aids early identification of at-risk newborns, informs prenatal care (notably smoking cessation), and helps optimize neonatal resource planning. Limitations in extreme value predictions are noted, suggesting future validation and richer longitudinal data integration.