Recently Published
Identifying Player Importance Profiles in Professional Basketball - Clustering and Principal Component Analysis of ACB Player Performance
This project applies unsupervised learning to identify distinct and interpretable player importance profiles in the Spanish ACB league in the 2024-25 season, using engineered performance features that go beyond traditional box-score evaluations. By combining clustering and principal component analysis, the study examines whether dimensionality reduction clarifies and stabilizes the structure of player importance without altering the underlying groupings.
rastercontourntif_3rPlot
rastercontourntif_3rPlot
The Social DNA of Cinema (Clustering & Dimension Reduction)
USL Clustering and Dimension Reduction Project
Health_association_rules
This report explores the use of association rule mining techniques: Apriori, FP-Growth style pattern mining, and ECLAT to identify co-occurring health risk factors across countries. Using 2015 data from the WHO Global Health Observatory, the analysis treats each country as a transaction and examines patterns involving BMI, cholesterol, depression, and alcohol consumption.
rasterpolygonotif2_rPlot
rasterpolygonotif4_shp <- st_read("rasterxpolygonoRasterT_tif4.shp")
plot(st_geometry(rasterpolygonotif4_shp), axes=TRUE)
rasterpolygonotif4_shp <- st_transform(rasterpolygonotif4_shp, crs = 4324)
Market Basket Analysis (Association Rules)
This report performs market basket analysis on the Online Retail dataset using:
*Transaction construction from invoices
*Apriori association rules (with redundancy and significance filtering)
*Targeted rules with a fixed RHS item
*Cross-table exploration for top items
*Jaccard-based similarity and hierarchical clustering (transactions and items)
*Category aggregation and rule mining
*CBA-style classification workflow (prepared features)
This analysis was prepared as part of the course Unsupervised Machine Learning at the University of Warsaw.
Image Compression Using PCA
This report explores Principal Component Analysis (PCA), a method introduced during the course, as a simple and transparent approach to image compression. The idea is to represent an image with fewer degrees of freedom while keeping the reconstruction visually acceptable.
Two settings are considered: - PCA applied to a grayscale version of the image, - PCA applied separately to each RGB channel.
The report focuses on the practical trade-off between compression strength and visual quality, as observed both numerically and visually. Reconstruction quality is evaluated using MSE and PSNR, and the results are compared against standard JPEG compression at different quality settings.