Recently Published

Exploratory Data Analysis and Modeling Strategy: SwiftKey Milestone Report
This report provides a comprehensive exploratory analysis of the HC Corpora dataset (Twitter, Blogs, and News). It details the data cleaning process, provides summary statistics for the corpora, and visualizes word frequencies (N-grams). Finally, it outlines the roadmap for building a Katz Back-off prediction algorithm and a final Shiny application.
Plot
Plot
Radar Plot for Clusters
Plot
Heatmap for cluster
Plot
Plot
PlotAir Quality Assessment and Temperature Relationship Using R Studio: A Case Study of Major Cities in Pakistan
This study analyzes the relationship between air pollution (PM2.5 levels) and temperature across major cities of Pakistan using R Studio. A simulated dataset was created for five cities — Karachi, Lahore, Islamabad, Peshawar, and Quetta — to demonstrate the process of environmental data analysis. Statistical summaries, correlation, and graphical visualization techniques were applied to explore the variation of PM2.5 concentrations with temperature. The results indicate that temperature has a moderate correlation with PM2.5 values, suggesting potential climatic influence on air quality. Visualization through scatter plots, regression lines, and boxplots enhances understanding of pollution trends. This approach demonstrates how R Studio can be effectively used as a tool for environmental monitoring, data visualization, and statistical interpretation in developing countries.
Alberi monumentali - Madonie
rilievo schede novembre 2025
Exempt Airbnb Listings in Lisbon
A short analysis of data from from Inside Airbnb about listings in Lisbon, Portugal. Focused on listings that claim to be exempt from licencing laws.
Document
LIVE