Recently Published

Tugas Week 10
Plot
Var3
Var2
Var1
ARIMA
Tugas Topik dalam Statistika I
Document
A Spatial Analysis of Income on Prevalence of Airbnb Listings in Boston
GEOG 588 - Final Project: This project assesses the relationship between Airbnb listing distribution and mean household income across the Boston, Massachusetts neighborhoods. A rarely assessed relationship, this project closes the gap in understanding by combining US Census income data with that of Airbnb listings. This produced a process for assessing other variable relationships to Airbnb listing distribution. Utilizing exploratory analysis and linear regression, mean household income and Airbnb listing distribution was found to have no correlation. Utilizing the methodologies employed in this study, future analysis can be performed to expand the variables assessed to include greater demographic characteristics within each neighborhood.
Minggu 12 - Tugas ARIMA
Statistics for Data Science (229711) - Chapter 8: Data Clustering
This chapter introduces the concept of Unsupervised Learning through the lens of Data Clustering. Students will learn how to find "hidden structures" in data without predefined labels, mastering the techniques used to group similar observations together. From identifying customer segments to discovering natural patterns in biology, this chapter provides the tools to make sense of unlabeled datasets by letting the data speak for itself. Core Topics covered: Introduction to Clustering K-Means Clustering Hierarchical Clustering DBSCAN Cluster Validation Gaussian Mixture Models Practical Clustering Workflow Chapter Lab Activity: Customer Segmentation with wholesales Data
Wine Prediction
This project builds a count regression model to predict the number of wine cases purchased by distributors based on chemical and marketing properties of roughly 12,000 commercially available wines. We walk through exploratory data analysis, data preparation including missing value flags and median imputation, and model building across Poisson, Negative Binomial, and Linear regression families. The key finding is that expert star ratings and label appeal dominate the prediction, while chemical variables add little beyond those two. The parsimonious Poisson model is selected as the final model based on AIC and interpretability.