gravatar

rafajones

Carlos Mercado

Recently Published

BIG DATA NLP - Predictive Text Engine
4,270,000 tweets, blog posts, and news articles combined. Using a custom katz-backoff model process to rapidly search text and return predicted words and probabilities.
Text Prediction Slide Deck
Capstone in the JHU Data Science Specialization
Case Study Report
Report (with code) on the relationship between inflation and unemployment.
Case Study Slideshow
Slideshow on the relationship between inflation and unemployment.
Healthcare.gov NLP
Slideshow of healthcare.gov comment data analysis
World Bank - Poverty and Inequality Visualizations
Using the World Bank's Poverty and Inequality Data from 1975 to 2015 I analyzed the dataset, trimmed it down as carefully as possible, and made a few key visualizations.
Work in Progress - NLP Prediction Model
Reproducible Download and some exploration, does not yet include model building.
NBA Compare Presentation
Why should you use my NBACompare Shiny app?
Predicting Incorrect Exercise
Feature Selection, Data Cleaning, Model comparisons, parallel computing.
Ensemble Model Guide
Using techniques from JHU Coursera Machine Learning W4 video, combining predictors. I followed along, but made a reproducible version with corrections for updates to predict(). Weirdly, the GAM ensemble model didn't beat the basic GLM model for the validation set.
PlotlyMTcars
Attempting Plot_ly with mtcars data
Transmission doesn't Affect Miles Per Gallon
Final Project for Regression Modeling coursera course.
Smallest Cities in America
Using population data, coordinates, and leaflet() to make an interactive map of the smallest cities in the USA
Resume
I made this for fun and to test combinations of ggplot2 and aesthetic themes.
NBA - Why Westbrook (or Isaiah Thomas) is the Statistical MVP
Designing four categories and then testing regular season boxscores from basketball-reference. Westbrook and Isaiah Thomas tied for MVP. I'm personally picking Westbrook to win, but it didn't feel right to design a tie-breaker after having seen the data.
Orange Juice Grows Guinea Pigs Teeth Better
Orange Juice vs Abscorbic Acid as a catalyst for cellular tooth growth in guinea pigs. For low doses of Vitamin C the supplement used has a major effect on too growth, but beyond 2mg/day the supplement stops mattering.
Exploring R's Exponential Distribution
Used rexp() to simulate data and explore means and variances around lambda = 0.2
Tornadoes most harm people, floods most harm property.
An analysis of NOAA data to identify the deadliest and damaging weather events.