RPubs

by RStudio

sconnin

Sean Connin

Recently Published

Machine Learning Classification & Principal Components Analysis

The work consists of two separate sections. The first part compares classification outcomes for selected Machine Learning algorithms relative to a baseline model (logistic regression). Model building and evaluation is conducted using Tidymodels. The second part consists of a principal components analysis study of water chemistry from streams across New York State. The package Factoextra is used for calculations and data visualizations. The results demonstrate to use of this unsupervised learning technique to cluster waters based on their chemical composition.

about 3 years ago

Tree Regression Models

Based on Chapter 8 of Applied Predictive Modeling (Kuhn and Johnson). Includes application of Tidymodels.

about 3 years ago

Nonlinear Predictive Models

Based on Chapter 7 of Applied Predictive Modeling (Kuhn and Johnson). Includes application of Tidymodels.

about 3 years ago

STEM Completion: SVM Regression

An initial SVM model for the average percentage of women completing STEM programs (undergraduate) at public/private 4-yr + institutions in the United States.

about 3 years ago

Regularization

Based on Chapter 6 of Applied Predictive Modeling (Kuhn and Johnson).

about 3 years ago

STEM Completion Models

Initial tree and ensemble models for the average percentage of women completing STEM programs (undergraduate) at public/private 4-yr + institutions in the United States.

about 3 years ago

Forecasting Project

624 - Project 1. Forecasting models for ATM cash withdrawals and residential power consumption.

over 3 years ago

ARIMA

624 - Forecasting Principals and Practice. Chapter 9.

over 3 years ago

622-1 Classification Models

Comparison of Classifiers on a synthetic dataset(s).

over 3 years ago

Exponential Smoothing

624 - Forecasting Principals and Practice. Chapter 8.

over 3 years ago

Data Processing and Overfitting

Based on Chapter 3 of Kuhn and Johnson's Applied Predictive Modeling.

over 3 years ago

Forecasters Toolbox

624 - Forecasting Principals and Practice. Chapter 5.

over 3 years ago

Time Series Decomposition

624 - Forecasting Principals and Practice. Chapter 3.

over 3 years ago

Time Series Graphics

624 - Forecasting Principals and Practice. Chapter 2.

over 3 years ago

Insurance Classification and Prediction

The purpose of this work is to estimate the probability that an individual (seeking auto insurance) will be in an accident and then to forecast the potential cost of an ensuing claim. A synthetic data set of ~ 8000 observations will be used to construct predictive models for this purpose. The use of synthetic data permits model development absent proprietary information, while also providing a heuristic for quantitative reasoning.

over 3 years ago

RPubs

sconnin

Sean Connin

Recently Published

Machine Learning Classification & Principal Components Analysis

Tree Regression Models

Nonlinear Predictive Models

STEM Completion: SVM Regression

Regularization

STEM Completion Models

Forecasting Project

ARIMA

622-1 Classification Models

Exponential Smoothing

Data Processing and Overfitting

Forecasters Toolbox

Time Series Decomposition

Time Series Graphics

Insurance Classification and Prediction

Javascript Snippets

501C3 Study

2016 Campaign Speeches

API Query - NY Times

Files structures - xml, json, html

New Orleans - Multiple Data Sources

Tidy Data

Character Manipulation and Date Processing

Congressional Redistricting in Texas

Sign In

sconnin

Sean Connin

Recently Published