gravatar

jhnortham

Jamie Heather Northam

Recently Published

A Comparison of Three Sampling Distributions to Reduce a Diabetes Dataset with High Dimensionality
random sampling, stratified sampling, and systematic sampling are compared with the UCI 130 Hospitals US from 1999-2008 dataset
EDA of a Bankruptcy Dataset
This EDA explores the data from the american_bankruptcy dataset from Kaggle, including data visualization, descriptive statistics, data transformations, and inclusion of a k-means cluster model.
Assignment 8
not a finished product but saving here for reference as I work on it
A Demonstration of Principles of Exploratory Data Analysis for Data Science Using the Cleveland Heart Dataset: Part 7
In Part 7 of this EDA assignment, I practiced feature generation techniques
A Demonstration of Principles of Exploratory Data Analysis for Data Science Using the Cleveland Heart Dataset: Part 6
In Part 6 of this EDA of the Cleveland Heart dataset, I address transformation and scaling.
A Demonstration of Principles of Exploratory Data Analysis for Data Science Using the Cleveland Heart Dataset: Part 4
This is part 4 of my EDA of the Cleveland Heart dataset, focused on multivariate data visualization.
A Demonstration of Principles of Exploratory Data Analysis for Data Science Using the Cleveland Heart Dataset: Part 3
The 3rd assignment in my current data science course on EDA. I'm posting for my own records, since the frequent error messages in the HTML document are cumbersome but valuable to me as a student. As someone new to coding, I'm keeping this in my profile for reference.
A Demonstration of Principles of Exploratory Data Analysis for Data Science Using the Cleveland Heart Dataset: Part 2
In Part 2 of this EDA, missingness is explored using Amelia - II. This particular dataset did not contain missing values for imputation. A discussion regarding multiple imputation methods, such as MICE is included at the end of the notebook.
Preliminary Steps of EDA using Cleveland Heart dataset
In this notebook, I refined the formatting and style of my notebook while completing a class assignment designed to teach one aspect of data preprocessing - reclassification!
Exploratory Data Analysis Report - Part II
Assignment 7
Exploratory Data Analysis (EDA)
This is an EDA assignment using a provided data set. This assignment focuses on obtaining meaningful insights using data visualizations.
Heather_Northam_Cleaning your Data
Assignment 4: class assignment: DDS 8500 v1 National University
my_personalized_data
practice knit of code to HTML