Recently Published
Publish Document
Google Data Analytics Capstone: A Cyclistic Bike-Share Analysis
Wiesbaden, Marktplatz
Beispiel für Geocoding
Coursera: Prediction Assignment Writeup
The report explore die relationship between some variables of data from acelerometers on the belt, forearm, and dumbell of 6 participants. The goal is to predict the "classe" variable. For the prediction of the 20 predefined test data sets the method Support Vector Machine is used and delivers good results.
Analysis MPG difference between transmissions
The report explores the relationship between some variables of data mtcars and the reponse variable "miles per gallon (MPG)". There are particurlarly interested in the questions. First, is an automatic or manual transmission better for MPG? Second, how different is the MPG between automatic manual transmission? The exploration and analysis of the data mtcars provide that among all variables "Weight" and "Number of cylinders" have significat impact in quantifying the difference on "MPG" between automatic an transmission cars.
Excercise: Simulations of Exponential Distributation
The report is written as Peer Assignment in the course "Statistical Inference" at Johns Hopkins University".
The report investigate the exponential distribution and illustrate via simulation the properties of the distribution of the mean and variance.
* the sample mean and compare it to the theoretical mean of exponential
* the sample variance and compare it to the theoretical variance
* the distribution of mean is approximately normal
Presentation of the WebApp 'My Next Word'
Many people all over the world use a smartphone to communicate with other people via social media, sms or e-mail.
Typing can be painful, so a suggestion list with matching words would be helpful. The web app 'My_Next_Word' tries to support the user with a list of possible words and their probabilities to capture the next word.
Capstone Project - Milestone - Data Preparation and Exploratory Analysis
Capstone Project - Milestone - Data Preparation and Exploratory Analysis
The goal of the project “Data Science Capstone” is to build an application that can predict the next word based on an existing word sequence (e.g. two, three or four words).
For this purpose, a data source “Coursera-SwiftKey.zip” will be made available within the framework of the project, which provides three resources of blogs, news and Twitter in different languages (US English, German, Russian, Finnish).
URL “https://d396qusza40orc.cloudfront.net/dsscapstone/dataset/Coursera-SwiftKey.zip
The considerations in the further project are limited to the data sources in English.
The aim of this milestone report is to provide the data, the data cleaning, an exploratory data analysis and an outlook to the planned prediction algorithm of the next word.