Recently Published
Caso de Práctica 1: Colombina - Juan Jose Zuluaga Suarez
Caso de Práctica 1: Colombina - Juan Jose Zuluaga Suarez
NYC 2013 Flights Data Cleaning
The main objective of this cleaning process is to transform the raw, segmented temporal data of the NYC 2013 flights dataset into a continuous and usable timeline. By converting military-style integers into formal time objects and reconciling scheduled times with their respective delays, the script aims to create accurate, high-fidelity datetime columns. A critical component of this objective is the implementation of logical corrections for overnight flights, ensuring that arrivals occurring after midnight are correctly attributed to the following calendar day, thereby maintaining temporal integrity for subsequent analysis.
Bavanam Poojitha
In this analysis, I examined whether the intervention reduced participants’ stress levels. The same participants were measured before the intervention (Stress_Pre) and after the intervention (Stress_Post), making this a dependent (paired) design.
My research question was: Is there a significant difference in stress levels before and after the intervention?
The null hypothesis stated that there is no difference in stress scores before and after the intervention. The alternative hypothesis stated that there is a significant difference in stress scores.
Before conducting the inferential test, I examined the assumption of normality using a histogram, boxplot, and Shapiro-Wilk test on the difference scores (After − Before). Although the histogram appeared roughly symmetrical and moderately bell-shaped, the Shapiro-Wilk test was statistically significant (p < .05), indicating that the data were not normally distributed. Additionally, while there were a couple of outliers in the boxplot, they were not far from the whiskers and were not considered severe. Because the normality assumption was violated, I proceeded with a Wilcoxon Signed-Rank test instead of a paired t-test.
The results showed a statistically significant difference between stress levels before and after the intervention, V = 620, p < .001. The median stress score decreased from 47.24 before the intervention to 40.85 after the intervention. The effect size was large (r₍rb₎ = .84), indicating a strong reduction in stress levels following the intervention.