Recently Published
Final Project
Data101 Professor Raunak Singh Chhabra Final Project Seung Hwan Oh
Analysis of Intrahepatic Cholangiocarcinoma ICC gene expression study GSE316921 for EBV association and Machine Learning by class
In this short project, we do a quick exploratory data analysis without plotting just reading the series text information and then use fold change data of the already supplied data set on intrahepatic cholangiocarcinoma or ICC that used two groups to compare hypoxia in ICC wild type and in ICC with a knockout inhibition of the SLC2A1 gene with short hairpin version or shSLC2A1. We then separated the groups after getting fold change values for top genes stimulated and inhbited without infinites, NaNs or 0s, and used a quick random forest classifier to tell if the genes would be good in predicting the class of the group. And 100% accuracy in both groups for training and testing. Small set of samples, we then looked at the EBV genes of recent studies in our projects to compare LMP1, LMP2, PDL1 or CD274, and PDL2 or PDCD1LG2. Only the PDL1 and PDL2 genes were included and both inhibited in both groups.