Set up an “overall survival” i.e. time for each patient in the cohort
13
For event/strata information for each patient, fetch gene expression data from GDC portal
14
Build query using GDCquery
15
Download data using GDCdownload
16
Extract counts using GDCprepare
17
Perform Variance Stabilization Transformation vst on counts before further analysis
18
Wrangle data to get the relevant data and data in the right shape
19
Approaches to divide cohort into 2 groups based on expression
20
Bifurcating patients into low and high TP53 expression groups
21
Define strata for each patient
22
Compute a survival curve using survfit and creating a Kaplan-Meier curve using ggsruvplot
23
survfit vs survdiff
Description:
Learn how to perform survival analysis using TCGA data in R, focusing on creating Kaplan-Meier curves. Explore the concept of survival analysis, its applications, and essential components like censoring. Discover methods for survival analysis and how to interpret Kaplan-Meier curves. Follow step-by-step instructions to download clinical data from the GDC portal, prepare gene expression data, and perform variance stabilization transformation. Master techniques for dividing cohorts based on expression levels, defining strata, and computing survival curves using survfit() and ggsruvplot(). Gain practical skills in utilizing the survival and survminer packages to conduct comprehensive survival analysis in R.
Survival Analysis with TCGA Data in R - Create Kaplan-Meier Curves