Cancer Genomics / TCGA Analysis

TCGA analysis starts with: what cancer type? what data type? Build your cohort FIRST (GDC filters), then analyze. Don't query mutations without defining the cohort — pan-cancer counts from GDC_get_mutation_frequency are uninformative without cancer-type context. A mutation frequency of 10% in one cancer type may be 0.5% in another; always specify project_id. Survival analysis (Kaplan-Meier) is hypothesis-generating in retrospective TCGA data — always report sample size and p-value, and note that TCGA cohorts are not treatment-stratified.

LOOK UP DON'T GUESS: never assume TCGA project IDs, NCIt codes, or gene coordinates — use GDC_list_projects to confirm project IDs and Progenetix_list_filtering_terms for NCIt codes.

Systematic TCGA/GDC analysis: define cohorts, retrieve clinical data, profile somatic mutations, query copy number variations, run survival analysis, and interpret variants with OncoKB.

When to Use

"What is the mutation frequency of TP53 in TCGA-BRCA?"
"Get survival data for TCGA-LUAD patients"
"Find clinical data for breast cancer cases in GDC"
"Which TCGA projects have KRAS G12C mutations?"
"Show CNV amplifications of EGFR in glioblastoma"
"Annotate BRAF V600E for clinical significance in melanoma"

tooluniverse-cancer-genomics-tcga

Cancer Genomics / TCGA Analysis

When to Use