Golden Helix · Clinical Genomics Guide
Somatic Variant Analysis
Tumor Profiling and Precision Oncology
A clinical guide for oncology labs. Tumor profiling workflows, AMP/ASCO/CAP classification, TMB and MSI and HRD biomarkers, liquid biopsy, and the analytical differences from germline NGS.
Introduction
The genetic profile of a tumor is
as clinical as its histology.
Cancer is, at its molecular core, a disease of accumulated somatic mutations. Every tumor carries a unique set of genetic alterations in oncogenes, tumor suppressors, DNA repair genes, and signaling pathways that drove its initiation and continue to shape its behavior, its response to treatment, and its potential for resistance. Identifying those alterations precisely, and translating them into clinical decisions, is the work of somatic oncology.
NGS made this practical at clinical scale. Where the first molecular oncology tests identified single variants in single genes (BRAF V600E by PCR, EGFR by Sanger), modern panels profile hundreds of cancer-relevant genes simultaneously, detect all major variant types, and quantify TMB and MSI from the same sequencing data. One test now provides more clinically actionable information than weeks of sequential single-gene work.
Foundation
Somatic vs Germline: A Fundamental Distinction
Understanding somatic analysis begins with understanding what makes somatic variants different from germline variants, biologically, analytically, and clinically.
The biology
Germline variants are present in every cell of the body from conception. They are inherited or arise de novo during early embryonic development, present at 50% in heterozygous carriers, and cause hereditary conditions including inherited cancer predisposition syndromes.
Somatic variants arise in individual cells after birth, through replication errors, mutagen exposure, or failed DNA repair. They are present only in cells descended from the original mutated cell. The clinical consequence: somatic variants are not at 50% VAF. They appear at the VAF of the tumor clone, which may be 80% in a pure tumor, 20% with normal contamination, or 0.1% in liquid biopsy where only a fraction of cfDNA is tumor-derived. Detecting low-VAF somatic variants with clinical confidence is the central technical challenge of somatic NGS.
| Dimension | Germline Testing | Somatic Testing |
|---|---|---|
| Clinical question | Does this patient have an inherited disease-causing variant? | Does this tumor have actionable molecular alterations? |
| Variant origin | Inherited or de novo at conception | Acquired in tumor cells after birth |
| Expected VAF | ~50% (het) or ~100% (hom) | Variable: 1–100% depending on tumor purity |
| Classification framework | ACMG/AMP 5-tier (pathogenicity) | AMP/ASCO/CAP 4-tier (actionability) |
| Primary databases | ClinVar, gnomAD, OMIM | Golden Helix CancerKB, CIViC, COSMIC, FDA labels |
| Sample type | Normal tissue (blood, saliva) | Tumor tissue or liquid biopsy |
| Inheritance modeling | Essential | Not applicable |
| Clinical output | Diagnosis of inherited condition | Therapy selection, prognosis, trial eligibility |
What to Detect
Variant Types in Somatic Analysis
Comprehensive tumor profiling must detect every variant class with therapeutic or prognostic relevance. Each type requires a specific detection approach.
Most common
Single Nucleotide Variants (SNVs)
BRAF V600E (50% of melanomas), EGFR L858R and exon 19 deletions (NSCLC), KRAS G12C (lung, colorectal), PIK3CA (~30% of breast cancers). The most therapeutically targeted variant class.
Loss of function
Insertions & Deletions
Frameshift indels in tumor suppressors drive loss of function. BRCA1/2 frameshifts predict PARP inhibitor sensitivity. EGFR exon 20 insertions need different inhibitors than exon 19 deletions: variant-level precision matters.
Amplification / loss
Copy Number Variants
ERBB2 (HER2) amplification in 15–20% of breast cancers drives HER2-targeted therapy. MET amplification, CDKN2A deletion, RB1 loss all carry treatment implications. CNV calling from targeted panels is technically harder than WGS.
Rearrangements
Gene Fusions
Among the most targetable alterations in oncology. BCR::ABL1 (CML), EML4::ALK (NSCLC), RET fusions (NSCLC and thyroid), NTRK1/2/3 (tumor-agnostic FDA approval). Detection needs dedicated fusion callers, ideally with RNA evidence.
Splicing
MET Exon 14 Skipping
Splice-site mutations that eliminate the exon 14 degron, preventing MET degradation and driving constitutive signaling. Present in 3–4% of NSCLC. Missed by DNA-only panels that do not evaluate splice junction evidence.
Genome-wide pattern
Mutational Signatures
COSMIC signatures encode the mutagenic processes that shaped the tumor: APOBEC, tobacco, UV, mismatch repair deficiency, BRCA-related HRD. Needs WGS or large panels. Increasingly used to predict immunotherapy and PARP inhibitor response.
Sample Reality
Tissue, FFPE & Liquid Biopsy
Sample quality and type profoundly affect what somatic analysis can detect and how reliably it can detect it.
- 01
Fresh frozen tissue
The gold standard. High DNA quality, high tumor content, all variant types detectable with high sensitivity. Rarely available outside research settings because pathology workflows are built around FFPE.
- 02
FFPE tissue (the clinical reality)
The standard clinical sample. Fixation causes DNA fragmentation and cytosine deamination artifacts (C→T and G→A) that look like real low-VAF variants. Clinical pipelines must filter known FFPE artifact patterns, apply VAF thresholds appropriate for FFPE quality, and use OxoG markers. Blocks older than 5–10 years often yield insufficient DNA.
- 03
Liquid biopsy (cfDNA / ctDNA)
Plasma sequencing of cell-free DNA. Tumor fraction can be <0.1% of total cfDNA, so detection requires >5,000x coverage, unique molecular identifiers for error correction, and dedicated ultra-low VAF callers. Applications: therapy monitoring, MRD, resistance mechanism identification, primary profiling when tissue is inaccessible.
Design Decision
Tumor-Normal vs Tumor-Only
One of the most consequential analytical decisions in somatic NGS is whether to sequence matched normal tissue alongside the tumor. The choice shapes specificity, germline inference, and what kinds of findings the report can and cannot make.
- 01
Tumor-Normal (paired) analysis
Sequence matched normal tissue (typically blood) alongside the tumor and directly subtract germline variants. Maximizes specificity, eliminates VAF-based germline inference, enables LOH detection, provides a reliable CNV baseline, and surfaces potentially pathogenic germline variants in the matched normal. The preferred approach for comprehensive genomic profiling.
- 02
Tumor-Only analysis
No matched normal. Use population databases (gnomAD, ExAC, dbSNP) and somatic databases (COSMIC hotspots) to distinguish somatic from germline computationally. Studies consistently show 4–12% variant misclassification compared to paired analysis. Appropriate when the panel is small and focused on well-characterized hotspots, when sample volume prevents normal collection, or when turnaround time precludes paired analysis. The choice must be explicit and disclosed in the report.
Analytical Pipeline
The Somatic Variant Calling Pipeline
Somatic calling differs from germline calling in sensitivity requirements, artifact management, and the databases used for filtering. Standard germline callers are tuned for 50% VAF heterozygous calls. Somatic callers must reliably operate well below that.
VAF sensitivity thresholds
- Solid tumor, FFPE, high purity: typically 5–20% minimum VAF
- Solid tumor, low purity, necrotic, or decellularized: down to 2–5% VAF
- Liquid biopsy: 0.1% VAF or lower for MRD and resistance monitoring
Sensitivity thresholds should be established during analytical validation for each specific assay. Not adopted from tool defaults.
Somatic-specific callers
- Mutect2 (GATK): the most widely used paired caller, designed for matched germline filtering.
- Strelka2: high-performance somatic SNV and indel caller tuned for clinical accuracy.
- VarScan2: supports paired and unpaired calling, common for tumor-only applications.
- DRAGEN Somatic: Illumina's hardware-accelerated pipeline, widely used at clinical scale.
No single caller handles every variant type. Gene fusion detection needs dedicated tools (STAR-Fusion, Arriba, FusionCatcher for RNA-based; Delly, Manta for DNA). CNV detection needs separate algorithms. A comprehensive pipeline integrates several specialized callers and reconciles their output.
Artifact filtering
At low VAF, artifact rates can rival true signal. Essential steps include strand-bias filtering (true variants appear on both strands; FFPE C→T artifacts are often strand-specific), population database filtering (gnomAD >0.1% is almost always germline), lab-specific blacklists of recurrent artifact positions, and minimum-depth thresholds appropriate for the assay.
Classification
AMP/ASCO/CAP in Depth
The 2017 AMP/ASCO/CAP guidelines (Li et al., Journal of Molecular Diagnostics) define the standard framework for interpreting and reporting somatic variants. Unlike ACMG germline classification, which focuses on pathogenicity, AMP focuses on actionability: the degree to which a variant influences treatment decisions.
Evidence levels feed tier assignment
- Level A: FDA-approved or standard-of-care guidelines for this tumor type. Examples: BRAF V600E in melanoma, EGFR L858R in NSCLC, HER2 amplification in breast cancer.
- Level B: Strong evidence or expert consensus from large clinical studies, but FDA approval in this tumor type may be absent.
- Level C: Clinical significance in a different tumor type. Relevant for off-label and basket trial eligibility.
- Level D: Preclinical data or limited clinical observations. Biological plausibility, not yet clinical actionability.
- 01
Tier I, Strong clinical significance
Supported by Level A or B evidence. Directly informs treatment decisions in routine clinical practice. The oncologist should act on Tier I findings. Examples: BRAF V600E (Tier I in melanoma, Tier II in colorectal due to different clinical significance), EGFR exon 19 deletion (Tier I in NSCLC).
- 02
Tier II, Potential clinical significance
Supported by Level C or D evidence, or by Level A/B evidence in a different tumor type. May inform off-label treatment, clinical trial referral, or molecular tumor board discussion. The same variant can be Tier I in one tumor type and Tier II in another. Context is everything.
- 03
Tier III, Unknown clinical significance
Rare variants with plausible functional impact but no established clinical association. Similar to VUS in germline classification. Reported with appropriate uncertainty. As evidence accumulates, Tier III variants may be reclassified.
- 04
Tier IV, Benign or likely benign
Common variants ruled out by population frequency or established as functionally silent. Not reported in the clinical context.
Immune Biomarkers
TMB, MSI & HRD
The most transformative development in precision oncology over the past decade has been immune checkpoint inhibitor therapy. Three genomic biomarkers now play a central role in identifying which patients are most likely to respond.
Mutation count
TMB (Tumor Mutational Burden)
Somatic mutations per megabase of sequenced genome. High TMB generates more neoantigens, making tumors more responsive to PD-1/PD-L1 inhibitors. The FDA granted tumor-agnostic pembrolizumab approval for solid tumors with TMB ≥10 mut/Mb in 2020, the first tumor-agnostic approval based on a quantitative genomic threshold. Calculation methodology varies between panels; cross-calibration matters.
Repair deficiency
MSI (Microsatellite Instability)
Defective mismatch repair (MLH1, MSH2, MSH6, PMS2) drives frameshift mutations at microsatellite loci. MSI-high tumors are among the most responsive to checkpoint immunotherapy across tumor types. The FDA approved pembrolizumab pan-tumor for MSI-H/dMMR in 2017, the first tumor-agnostic cancer drug approval. Calculable directly from NGS data.
HR pathway
HRD (Homologous Recombination Deficiency)
Caused by BRCA1/2 pathogenic variants or BRCA1 epigenetic silencing. HRD tumors cannot repair double-strand breaks, making them hypersensitive to PARP inhibitors (olaparib, rucaparib, niraparib, talazoparib) and platinum chemotherapy. HRD scores combine LOH, telomeric allelic imbalance, and large-scale state transitions. Most relevant in breast, ovarian, prostate, and pancreatic cancers.
Cross-Boundary Finding
Germline Variants Found During Tumor Profiling
As tumor-based somatic NGS expands, a growing issue has emerged: a meaningful fraction of variants identified during tumor profiling are actually germline in origin, and some are clinically significant cancer predisposition variants.
Studies estimate that 4 to 12% of patients undergoing tumor-based molecular profiling carry pathogenic or likely pathogenic germline variants relevant to their cancer, in genes including BRCA1, BRCA2, MLH1, MSH2, TP53, and CDH1. These are often detectable in tumor data because the germline variant is present in all cells (including tumor cells) at ~50% VAF.
When a likely germline pathogenic variant is identified in tumor profiling:
- Inform the ordering oncologist with a clear explanation of the somatic vs germline distinction.
- Offer genetic counseling referral: the patient may have hereditary predisposition with implications for surveillance, risk-reducing interventions, and family member testing.
- Recognize treatment implications: germline BRCA1/2 mutations predict PARP inhibitor sensitivity and may qualify the patient for trials designed for hereditary cancer predisposition.
- Perform a separate germline confirmation test on normal tissue before clinical action is taken on a variant identified in tumor tissue.
Output
The Somatic Clinical Report
A CAP-compliant somatic NGS report must communicate complex molecular findings clearly enough for an oncologist to act on them, accurately enough to support therapeutic decisions, and transparently enough to disclose analytical limitations.
Required content
- Specimen information: tumor type, biopsy site, sample date, tumor purity, DNA yield, sequencing quality
- Methodology: panel name and version, genes covered, sequencing platform, pipeline version, minimum coverage thresholds, reference build, variant callers, tumor-normal or tumor-only
- Tier I and II variants with HGVS, VAF, depth, tier classification, evidence level, matched therapies or trials
- Biomarkers: TMB (mut/Mb), MSI status (stable, indeterminate, high), HRD score where applicable
- Tier III variants reported with uncertainty disclosure
- Negative findings: which genes were analyzed and found negative, coverage gaps or low-depth regions
- Limitations: FFPE artifacts, tumor-only constraints, variant types not detected by this assay
- Therapeutic matching: FDA-approved therapies for Tier I, relevant clinical trials and off-label options for Tier II
- Laboratory director signature and contact
Clinical trial matching
A report identifying actionable variants should actively facilitate trial access. Include relevant ClinicalTrials.gov entries matching the patient's tumor type and identified alterations, so the oncologist can discuss eligibility at the same appointment where molecular results are reviewed.
Common Questions
Frequently Asked Questions
What is somatic analysis?
What does somatic testing mean?
How does somatic analysis differ from germline analysis?
What is tumor mutational burden and why does it matter?
What is the difference between tumor-only and tumor-normal sequencing?
What is MSI and how is it calculated from NGS?
Can liquid biopsy replace tissue biopsy?
Keep Reading
Related Resources
Somatic analysis intersects germline, tertiary, and infrastructure topics across the Golden Helix guide series.
Build a Clinical Somatic Program
Golden Helix's precision oncology software provides AMP/ASCO/CAP-guided somatic interpretation, TMB and MSI and HRD biomarker calculation, therapeutic matching, and clinical trial recommendations in a single validated platform for oncology labs.