Golden Helix · Clinical Genomics Guide

Somatic Variant Analysis
Tumor Profiling and Precision Oncology

A clinical guide for oncology labs. Tumor profiling workflows, AMP/ASCO/CAP classification, TMB and MSI and HRD biomarkers, liquid biopsy, and the analytical differences from germline NGS.

Tumor ProfilingAMP ClassificationTMB · MSI · HRDLiquid BiopsyTumor-Normal vs Tumor-Only

Introduction

The genetic profile of a tumor is
as clinical as its histology.

Cancer is, at its molecular core, a disease of accumulated somatic mutations. Every tumor carries a unique set of genetic alterations in oncogenes, tumor suppressors, DNA repair genes, and signaling pathways that drove its initiation and continue to shape its behavior, its response to treatment, and its potential for resistance. Identifying those alterations precisely, and translating them into clinical decisions, is the work of somatic oncology.

NGS made this practical at clinical scale. Where the first molecular oncology tests identified single variants in single genes (BRAF V600E by PCR, EGFR by Sanger), modern panels profile hundreds of cancer-relevant genes simultaneously, detect all major variant types, and quantify TMB and MSI from the same sequencing data. One test now provides more clinically actionable information than weeks of sequential single-gene work.

4
AMP/ASCO/CAP classification tiers (I, II, III, IV)
≥10
Mut/Mb TMB threshold for tumor-agnostic immunotherapy approval
4–12%
Tumor profiles harboring a germline cancer predisposition variant
0.1%
Minimum ctDNA VAF for liquid biopsy MRD detection
~50%
Melanomas that carry BRAF V600E
2017
Year AMP/ASCO/CAP somatic framework was published

Foundation

Somatic vs Germline: A Fundamental Distinction

Understanding somatic analysis begins with understanding what makes somatic variants different from germline variants, biologically, analytically, and clinically.

The biology

Germline variants are present in every cell of the body from conception. They are inherited or arise de novo during early embryonic development, present at 50% in heterozygous carriers, and cause hereditary conditions including inherited cancer predisposition syndromes.

Somatic variants arise in individual cells after birth, through replication errors, mutagen exposure, or failed DNA repair. They are present only in cells descended from the original mutated cell. The clinical consequence: somatic variants are not at 50% VAF. They appear at the VAF of the tumor clone, which may be 80% in a pure tumor, 20% with normal contamination, or 0.1% in liquid biopsy where only a fraction of cfDNA is tumor-derived. Detecting low-VAF somatic variants with clinical confidence is the central technical challenge of somatic NGS.

DimensionGermline TestingSomatic Testing
Clinical questionDoes this patient have an inherited disease-causing variant?Does this tumor have actionable molecular alterations?
Variant originInherited or de novo at conceptionAcquired in tumor cells after birth
Expected VAF~50% (het) or ~100% (hom)Variable: 1–100% depending on tumor purity
Classification frameworkACMG/AMP 5-tier (pathogenicity)AMP/ASCO/CAP 4-tier (actionability)
Primary databasesClinVar, gnomAD, OMIMGolden Helix CancerKB, CIViC, COSMIC, FDA labels
Sample typeNormal tissue (blood, saliva)Tumor tissue or liquid biopsy
Inheritance modelingEssentialNot applicable
Clinical outputDiagnosis of inherited conditionTherapy selection, prognosis, trial eligibility

What to Detect

Variant Types in Somatic Analysis

Comprehensive tumor profiling must detect every variant class with therapeutic or prognostic relevance. Each type requires a specific detection approach.

Most common

Single Nucleotide Variants (SNVs)

BRAF V600E (50% of melanomas), EGFR L858R and exon 19 deletions (NSCLC), KRAS G12C (lung, colorectal), PIK3CA (~30% of breast cancers). The most therapeutically targeted variant class.

Loss of function

Insertions & Deletions

Frameshift indels in tumor suppressors drive loss of function. BRCA1/2 frameshifts predict PARP inhibitor sensitivity. EGFR exon 20 insertions need different inhibitors than exon 19 deletions: variant-level precision matters.

Amplification / loss

Copy Number Variants

ERBB2 (HER2) amplification in 15–20% of breast cancers drives HER2-targeted therapy. MET amplification, CDKN2A deletion, RB1 loss all carry treatment implications. CNV calling from targeted panels is technically harder than WGS.

Rearrangements

Gene Fusions

Among the most targetable alterations in oncology. BCR::ABL1 (CML), EML4::ALK (NSCLC), RET fusions (NSCLC and thyroid), NTRK1/2/3 (tumor-agnostic FDA approval). Detection needs dedicated fusion callers, ideally with RNA evidence.

Splicing

MET Exon 14 Skipping

Splice-site mutations that eliminate the exon 14 degron, preventing MET degradation and driving constitutive signaling. Present in 3–4% of NSCLC. Missed by DNA-only panels that do not evaluate splice junction evidence.

Genome-wide pattern

Mutational Signatures

COSMIC signatures encode the mutagenic processes that shaped the tumor: APOBEC, tobacco, UV, mismatch repair deficiency, BRCA-related HRD. Needs WGS or large panels. Increasingly used to predict immunotherapy and PARP inhibitor response.

Sample Reality

Tissue, FFPE & Liquid Biopsy

Sample quality and type profoundly affect what somatic analysis can detect and how reliably it can detect it.

  • 01

    Fresh frozen tissue

    The gold standard. High DNA quality, high tumor content, all variant types detectable with high sensitivity. Rarely available outside research settings because pathology workflows are built around FFPE.

  • 02

    FFPE tissue (the clinical reality)

    The standard clinical sample. Fixation causes DNA fragmentation and cytosine deamination artifacts (C→T and G→A) that look like real low-VAF variants. Clinical pipelines must filter known FFPE artifact patterns, apply VAF thresholds appropriate for FFPE quality, and use OxoG markers. Blocks older than 5–10 years often yield insufficient DNA.

  • 03

    Liquid biopsy (cfDNA / ctDNA)

    Plasma sequencing of cell-free DNA. Tumor fraction can be <0.1% of total cfDNA, so detection requires >5,000x coverage, unique molecular identifiers for error correction, and dedicated ultra-low VAF callers. Applications: therapy monitoring, MRD, resistance mechanism identification, primary profiling when tissue is inaccessible.

Design Decision

Tumor-Normal vs Tumor-Only

One of the most consequential analytical decisions in somatic NGS is whether to sequence matched normal tissue alongside the tumor. The choice shapes specificity, germline inference, and what kinds of findings the report can and cannot make.

  • 01

    Tumor-Normal (paired) analysis

    Sequence matched normal tissue (typically blood) alongside the tumor and directly subtract germline variants. Maximizes specificity, eliminates VAF-based germline inference, enables LOH detection, provides a reliable CNV baseline, and surfaces potentially pathogenic germline variants in the matched normal. The preferred approach for comprehensive genomic profiling.

  • 02

    Tumor-Only analysis

    No matched normal. Use population databases (gnomAD, ExAC, dbSNP) and somatic databases (COSMIC hotspots) to distinguish somatic from germline computationally. Studies consistently show 4–12% variant misclassification compared to paired analysis. Appropriate when the panel is small and focused on well-characterized hotspots, when sample volume prevents normal collection, or when turnaround time precludes paired analysis. The choice must be explicit and disclosed in the report.

Analytical Pipeline

The Somatic Variant Calling Pipeline

Somatic calling differs from germline calling in sensitivity requirements, artifact management, and the databases used for filtering. Standard germline callers are tuned for 50% VAF heterozygous calls. Somatic callers must reliably operate well below that.

VAF sensitivity thresholds

  • Solid tumor, FFPE, high purity: typically 5–20% minimum VAF
  • Solid tumor, low purity, necrotic, or decellularized: down to 2–5% VAF
  • Liquid biopsy: 0.1% VAF or lower for MRD and resistance monitoring

Sensitivity thresholds should be established during analytical validation for each specific assay. Not adopted from tool defaults.

Somatic-specific callers

  • Mutect2 (GATK): the most widely used paired caller, designed for matched germline filtering.
  • Strelka2: high-performance somatic SNV and indel caller tuned for clinical accuracy.
  • VarScan2: supports paired and unpaired calling, common for tumor-only applications.
  • DRAGEN Somatic: Illumina's hardware-accelerated pipeline, widely used at clinical scale.

No single caller handles every variant type. Gene fusion detection needs dedicated tools (STAR-Fusion, Arriba, FusionCatcher for RNA-based; Delly, Manta for DNA). CNV detection needs separate algorithms. A comprehensive pipeline integrates several specialized callers and reconciles their output.

Artifact filtering

At low VAF, artifact rates can rival true signal. Essential steps include strand-bias filtering (true variants appear on both strands; FFPE C→T artifacts are often strand-specific), population database filtering (gnomAD >0.1% is almost always germline), lab-specific blacklists of recurrent artifact positions, and minimum-depth thresholds appropriate for the assay.

Classification

AMP/ASCO/CAP in Depth

The 2017 AMP/ASCO/CAP guidelines (Li et al., Journal of Molecular Diagnostics) define the standard framework for interpreting and reporting somatic variants. Unlike ACMG germline classification, which focuses on pathogenicity, AMP focuses on actionability: the degree to which a variant influences treatment decisions.

Evidence levels feed tier assignment

  • Level A: FDA-approved or standard-of-care guidelines for this tumor type. Examples: BRAF V600E in melanoma, EGFR L858R in NSCLC, HER2 amplification in breast cancer.
  • Level B: Strong evidence or expert consensus from large clinical studies, but FDA approval in this tumor type may be absent.
  • Level C: Clinical significance in a different tumor type. Relevant for off-label and basket trial eligibility.
  • Level D: Preclinical data or limited clinical observations. Biological plausibility, not yet clinical actionability.
  • 01

    Tier I, Strong clinical significance

    Supported by Level A or B evidence. Directly informs treatment decisions in routine clinical practice. The oncologist should act on Tier I findings. Examples: BRAF V600E (Tier I in melanoma, Tier II in colorectal due to different clinical significance), EGFR exon 19 deletion (Tier I in NSCLC).

  • 02

    Tier II, Potential clinical significance

    Supported by Level C or D evidence, or by Level A/B evidence in a different tumor type. May inform off-label treatment, clinical trial referral, or molecular tumor board discussion. The same variant can be Tier I in one tumor type and Tier II in another. Context is everything.

  • 03

    Tier III, Unknown clinical significance

    Rare variants with plausible functional impact but no established clinical association. Similar to VUS in germline classification. Reported with appropriate uncertainty. As evidence accumulates, Tier III variants may be reclassified.

  • 04

    Tier IV, Benign or likely benign

    Common variants ruled out by population frequency or established as functionally silent. Not reported in the clinical context.

Immune Biomarkers

TMB, MSI & HRD

The most transformative development in precision oncology over the past decade has been immune checkpoint inhibitor therapy. Three genomic biomarkers now play a central role in identifying which patients are most likely to respond.

Mutation count

TMB (Tumor Mutational Burden)

Somatic mutations per megabase of sequenced genome. High TMB generates more neoantigens, making tumors more responsive to PD-1/PD-L1 inhibitors. The FDA granted tumor-agnostic pembrolizumab approval for solid tumors with TMB ≥10 mut/Mb in 2020, the first tumor-agnostic approval based on a quantitative genomic threshold. Calculation methodology varies between panels; cross-calibration matters.

Repair deficiency

MSI (Microsatellite Instability)

Defective mismatch repair (MLH1, MSH2, MSH6, PMS2) drives frameshift mutations at microsatellite loci. MSI-high tumors are among the most responsive to checkpoint immunotherapy across tumor types. The FDA approved pembrolizumab pan-tumor for MSI-H/dMMR in 2017, the first tumor-agnostic cancer drug approval. Calculable directly from NGS data.

HR pathway

HRD (Homologous Recombination Deficiency)

Caused by BRCA1/2 pathogenic variants or BRCA1 epigenetic silencing. HRD tumors cannot repair double-strand breaks, making them hypersensitive to PARP inhibitors (olaparib, rucaparib, niraparib, talazoparib) and platinum chemotherapy. HRD scores combine LOH, telomeric allelic imbalance, and large-scale state transitions. Most relevant in breast, ovarian, prostate, and pancreatic cancers.

Cross-Boundary Finding

Germline Variants Found During Tumor Profiling

As tumor-based somatic NGS expands, a growing issue has emerged: a meaningful fraction of variants identified during tumor profiling are actually germline in origin, and some are clinically significant cancer predisposition variants.

Studies estimate that 4 to 12% of patients undergoing tumor-based molecular profiling carry pathogenic or likely pathogenic germline variants relevant to their cancer, in genes including BRCA1, BRCA2, MLH1, MSH2, TP53, and CDH1. These are often detectable in tumor data because the germline variant is present in all cells (including tumor cells) at ~50% VAF.

When a likely germline pathogenic variant is identified in tumor profiling:

  • Inform the ordering oncologist with a clear explanation of the somatic vs germline distinction.
  • Offer genetic counseling referral: the patient may have hereditary predisposition with implications for surveillance, risk-reducing interventions, and family member testing.
  • Recognize treatment implications: germline BRCA1/2 mutations predict PARP inhibitor sensitivity and may qualify the patient for trials designed for hereditary cancer predisposition.
  • Perform a separate germline confirmation test on normal tissue before clinical action is taken on a variant identified in tumor tissue.

Output

The Somatic Clinical Report

A CAP-compliant somatic NGS report must communicate complex molecular findings clearly enough for an oncologist to act on them, accurately enough to support therapeutic decisions, and transparently enough to disclose analytical limitations.

Required content

  • Specimen information: tumor type, biopsy site, sample date, tumor purity, DNA yield, sequencing quality
  • Methodology: panel name and version, genes covered, sequencing platform, pipeline version, minimum coverage thresholds, reference build, variant callers, tumor-normal or tumor-only
  • Tier I and II variants with HGVS, VAF, depth, tier classification, evidence level, matched therapies or trials
  • Biomarkers: TMB (mut/Mb), MSI status (stable, indeterminate, high), HRD score where applicable
  • Tier III variants reported with uncertainty disclosure
  • Negative findings: which genes were analyzed and found negative, coverage gaps or low-depth regions
  • Limitations: FFPE artifacts, tumor-only constraints, variant types not detected by this assay
  • Therapeutic matching: FDA-approved therapies for Tier I, relevant clinical trials and off-label options for Tier II
  • Laboratory director signature and contact

Clinical trial matching

A report identifying actionable variants should actively facilitate trial access. Include relevant ClinicalTrials.gov entries matching the patient's tumor type and identified alterations, so the oncologist can discuss eligibility at the same appointment where molecular results are reviewed.

Common Questions

Frequently Asked Questions

What is somatic analysis?
Somatic variant analysis is the process of identifying and interpreting genetic variants in tumor DNA that have been acquired during the development of cancer, rather than inherited. It involves sequencing tumor tissue or liquid biopsy, calling variants at potentially very low allele frequencies, filtering germline background, and classifying detected variants according to clinical actionability using the AMP/ASCO/CAP framework. The output is a clinical report identifying which molecular alterations in the patient's tumor are actionable for treatment selection, prognosis, or clinical trial enrollment.
What does somatic testing mean?
Somatic testing means ordering a molecular profiling test on a patient's tumor (rather than on normal tissue) to identify the somatic mutations driving that specific cancer. In clinical practice, somatic testing is ordered to determine whether a patient's tumor carries variants that predict response to targeted therapies (such as BRAF inhibitors, EGFR inhibitors, or PARP inhibitors), to assess immunotherapy biomarkers (TMB, MSI, HRD), or to identify mutations associated with clinical trials. Somatic testing is distinct from germline genetic testing, which identifies inherited variants.
How does somatic analysis differ from germline analysis?
The fundamental difference is what question is being asked. Germline analysis asks whether a patient inherited a disease-causing variant: it focuses on pathogenicity and is relevant for diagnosing inherited conditions. Somatic analysis asks whether a tumor has acquired molecular alterations that affect treatment decisions: it focuses on actionability and is specific to cancer. The two analyses use different sample types, different variant callers, different classification frameworks, different databases, and produce different clinical outputs. Many cancer patients need both. Increasingly, tumor profiling identifies variants of both somatic and germline origin simultaneously.
What is tumor mutational burden and why does it matter?
Tumor mutational burden (TMB) is the total number of somatic mutations per megabase of sequenced genome in a tumor sample. High TMB indicates a tumor has accumulated many mutations (often due to defective DNA repair or high mutagen exposure), generating numerous neoantigens that the immune system can recognize. This makes high-TMB tumors more responsive to immune checkpoint inhibitors. The FDA approved pembrolizumab for any advanced solid tumor with TMB ≥10 mutations per megabase in 2020, the first tumor-agnostic approval based on a quantitative genomic threshold. TMB is now a standard component of comprehensive tumor profiling.
What is the difference between tumor-only and tumor-normal sequencing?
Tumor-only sequencing analyzes DNA extracted from tumor tissue alone, using population databases and computational methods to distinguish somatic from germline variants. Tumor-normal (paired) sequencing sequences both tumor tissue and matched normal tissue (typically blood), enabling direct subtraction of germline variants from the somatic call set. Tumor-normal analysis provides higher specificity for somatic variant detection and directly identifies germline variants that may indicate hereditary cancer predisposition. It is the preferred approach for comprehensive genomic profiling. Tumor-only analysis is used when matched normal tissue is unavailable, but laboratories should validate their tumor-only filtering pipeline and clearly disclose this limitation in clinical reports.
What is MSI and how is it calculated from NGS?
Microsatellite instability (MSI) results from defective DNA mismatch repair: failure of the MMR protein complex (MLH1, MSH2, MSH6, PMS2) to correct DNA replication errors at short repetitive sequences called microsatellites. Accumulation of frameshift mutations at microsatellites generates abundant neoantigens, making MSI-high tumors among the most responsive to checkpoint immunotherapy across tumor types. MSI status can be calculated computationally from NGS data by measuring the instability of microsatellite loci within the sequenced regions, correlating well with traditional PCR-based MSI testing and IHC for MMR protein expression. Most comprehensive genomic profiling panels now report MSI status directly.
Can liquid biopsy replace tissue biopsy?
Liquid biopsy complements tissue biopsy rather than replacing it. ctDNA-based testing is the preferred approach for therapy monitoring, minimal residual disease detection, resistance mechanism identification at progression, and primary profiling when safe tissue biopsy is not possible. For initial diagnosis and comprehensive profiling at the start of treatment, tissue remains the standard where it is accessible because tumor content is higher and the full spectrum of variant types is more reliably detectable. The two approaches are increasingly used together throughout a cancer treatment journey.

Build a Clinical Somatic Program

Golden Helix's precision oncology software provides AMP/ASCO/CAP-guided somatic interpretation, TMB and MSI and HRD biomarker calculation, therapeutic matching, and clinical trial recommendations in a single validated platform for oncology labs.