Golden Helix · Clinical Genomics Guide

Genome Interpretation
How Genomic Data Becomes a Clinical Diagnosis

Sequencing a genome is now routine. Interpreting one is not. This guide covers the concepts, frameworks, and evidence types that underpin rigorous genome interpretation, from the four types of genetic conditions through the ACMG criteria to what clinicians and patients should actually do with their results.

ACMG FrameworkEvidence HierarchyVUS StrategyGenotype-PhenotypeReading Reports

Introduction

Sequencing a genome is routine.
Interpreting one is not.

A whole genome sequencing run takes a day. It produces gigabytes of data and millions of variant calls. But none of those variants, on their own, tell a clinician whether a patient has an inherited disease, which therapy to prescribe, or whether a family member should be tested. That requires genome interpretation: the complex, knowledge-intensive process of determining which genetic variants are clinically significant and what to do about them.

Genome interpretation sits at the intersection of molecular biology, population genetics, clinical medicine, and computational science. It requires understanding not just what a variant does to a protein, but whether it has been seen before in affected patients, how common it is in the general population, whether it fits the inheritance pattern, and whether the current evidence is sufficient to act on it. This is why genome interpretation is considered the most difficult stage of the clinical genomics workflow, not technically but intellectually, and why interpretation quality varies more across labs than any other pipeline stage.

4–5M
Variants per individual relative to the reference
28
ACMG/AMP classification criteria
5
Pathogenicity tiers: P, LP, VUS, LB, B
30–50%
Typical VUS rate in WES rare disease
17%
Variants Pathogenic at one lab but VUS at another (ClinVar)
10–30%
VUS reclassification rate over 3–5 years

Definition

What Is Genome Interpretation?

Genome interpretation, also called variant interpretation, is the process of evaluating genetic variants identified during sequencing to determine their clinical significance. It begins where computational analysis ends, at a list of variants in a VCF file, each annotated with biological context, and ends at a defensible clinical conclusion about which variants cause or contribute to the patient's condition.

The four questions for every candidate variant

  • Is the variant real? Quality assessment: is this a true variant or a sequencing artifact?
  • Is it rare enough to be disease-causing? Population frequency in the relevant ancestral group.
  • Does it affect gene function? Effect on protein, transcript, or expression.
  • Does it fit the clinical picture? Gene-disease relationship matches the phenotype, inheritance fits the family history.

Every variant that reaches clinical reporting has been evaluated against all four. The answers, weighted by strength of evidence, determine the final classification. Genome interpretation sits within the broader NGS pipeline as the clinical component of tertiary analysis. For the upstream steps (annotation, filtering, prioritization) that precede formal classification, see the tertiary analysis guide.

Context

The Four Types of Genetic Conditions

The clinical context shapes interpretation. The most important contextual factor is what type of genetic condition is being investigated. The four types differ in inheritance, genetic architecture, and penetrance, and therefore in how variants are interpreted and weighted.

1 · Mendelian

Single-Gene Conditions

Caused by pathogenic variants in a single gene following defined inheritance (AD, AR, X-linked, mitochondrial). 7,000+ identified, caused by variants in 4,000+ genes. The primary focus of germline variant interpretation. ACMG/AMP was developed for this. Examples: cystic fibrosis (CFTR, AR), Huntington (HTT, AD), Duchenne (DMD, X-linked), BRCA1/2 (AD, incomplete penetrance).

2 · Chromosomal

Chromosomal Conditions

Abnormalities in chromosome number or large-scale structure: gains, losses, duplications, deletions, inversions, translocations affecting megabases to entire chromosomes. Affect hundreds to thousands of genes simultaneously. Mostly de novo. Detected by karyotype, chromosomal microarray, and CNV calling from WGS. Examples: Down syndrome, 22q11.2 deletion, Turner syndrome, Klinefelter, Angelman/Prader-Willi.

3 · Multifactorial

Complex Conditions

Combined effect of many variants across many genes interacting with environment. No single variant is necessary or sufficient. GWAS-identified variants have small effects individually (OR 1.1–1.5) and are clinically actionable only as polygenic risk scores. ACMG/AMP does not apply. Examples: type 2 diabetes, coronary artery disease, schizophrenia, age-related macular degeneration, most common cancers.

4 · Mitochondrial

Mitochondrial Conditions

Pathogenic variants in mtDNA (maternally inherited) or in nuclear genes encoding mitochondrial proteins (Mendelian). Heteroplasmy is the defining challenge: each cell contains hundreds to thousands of mitochondria, and the proportion of mutant mtDNA varies by tissue and individual. 10% heteroplasmy may be silent; 80% causes severe disease. Examples: MELAS, Leigh syndrome, Leber hereditary optic neuropathy.

The Inputs

The Evidence Hierarchy

Genome interpretation is fundamentally an evidence-weighing exercise. No single piece is sufficient. Pathogenicity conclusions are built by integrating multiple independent lines of evidence, each contributing different information about biological and clinical significance. The ACMG/AMP framework organizes evidence into the following categories, each with a defined weight.

  • 01

    Population frequency

    How common is this variant in the general population? Rare variants are more likely to be disease-causing; common variants are almost always benign. Key databases: gnomAD (800,000+ individuals across diverse populations). Frequency thresholds are condition-specific. ACMG criteria: BA1 (very high frequency, stand-alone benign), BS1 (high frequency), PM2 (absent or very low).

  • 02

    Functional evidence

    Has the variant been shown experimentally to disrupt protein activity, localization, stability, or interaction? Sources include biochemical assays, cell-based studies, animal models, RNA studies, and protein structural analysis. Among the strongest available evidence, but most variants have not been characterized. ACMG criteria: PS3 (functional studies show damaging effect), BS3 (no damaging effect).

  • 03

    Computational (in silico)

    Predicted impact based on evolutionary conservation, protein structure, and machine learning. Tools: SIFT, PolyPhen-2, CADD, REVEL, BayesDel, SpliceAI, AlphaMissense. Supporting evidence only. Never primary. The 2023 ClinGen SVI calibration recommends tool-specific score thresholds rather than generic "tools agree" calls. ACMG criteria: PP3 (damaging predicted), BP4 (benign predicted).

  • 04

    Segregation evidence

    Does the variant co-segregate with disease in the family? Affected relatives carry it; unaffected relatives do not. Strength depends on the number of informative meioses. Small families provide limited information; large multi-generation pedigrees with multiple affected and unaffected members are highly informative. ACMG criteria: PP1 (co-segregation), BS4 (lack of segregation).

  • 05

    De novo evidence

    Did this variant arise newly in the patient (absent from both biological parents)? High evidential weight for severe early-onset dominant conditions. Why trio sequencing substantially improves diagnostic yield: without parental data, de novo variants look like any other rare heterozygous variant. ACMG criteria: PS2 (de novo with confirmed parentage), PM6 (assumed de novo without confirmed parentage).

  • 06

    Literature and database evidence

    Has this exact variant, or one at the same position with the same amino acid change, been reported in affected patients before? Sources: ClinVar (star-rated submitter classifications), HGMD, Mastermind. A variant with multiple independent Pathogenic submissions carries strong weight; the same variant reported by a single lab without supporting evidence carries much less. ACMG criteria: PS1, PM5, PP5.

  • 07

    Phenotype specificity

    Does the patient's presentation match what is known about the gene's disease spectrum? A variant in a gene associated with a highly specific syndrome (e.g., Marfan features and an FBN1 variant) carries more weight than the same variant in a gene with a broad, non-specific phenotype. Operationalized through HPO term matching. ACMG criteria: PP4.

The Framework

ACMG Classification: 28 Criteria, 5 Tiers

The 2015 ACMG/AMP variant classification framework (Richards et al., Genetics in Medicine) is the global standard for germline variant interpretation. It defines 28 criteria organized by evidence type and strength, and rules for combining them to reach a 5-tier classification.

The five tiers

  • Pathogenic (P): the variant causes disease. Clinical management should be guided by this finding. Typically requires either PVS1 plus supporting evidence, or multiple Strong criteria.
  • Likely Pathogenic (LP): probability of disease-causing exceeds 90%. Clinically managed the same as Pathogenic in most contexts, with appropriate disclosure of remaining uncertainty.
  • Variant of Uncertain Significance (VUS): insufficient evidence to classify. Cannot guide clinical management. See the dedicated VUS section below.
  • Likely Benign (LB): probability of being benign exceeds 90%. Not reported in most clinical contexts but retained in the lab database for future reclassification monitoring.
  • Benign (B): the variant does not cause disease. Established by very high population frequency (BA1) or multiple independent lines of benign evidence.

Key criteria explained

  • 01

    PVS1 · Very Strong

    The single most powerful pathogenicity criterion. Null variant (nonsense, frameshift, canonical splice ±1/±2, initiation codon, exon deletion) in a gene where loss-of-function is an established disease mechanism. PVS1 alone plus modest supporting evidence typically achieves Likely Pathogenic. Requires confirmation that LoF is the disease mechanism for that specific gene.

  • 02

    PS1 / PS3 / PS2 · Strong

    PS1: same amino acid change as an established pathogenic variant (inherits much of that variant's evidence). PS3: well-characterized functional studies showing disruption consistent with disease mechanism. PS2: confirmed de novo with maternity/paternity testing in a patient with disease consistent with the gene.

  • 03

    PM2 / PM5 / PM6 · Moderate

    PM2: absent or extremely rare in gnomAD, consistent with rare disease allele status; threshold depends on prevalence and inheritance pattern. PM5: different amino acid change at a known pathogenic position. PM6: assumed de novo without confirmed parentage (weaker than PS2; can be upgraded when parental DNA is tested).

  • 04

    PP3 / PP4 · Supporting

    PP3: multiple well-calibrated computational tools predict damaging effect (per 2023 ClinGen SVI guidance, use tool-specific thresholds). PP4: patient's phenotype or family history is highly specific for a disease with a single genetic etiology.

  • 05

    BA1 · Benign Stand-Alone

    Allele frequency greater than 5% in any general population in gnomAD. Almost certainly benign. The stand-alone designation means this criterion alone is sufficient for Benign classification.

  • 06

    BS1 / BS3 / BS4 · Benign Strong

    BS1: allele frequency greater than expected for the disorder. BS3: functional studies show no damaging effect. BS4: lack of segregation in affected members of a family with the disease.

Combining criteria

The framework provides explicit combination rules. Pathogenic classification requires combinations such as: 1 Very Strong + 1 Strong; 1 Very Strong + 2 Moderate; 2 Strong; 1 Strong + 3 Supporting; 2 Moderate + 2 Supporting (among others). Likely Pathogenic requires lower combinations. These rules ensure that classifications are reproducible across labs when criteria are applied consistently.

ClinGen Expert Panel refinements

For many clinically important genes, ClinGen Expert Panels have published gene-specific specifications that refine how ACMG criteria are applied: gene-specific allele frequency thresholds for PM2 and BS1, which functional assays meet PS3 strength, phenotype specificity thresholds for PP4, PVS1 application adjustments for genes with complex isoform structures. Gene-specific rule sets exist for BRCA1/2, MLH1/MSH2/MSH6/PMS2, SCN5A and KCNQ1, MYH7 and MYBPC3, and dozens of others. Using gene-specific specifications substantially reduces inter-laboratory variability and VUS rates.

Output

Reading a Genomic Report

Whether you are a clinician receiving a genomic report for a patient or an individual reviewing your own results, understanding the structure of a clinical genomics report is essential for translating the laboratory's findings into appropriate action.

Standard report structure

  • Specimen and test information: patient ID, sample type, test name, assay methodology, sequencing platform, reference genome build, coverage metrics.
  • Summary of findings: brief clinical summary of the most significant result.
  • Variant section: each reported variant with HGVS nomenclature (genomic, coding, protein), gene, classification, zygosity, and supporting evidence summary.
  • Clinical interpretation: how the identified variant(s) explain or relate to the presentation. For P or LP findings, connects the molecular result to phenotype and provides management context.
  • Limitations: what the test did not cover, variant types not detected, known coverage gaps, caveats about negative results.
  • Recommendations: confirmatory testing, specialist referral, cascade family testing, surveillance, therapeutic implications.

What each tier means for management

  • Pathogenic: clinically significant. Management (surveillance, risk reduction, therapy selection, family testing) should be guided by this result in conjunction with the patient's full clinical picture and specialist consultation.
  • Likely Pathogenic: treated clinically the same as Pathogenic in most contexts. Probability exceeds 90%. Acknowledge residual uncertainty when disclosing to patients.
  • VUS: cannot guide clinical management. Management should be based on the clinical presentation, not the VUS finding. See dedicated VUS section.
  • Likely Benign / Benign: not clinically actionable. The variant is very likely or definitively not causing disease in the gene where it was found.

The Hard Problem

The VUS Problem

Variants of uncertain significance are the most common result type in clinical genomic sequencing for many patient populations, and the most misunderstood. VUS rates of 30 to 50% in WES for unselected rare disease populations are typical: not because the technology is failing, but because genomic knowledge is genuinely incomplete for a large fraction of rare variants.

Why VUS rates are high

The human genome contains ~4 to 5 million variants relative to the reference in any individual. The vast majority have never been observed in a clinical context. Even for variants in disease-associated genes, most specific nucleotide changes have never been seen before in an affected patient. They are novel variants with no prior clinical report. Without prior clinical evidence, pathogenicity determination depends entirely on functional evidence, computational predictions, and population frequency. None alone is sufficient for a definitive classification. The combination of weak positive signals and no prior clinical reports produces a VUS.

What a VUS is not

A VUS is not a positive result. It should not trigger the same clinical response as a Pathogenic finding. Many clinicians and patients misinterpret a VUS as evidence of disease, leading to unnecessary surveillance, prophylactic procedures, and psychological distress in patients who may not carry a disease-causing variant at all.

A VUS is also not a negative result. The variant may eventually be reclassified as Pathogenic as evidence accumulates. The appropriate response is watchful waiting with systematic reclassification monitoring.

What labs and patients should do

  • Labs: implement proactive reclassification monitoring. A systematic process for identifying VUS findings in the lab's patient cohort that have been reclassified in external databases and proactively notifying ordering clinicians. This is a patient safety function, not just an operational nicety.
  • Clinicians: do not act on VUS findings clinically. Counsel patients about what a VUS means, document it, and establish a re-contact plan for when reclassification occurs.
  • Patients: understand that a VUS does not provide an answer and is not the same as a positive result. Stay connected with the ordering lab so reclassification notifications can reach you.

Published studies consistently document VUS reclassification rates of 10 to 30% over 3 to 5 years, with the majority of reclassifications moving toward Benign or Likely Benign rather than Pathogenic. Important context for patients: most VUS findings eventually resolve to benign classifications.

Clinical Context

Genotype-Phenotype Correlation

Genome interpretation does not occur in a clinical vacuum. The same variant can have different clinical significance depending on the patient's phenotype, inheritance context, and family history. This is why variant interpretation is not purely a laboratory function. It requires genuine integration of molecular and clinical information.

Why phenotype drives interpretation

  • Prior probability. A patient with confirmed Marfan features presenting for molecular confirmation has a very high prior probability that a variant in FBN1 is causative. The same variant in a patient with no connective tissue features has a much lower prior probability.
  • Gene relevance. A candidate variant in a gene with no established association to the presentation cannot support a diagnosis regardless of functional impact. Conversely, a moderate-evidence variant in a gene with perfect phenotypic match may be clinically significant even without strong functional data.
  • Inheritance pattern confirmation. A heterozygous variant in an autosomal recessive gene is not a diagnosis. It makes the patient a carrier. If the presentation is consistent with a recessive condition and no second variant is found, the interpretation must address whether a second hit has been missed (poor coverage, intronic variant, deletion) rather than incorrectly attributing the condition to the single heterozygous finding.

HPO terms and computational phenotype matching

Human Phenotype Ontology (HPO) terms provide a standardized vocabulary for clinical features that enables computational phenotype-genotype matching. When a patient's HPO terms are entered into an interpretation platform, gene-disease relationships can be ranked by phenotypic similarity, surfacing candidates in genes with established associations to the patient's specific feature combination. This is why thorough phenotypic documentation is a prerequisite for efficient interpretation. A report listing "intellectual disability" provides less information than one listing the specific features (dysmorphic facial features, absent speech, seizure onset at 6 months, hypotonia, brain MRI findings) that together create a distinctive phenotypic fingerprint pointing toward specific diagnoses.

Open Problems

Challenges in Genome Interpretation

Genome interpretation is improving rapidly, but several structural challenges affect clinical quality and consistency across labs. Each is being actively addressed, but each remains a real source of variability in clinical results today.

  • 01

    Inter-laboratory variability

    17% of variants classified Pathogenic by one lab are classified VUS by another (2018 ClinVar submitter survey). Caused by inconsistent application of ACMG criteria: different thresholds for PM2, different standards for PP3/BP4, different judgments on functional evidence quality. ClinGen's gene-specific rule specifications are the most systematic fix.

  • 02

    Database lag and evolving evidence

    ClinVar receives thousands of new submissions weekly. Labs running on quarterly annotation snapshots may miss months of new clinical evidence. Monthly database updates with controlled versioning, tied to lab validation procedures, are the minimum standard for CAP/CLIA-compliant interpretation.

  • 03

    Population representation gaps

    gnomAD over-represents individuals of European ancestry. Population frequency thresholds calibrated on European data may not accurately reflect frequency in other populations, leading to false PM2 or BS1 assignments. African, South Asian, East Asian, and Latino populations are underrepresented. Use population-stratified frequencies rather than global frequencies when possible.

  • 04

    Evolving gene-disease relationships

    Some gene-disease associations published in early exome papers were based on insufficient evidence and have since been challenged or refuted. ClinGen's Gene Curation formally evaluates strength of evidence on a Definitive-to-Refuted scale. Interpreting a variant in a gene with Limited or Disputed classification as if the relationship were certain is a classification error with clinical consequences.

At Scale

Automating Interpretation Without Losing Rigor

The volume of genomic testing has grown faster than the workforce of clinical geneticists and molecular pathologists capable of performing manual interpretation. Automation is not optional. It is necessary for clinical genomics to scale. The question is not whether to automate but where automation is appropriate and where human judgment remains essential.

Where automation adds value

Evidence gathering is the most automatable component. Pulling allele frequencies from gnomAD, retrieving ClinVar classifications, running in silico prediction tools, calculating SpliceAI scores, and searching literature databases: all can be automated reliably and produce the same results as manual database queries, faster and without transcription errors.

ACMG criteria application is partially automatable. PVS1 (null variant assessment), BA1 (high frequency), PM2 (absent from population databases), and PP3/BP4 (computational predictions) can all be applied algorithmically with high accuracy when properly calibrated. Many criteria, including PS3 (functional evidence quality), PP4 (phenotype specificity), and PM3 (trans configuration in recessive disease), require human judgment that cannot be reliably automated.

Clinical interpretation synthesis (translating the molecular finding into a clinical report that accurately characterizes the variant's significance for the specific patient) remains a human responsibility. No algorithm can reliably perform the holistic integration of molecular finding, patient phenotype, family history, and clinical context that a board-certified clinical geneticist or molecular pathologist brings to this step.

Common Questions

Frequently Asked Questions

How do you interpret genomic testing results?
Identify the variant classification tier for each reported finding. Pathogenic and Likely Pathogenic results are clinically significant and should inform management. VUS results should not guide clinical decisions. Likely Benign and Benign results are not actionable. For Pathogenic or LP findings, review the clinical interpretation section to understand how the variant explains the presentation and what management is indicated: specialist referral, surveillance, therapeutic options, or family testing. For VUS, document and establish a re-contact plan for reclassification. For negative results, consult a genetic counselor about whether additional testing is appropriate given what the assay did and did not cover. All complex results should be reviewed with a clinical geneticist or genetic counselor before clinical action.
What are the 4 types of genetic conditions?
(1) Mendelian (single-gene) conditions, caused by pathogenic variants in a single gene following defined inheritance patterns. Examples: cystic fibrosis, Huntington disease. (2) Chromosomal conditions, caused by abnormalities in chromosome number or large-scale structure (Down syndrome, 22q11.2 deletion). (3) Multifactorial (complex) conditions, caused by many variants across many genes interacting with environment (type 2 diabetes, coronary artery disease, most common cancers). (4) Mitochondrial conditions, caused by variants in mtDNA or in nuclear genes encoding mitochondrial proteins, characterized by maternal inheritance and heteroplasmy (MELAS, Leigh syndrome).
What is a variant of uncertain significance (VUS)?
A VUS means the lab identified a variant in a clinically relevant location but cannot currently determine whether it causes disease. Not a positive result: should not drive clinical management changes. Not a negative result: may be reclassified as Pathogenic or Benign as evidence accumulates. VUS rates of 30 to 50% are typical in WES, reflecting genuine incompleteness of current genomic knowledge rather than laboratory error. The appropriate response: document it, do not act on it clinically, stay connected with the laboratory for reclassification updates, and consider whether additional functional or family studies could resolve the uncertainty.
What is the difference between Pathogenic and Likely Pathogenic?
Pathogenic means the evidence strongly and definitively supports that the variant causes disease. Likely Pathogenic means the probability that the variant causes disease exceeds 90%, but some residual uncertainty remains because the full weight of evidence required for Pathogenic classification is not present. In clinical practice both are typically treated the same way: management decisions are guided by both, with appropriate disclosure of the remaining uncertainty in LP cases. The distinction matters most for research and database purposes, where the strength of evidence is relevant for ongoing classification refinement.
Why do different laboratories classify the same variant differently?
Inter-laboratory variability is a documented and ongoing challenge in clinical genomics. Different labs apply the 28 ACMG criteria with different thresholds, use different databases for frequency assessment, and make different judgments about the quality of functional evidence. Studies have found that 17% of variants classified Pathogenic by one lab are classified VUS by another. ClinGen is addressing this through gene-specific rule specifications that define exactly how criteria should be applied for variants in specific genes, reducing the scope for subjective judgment and improving classification consistency.
Can genome interpretation results change over time?
Yes. Variant classifications are not permanent. As more patients are sequenced, more functional studies are published, and more gene-disease relationships are established, classifications change. VUS variants are reclassified as Pathogenic or Benign. Some Likely Pathogenic variants accumulate enough additional evidence to achieve Pathogenic status. Occasionally, previously Pathogenic variants are downgraded when new evidence challenges their disease association. Maintaining a connection to the ordering lab, and systematic reclassification monitoring on the lab side, is clinically important. A result that was uncertain when first reported may have a definitive classification waiting in a database today.
What does a negative genomic result actually rule out?
A negative result means no Pathogenic or Likely Pathogenic variants were identified in the genes tested. It does not rule out a genetic cause. It rules out the causes detectable by the assay performed. A negative hereditary cancer panel does not rule out other cancer predisposition genes not on the panel. A negative WES does not rule out non-coding variants detectable only by WGS. A negative WGS does not rule out all genetic causes; some conditions have causes not yet discovered. Negative results should always be discussed with a genetic counselor.

Scale Interpretation Without Losing Rigor

VarSeq automates evidence gathering, ACMG/AMP criteria scoring, and phenotype-driven prioritization. Reducing per-case interpretation time by up to 80% while preserving the human-in-the-loop clinical rigor CAP/CLIA accreditation requires.