Golden Helix · Clinical Genomics Guide
Genome Interpretation
How Genomic Data Becomes a Clinical Diagnosis
Sequencing a genome is now routine. Interpreting one is not. This guide covers the concepts, frameworks, and evidence types that underpin rigorous genome interpretation, from the four types of genetic conditions through the ACMG criteria to what clinicians and patients should actually do with their results.
Introduction
Sequencing a genome is routine.
Interpreting one is not.
A whole genome sequencing run takes a day. It produces gigabytes of data and millions of variant calls. But none of those variants, on their own, tell a clinician whether a patient has an inherited disease, which therapy to prescribe, or whether a family member should be tested. That requires genome interpretation: the complex, knowledge-intensive process of determining which genetic variants are clinically significant and what to do about them.
Genome interpretation sits at the intersection of molecular biology, population genetics, clinical medicine, and computational science. It requires understanding not just what a variant does to a protein, but whether it has been seen before in affected patients, how common it is in the general population, whether it fits the inheritance pattern, and whether the current evidence is sufficient to act on it. This is why genome interpretation is considered the most difficult stage of the clinical genomics workflow, not technically but intellectually, and why interpretation quality varies more across labs than any other pipeline stage.
Definition
What Is Genome Interpretation?
Genome interpretation, also called variant interpretation, is the process of evaluating genetic variants identified during sequencing to determine their clinical significance. It begins where computational analysis ends, at a list of variants in a VCF file, each annotated with biological context, and ends at a defensible clinical conclusion about which variants cause or contribute to the patient's condition.
The four questions for every candidate variant
- Is the variant real? Quality assessment: is this a true variant or a sequencing artifact?
- Is it rare enough to be disease-causing? Population frequency in the relevant ancestral group.
- Does it affect gene function? Effect on protein, transcript, or expression.
- Does it fit the clinical picture? Gene-disease relationship matches the phenotype, inheritance fits the family history.
Every variant that reaches clinical reporting has been evaluated against all four. The answers, weighted by strength of evidence, determine the final classification. Genome interpretation sits within the broader NGS pipeline as the clinical component of tertiary analysis. For the upstream steps (annotation, filtering, prioritization) that precede formal classification, see the tertiary analysis guide.
Context
The Four Types of Genetic Conditions
The clinical context shapes interpretation. The most important contextual factor is what type of genetic condition is being investigated. The four types differ in inheritance, genetic architecture, and penetrance, and therefore in how variants are interpreted and weighted.
1 · Mendelian
Single-Gene Conditions
Caused by pathogenic variants in a single gene following defined inheritance (AD, AR, X-linked, mitochondrial). 7,000+ identified, caused by variants in 4,000+ genes. The primary focus of germline variant interpretation. ACMG/AMP was developed for this. Examples: cystic fibrosis (CFTR, AR), Huntington (HTT, AD), Duchenne (DMD, X-linked), BRCA1/2 (AD, incomplete penetrance).
2 · Chromosomal
Chromosomal Conditions
Abnormalities in chromosome number or large-scale structure: gains, losses, duplications, deletions, inversions, translocations affecting megabases to entire chromosomes. Affect hundreds to thousands of genes simultaneously. Mostly de novo. Detected by karyotype, chromosomal microarray, and CNV calling from WGS. Examples: Down syndrome, 22q11.2 deletion, Turner syndrome, Klinefelter, Angelman/Prader-Willi.
3 · Multifactorial
Complex Conditions
Combined effect of many variants across many genes interacting with environment. No single variant is necessary or sufficient. GWAS-identified variants have small effects individually (OR 1.1–1.5) and are clinically actionable only as polygenic risk scores. ACMG/AMP does not apply. Examples: type 2 diabetes, coronary artery disease, schizophrenia, age-related macular degeneration, most common cancers.
4 · Mitochondrial
Mitochondrial Conditions
Pathogenic variants in mtDNA (maternally inherited) or in nuclear genes encoding mitochondrial proteins (Mendelian). Heteroplasmy is the defining challenge: each cell contains hundreds to thousands of mitochondria, and the proportion of mutant mtDNA varies by tissue and individual. 10% heteroplasmy may be silent; 80% causes severe disease. Examples: MELAS, Leigh syndrome, Leber hereditary optic neuropathy.
The Inputs
The Evidence Hierarchy
Genome interpretation is fundamentally an evidence-weighing exercise. No single piece is sufficient. Pathogenicity conclusions are built by integrating multiple independent lines of evidence, each contributing different information about biological and clinical significance. The ACMG/AMP framework organizes evidence into the following categories, each with a defined weight.
- 01
Population frequency
How common is this variant in the general population? Rare variants are more likely to be disease-causing; common variants are almost always benign. Key databases: gnomAD (800,000+ individuals across diverse populations). Frequency thresholds are condition-specific. ACMG criteria: BA1 (very high frequency, stand-alone benign), BS1 (high frequency), PM2 (absent or very low).
- 02
Functional evidence
Has the variant been shown experimentally to disrupt protein activity, localization, stability, or interaction? Sources include biochemical assays, cell-based studies, animal models, RNA studies, and protein structural analysis. Among the strongest available evidence, but most variants have not been characterized. ACMG criteria: PS3 (functional studies show damaging effect), BS3 (no damaging effect).
- 03
Computational (in silico)
Predicted impact based on evolutionary conservation, protein structure, and machine learning. Tools: SIFT, PolyPhen-2, CADD, REVEL, BayesDel, SpliceAI, AlphaMissense. Supporting evidence only. Never primary. The 2023 ClinGen SVI calibration recommends tool-specific score thresholds rather than generic "tools agree" calls. ACMG criteria: PP3 (damaging predicted), BP4 (benign predicted).
- 04
Segregation evidence
Does the variant co-segregate with disease in the family? Affected relatives carry it; unaffected relatives do not. Strength depends on the number of informative meioses. Small families provide limited information; large multi-generation pedigrees with multiple affected and unaffected members are highly informative. ACMG criteria: PP1 (co-segregation), BS4 (lack of segregation).
- 05
De novo evidence
Did this variant arise newly in the patient (absent from both biological parents)? High evidential weight for severe early-onset dominant conditions. Why trio sequencing substantially improves diagnostic yield: without parental data, de novo variants look like any other rare heterozygous variant. ACMG criteria: PS2 (de novo with confirmed parentage), PM6 (assumed de novo without confirmed parentage).
- 06
Literature and database evidence
Has this exact variant, or one at the same position with the same amino acid change, been reported in affected patients before? Sources: ClinVar (star-rated submitter classifications), HGMD, Mastermind. A variant with multiple independent Pathogenic submissions carries strong weight; the same variant reported by a single lab without supporting evidence carries much less. ACMG criteria: PS1, PM5, PP5.
- 07
Phenotype specificity
Does the patient's presentation match what is known about the gene's disease spectrum? A variant in a gene associated with a highly specific syndrome (e.g., Marfan features and an FBN1 variant) carries more weight than the same variant in a gene with a broad, non-specific phenotype. Operationalized through HPO term matching. ACMG criteria: PP4.
The Framework
ACMG Classification: 28 Criteria, 5 Tiers
The 2015 ACMG/AMP variant classification framework (Richards et al., Genetics in Medicine) is the global standard for germline variant interpretation. It defines 28 criteria organized by evidence type and strength, and rules for combining them to reach a 5-tier classification.
The five tiers
- Pathogenic (P): the variant causes disease. Clinical management should be guided by this finding. Typically requires either PVS1 plus supporting evidence, or multiple Strong criteria.
- Likely Pathogenic (LP): probability of disease-causing exceeds 90%. Clinically managed the same as Pathogenic in most contexts, with appropriate disclosure of remaining uncertainty.
- Variant of Uncertain Significance (VUS): insufficient evidence to classify. Cannot guide clinical management. See the dedicated VUS section below.
- Likely Benign (LB): probability of being benign exceeds 90%. Not reported in most clinical contexts but retained in the lab database for future reclassification monitoring.
- Benign (B): the variant does not cause disease. Established by very high population frequency (BA1) or multiple independent lines of benign evidence.
Key criteria explained
- 01
PVS1 · Very Strong
The single most powerful pathogenicity criterion. Null variant (nonsense, frameshift, canonical splice ±1/±2, initiation codon, exon deletion) in a gene where loss-of-function is an established disease mechanism. PVS1 alone plus modest supporting evidence typically achieves Likely Pathogenic. Requires confirmation that LoF is the disease mechanism for that specific gene.
- 02
PS1 / PS3 / PS2 · Strong
PS1: same amino acid change as an established pathogenic variant (inherits much of that variant's evidence). PS3: well-characterized functional studies showing disruption consistent with disease mechanism. PS2: confirmed de novo with maternity/paternity testing in a patient with disease consistent with the gene.
- 03
PM2 / PM5 / PM6 · Moderate
PM2: absent or extremely rare in gnomAD, consistent with rare disease allele status; threshold depends on prevalence and inheritance pattern. PM5: different amino acid change at a known pathogenic position. PM6: assumed de novo without confirmed parentage (weaker than PS2; can be upgraded when parental DNA is tested).
- 04
PP3 / PP4 · Supporting
PP3: multiple well-calibrated computational tools predict damaging effect (per 2023 ClinGen SVI guidance, use tool-specific thresholds). PP4: patient's phenotype or family history is highly specific for a disease with a single genetic etiology.
- 05
BA1 · Benign Stand-Alone
Allele frequency greater than 5% in any general population in gnomAD. Almost certainly benign. The stand-alone designation means this criterion alone is sufficient for Benign classification.
- 06
BS1 / BS3 / BS4 · Benign Strong
BS1: allele frequency greater than expected for the disorder. BS3: functional studies show no damaging effect. BS4: lack of segregation in affected members of a family with the disease.
Combining criteria
The framework provides explicit combination rules. Pathogenic classification requires combinations such as: 1 Very Strong + 1 Strong; 1 Very Strong + 2 Moderate; 2 Strong; 1 Strong + 3 Supporting; 2 Moderate + 2 Supporting (among others). Likely Pathogenic requires lower combinations. These rules ensure that classifications are reproducible across labs when criteria are applied consistently.
ClinGen Expert Panel refinements
For many clinically important genes, ClinGen Expert Panels have published gene-specific specifications that refine how ACMG criteria are applied: gene-specific allele frequency thresholds for PM2 and BS1, which functional assays meet PS3 strength, phenotype specificity thresholds for PP4, PVS1 application adjustments for genes with complex isoform structures. Gene-specific rule sets exist for BRCA1/2, MLH1/MSH2/MSH6/PMS2, SCN5A and KCNQ1, MYH7 and MYBPC3, and dozens of others. Using gene-specific specifications substantially reduces inter-laboratory variability and VUS rates.
Output
Reading a Genomic Report
Whether you are a clinician receiving a genomic report for a patient or an individual reviewing your own results, understanding the structure of a clinical genomics report is essential for translating the laboratory's findings into appropriate action.
Standard report structure
- Specimen and test information: patient ID, sample type, test name, assay methodology, sequencing platform, reference genome build, coverage metrics.
- Summary of findings: brief clinical summary of the most significant result.
- Variant section: each reported variant with HGVS nomenclature (genomic, coding, protein), gene, classification, zygosity, and supporting evidence summary.
- Clinical interpretation: how the identified variant(s) explain or relate to the presentation. For P or LP findings, connects the molecular result to phenotype and provides management context.
- Limitations: what the test did not cover, variant types not detected, known coverage gaps, caveats about negative results.
- Recommendations: confirmatory testing, specialist referral, cascade family testing, surveillance, therapeutic implications.
What each tier means for management
- Pathogenic: clinically significant. Management (surveillance, risk reduction, therapy selection, family testing) should be guided by this result in conjunction with the patient's full clinical picture and specialist consultation.
- Likely Pathogenic: treated clinically the same as Pathogenic in most contexts. Probability exceeds 90%. Acknowledge residual uncertainty when disclosing to patients.
- VUS: cannot guide clinical management. Management should be based on the clinical presentation, not the VUS finding. See dedicated VUS section.
- Likely Benign / Benign: not clinically actionable. The variant is very likely or definitively not causing disease in the gene where it was found.
The Hard Problem
The VUS Problem
Variants of uncertain significance are the most common result type in clinical genomic sequencing for many patient populations, and the most misunderstood. VUS rates of 30 to 50% in WES for unselected rare disease populations are typical: not because the technology is failing, but because genomic knowledge is genuinely incomplete for a large fraction of rare variants.
Why VUS rates are high
The human genome contains ~4 to 5 million variants relative to the reference in any individual. The vast majority have never been observed in a clinical context. Even for variants in disease-associated genes, most specific nucleotide changes have never been seen before in an affected patient. They are novel variants with no prior clinical report. Without prior clinical evidence, pathogenicity determination depends entirely on functional evidence, computational predictions, and population frequency. None alone is sufficient for a definitive classification. The combination of weak positive signals and no prior clinical reports produces a VUS.
What a VUS is not
A VUS is not a positive result. It should not trigger the same clinical response as a Pathogenic finding. Many clinicians and patients misinterpret a VUS as evidence of disease, leading to unnecessary surveillance, prophylactic procedures, and psychological distress in patients who may not carry a disease-causing variant at all.
A VUS is also not a negative result. The variant may eventually be reclassified as Pathogenic as evidence accumulates. The appropriate response is watchful waiting with systematic reclassification monitoring.
What labs and patients should do
- Labs: implement proactive reclassification monitoring. A systematic process for identifying VUS findings in the lab's patient cohort that have been reclassified in external databases and proactively notifying ordering clinicians. This is a patient safety function, not just an operational nicety.
- Clinicians: do not act on VUS findings clinically. Counsel patients about what a VUS means, document it, and establish a re-contact plan for when reclassification occurs.
- Patients: understand that a VUS does not provide an answer and is not the same as a positive result. Stay connected with the ordering lab so reclassification notifications can reach you.
Published studies consistently document VUS reclassification rates of 10 to 30% over 3 to 5 years, with the majority of reclassifications moving toward Benign or Likely Benign rather than Pathogenic. Important context for patients: most VUS findings eventually resolve to benign classifications.
Clinical Context
Genotype-Phenotype Correlation
Genome interpretation does not occur in a clinical vacuum. The same variant can have different clinical significance depending on the patient's phenotype, inheritance context, and family history. This is why variant interpretation is not purely a laboratory function. It requires genuine integration of molecular and clinical information.
Why phenotype drives interpretation
- Prior probability. A patient with confirmed Marfan features presenting for molecular confirmation has a very high prior probability that a variant in FBN1 is causative. The same variant in a patient with no connective tissue features has a much lower prior probability.
- Gene relevance. A candidate variant in a gene with no established association to the presentation cannot support a diagnosis regardless of functional impact. Conversely, a moderate-evidence variant in a gene with perfect phenotypic match may be clinically significant even without strong functional data.
- Inheritance pattern confirmation. A heterozygous variant in an autosomal recessive gene is not a diagnosis. It makes the patient a carrier. If the presentation is consistent with a recessive condition and no second variant is found, the interpretation must address whether a second hit has been missed (poor coverage, intronic variant, deletion) rather than incorrectly attributing the condition to the single heterozygous finding.
HPO terms and computational phenotype matching
Human Phenotype Ontology (HPO) terms provide a standardized vocabulary for clinical features that enables computational phenotype-genotype matching. When a patient's HPO terms are entered into an interpretation platform, gene-disease relationships can be ranked by phenotypic similarity, surfacing candidates in genes with established associations to the patient's specific feature combination. This is why thorough phenotypic documentation is a prerequisite for efficient interpretation. A report listing "intellectual disability" provides less information than one listing the specific features (dysmorphic facial features, absent speech, seizure onset at 6 months, hypotonia, brain MRI findings) that together create a distinctive phenotypic fingerprint pointing toward specific diagnoses.
Open Problems
Challenges in Genome Interpretation
Genome interpretation is improving rapidly, but several structural challenges affect clinical quality and consistency across labs. Each is being actively addressed, but each remains a real source of variability in clinical results today.
- 01
Inter-laboratory variability
17% of variants classified Pathogenic by one lab are classified VUS by another (2018 ClinVar submitter survey). Caused by inconsistent application of ACMG criteria: different thresholds for PM2, different standards for PP3/BP4, different judgments on functional evidence quality. ClinGen's gene-specific rule specifications are the most systematic fix.
- 02
Database lag and evolving evidence
ClinVar receives thousands of new submissions weekly. Labs running on quarterly annotation snapshots may miss months of new clinical evidence. Monthly database updates with controlled versioning, tied to lab validation procedures, are the minimum standard for CAP/CLIA-compliant interpretation.
- 03
Population representation gaps
gnomAD over-represents individuals of European ancestry. Population frequency thresholds calibrated on European data may not accurately reflect frequency in other populations, leading to false PM2 or BS1 assignments. African, South Asian, East Asian, and Latino populations are underrepresented. Use population-stratified frequencies rather than global frequencies when possible.
- 04
Evolving gene-disease relationships
Some gene-disease associations published in early exome papers were based on insufficient evidence and have since been challenged or refuted. ClinGen's Gene Curation formally evaluates strength of evidence on a Definitive-to-Refuted scale. Interpreting a variant in a gene with Limited or Disputed classification as if the relationship were certain is a classification error with clinical consequences.
At Scale
Automating Interpretation Without Losing Rigor
The volume of genomic testing has grown faster than the workforce of clinical geneticists and molecular pathologists capable of performing manual interpretation. Automation is not optional. It is necessary for clinical genomics to scale. The question is not whether to automate but where automation is appropriate and where human judgment remains essential.
Where automation adds value
Evidence gathering is the most automatable component. Pulling allele frequencies from gnomAD, retrieving ClinVar classifications, running in silico prediction tools, calculating SpliceAI scores, and searching literature databases: all can be automated reliably and produce the same results as manual database queries, faster and without transcription errors.
ACMG criteria application is partially automatable. PVS1 (null variant assessment), BA1 (high frequency), PM2 (absent from population databases), and PP3/BP4 (computational predictions) can all be applied algorithmically with high accuracy when properly calibrated. Many criteria, including PS3 (functional evidence quality), PP4 (phenotype specificity), and PM3 (trans configuration in recessive disease), require human judgment that cannot be reliably automated.
Clinical interpretation synthesis (translating the molecular finding into a clinical report that accurately characterizes the variant's significance for the specific patient) remains a human responsibility. No algorithm can reliably perform the holistic integration of molecular finding, patient phenotype, family history, and clinical context that a board-certified clinical geneticist or molecular pathologist brings to this step.
Common Questions
Frequently Asked Questions
How do you interpret genomic testing results?
What are the 4 types of genetic conditions?
What is a variant of uncertain significance (VUS)?
What is the difference between Pathogenic and Likely Pathogenic?
Why do different laboratories classify the same variant differently?
Can genome interpretation results change over time?
What does a negative genomic result actually rule out?
Keep Reading
Related Resources
Scale Interpretation Without Losing Rigor
VarSeq automates evidence gathering, ACMG/AMP criteria scoring, and phenotype-driven prioritization. Reducing per-case interpretation time by up to 80% while preserving the human-in-the-loop clinical rigor CAP/CLIA accreditation requires.