HEREAT Human Molecular Genetics and Epigenetics Research Laboratory


Understanding the Mutational Constraint Spectrum: A Deep Dive into Human Genetic Variation



Illustrative image of the role of cytoskeleton integrins in MS

In a study published in Nature, titled "The mutational constraint spectrum quantified from variation in 141,456 humans," Konrad J. Karczewski and colleagues offer an in-depth exploration of how genetic variants that inactivate protein-coding genes, known as predicted loss-of-function (pLoF) variants, shape human biology. This extensive research, leveraging data from the Genome Aggregation Database (gnomAD), presents a nuanced view of human genetic variation and its implications for understanding disease and gene function.

Loss-of-Function Variants
Loss-of-function (LoF) variants, which include stop-gained, essential splice site, and frameshift mutations, have long been recognized as critical for deciphering gene function. By effectively "breaking" genes, these variants can reveal the importance of specific genes in maintaining normal physiological processes. However, such variants are typically rare in human populations due to their deleterious nature, making their study challenging without large datasets.

The study aggregates data from 125,748 exomes and 15,708 genomes, forming one of the most comprehensive databases of human genetic variation to date. This vast collection allowed the researchers to identify 443,769 high-confidence pLoF variants after rigorous filtering to exclude sequencing and annotation errors.

The Loss-of-Function Observed/Expected Upper Bound Fraction (LOEUF)
A key innovation of this study is the development of the LOEUF metric, which quantifies the intolerance of genes to LoF variants. By comparing the observed number of pLoF variants to the number expected under neutral evolution, LOEUF places each gene on a spectrum from highly intolerant (suggesting essential function) to more tolerant (indicating redundancy or non-essentiality). This continuous metric offers a more nuanced understanding compared to previous binary classifications.

Validating LOEUF: Correlations with Biological Relevance
The study validates the LOEUF metric through comparisons with other indicators of gene importance, such as the incidence of structural variants, mouse gene knockout data, and human cell viability assays. Genes with lower LOEUF scores, indicating higher intolerance to LoF variants, were more likely to be essential for survival in model organisms and human cells. These findings reinforce the utility of LOEUF in prioritizing genes for further study in disease contexts.

Implications for Disease Research
One of the most significant implications of this research is its potential to improve our understanding of the genetic basis of both rare and common diseases. The study demonstrates that genes with lower LOEUF scores are more likely to be implicated in severe developmental disorders and intellectual disabilities when disrupted. Moreover, the research highlights that even common variants in these constrained genes can contribute to complex traits and diseases, such as schizophrenia and educational attainment.

Beyond Coding Regions: The Challenge of Regulatory Variants
While the focus of this study is on protein-coding regions, the authors acknowledge the limitations in understanding non-coding regulatory variants. These variants, which may alter gene expression without disrupting the protein sequence, are harder to detect and interpret. The authors suggest that future research will need to explore these regions with even larger datasets and improved functional annotations.

A Vision for the Future: A Human "Knockout" Project
The sheer scale of the gnomAD dataset and the insights gained from it pave the way for an ambitious vision: a systematic effort to identify and study individuals with complete gene knockouts, either in heterozygous or homozygous states. Such a project could revolutionize our understanding of gene function and disease by linking specific genetic disruptions directly to phenotypic outcomes.

Conclusion
The study by Karczewski et al. represents a significant advance in human genetics, offering new tools and insights for exploring the role of genetic variation in health and disease. The LOEUF metric, in particular, provides a powerful framework for prioritizing genes in genetic studies, with far-reaching implications for precision medicine and therapeutic development. As genomic data continues to grow, the approaches and findings from this study will likely become cornerstones of future genetic research.

Reference:
Karczewski, K.J., Francioli, L.C., Tiao, G. et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581, 434–443 (2020).