Mutational Constraint Analysis Workflow for Overlapping Short Open Reading Frames and Genomic Neighbours-Reference-Cited by-同舟云学术

Mutational Constraint Analysis Workflow for Overlapping Short Open Reading Frames and Genomic Neighbours

Published:2024-07-10 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Danner Martin^ORCID,Begemann Matthias^ORCID,Kraft Florian^ORCID,Elbracht Miriam^ORCID,Kurth Ingo^ORCID,Krause Jeremias^ORCID

Abstract

Understanding the dark genome is a priority task following the complete sequencing of the human genome. Short open reading frames (sORFs) are a group of largely unexplored elements of the dark genome with the potential for being translated into microproteins. The definitive number of coding and regulatory sORFs is not known, however they could account for up to 1-2% of the human genome. This corresponds to an order of magnitude in the range of canonical coding genes. For a few sORFs a clinical relevance has already been demonstrated, but for the majority of potential sORFs the biological function remains unclear. A major limitation in predicting their disease relevance using large-scale genomic data is the fact that no population-level constraint metrics for genetic variants in sORFs are yet available. To overcome this, we used the recently released gno-mAD 4.0 dataset and analysed the constraint of a consensus set of sORFs and their genomic neighbours. We demonstrate that sORFs are mostly embedded into a moderately constraint genomic context, but within the gencode dataset we identified a subset of highly constrained sORFs comparable to highly constrained canonical genes.

Publisher

Cold Spring Harbor Laboratory

Reference36 articles.

1. Genomic Medicine–Progress, Pitfalls, and Promise

2. Dual function of DNA sequences: protein-coding sequences function as transcriptional enhancers;Perspectives in biology and medicine,2015

3. A novel approach to exploring the dark genome and its application to mapping of the vertebrate virus fossil record;Genome Biology,2024

4. Shedding light on the dark genome: Insights into the genetic, CRISPR-based, and pharmacological dependencies of human cancers and disease aggressiveness

5. Standardized annotation of translated open reading frames