Long noncoding RNAs are rarely translated in two human cell lines-Reference-Cited by-同舟云学术

Long noncoding RNAs are rarely translated in two human cell lines

Published:2012-09 Issue:9 Volume:22 Page:1646-1657
ISSN:1088-9051
Container-title:Genome Research
language:en
Short-container-title:Genome Res.

Author:

Bánfai Balázs,Jia Hui,Khatun Jainab,Wood Emily,Risk Brian,Gundling William E.,Kundaje Anshul,Gunawardena Harsha P.,Yu Yanbao,Xie Ling,Krajewski Krzysztof,Strahl Brian D.,Chen Xian,Bickel Peter,Giddings Morgan C.,Brown James B.,Lipovich Leonard

Abstract

Data from the Encyclopedia of DNA Elements (ENCODE) project show over 9640 human genome loci classified as long noncoding RNAs (lncRNAs), yet only ∼100 have been deeply characterized to determine their role in the cell. To measure the protein-coding output from these RNAs, we jointly analyzed two recent data sets produced in the ENCODE project: tandem mass spectrometry (MS/MS) data mapping expressed peptides to their encoding genomic loci, and RNA-seq data generated by ENCODE in long polyA+ and polyA− fractions in the cell lines K562 and GM12878. We used the machine-learning algorithm RuleFit3 to regress the peptide data against RNA expression data. The most important covariate for predicting translation was, surprisingly, the Cytosol polyA− fraction in both cell lines. LncRNAs are ∼13-fold less likely to produce detectable peptides than similar mRNAs, indicating that ∼92% of GENCODE v7 lncRNAs are not translated in these two ENCODE cell lines. Intersecting 9640 lncRNA loci with 79,333 peptides yielded 85 unique peptides matching 69 lncRNAs. Most cases were due to a coding transcript misannotated as lncRNA. Two exceptions were an unprocessed pseudogene and a bona fide lncRNA gene, both with open reading frames (ORFs) compromised by upstream stop codons. All potentially translatable lncRNA ORFs had only a single peptide match, indicating low protein abundance and/or false-positive peptide matches. We conclude that with very few exceptions, ribosomes are able to distinguish coding from noncoding transcripts and, hence, that ectopic translation and cryptic mRNAs are rare in the human lncRNAome.

Publisher

Cold Spring Harbor Laboratory

Subject

Genetics(clinical),Genetics

Reference47 articles.

1. Post-transcriptional processing generates a diversity of 5′-modified long and short RNAs

2. On "genomenclature": a comprehensive (and respectful) taxonomy for pseudogenes and other "junk DNA".

3. The Transcriptional Landscape of the Mammalian Genome

4. Discovery and revision of Arabidopsis genes by proteogenomics

Cited by 341 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Radiotherapy and breast cancer: finally, an lncRNA perspective on radiosensitivity and radioresistance;Frontiers in Oncology;2024-09-13

2. Multi-Omic Approaches in Cancer-Related Micropeptide Identification;Proteomes;2024-09-13

3. Challenges in LncRNA Biology: Views and Opinions;Non-Coding RNA;2024-08-01

4. Beyond traditional translation: ncRNA derived peptides as modulators of tumor behaviors;Journal of Biomedical Science;2024-06-14

5. Long non-coding RNA LINC00930 targeting miR-6792-3p/ZBTB16 regulates the proliferation and EMT of pancreatic cancer;BMC Cancer;2024-05-24