Prediction of transcript isoforms in 19 chicken tissues by Oxford Nanopore long-read sequencing-Reference-Cited by-同舟云学术

Prediction of transcript isoforms in 19 chicken tissues by Oxford Nanopore long-read sequencing

Published:2022-10-03 Issue: Volume:13 Page:
ISSN:1664-8021
Container-title:Frontiers in Genetics
language:
Short-container-title:Front. Genet.

Author:

Guan Dailu,Halstead Michelle M.,Islas-Trejo Alma D.,Goszczynski Daniel E.,Cheng Hans H.,Ross Pablo J.,Zhou Huaijun

Abstract

To identify and annotate transcript isoforms in the chicken genome, we generated Nanopore long-read sequencing data from 68 samples that encompassed 19 diverse tissues collected from experimental adult male and female White Leghorn chickens. More than 23.8 million reads with mean read length of 790 bases and average quality of 18.2 were generated. The annotation and subsequent filtering resulted in the identification of 55,382 transcripts at 40,547 loci with mean length of 1,700 bases. We predicted 30,967 coding transcripts at 19,461 loci, and 16,495 lncRNA transcripts at 15,512 loci. Compared to existing reference annotations, we found ∼52% of annotated transcripts could be partially or fully matched while ∼47% were novel. Seventy percent of novel transcripts were potentially transcribed from lncRNA loci. Based on our annotation, we quantified transcript expression across tissues and found two brain tissues (i.e., cerebellum and cortex) expressed the highest number of transcripts and loci. Furthermore, ∼22% of the transcripts displayed tissue specificity with the reproductive tissues (i.e., testis and ovary) exhibiting the most tissue-specific transcripts. Despite our wide sampling, ∼20% of Ensembl reference loci were not detected. This suggests that deeper sequencing and additional samples that include different breeds, cell types, developmental stages, and physiological conditions, are needed to fully annotate the chicken genome. The application of Nanopore sequencing in this study demonstrates the usefulness of long-read data in discovering additional novel loci (e.g., lncRNA loci) and resolving complex transcripts (e.g., the longest transcript for the TTN locus).

Publisher

Frontiers Media SA

Subject

Genetics (clinical),Genetics,Molecular Medicine

Reference61 articles.

1. Opportunities and challenges in long-read sequencing data analysis;Amarasinghe;Genome Biol.,2020

2. HTSeq—A Python framework to work with high-throughput sequencing data;Anders;Bioinformatics,2015

3. Coordinated international action to accelerate genome-to-phenome with FAANG, the Functional Annotation of Animal Genomes project;Andersson;Genome Biol.,2015

4. Alternative splicing as a regulator of development and tissue identity;Baralle;Nat. Rev. Mol. Cell Biol.,2017

5. Improved annotation of the domestic pig genome through integration of Iso-Seq and RNA-seq data;Beiki;BMC Genomics,2019

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Full-length transcriptome sequencing of pepper fruit during development and construction of a transcript variation database;Horticulture Research;2024-07-24

2. Enriched atlas of lncRNA and protein-coding genes for the GRCg7b chicken assembly and its functional annotation across 47 tissues;Scientific Reports;2024-03-19

3. When Livestock Genomes Meet Third-Generation Sequencing Technology: From Opportunities to Applications;Genes;2024-02-15

4. The ChickenGTEx atlas: the genetic regulation of multi-tissue and single-cell transcriptome signatures in chickens;2023-09-26

5. The Abundant and Unique Transcripts and Alternative Splicing of the Artificially Autododecaploid London Plane (Platanus × acerifolia);International Journal of Molecular Sciences;2023-09-23