Exploring the effects of assembly strategies on differential gene expression – A case study in a non-model crustacean species, the wild black tiger prawn (Penaeus monodon)-Reference-Cited by-同舟云学术

Exploring the effects of assembly strategies on differential gene expression – A case study in a non-model crustacean species, the wild black tiger prawn (Penaeus monodon)

Published:2024-08-26 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Nguyen Minh Thanh¹,Tran Minh Nhut¹,Le Thi Hong Tham¹,Vo Thi Bao Chau¹,Nguyen Hoang Khue Tu¹,Tran Thi Hai Yen¹,Nguyen Thanh Luan²,Elizur Abigail³,Ventura Tomer³,Nguyen Tuan Viet⁴,Vo Thu Thi Minh¹

Affiliation:

1. International University, Vietnam National University HCM

2. Research Institute for Aquaculture No2

3. Centre for BioInnovation, University of the Sunshine Coast

4. Agriculture Victoria, AgriBio, Centre for AgriBiosciences

Abstract

The Penaeus monodon genome became a subject for extended studies of several aspects of nutrition, growth, and reproduction. In this study, transcriptome from the hepatopancreas and ovary of wild-caught female broodstocks were generated by genome-guided (GG) and de novo (DN) assembly. We compared the effectiveness of these methods in terms of the number of transcripts and their annotations. We analyzed mapping features and differentially expressed genes (DEGs) using three estimation approaches: mapping reads against (i) a genome assembly of P. monodon (reference-based (RB)), transcriptome generated by (ii) GG, and (iii) DN assembly. DN had the highest percentage of mapping rates and annotated aligned reads, leading to 2.09 times more unigenes than GG assembly, with 49% of unigenes matching the blast search, compared to 39.66%. Furthermore, 69% of blasted unigenes from DN assembly were assigned GO terms in DN assembly, compared to 23.9% in GG. Additionally, DEGs identified of the two tissues by DN approach (820) surpassed the total number of DEGs identified by GG (488) and RB (117) approaches. In contrast, the GG approach identified the highest number of DEGs from our genes of interest (93.5%), followed by the DN (82.6%) and the RB (37.3%) approach. The DN assembly is ideal for transcript reconstruction and DEGs recovery, while the GG assembly generated an appropriate database for studying specific genes or sets of genes. We, therefore, recommend using a combination of DN and GG assemblies to improve differential gene expression analysis for non-model organisms with poorly resolved genome annotations.

Publisher

Springer Science and Business Media LLC

Reference79 articles.

1. The effects of oxidative stress on female reproduction: A review;Agarwal A;Reproductive Biology Endocrinol,2012

2. HTSeq-A Python framework to work with high-throughput sequencing data;Anders S;Bioinformatics,2015

3. Relationship Between Vitellogenin and Vitellin in a Marine Shrimp (Penaeus semisulcatus) and Molecular Characterization of Vitellogenin Complementary DNAs1;Avarre J-C;Biol Reprod,2003

4. Trimmomatic: a flexible trimmer for Illumina sequence data;Bolger AM;Bioinf (Oxford England),2014

5. Effects of β-carotene source, Dunaliella salina, and astaxanthin on pigmentation, growth, survival and health of Penaeus monodon;Boonyaratpalin M;Aquac Res,2001