Abstract
AbstractBackgroundSequencing and annotating genomes of non-model organisms helps to understand genome architecture, the genetic processes underlying species traits, and how these genes have evolved in closely-related taxa, among many other biological processes. However, many metazoan groups, such as the extremely diverse molluscs, are still underrepresented in the number of sequenced and annotated genomes. Although sequencing techniques have recently improved in quality and quantity, molluscs are still neglected due to difficulties in applying standardized protocols for obtaining genomic data.ResultsIn this study, we present the chromosome-level genome assembly and annotation of the marine sacoglossan speciesElysia timida, known for its ability to store the chloroplasts of its food algae. In particular, by optimizing the Long-read and chromosome conformation capture library preparations, the genome assembly was performed using PacBio HiFi and Arima HiC data. The scaffold and contig N50s, at 41.8 Mb and 1.92 Mb, respectively, are 100-fold and 4-fold higher compared to other published sacoglossan genome assemblies. Structural annotation resulted in 19,904 protein-coding genes, which are more contiguous and complete compared to publicly available annotations of Sacoglossa. We detected genes encoding polyketide synthases inE. timida, indicating that polypropionates are produced. HPLC-MS/MS analysis confirmed the presence of a large number of polypropionates, including known and yet uncharacterised compounds.ConclusionsWe can show that our methodological approach helps to obtain a high-quality genome assembly even for a “difficult-to-sequence” organism, which may facilitate genome sequencing in molluscs. This will enable a better understanding of complex biological processes in molluscs, such as functional kleptoplasty in Sacoglossa, by significantly improving the quality of genome assemblies and annotations.
Publisher
Cold Spring Harbor Laboratory
Reference141 articles.
1. NINJA-OPS: Fast Accurate Marker Gene Alignment Using Concatenated Ribosomes;PLOS Computational Biology,2016
2. Inhibitory effects of acid polysaccharides from sea urchin embryos on RNA polymerase activity;Biochimica et Biophysica Acta (BBA)-Nucleic Acids and Protein Synthesis,1972
3. Arnold, K. , Gosling, J. & Holmes, D . (2005). The Java Programming Language. Addison Wesley Professional: Boston, MA, USA; ISBN 0–321-34980-6.
4. Rapid and sensitive detection of genome contamination at scale with FCS-GX
5. DeepConsensus improves the accuracy of sequences with a gap-aware sequence transformer;Nature Biotechnology,2023