Affiliation:
1. Leibniz-Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben
2. University of Saskatchewan
Abstract
Abstract
Background
As complete and accurate genome sequences are becoming easier to obtain, more researchers wish to get one or more of them to support their research endeavors. Reliable and well-documented sequence assembly workflows find use in reference or pan-genome projects.
Results
We describe modifications to the TRITEX genome assembly workflow motivated by the rise of fast and easy long-read contig assembly of inbred plant genomes and the routine deployment of the toolchains in pan-genome projects. New features include the use as surrogates of or complements to dense genetic maps and the introduction of user-editable tables to make the curation of contig placements easier and more intuitive.
Conclusion
Even maximally contiguous sequence assemblies of the telomere-to-telomere sort, and to a yet greater extent, the fragmented kind require validation, correction, and comparison to reference standards. As pan-genomics is burgeoning, these tasks are bound to become more widespread and TRITEX is one tool to get them done. This technical guide is supported by a step-by-step computational tutorial accessible under https://tritexassembly.bitbucket.io/. The TRITEX source code is hosted under this URL: https://bitbucket.org/tritexassembly.
Publisher
Research Square Platform LLC