Abstract
AbstractWe have used multiple sequencing approaches to sequence the genome of a volunteer from Saudi Arabia. We use the resulting data to generate ade novoassembly of the genome, and use different computational approaches to refine the assembly. As a consequence, we provide a contiguous assembly of the complete genome of an individual from Saudi Arabia for all chromosomes except chromosome Y, and label this assemblyKSA001. We transferred genome annotations from reference genomes and predicted genome features using methods from Artificial Intelligence to fully annotateKSA001, and we make all primary sequencing data, the assembly, and the genome annotations freely available in public databases using the FAIR data principles.
Publisher
Cold Spring Harbor Laboratory
Reference39 articles.
1. The complete sequence of a human genome;In: Science,2022
2. A. V. Zimin et al. “A reference-quality, fully annotated genome from a Puerto Rican individual”. In: Genetics 220.2 (2022), iyab227.
3. W.-W. Liao et al. “A Draft Human Pangenome Reference”. In: bioRxiv (2022).
4. Chasing perfection: validation and polishing strategies for telomere-to-telomere genome assemblies;en. In: Nature Methods,2022
5. Long-read mapping to repetitive reference sequences using Winnowmap2;en. In: Nature Methods,2022