Abstract
AbstractIn plant genomic research, long read sequencing has been widely used to detect structure variations that are not captured by short read sequencing. In this letter, we described an analysis of whole genome re-sequencing of 29 soybean varieties using nanopore long-read sequencing. The compiled germplasm reflects diverse applications, including livestock feeding, soy milk and tofu production, as well as consumption of natto, sprouts, and vegetable soybeans (edamame). We have identified 365,497 structural variations in these newly re-sequenced genomes and found that the newly identified structural variations are associated with important agronomic traits. These traits include seed weight, flowering time, plant height, oleic acid content, methionine content, and trypsin inhibitor content, all of which significantly impact soybean production and quality. Experimental validation supports the roles of predicted candidate genes and structural variant in these biological processes. Our research provides a new source for rapid marker discovery in crop genomes using structural variation and whole genome sequencing.
Publisher
Cold Spring Harbor Laboratory