Abstract
Due to the success of next-generation sequencing, there has been a vast build-up of microbial genomes in the public repositories. FAIR genome prospecting of this huge genomic potential for biotechnological benefiting, require new efficient and flexible methods. In this study, Semantic Web technologies are applied to develop a function-based genome mining approach that follows a knowledge and discovery in database (KDD) protocol. Focusing on the industrial important trait of 1,3-propanediol (1,3-PD) production 187 new candidate species were identified. Furthermore, the genetic architecture of the particular trait was resolved, and persistent domains identified.
Publisher
Cold Spring Harbor Laboratory