Affiliation:
1. Southwest Forestry University
Abstract
Abstract
Macroevolution of most organisms is generally the result of synergistic action of multiple key genes in evolutionary biology. Unfortunately, the weights of these key genes in macroevolution are difficult to assess. In this study, we designed various word embedding libraries of natural language processing (NLP) considering the multiple mechanisms of evolutionary genomics. A novel method (IKGM) based on three types of attention mechanisms (domain attention, kmer attention and fused attention) were proposed to calculate the weights of different genes in macroevolution. Taking 34 species of diurnal butterflies and nocturnal moths in Lepidoptera as an example, we identified a few of key genes with high weights, which annotated to the functions of circadian rhythms, sensory organs, as well as behavioral habits etc. This study not only provides a novel method to identify the key genes of macroevolution at the genomic level, but also helps us to understand the microevolution mechanisms of diurnal butterflies and nocturnal moths in Lepidoptera.
Publisher
Research Square Platform LLC
Reference72 articles.
1. Transitions from Drag-based to Lift-based Propulsion in Mammalian Swimming1;FISH FE;American Zoologist,1996
2. Ashley-Ross, M. A., Hsieh, S. T., Gibb, A. C. & Blob, R. W. Vertebrate land invasions-past, present, and future: an introduction to the symposium. Integr Comp Biol 53, 192–196 (2013).
3. Zimmer, C. At the Water’s Edge: Fish with Fingers, Whales with Legs, and How Life Came Ashore but Then Went Back to Sea. (Simon and Schuster, 2014).
4. Chromosomal instability in Afrotheria: fragile sites, evolutionary breakpoints and phylogenetic inference from genome sequence assemblies;Ruiz-Herrera A;BMC Evol Biol,2007
5. Body and limb size dissociation at the origin of birds: uncoupling allometric constraints across a macroevolutionary transition;Dececchi TA;Evolution,2013