Affiliation:
1. School of Biology and Environmental Science, University College Dublin , Belfield, Dublin 4, Ireland
Abstract
Abstract
Summary
Sensory receptor gene families have undergone extensive expansion and loss across vertebrate evolution, leading to significant variation in receptor counts between species. However, due to their species-specific nature, conventional reference-based annotation tools often underestimate the true number of sensory receptors in a given species. While there has been an exponential increase in the taxonomic diversity of publicly available genome assemblies in recent years, only ∼30% of vertebrate species on the NCBI database are currently annotated. To overcome these limitations, we developed ‘Sensommatic’, an automated and accessible sensory receptor annotation pipeline. Sensommatic implements BLAST and AUGUSTUS to mine and predict sensory receptor genes from whole genome assemblies, adopting a one-to-many gene mapping approach. While designed for vertebrates, Sensommatic can be extended to run on non-vertebrate species by generating customized reference files, making it a scalable and generalizable tool.
Availability and implementation
Source code and associated files are available at: https://github.com/GMHughes/Sensommatic
Funder
Science Foundation Ireland
Publisher
Oxford University Press (OUP)
Reference24 articles.
1. Basic local alignment search tool;Altschul;J Mol Biol,1990
2. UniProt: the universal protein knowledgebase in 2023;Bateman;Nucleic Acids Research,2022
3. Accelerated profile HMM searches;Eddy;PLoS Comput Biol,2011
4. The era of reference genomes in conservation genomics;Formenti;Trends Ecol Evol,2022
5. A comparative genomics multitool for scientific discovery and conservation;Genereux;Nature,2020