Abstract
AbstractProtein-coding differences between mammals often fail to explain phenotypic diversity, suggesting involvement of enhancers, often rapidly evolving regions that regulate gene expression. Identifying associations between enhancers and phenotypes is challenging because enhancer activity is context-dependent and may be conserved without much sequence conservation. We developed TACIT (Tissue-Aware Conservation Inference Toolkit) to associate open chromatin regions (OCRs) with phenotypes using predictions in hundreds of mammalian genomes from machine learning models trained to learn tissue-specific regulatory codes. Applying TACIT for motor cortex and parvalbumin-positive interneurons to neurological phenotypes revealed dozens of new OCR-phenotype associations. Many associated OCRs were near relevant genes, including brain size-associated OCRs near genes mutated in microcephaly or macrocephaly. Our work creates a forward genomics foundation for identifying candidate enhancers associated with phenotype evolution.One Sentence SummaryA new machine learning-based approach associates enhancers with the evolution of brain size and behavior across mammals.
Publisher
Cold Spring Harbor Laboratory