Affiliation:
1. School of Biological Sciences The University of Queensland St Lucia Queensland Australia
2. Institute of Evolutionary Biology University of Edinburgh Edinburgh UK
3. Biological Data Science Institute The Australian National University Canberra Australian Capital Territory Australia
Abstract
AbstractThis study explores the impact of positive selection on the genetic composition of a Drosophila serrata population in eastern Australia through a comprehensive analysis of 110 whole genome sequences. Utilizing an advanced deep learning algorithm (partialS/HIC) and a range of inferred demographic histories, we identified that approximately 14% of the genome is directly affected by sweeps, with soft sweeps being more prevalent (10.6%) than hard sweeps (2.1%), and partial sweeps being uncommon (1.3%). The algorithm demonstrated robustness to demographic assumptions in classifying complete sweeps but faced challenges in distinguishing neutral regions from partial sweeps and linked regions under demographic misspecification. The findings reveal the indirect influence of sweeps on nearly two‐thirds of the genome through linkage, with an over‐representation of putatively deleterious variants suggesting that positive selection drags deleterious variants to higher frequency due to hitchhiking with beneficial loci. Gene ontology enrichment analysis further supported our confidence in the accuracy of sweep detection as several traits expected to be under positive selection due to evolutionary arms races (e.g. immunity) were detected in hard sweeps. This study provides valuable insights into the direct and indirect contributions of positive selection in shaping genomic variation in natural populations.