Abstract
AbstractUnderstanding the intricate roles of RNA molecules in virulence and host-pathogen interactions can provide valuable insights into combatting infections and improving human health. Although much progress has been achieved in understanding transcriptional regulation during host-pathogen interactions in diverse species, more is needed to know about the structure of pathogen RNAs. This is particularly true for fungal pathogens, including pathogenic yeasts of theCandidagenus, which are the leading cause of hospital-acquired fungal infections. Deciphering the relation between RNA structure and their biology remains a significant gap. Despite advancements in transcriptional regulation studies, especially for fungal pathogens likeCandida, the structural aspects of pathogenic RNAs remain understudied. Our work addresses this gap by employing genome-wide structure probing to comprehensively explore the structural landscape of mRNAs and long non-coding RNAs (lncRNAs) in the four majorCandidapathogens. Specifically focusing on mRNA, we observe a robust correlation between sequence conservation and structural characteristics in orthologous transcripts, significantly when sequence identity exceeds 50%, highlighting structural feature conservation among closely related species. We investigate the impact of single nucleotide polymorphisms (SNPs) on mRNA secondary structure. SNPs within 5’ untranslated regions (UTRs) tend to occur in less structured positions, suggesting structural constraints influencing transcript regulation. Furthermore, we compare the structural properties of coding regions and UTRs, noting that coding regions are generally more structured than UTRs, consistent with similar trends in other species.Additionally, we provide the first experimental characterization of lncRNA structures inCandida species. Most lncRNAs form independent subdomains, similar to human lncRNAs. Notably, we identify hairpin-like structures in lncRNAs, a feature known to be functionally significant. Comparing hairpin prevalence between lncRNAs and protein-coding genes, we find enrichment in lncRNAs acrossCandidaspecies, humans, andArabidopsis thaliana, suggesting a conserved role for these structures.In summary, our study offers valuable insights into the interplay between RNA sequence, structure, and function inCandidapathogens, with implications for gene expression regulation and potential therapeutic strategies againstCandidainfections.
Publisher
Cold Spring Harbor Laboratory