Abstract
AbstractExpanding antiviral treatment options against SARS-CoV-2 remains crucial as the virus evolves rapidly and drug resistant strains have emerged. Broad spectrum host-directed antivirals (HDA) are promising therapeutic options, however the robust identification of relevant host factors by CRISPR/Cas9 or RNA interference screens remains challenging due to low consistency in the resulting hits.To address this issue, we employed machine learning based on experimental data from knockout screens and a drug screen. As gold standard, we assembled perturbed genes reducing virus replication or protecting the host cells. The machines based their predictions on features describing cellular localization, protein domains, annotated gene sets from Gene Ontology, gene and protein sequences, and experimental data from proteomics, phospho-proteomics, protein interaction and transcriptomic profiles of SARS-CoV-2 infected cells.The models reached a remarkable performance with a balanced accuracy of 0.82 (knockout based classifier) and 0.71 (drugs screen based classifier), suggesting patterns of intrinsic data consistency. The predicted host dependency factors were enriched in sets of genes particularly coding for development, morphogenesis, and neural related processes. Focusing on development and morphogenesis-associated gene sets, we found β-catenin to be central and selected PRI-724, a canonical β-catenin/CBP disruptor, as a potential HDA. PRI-724 limited infection with SARS-CoV-2 variants, SARS-CoV-1, MERS-CoV and IAV in different cell line models. We detected a concentration-dependent reduction in CPE development, viral RNA replication, and infectious virus production in SARS-CoV-2 and SARS-CoV-1-infected cells. Independent of virus infection, PRI-724 treatment caused cell cycle deregulation which substantiates its potential as a broad spectrum antiviral. Our proposed machine learning concept may support focusing and accelerating the discovery of host dependency factors and the design of antiviral therapies.Author’s summaryDrug resistance to pathogens is a well-known phenomenon which was also observed for SARS-CoV-2. Given the gradually increasing evolutionary pressure on the virus by herd immunity, we attempted to enlarge the available antiviral repertoire by focusing on host proteins that are usurped by viruses. The identification of such proteins was followed within several high throughput screens in which genes are knocked out individually. But, so far, these efforts led to very different results. Machine learning helps to identify common patterns and normalizes independent studies to their individual designs. With such an approach, we identified genes that are indispensable during embryonic development, i.e., when cells are programmed for their specific destiny. Shortlisting the hits revealed β-catenin, a central player during development, and PRI-724, which inhibits the interaction of β-catenin with cAMP responsive element binding (CREB) binding protein (CBP). In our work, we confirmed that the disruption of this interaction impedes virus replication and production. In A549-AT cells treated with PRI-724, we observed cell cycle deregulation which might contribute to the inhibition of virus infection, however the exact underlying mechanisms needs further investigation.
Publisher
Cold Spring Harbor Laboratory