Garbage in, garbage out: how reliable training data improved a virtual screening approach against SARS-CoV-2 MPro

Author:

Ruatta Santiago M.,Prada Gori Denis N.,Fló Díaz Martín,Lorenzelli Franca,Perelmuter Karen,Alberca Lucas N.,Bellera Carolina L.,Medeiros Andrea,López Gloria V.,Ingold Mariana,Porcal Williams,Dibello Estefanía,Ihnatenko Irina,Kunick Conrad,Incerti Marcelo,Luzardo Martín,Colobbio Maximiliano,Ramos Juan Carlos,Manta Eduardo,Minini Lucía,Lavaggi María Laura,Hernández Paola,Šarlauskas Jonas,Huerta García César Sebastian,Castillo Rafael,Hernández-Campos Alicia,Ribaudo Giovanni,Zagotto Giuseppe,Carlucci Renzo,Medrán Noelia S.,Labadie Guillermo R.,Martinez-Amezaga Maitena,Delpiccolo Carina M. L.,Mata Ernesto G.,Scarone Laura,Posada Laura,Serra Gloria,Calogeropoulou Theodora,Prousis Kyriakos,Detsi Anastasia,Cabrera Mauricio,Alvarez Guzmán,Aicardo Adrián,Araújo Verena,Chavarría Cecilia,Mašič Lucija Peterlin,Gantner Melisa E.,Llanos Manuel A.,Rodríguez Santiago,Gavernet Luciana,Park Soonju,Heo Jinyeong,Lee Honggun,Paul Park Kyu-Ho,Bollati-Fogolín Mariela,Pritsch Otto,Shum David,Talevi Alan,Comini Marcelo A.

Abstract

Introduction: The identification of chemical compounds that interfere with SARS-CoV-2 replication continues to be a priority in several academic and pharmaceutical laboratories. Computational tools and approaches have the power to integrate, process and analyze multiple data in a short time. However, these initiatives may yield unrealistic results if the applied models are not inferred from reliable data and the resulting predictions are not confirmed by experimental evidence.Methods: We undertook a drug discovery campaign against the essential major protease (MPro) from SARS-CoV-2, which relied on an in silico search strategy –performed in a large and diverse chemolibrary– complemented by experimental validation. The computational method comprises a recently reported ligand-based approach developed upon refinement/learning cycles, and structure-based approximations. Search models were applied to both retrospective (in silico) and prospective (experimentally confirmed) screening.Results: The first generation of ligand-based models were fed by data, which to a great extent, had not been published in peer-reviewed articles. The first screening campaign performed with 188 compounds (46 in silico hits and 100 analogues, and 40 unrelated compounds: flavonols and pyrazoles) yielded three hits against MPro (IC50 ≤ 25 μM): two analogues of in silico hits (one glycoside and one benzo-thiazol) and one flavonol. A second generation of ligand-based models was developed based on this negative information and newly published peer-reviewed data for MPro inhibitors. This led to 43 new hit candidates belonging to different chemical families. From 45 compounds (28 in silico hits and 17 related analogues) tested in the second screening campaign, eight inhibited MPro with IC50 = 0.12–20 μM and five of them also impaired the proliferation of SARS-CoV-2 in Vero cells (EC50 7–45 μM).Discussion: Our study provides an example of a virtuous loop between computational and experimental approaches applied to target-focused drug discovery against a major and global pathogen, reaffirming the well-known “garbage in, garbage out” machine learning principle.

Funder

Institut Pasteur

International Center for Genetic Engineering and Biotechnology

National Research Foundation of Korea

Deutsche Forschungsgemeinschaft

Consejo Nacional de Ciencia y Tecnología

Publisher

Frontiers Media SA

Subject

Pharmacology (medical),Pharmacology

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3