Data Base similarity (DBsimilarity) of natural products to aid compound identification on MS and NMR pipelines, similarity networking, and more

Author:

Borges Ricardo M.1ORCID,de Assis Ferreira Gabriela1,Campos Mariana Martins1,Teixeira Andrew Magno1ORCID,das Neves Costa Fernanda1ORCID,Chagas Fernanda Oliveira1ORCID

Affiliation:

1. Instituto de Pesquisas de Produtos Naturais Walter Mors Universidade Federal do Rio de Janeiro Rio de Janeiro Brazil

Abstract

AbstractIntroductionWe developed Data Base similarity (DBsimilarity), a user‐friendly tool designed to organize structure databases into similarity networks, with the goal of facilitating the visualization of information primarily for natural product chemists who may not have coding experience.MethodDBsimilarity, written in Jupyter Notebooks, converts Structure Data File (SDF) files into Comma‐Separated Values (CSV) files, adds chemoinformatics data, constructs an MZMine custom database file and an NMRfilter candidate list of compounds for rapid dereplication of MS and 2D NMR data, calculates similarities between compounds, and constructs CSV files formatted into similarity networks for Cytoscape.ResultsThe Lotus database was used as a source for Ginkgo biloba compounds, and DBsimilarity was used to create similarity networks including NPClassifier classification to indicate biosynthesis pathways. Subsequently, a database of validated antibiotics from natural products was combined with the G. biloba compounds to identify promising compounds. The presence of 11 compounds in both datasets points to possible antibiotic properties of G. biloba, and 122 compounds similar to these known antibiotics were highlighted. Next, DBsimilarity was used to filter the NPAtlas database (selecting only those with MIBiG reference) to identify potential antibacterial compounds using the ChEMBL database as a reference. It was possible to promptly identify five compounds found in both databases and 167 others worthy of further investigation.ConclusionChemical and biological properties are determined by molecular structures. DBsimilarity enables the creation of interactive similarity networks using Cytoscape. It is also in line with a recent review that highlights poor biological plausibility and unrealistic chromatographic behaviors as significant sources of errors in compound identification.

Funder

Conselho Nacional de Desenvolvimento Científico e Tecnológico

Fundação Carlos Chagas Filho de Amparo à Pesquisa do Estado do Rio de Janeiro

Publisher

Wiley

Subject

Complementary and alternative medicine,Drug Discovery,Plant Science,Molecular Medicine,General Medicine,Biochemistry,Food Science,Analytical Chemistry

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3