Large-scale literature mining to assess the relation between anti-cancer drugs and cancer types-Reference-Cited by-同舟云学术

Large-scale literature mining to assess the relation between anti-cancer drugs and cancer types

Published:2021-06-26 Issue:1 Volume:19 Page:
ISSN:1479-5876
Container-title:Journal of Translational Medicine
language:en
Short-container-title:J Transl Med

Author:

Bauer Chris^ORCID,Herwig Ralf,Lienhard Matthias,Prasse Paul,Scheffer Tobias,Schuchhardt Johannes

Abstract

Abstract Background There is a huge body of scientific literature describing the relation between tumor types and anti-cancer drugs. The vast amount of scientific literature makes it impossible for researchers and physicians to extract all relevant information manually. Methods In order to cope with the large amount of literature we applied an automated text mining approach to assess the relations between 30 most frequent cancer types and 270 anti-cancer drugs. We applied two different approaches, a classical text mining based on named entity recognition and an AI-based approach employing word embeddings. The consistency of literature mining results was validated with 3 independent methods: first, using data from FDA approvals, second, using experimentally measured IC-50 cell line data and third, using clinical patient survival data. Results We demonstrated that the automated text mining was able to successfully assess the relation between cancer types and anti-cancer drugs. All validation methods showed a good correspondence between the results from literature mining and independent confirmatory approaches. The relation between most frequent cancer types and drugs employed for their treatment were visualized in a large heatmap. All results are accessible in an interactive web-based knowledge base using the following link: https://knowledgebase.microdiscovery.de/heatmap. Conclusions Our approach is able to assess the relations between compounds and cancer types in an automated manner. Both, cancer types and compounds could be grouped into different clusters. Researchers can use the interactive knowledge base to inspect the presented results and follow their own research questions, for example the identification of novel indication areas for known drugs.

Funder

Bundesministerium für Bildung und Forschung

Publisher

Springer Science and Business Media LLC

Subject

General Biochemistry, Genetics and Molecular Biology,General Medicine

Link

https://link.springer.com/content/pdf/10.1186/s12967-021-02941-z.pdf

Reference30 articles.

1. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68(6):394–424. https://doi.org/10.3322/caac.21492.

2. Tay-Teo K, Ilbawi A. Hill SR comparison of sales income and research and development costs for FDA-approved cancer drugs sold by originator drug companies. JAMA Netw Open. 2019;2(1):186875. https://doi.org/10.1001/jamanetworkopen.2018.6875.

3. Simon C, Davidsen K, Hansen C, Seymour E, Barnkob MB, Olsen LR. BioReader: a text mining tool for performing classification of biomedical literature. BMC Bioinform. 2019;19(Suppl 13):57. https://doi.org/10.1186/s12859-019-2607-x.