Abstract
The lit-OTAR framework, developed through a collaboration between Europe PMC and Open Targets, leverages deep learning to revolutionise drug discovery by extracting evidence from scientific literature for drug target identification and validation. This novel framework combines Named Entity Recognition (NER) for identifying genes/proteins, diseases, organisms, and chemicals/drugs within scientific texts, and entity normalisation to map these entities to databases like Ensembl, Experimental Factor Ontology (EFO), and ChEMBL. Continuously operational, it has processed over 39 million abstracts and 4.5 million full-text articles and preprints to date, identifying more than 48.5 million unique associations that significantly help accelerate the drug discovery process and scientific research (>29.9m distinct target-disease, 11.8m distinct target-drug and 8.3m distinct disease-drug relationships). The results are made accessible through the Open Targets Platform (https://platform.opentargets.org/) as well as Europe PMC website (SciLite web app) and annotations API (https://europepmc.org/annotationsapi).
Publisher
Cold Spring Harbor Laboratory
Reference16 articles.
1. The next-generation Open Targets Platform: reimagined, redesigned, rebuilt
2. Europe pmc in 2023;Nucleic Acids Research,2024
3. The DisGeNET knowledge platform for disease genomics: 2019 update
4. Literature evidence in open targets-a target validation platform;Journal of Biomedical Semantics,2017
5. Xiao Yang , Shyamasree Saha , Aravind Venkatesan , Santosh Tirunagari , Vid Vartak , and Johanna McEntyre . Europe pmc annotated full-text corpus for gene/proteins, diseases and organisms. bioRxiv, pages 2023–02, 2023.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献