Author:
Paliwal Saee,de Giorgio Alex,Neil Daniel,Michel Jean-Baptiste,Lacoste Alix MB
Abstract
AbstractIncorrect drug target identification is a major obstacle in drug discovery. Only 15% of drugs advance from Phase II to approval, with ineffective targets accounting for over 50% of these failures1–3. Advances in data fusion and computational modeling have independently progressed towards addressing this issue. Here, we capitalize on both these approaches with Rosalind, a comprehensive gene prioritization method that combines heterogeneous knowledge graph construction with relational inference via tensor factorization to accurately predict disease-gene links. Rosalind demonstrates an increase in performance of 18%-50% over five comparable state-of-the-art algorithms. On historical data, Rosalind prospectively identifies 1 in 4 therapeutic relationships eventually proven true. Beyond efficacy, Rosalind is able to accurately predict clinical trial successes (75% recall at rank 200) and distinguish likely failures (74% recall at rank 200). Lastly, Rosalind predictions were experimentally tested in a patient-derived in-vitro assay for Rheumatoid arthritis (RA), which yielded 5 promising genes, one of which is unexplored in RA.
Publisher
Springer Science and Business Media LLC
Reference87 articles.
1. Thomas, D. W. et al. Clinical development success rates 2006–2015. San Diego: Biomedtracker/Washington, DC: BIO/Bend: Amplion (2016).
2. Harrison, R. K. Phase ii and phase iii failures2013–2015, (2016).
3. Arrowsmith, J. & Miller, P. Trial watch: phase ii and phase iii attrition rates2011–2012, (2013).
4. Hay, M., Thomas, D. W., Craighead, J. L., Economides, C. & Rosenthal, J. Clinical development success rates for investigational drugs. Nat. Biotechnol. 32, 40 (2014).
5. Kinch, M. S., Haynesworth, A., Kinch, S. L. & Hoyer, D. An overview of fda-approved new molecular entities: 1827–2013. Drug Discovery Today 19, 1033–1039 (2014).
Cited by
33 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献