Author:
Khodabakhsh Athar,Loka Tobias P.,Boutin Sébastien,Nurjadi Dennis,Renard Bernhard Y.
Abstract
AbstractMotivationGenome sequencing processes are commonly followed by computational analysis in medical diagnosis. The analyses are generally performed once the sequencing process has finished. However, in time-critical applications, it is crucial to start diagnosis once sufficient evidence has been accumulated. This research aims to define a proof-of-principle for predicting earlier time for decision-making using a machine learning approach. The method is evaluated on Illumina sequencing cycles for pathogen diagnosis.ResultsWe utilized a Long-Short Term Memory (LSTM) approach to make predictions for the early decision-making time in time-critical clinical applications. We modeled the (meta-)information obtained from NGS intermediate cycles to investigate whether there are any changes to expect in the remaining sequencing cycles. We tested our model on different patient datasets, resulting in high accuracy of over 98%, indicating the model is independent of a dataset. Furthermore, we can save several hours of turnaround time by using the early prediction results. We used the SHapley Additive exPlanations (SHAP) framework for the interpretation and assessment of the LSTM classifier.AvailabilityThe source code is available athttps://gitlab.com/dacs-hpi/ngs-biclass.ContactBernhard.Renard@hpi.de
Publisher
Cold Spring Harbor Laboratory
Reference34 articles.
1. Abadi, M. , Agarwal, A. , Barham, P. , Brevdo, E. , Chen, Z. ,… and Zheng, X. (2015). TensorFlow: Large-scale machine learning on heterogeneous systems.
2. PAIPline: pathogen identification in metagenomic and clinical next generation sequencing samples
3. Learning long-term dependencies with gradient descent is difficult
4. Braverman, V. , Ostrovsky, R. , and Zaniolo, C. (2009, June). Optimal sampling from sliding windows. In Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems (pp. 147–156).
5. Development of radiation therapy optimization;Acta oncologica,2000