Affiliation:
1. Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, No. 4, Section 2, North Jianshe Road, Chengdu 610054, China
Abstract
Abstract
Due to the concerted efforts to utilize the microbial features to improve disease prediction capabilities, automated machine learning (AutoML) systems aiming to get rid of the tediousness in manually performing ML tasks are in great demand. Here we developed mAML, an ML model-building pipeline, which can automatically and rapidly generate optimized and interpretable models for personalized microbiome-based classification tasks in a reproducible way. The pipeline is deployed on a web-based platform, while the server is user-friendly and flexible and has been designed to be scalable according to the specific requirements. This pipeline exhibits high performance for 13 benchmark datasets including both binary and multi-class classification tasks. In addition, to facilitate the application of mAML and expand the human disease-related microbiome learning repository, we developed GMrepo ML repository (GMrepo Microbiome Learning repository) from the GMrepo database. The repository involves 120 microbiome-based classification tasks for 85 human-disease phenotypes referring to 12 429 metagenomic samples and 38 643 amplicon samples. The mAML pipeline and the GMrepo ML repository are expected to be important resources for researches in microbiology and algorithm developments.
Database URL: http://lab.malab.cn/soft/mAML
Funder
National Key Research and Development Program of China
National Natural Science Foundation of China
Publisher
Oxford University Press (OUP)
Subject
General Agricultural and Biological Sciences,General Biochemistry, Genetics and Molecular Biology,Information Systems
Reference23 articles.
1. Scikit-learn: machine learning in python;Pedregosa;J. Mach. Learn. Res.,2011
2. Meta-analysis of fecal metagenomes reveals global microbial signatures that are specific for colorectal cancer;Wirbel;Nat. Med.,2019
3. A metagenome-wide association study of gut microbiota in type 2 diabetes;Qin;Nature,2012
4. Auto-WEKA 2.0: automatic model selection and hyperparameter optimization in WEKA;Kotthoff;J. Mach. Learn. Res.,2017
Cited by
24 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献