Filter and Wrapper Stacking Ensemble (FWSE): a robust approach for reliable biomarker discovery in high-dimensional omics data

Author:

Budhraja Sugam12ORCID,Doborjeh Maryam12ORCID,Singh Balkaran12,Tan Samuel3,Doborjeh Zohreh4ORCID,Lai Edmund12ORCID,Merkin Alexander5,Lee Jimmy36ORCID,Goh Wilson378ORCID,Kasabov Nikola1291011ORCID

Affiliation:

1. Knowledge Engineering and Discovery Research Innovation (KEDRI) , School of Engineering Computer and Mathematical Sciences, , 55 Wellesley Street East, 1010 Auckland , New Zealand

2. Auckland University of Technology , School of Engineering Computer and Mathematical Sciences, , 55 Wellesley Street East, 1010 Auckland , New Zealand

3. Lee Kong Chian School of Medicine, Nanyang Technological University , 50 Nanyang Ave, 639798 , Singapore

4. School of Population Health, The University of Auckland , Grafton, 1023,Auckland , New Zealand

5. National Institute for Stroke and Applied Neuroscience, Auckland University of Technology , 55 Wellesley Street East, 1010 Auckland , New Zealand

6. Institute of Mental Health , 10 Buangkok View, 539747 , Singapore

7. Center for Biomedical Informatics, Nanyang Technological University , 50 Nanyang Ave, 639798 , Singapore

8. School of Biological Sciences, Nanyang Technological University , 50 Nanyang Ave, 639798 , Singapore

9. Intelligent Systems Research Center, Ulster University , Magee Campus, Derry, BT48 7JL, Ulster , United Kingdom

10. Auckland Bioengineering Institute, The University of Auckland , 6/70 Symonds Street, 1010 Auckland , New Zealand

11. Institute of Information and Communication Technologies, Bulgarian Academy of Sciences , Sofia , Bulgaria

Abstract

Abstract Selecting informative features, such as accurate biomarkers for disease diagnosis, prognosis and response to treatment, is an essential task in the field of bioinformatics. Medical data often contain thousands of features and identifying potential biomarkers is challenging due to small number of samples in the data, method dependence and non-reproducibility. This paper proposes a novel ensemble feature selection method, named Filter and Wrapper Stacking Ensemble (FWSE), to identify reproducible biomarkers from high-dimensional omics data. In FWSE, filter feature selection methods are run on numerous subsets of the data to eliminate irrelevant features, and then wrapper feature selection methods are applied to rank the top features. The method was validated on four high-dimensional medical datasets related to mental illnesses and cancer. The results indicate that the features selected by FWSE are stable and statistically more significant than the ones obtained by existing methods while also demonstrating biological relevance. Furthermore, FWSE is a generic method, applicable to various high-dimensional datasets in the fields of machine intelligence and bioinformatics.

Funder

National Research Foundation

National Research Foundation Singapore

National Medical Research Council Translational and Clinical Research Flagship Program

Publisher

Oxford University Press (OUP)

Subject

Molecular Biology,Information Systems

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3