Machine Learning-Based Ensemble Recursive Feature Selection of Circulating miRNAs for Cancer Tumor Classification-Reference-Cited by-同舟云学术

Machine Learning-Based Ensemble Recursive Feature Selection of Circulating miRNAs for Cancer Tumor Classification

Published:2020-07-03 Issue:7 Volume:12 Page:1785
ISSN:2072-6694
Container-title:Cancers
language:en
Short-container-title:Cancers

Author:

Lopez-Rincon Alejandro,Mendoza-Maldonado Lucero,Martinez-Archundia Marlet^ORCID,Schönhuth Alexander^ORCID,Kraneveld Aletta D.^ORCID,Garssen Johan^ORCID,Tonda Alberto^ORCID

Abstract

Circulating microRNAs (miRNA) are small noncoding RNA molecules that can be detected in bodily fluids without the need for major invasive procedures on patients. miRNAs have shown great promise as biomarkers for tumors to both assess their presence and to predict their type and subtype. Recently, thanks to the availability of miRNAs datasets, machine learning techniques have been successfully applied to tumor classification. The results, however, are difficult to assess and interpret by medical experts because the algorithms exploit information from thousands of miRNAs. In this work, we propose a novel technique that aims at reducing the necessary information to the smallest possible set of circulating miRNAs. The dimensionality reduction achieved reflects a very important first step in a potential, clinically actionable, circulating miRNA-based precision medicine pipeline. While it is currently under discussion whether this first step can be taken, we demonstrate here that it is possible to perform classification tasks by exploiting a recursive feature elimination procedure that integrates a heterogeneous ensemble of high-quality, state-of-the-art classifiers on circulating miRNAs. Heterogeneous ensembles can compensate inherent biases of classifiers by using different classification algorithms. Selecting features then further eliminates biases emerging from using data from different studies or batches, yielding more robust and reliable outcomes. The proposed approach is first tested on a tumor classification problem in order to separate 10 different types of cancer, with samples collected over 10 different clinical trials, and later is assessed on a cancer subtype classification task, with the aim to distinguish triple negative breast cancer from other subtypes of breast cancer. Overall, the presented methodology proves to be effective and compares favorably to other state-of-the-art feature selection methods.

Publisher

MDPI AG

Subject

Cancer Research,Oncology

Link

https://www.mdpi.com/2072-6694/12/7/1785/pdf

Reference104 articles.

1. New Concepts in Cancer Biomarkers: Circulating miRNAs in Liquid Biopsies

2. Current State of Circulating MicroRNAs as Cancer Biomarkers

3. MicroRNA maturation: stepwise processing and subcellular localization

4. MicroRNA biogenesis: coordinated cropping and dicing

5. MicroRNAs in cancer biology and therapy: Current status and perspectives

Cited by 44 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multi-omics-based Machine Learning for the Subtype Classification of Breast Cancer;Arabian Journal for Science and Engineering;2024-09-10

2. Artificial Intelligence Applications in Oral Cancer and Oral Dysplasia;Tissue Engineering Part A;2024-08-07

3. Identifying miRNA as biomarker for breast cancer subtyping using association rule;Computers in Biology and Medicine;2024-08

4. Machine-Learning Analysis of mRNA: An Application to Inflammatory Bowel Disease;2024 16th International Conference on Human System Interaction (HSI);2024-07-08

5. Effect of feature optimization on performance of machine learning models for predicting traffic incident duration;Engineering Applications of Artificial Intelligence;2024-05