CAEM-GBDT: a cancer subtype identifying method using multi-omics data and convolutional autoencoder network-Reference-Cited by-同舟云学术

CAEM-GBDT: a cancer subtype identifying method using multi-omics data and convolutional autoencoder network

Published:2024-07-15 Issue: Volume:4 Page:
ISSN:2673-7647
Container-title:Frontiers in Bioinformatics
language:
Short-container-title:Front. Bioinform.

Author:

Shen Jiquan,Guo Xuanhui,Bai Hanwen,Luo Junwei

Abstract

The identification of cancer subtypes plays a very important role in the field of medicine. Accurate identification of cancer subtypes is helpful for both cancer treatment and prognosis Currently, most methods for cancer subtype identification are based on single-omics data, such as gene expression data. However, multi-omics data can show various characteristics about cancer, which also can improve the accuracy of cancer subtype identification. Therefore, how to extract features from multi-omics data for cancer subtype identification is the main challenge currently faced by researchers. In this paper, we propose a cancer subtype identification method named CAEM-GBDT, which takes gene expression data, miRNA expression data, and DNA methylation data as input, and adopts convolutional autoencoder network to identify cancer subtypes. Through a convolutional encoder layer, the method performs feature extraction on the input data. Within the convolutional encoder layer, a convolutional self-attention module is embedded to recognize higher-level representations of the multi-omics data. The extracted high-level representations from the convolutional encoder are then concatenated with the input to the decoder. The GBDT (Gradient Boosting Decision Tree) is utilized for cancer subtype identification. In the experiments, we compare CAEM-GBDT with existing cancer subtype identifying methods. Experimental results demonstrate that the proposed CAEM-GBDT outperforms other methods. The source code is available from GitHub at https://github.com/gxh-1/CAEM-GBDT.git.

Publisher

Frontiers Media SA

Reference25 articles.

1. Assessment of the molecular heterogeneity of E-cadherin expression in invasive lobular breast cancer;Alexander;Cancers,2022

2. Research progress in predicting DNA methylation modifications and the relation with human diseases;Ao;Curr. Med. Chem.,2022

3. Comprehensive molecular portraits of human breast tumours;Brigham;Nature,2012

4. moBRCA-net: a breast cancer subtype classification framework based on multi-omics attention neural networks;Choi;BMC Bioinforma.,2023

5. Identifying cancer subtypes using a residual graph convolution model on a sample similarity network;Dai;Genes.,2021