Affiliation:
1. School of Electrical and Computer Engineering, Korea University , 145 Anam-ro, Seongbuk-gu, Seoul, 02841 , Korea
Abstract
Abstract
Recent studies have extensively used deep learning algorithms to analyze gene expression to predict disease diagnosis, treatment effectiveness, and survival outcomes. Survival analysis studies on diseases with high mortality rates, such as cancer, are indispensable. However, deep learning models are plagued by overfitting owing to the limited sample size relative to the large number of genes. Consequently, the latest style-transfer deep generative models have been implemented to generate gene expression data. However, these models are limited in their applicability for clinical purposes because they generate only transcriptomic data. Therefore, this study proposes ctGAN, which enables the combined transformation of gene expression and survival data using a generative adversarial network (GAN). ctGAN improves survival analysis by augmenting data through style transformations between breast cancer and 11 other cancer types. We evaluated the concordance index (C-index) enhancements compared with previous models to demonstrate its superiority. Performance improvements were observed in nine of the 11 cancer types. Moreover, ctGAN outperformed previous models in seven out of the 11 cancer types, with colon adenocarcinoma (COAD) exhibiting the most significant improvement (median C-index increase of ~15.70%). Furthermore, integrating the generated COAD enhanced the log-rank p-value (0.041) compared with using only the real COAD (p-value = 0.797). Based on the data distribution, we demonstrated that the model generated highly plausible data. In clustering evaluation, ctGAN exhibited the highest performance in most cases (89.62%). These findings suggest that ctGAN can be meaningfully utilized to predict disease progression and select personalized treatments in the medical field.
Funder
National Research Foundation of Korea
Publisher
Oxford University Press (OUP)
Reference62 articles.
1. SuperstarGAN: generative adversarial networks for image-to-image translation in large-scale domains;Ko;Neural Netw,2023
2. Controllable generative adversarial network. IEEE;Lee;Access,2019
3. Computer code representation through natural language processing for fMRI data analysis;Kim;2022 International Conference on Artificial Intelligence in Information and Communication (ICAIIC),2022
4. Stock Price prediction through the sentimental analysis of news articles;Kim;2019 Eleventh International Conference on Ubiquitous and Future Networks (ICUFN),2019
5. Deep learning techniques for automatic MRI cardiac multi-structures segmentation and diagnosis: is the problem solved?;Bernard;IEEE Trans Med Imaging,2018