Machine-learning analysis of factors that shape cancer aneuploidy landscapes reveals an important role for negative selection-Reference-Cited by-同舟云学术

Machine-learning analysis of factors that shape cancer aneuploidy landscapes reveals an important role for negative selection

Published:2023-07-05 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Jubran Juman^ORCID,Slutsky Rachel,Rozenblum Nir,Rokach Lior,Ben-David Uri,Yeger-Lotem Esti

Abstract

AbstractAneuploidy, an abnormal number of chromosomes within a cell, is considered a hallmark of cancer. Patterns of aneuploidy differ across cancers, yet are similar in cancers affecting closely-related tissues. The selection pressures underlying aneuploidy patterns are not fully understood, hindering our understanding of cancer development and progression. Here, we applied interpretable machine learning (ML) methods to study tissue-selective aneuploidy patterns. We defined 20 types of features of normal and cancer tissues, and used them to model gains and losses of chromosome-arms in 24 cancer types. In order to reveal the factors that shape the tissue-specific cancer aneuploidy landscapes, we interpreted the ML models by estimating the relative contribution of each feature to the models. While confirming known drivers of positive selection, our quantitative analysis highlighted the importance of negative selection for shaping the aneuploidy landscapes of human cancer. Tumor-suppressor gene density was a better predictor of gain patterns than oncogene density, and vice-versa for loss patterns. We identified the contribution of tissue-selective features and demonstrated them experimentally for chr13q gain in colon cancer. In line with an important role for negative selection in shaping the aneuploidy landscapes, we found compensation by paralogs to be a top predictor of chromosome-arm loss prevalence, and demonstrated this relationship for one such paralog interaction. Similar factors were found to shape aneuploidy patterns in human cancer cell lines, demonstrating their relevance for aneuploidy research. Overall, our quantitative, interpretable ML models improve the understanding of the genomic properties that shape cancer aneuploidy landscapes.

Publisher

Cold Spring Harbor Laboratory

Reference56 articles.

1. Cancer genomes tolerate deleterious coding mutations through somatic copy number amplifications of wild-type regions;Nat Commun,2023

2. Role of duplicate genes in determining the tissue-selectivity of hereditary diseases

3. Differential network analysis of multiple human tissue interactomes highlights tissue-selective processes and genetic disorder genes;Bioinformatics,2020

4. MyProteinNet: build up-to-date protein interaction networks for organisms, tissues and user-defined contexts

5. Context is everything: aneuploidy in cancer;Nat Rev Genet,2020

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Chromosome 7 to the rescue: overcoming chromosome 10 loss in gliomas;2024-01-22