Efficient weighted univariate clustering maps outstanding dysregulated genomic zones in human cancers-Reference-Cited by-同舟云学术

Efficient weighted univariate clustering maps outstanding dysregulated genomic zones in human cancers

Published:2020-07-03 Issue:20 Volume:36 Page:5027-5036
ISSN:1367-4803
Container-title:Bioinformatics
language:en
Short-container-title:

Author:

Song Mingzhou¹²^ORCID,Zhong Hua¹^ORCID

Affiliation:

1. Department of Computer Science

2. Molecular Biology Graduate Program, New Mexico State University, Las Cruces, NM 88003, USA

Abstract

Abstract Motivation Chromosomal patterning of gene expression in cancer can arise from aneuploidy, genome disorganization or abnormal DNA methylation. To map such patterns, we introduce a weighted univariate clustering algorithm to guarantee linear runtime, optimality and reproducibility. Results We present the chromosome clustering method, establish its optimality and runtime and evaluate its performance. It uses dynamic programming enhanced with an algorithm to reduce search-space in-place to decrease runtime overhead. Using the method, we delineated outstanding genomic zones in 17 human cancer types. We identified strong continuity in dysregulation polarity—dominance by either up- or downregulated genes in a zone—along chromosomes in all cancer types. Significantly polarized dysregulation zones specific to cancer types are found, offering potential diagnostic biomarkers. Unreported previously, a total of 109 loci with conserved dysregulation polarity across cancer types give insights into pan-cancer mechanisms. Efficient chromosomal clustering opens a window to characterize molecular patterns in cancer genome and beyond. Availability and implementation Weighted univariate clustering algorithms are implemented within the R package ‘Ckmeans.1d.dp’ (4.0.0 or above), freely available at https://cran.r-project.org/package=Ckmeans.1d.dp. Supplementary information Supplementary data are available at Bioinformatics online.

Funder

National Science Foundation

USDA

National Cancer Institute Partnership for the Advancement of Cancer Research NCI grants

(NMSU)

Publisher

Oxford University Press (OUP)

Subject

Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability

Link

http://academic.oup.com/bioinformatics/advance-article-pdf/doi/10.1093/bioinformatics/btaa613/34302768/btaa613.pdf

Reference56 articles.

1. Disruption of the 3D cancer genome blueprint;Achinger-Kawecka;Epigenomics,2017

2. Geometric applications of a matrix-searching algorithm;Aggarwal;Algorithmica,1987

3. A note on cluster analysis and dynamic programming;Bellman;Math. Biosci,1973

4. A computational procedure to identify significant overlap of differentially expressed and genomic imbalanced regions in cancer datasets;Bicciato;Nucleic Acids Res,2009

5. A gene expression map of the Arabidopsis root;Birnbaum;Science,2003

Cited by 30 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. When and where we are: Comparing early criminal careers of organized crime offenders in Italy and the Netherlands across decades;Journal of Criminal Justice;2024-11

2. Fast and explainable clustering based on sorting;Pattern Recognition;2024-06

3. Modeling type 1 diabetes progression using machine learning and single-cell transcriptomic measurements in human islets;Cell Reports Medicine;2024-05

4. Exploring Construct Measures Using Rasch Models and Discretization Methods to Analyze Existing Continuous Data;Measurement: Interdisciplinary Research and Perspectives;2024-01-02

5. Variation in Use of East Asian Late Paleolithic Weapons: a Study of Tip Cross-sectional Area of Stemmed Points from Korea;Journal of Paleolithic Archaeology;2023-11-09