Author:
Yousuff Mohamed,Babu Rajasekhara,Rathinam Anand
Abstract
AbstractSingle-cell multi-omics technology has catalyzed a transformative shift in contemporary cell biology, illuminating the nuanced relationship between genotype and phenotype. This paradigm shift hinges on the understanding that while genomic structures remain uniform across cells within an organism, the expression patterns dictate physiological traits. Leveraging high throughput sequencing, single-cell RNA sequencing (scRNA-seq) has emerged as a powerful tool, enabling comprehensive transcriptomic analysis at unprecedented resolution. This paper navigates through a landscape of dimensionality reduction techniques essential for distilling meaningful insights from the scRNA-seq datasets. Notably, while foundational, Principal Component Analysis may fall short of capturing the intricacies of diverse cell types. In response, nonlinear techniques have garnered traction, offering a more nuanced portrayal of cellular relationships. Among these, Pairwise Controlled Manifold Approximation Projection (PaCMAP) stands out for its capacity to preserve local and global structures. We present an augmented iteration, Compactness Preservation Pairwise Controlled Manifold Approximation Projection (CP-PaCMAP), a novel advancement for scRNA-seq data visualization. Employing benchmark datasets from critical human organs, we demonstrate the superior efficacy of CP-PaCMAP in preserving compactness, offering a pivotal breakthrough for enhanced classification and clustering in scRNA-seq analysis. A comprehensive suite of metrics, including Trustworthiness, Continuity, Mathew Correlation Coefficient, and Mantel test, collectively validate the fidelity and utility of proposed and existing techniques. These metrics provide a multi-dimensional evaluation, elucidating the performance of CP-PaCMAP compared to other dimensionality reduction techniques.
Publisher
Springer Science and Business Media LLC
Reference66 articles.
1. Allen M, Poggiali D, Whitaker K, Marshall TR, van Langen J, Kievit RA. Raincloud plots: a multi-platform tool for robust data visualization. Wellcome Open Res. 2021;4:63.
2. Amid E, Warmuth MK. TriMap: Large-scale Dimensionality Reduction Using Triplets. arXiv Prepr. 2019.
3. Andrew G, Arora R, Bilmes J, Livescu K. Deep Canonical Correlation Analysis. In: Dasgupta S, McAllester D, editors. Proceedings of the 30th International Conference on Machine Learning [Internet]. Atlanta, Georgia, USA: PMLR; 2013. p. 1247–55. (Proceedings of Machine Learning Research; vol. 28). https://proceedings.mlr.press/v28/andrew13.html
4. Babjac A, Royalty T, Steen AD, Emrich SJ. A Comparison of Dimensionality Reduction Methods for Large Biological Data. In: Proceedings of the 13th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics. Association for Computing Machinery; 2022. (BCB ’22).
5. Battenberg K, Kelly ST, Ras RA, Hetherington NA, Hayashi M, Minoda A. A flexible cross-platform single-cell data processing pipeline. Nat Commun. 2022;13(1):6847.