Dimensionality reduction methods for extracting functional networks from large‐scale CRISPR screens
-
Published:2023-09-26
Issue:11
Volume:19
Page:
-
ISSN:1744-4292
-
Container-title:Molecular Systems Biology
-
language:en
-
Short-container-title:Molecular Systems Biology
Author:
Hassan Arshia Zernab1,
Ward Henry N2,
Rahman Mahfuzur1,
Billmann Maximilian13ORCID,
Lee Yoonkyu2,
Myers Chad L12ORCID
Affiliation:
1. Department of Computer Science and Engineering University of Minnesota – Twin Cities Minneapolis MN USA
2. Bioinformatics and Computational Biology Graduate Program University of Minnesota – Twin Cities Minneapolis MN USA
3. Institute of Human Genetics University of Bonn, School of Medicine and University Hospital Bonn Bonn Germany
Abstract
AbstractCRISPR‐Cas9 screens facilitate the discovery of gene functional relationships and phenotype‐specific dependencies. The Cancer Dependency Map (DepMap) is the largest compendium of whole‐genome CRISPR screens aimed at identifying cancer‐specific genetic dependencies across human cell lines. A mitochondria‐associated bias has been previously reported to mask signals for genes involved in other functions, and thus, methods for normalizing this dominant signal to improve co‐essentiality networks are of interest. In this study, we explore three unsupervised dimensionality reduction methods—autoencoders, robust, and classical principal component analyses (PCA)—for normalizing the DepMap to improve functional networks extracted from these data. We propose a novel “onion” normalization technique to combine several normalized data layers into a single network. Benchmarking analyses reveal that robust PCA combined with onion normalization outperforms existing methods for normalizing the DepMap. Our work demonstrates the value of removing low‐dimensional signals from the DepMap before constructing functional gene networks and provides generalizable dimensionality reduction‐based normalization tools.
Funder
Deutsche Forschungsgemeinschaft
National Institutes of Health
National Science Foundation
University of Minnesota
Publisher
Springer Science and Business Media LLC
Subject
Applied Mathematics,Computational Theory and Mathematics,General Agricultural and Biological Sciences,General Immunology and Microbiology,General Biochemistry, Genetics and Molecular Biology,Information Systems
Reference65 articles.
1. 10x Genomics(2019)5k peripheral blood mononuclear cells (PBMCs) from a healthy donor (v3 chemistry) single cell gene expression dataset by Cell Ranger 3.0.2. (https://support.10xgenomics.com/single‐cell‐gene‐expression/datasets/3.0.2/5k_pbmc_v3). [DATASET]
2. ArnoldJB DarocziG WerthB WeitznerB KunstJ AuguieB RudisB WickhamH TalbotJ LondonJ(2021)ggthemes: Extra Themes Scales and Geoms for ‘ggplot2’. R package v4.2.4.
3. AuguieB AntonovA(2017)gridExtra: miscellaneous functions for ‘Grid’ graphics. R package v2.3.
4. Quantitative analysis of fitness and genetic interactions in yeast on a genome scale
5. Prioritization of cancer therapeutic targets using CRISPR–Cas9 screens
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献