Sequential Optimal Experimental Design of Perturbation Screens Guided by Multi-modal Priors-Reference-Cited by-同舟云学术

Sequential Optimal Experimental Design of Perturbation Screens Guided by Multi-modal Priors

Published:2023-12-13 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Huang Kexin,Lopez Romain,Hütter Jan-Christian,Kudo Takamasa,Rios Antonio,Regev Aviv

Abstract

AbstractUnderstanding a cell’s expression response to genetic perturbations helps to address important challenges in biology and medicine, including the function of gene circuits, discovery of therapeutic targets and cell reprogramming and engineering. In recent years, Perturb-seq, pooled genetic screens with single cell RNA-seq (scRNA-seq) readouts, has emerged as a common method to collect such data. However, irrespective of technological advances, because combinations of gene perturbations can have unpredictable, non-additive effects, the number of experimental configurations far exceeds experimental capacity, and for certain cases, the number of available cells. While recent machine learning models, trained on existing Perturb-seq data sets, can predict perturbation outcomes with some degree of accuracy, they are currently limited by sub-optimal training set selection and the small number of cell contexts of training data, leading to poor predictions for unexplored parts of perturbation space. As biologists deploy Perturb-seq across diverse biological systems, there is an enormous need for algorithms to guide iterative experiments while exploring the large space of possible perturbations and their combinations. Here, we propose a sequential approach for designing Perturb-seq experiments that uses the model to strategically select the most informative perturbations at each step for subsequent experiments. This enables a significantly more efficient exploration of the perturbation space, while predicting the effect of the rest of the unseen perturbations with high-fidelity. Analysis of a previous large-scale Perturb-seq experiment reveals that our setting is severely restricted by the number of examples and rounds, falling into a non-conventional active learning regime called “active learning on a budget”. Motivated by this insight, we develop IterPert, a novel active learning method that exploits rich and multi-modal prior knowledge in order to efficiently guide the selection of subsequent perturbations. Using prior knowledge for this task is novel, and crucial for successful active learning on a budget. We validate IterPertusing insilico benchmarking of active learning, constructed from a large-scale CRISPRi Perturb-seq data set. We find that IterPertoutperforms other active learning strategies by reaching comparable accuracy at only a third of the number of perturbations profiled as the next best method. Overall, our results demonstrate the potential of sequentially designing perturbation screens through IterPert.

Publisher

Cold Spring Harbor Laboratory

Reference50 articles.

1. Systems Biology: A Brief Overview

2. Perturb-Seq: Dissecting Molecular Circuits with Scalable Single-Cell RNA Profiling of Pooled Genetic Screens

3. Targeted perturb-seq enables genome-scale genetic screens in single cells;Nature Methods,2020

4. Massively parallel phenotyping of coding variants in cancer with Perturb-seq

5. Gavin R Schnitzler , Helen Kang , Vivian S Lee-Kim , Rosa X Ma , Tony Zeng , Ramcharan S Angom , Shi Fang , Shamsudheen Karuthedath Vellarikkal , Ronghao Zhou , Katherine Guo , et al. Mapping the convergence of genes for coronary artery disease onto endothelial cell programs. bioRxiv, pages 2022–11, 2022.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Toward a foundation model of causal cell and tissue biology with a Perturbation Cell and Tissue Atlas;Cell;2024-08

2. TDC-2: Multimodal Foundation for Therapeutic Science;2024-06-14