NSF4SL: negative-sample-free contrastive learning for ranking synthetic lethal partner genes in human cancers

Author:

Wang Shike1,Feng Yimiao1,Liu Xin1,Liu Yong2,Wu Min3,Zheng Jie14ORCID

Affiliation:

1. School of Information Science and Technology, ShanghaiTech University , Shanghai 201210, China

2. Joint NTU-UBC Research Centre of Excellence in Active Living for the Elderly, Nanyang Technological University , Singapore 639798, Singapore

3. Institute for Infocomm Research, Agency for Science, Technology and Research (A*STAR) , Singapore 138632, Singapore

4. Shanghai Engineering Research Center of Intelligent Vision and Imaging , Shanghai 201210, China

Abstract

Abstract Motivation Detecting synthetic lethality (SL) is a promising strategy for identifying anti-cancer drug targets. Targeting SL partners of a primary gene mutated in cancer is selectively lethal to cancer cells. Due to high cost of wet-lab experiments and availability of gold standard SL data, supervised machine learning for SL prediction has been popular. However, most of the methods are based on binary classification and thus limited by the lack of reliable negative data. Contrastive learning can train models without any negative sample and is thus promising for finding novel SLs. Results We propose NSF4SL, a negative-sample-free SL prediction model based on a contrastive learning framework. It captures the characteristics of positive SL samples by using two branches of neural networks that interact with each other to learn SL-related gene representations. Moreover, a feature-wise data augmentation strategy is used to mitigate the sparsity of SL data. NSF4SL significantly outperforms all baselines which require negative samples, even in challenging experimental settings. To the best of our knowledge, this is the first time that SL prediction is formulated as a gene ranking problem, which is more practical than the current formulation as binary classification. NSF4SL is the first contrastive learning method for SL prediction and its success points to a new direction of machine-learning methods for identifying novel SLs. Availability and implementation Our source code is available at https://github.com/JieZheng-ShanghaiTech/NSF4SL. Supplementary information Supplementary data are available at Bioinformatics online.

Funder

Startup Grant, ShanghaiTech University

Publisher

Oxford University Press (OUP)

Subject

Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability

Reference37 articles.

1. Predicting synthetic lethal interactions using conserved patterns in protein interaction networks;Benstead-Hume;PLoS Comput. Biol,2019

2. Translating embeddings for modeling multi-relational data;Bordes;Adv. Neural Inf. Process. Syst,2013

3. KOBAS-i: intelligent prioritization and exploratory visualization of biological functions for gene enrichment analysis;Bu;Nucleic Acids Res,2021

4. Dual-dropout graph convolutional network for predicting synthetic lethality in human cancers;Cai;Bioinformatics,2020

Cited by 6 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3