Augmented drug combination dataset to improve the performance of machine learning models predicting synergistic anticancer effects-Reference-Cited by-同舟云学术

Augmented drug combination dataset to improve the performance of machine learning models predicting synergistic anticancer effects

Published:2023-10-28 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Liu Mengmeng¹,Srivast Gopal¹,Ramanujam J.¹,Brylinski Michal¹

Affiliation:

1. Louisiana State University

Abstract

Abstract Combination therapy has gained popularity in cancer treatment as it enhances the treatment efficacy and overcomes drug resistance. Although machine learning (ML) techniques have become an indispensable tool for discovering new drug combinations, the data on drug combination therapy currently available may be insufficient to build high-precision models. We developed a data augmentation protocol to unbiasedly scale up the existing anti-cancer drug synergy dataset. Using a new drug similarity metric, we augmented the synergy data by substituting a compound in a drug combination instance with another molecule that exhibits highly similar pharmacological effects. Using this protocol, we were able to upscale the AZ-DREAM Challenges dataset from 8,798 to 6,016,697 drug combinations. Comprehensive performance evaluations show that Random Forest and Gradient Boosting Trees models trained on the augmented data achieve higher accuracy than those trained solely on the original dataset. Our data augmentation protocol provides a systematic and unbiased approach to generating more diverse and larger-scale drug combination datasets, enabling the development of more precise and effective ML models. The protocol presented in this study could serve as a foundation for future research aimed at discovering novel and effective drug combinations for cancer treatment.

Publisher

Research Square Platform LLC

Reference72 articles.

1. Predicting synergistic effects between compounds through their structural similarity and effects on transcriptomes;Liu Y;Bioinformatics,2016

2. Efficacy and safety of trastuzumab as a single agent in first-line treatment of HER2-overexpressing metastatic breast cancer;Vogel CL;J Clin Oncol,2002

3. Combination therapy in combating cancer;Bayat Mokhtari R;Oncotarget,2017

4. Machine learning in the prediction of cancer therapy;Rafique R;Comput Struct Biotechnol J,2021

5. The National Cancer Institute ALMANAC: A Comprehensive Screening Resource for the Detection of Anticancer Drug Pairs with Enhanced Therapeutic Activity;Holbeck SL;Cancer Res,2017