Double shrinking (DOSH), a regression-based algorithm for gene regulatory network inference from co-expression data
Author:
Vladimirovich Bannikov ArtyomORCID
Abstract
Abstract
Next generation sequencing allows obtaining large amounts of gene expression data. Inferring regulatory relations between genes from such data has been a long standing challenge. Current algorithms are based on linear regression or a distance measures, like partial correlation or mutual information. The majority of algorithms are of very broad nature. Their aim is to infer a random normal network without using any additional assumptions. Regularized regression algorithms are an exception, since they assume sparsity. Additional truthful assumptions make inferences easier and more accurate. The proposed algorithm, Double Shrinking (DOSH), is based on regularized regression with assumptions about gene expression data and network structure. Reliability of gene expression values is assumed to depend upon their magnitude; larger values are more reliable. Each gene is assumed to be completely predictable from other genes. The effectiveness of the algorithm is demonstrated by identifying genetic markers of survival in lymphoid leukemia.
Publisher
Research Square Platform LLC
Reference6 articles.
1. Emerging landscape of molecular interaction networks:Opportunities, challenges and prospects;Panditrao G;J Biosci,2022 2. Computational biology approaches for mapping transcriptional regulatory networks;Saint-André V;Comput Struct Biotechnol J. 2021 Aug 3. Murray LM, Thillaiyampalam G, Xi Y, Cristino AS, Upham JW. Whole transcriptome analysis of high and low IFN-α producers reveals differential response patterns following rhinovirus stimulation. Clin Transl Immunology. 2021 Nov 17;10(11):e1356. doi: 10.1002/cti2.1356. PMID: 34868584; PMCID: PMC8599968. 4. Cunningham F, Allen JE, Allen J, Alvarez-Jarreta J, Amode MR, Armean IM, Austine-Orimoloye O, Azov AG, Barnes I, Bennett R, Berry A, Bhai J, Bignell A, Billis K, Boddu S, Brooks L, Charkhchi M, Cummins C, Da Rin Fioretto L, Davidson C, Dodiya K, Donaldson S, El Houdaigui B, El Naboulsi T, Fatima R, Giron CG, Genez T, Martinez JG, Guijarro-Clarke C, Gymer A, Hardy M, Hollis Z, Hourlier T, Hunt T, Juettemann T, Kaikala V, Kay M, Lavidas I, Le T, Lemos D, Marugán JC, Mohanan S, Mushtaq A, Naven M, Ogeh DN, Parker A, Parton A, Perry M, Piližota I, Prosovetskaia I, Sakthivel MP, Salam AIA, Schmitt BM, Schuilenburg H, Sheppard D, Pérez-Silva JG, Stark W, Steed E, Sutinen K, Sukumaran R, Sumathipala D, Suner MM, Szpak M, Thormann A, Tricomi FF, Urbina-Gómez D, Veidenberg A, Walsh TA, Walts B, Willhoft N, Winterbottom A, Wass E, Chakiachvili M, Flint B, Frankish A, Giorgetti S, Haggerty L, Hunt SE, IIsley GR, Loveland JE, Martin FJ, Moore B, Mudge JM, Muffato M, Perry E, Ruffier M, Tate J, Thybert D, Trevanion SJ, Dyer S, Harrison PW, Howe KL, Yates AD, Zerbino DR, Flicek P. Ensembl 2022. Nucleic Acids Res. 2022 Jan 7;50(D1):D988-D995. doi: 10.1093/nar/gkab1049. PMID: 34791404; PMCID: PMC8728283. 5. A.K. Md. E. Saleh, M. Arashi, B.M.G. Kibria “Theory of Ridge Regression Estimation with Applications” (book) 2019.
|
|