Abstract
Abstract
To address the problem that the traditional correlation metric defaults the 0 terms in the adjacency matrix of lncRNA-disease to irrelevant, ignoring the fact that it is only not validated now but maybe validated as relevant in the future, we propose a correlation calculation method that incorporates potential correlation properties, which is verified by comparative experiments to have better performance than the traditional method in the model; meanwhile, with the idea of nearest neighbor, we design the matrix completion model (DMWNN) to reassign values to the 0-terms in the adjacency matrix. The correlation is used instead of the traditional Euclidean distance to screen more valuable neighbors; combined with the linear decay strategy of distance weights, the interference of low correlation data is reduced, and thus the accuracy of reassignment is improved. The AUC value reached 0.9480 in the five-fold cross-validation experiment and 0.9603 in the leave-one-out cross-validation experiment, and the experimental results showed that the DMWNN model can effectively explore the potential association between lncRNA and disease and has strong predictive performance.
Publisher
Research Square Platform LLC
Reference50 articles.
1. Non-coding RNAs as regulators of embryogenesis;Pauli A;Nat Rev Genet,2011
2. Ab initio reconstruction of cell type–specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs;Guttman M;Nat Biotechnol,2010
3. Pola Ce k N. Non-coding RNAs: hope or hype?[J];Hüttenhofer A;Trends in Genetics,2005
4. Advances in long non-coding RNA research;Chen XM;Advances in Biochemistry and Biophysics
5. Chen, X. Gui,Y, & Yan. Novel human lncRNA-disease association inference based on lncRNA expression profiles. Bioinformatics (Oxford, England), (2013).