Abstract
AbstractMotivationTo capture structural homology in RNAs, predicting RNA structural alignments has been a fundamental framework around RNA science. Learning simultaneous RNA structural alignments in their rich scoring is an undeveloped subject because evaluating them is computationally expensive.ResultsWe developed ConsTrain—a gradient-based machine learning method for rich structural alignment scoring. We also implemented ConsAlign—a simultaneous RNA structural aligner composed of ConsTrain’s learned scoring parameters. To aim for better structural alignment quality, ConsAlign employs (1) transfer learning from well-defined scoring models and (2) the ensemble model between the ConsTrain model and a mature thermodynamic scoring model. Keeping comparable running time, ConsAlign demonstrated competitive alignment prediction quality among current RNA structural aligners.Availability and implementationOur code and our data are freely available at https://github.com/heartsh/consalign.Contactheartsh@heartsh.ioSupplementary informationSupplementary data are available at Bioinformatics
Publisher
Cold Spring Harbor Laboratory