Abstract
Motivation
Predicting the secondary structure of RNA is a fundamental task in bioinformatics. Algorithms that predict secondary structure given only the primary sequence, and a model to evaluate the quality of a structure, are an integral part of this. These algorithms have been updated as our model of RNA thermodynamics changed and expanded. An exception to this has been the treatment of multi-loops. Although more advanced models of multi-loop free energy change have been suggested, a simple, linear model has been used since the 1980s. However, recently, new dynamic programing algorithms for secondary structure prediction that could incorporate these models were presented. Unfortunately, these models appear to have lower accuracy for secondary structure prediction.
Results
We apply linear regression and a new parameter optimization algorithm to find better parameters for the existing linear model and advanced non-linear multi-loop models. These include the Jacobson-Stockmayer and Aalberts & Nandagopal models. We find that the current linear model parameters may be near optimal for the linear model, and that no advanced model performs better than the existing linear model parameters even after parameter optimization.
Availability and implementation
Source code and data is available at https://github.com/maxhwardg/advanced_multiloops.
Original language | English |
---|---|
Pages (from-to) | 4298-4306 |
Number of pages | 9 |
Journal | Bioinformatics |
Volume | 35 |
Issue number | 21 |
DOIs | |
Publication status | Published - 1 Nov 2019 |