Marc Sturm, Sascha Quinten, Christian G Huber, and Oliver Kohlbacher (2007)
A statistical learning approach to the modeling of chromatographic retention of oligonucleotides incorporating sequence and secondary structure data
Nucl. Acids Res., 35(12):4195-4202.
We propose a new model for predicting the retention time of oligonucleotides. The model is based on ν support vector regression using features derived from base sequence and predicted secondary structure of oligonucleotides. Because of the secondary structure information, the model is applicable even at relatively low temperatures where the secondary structure is not suppressed by thermal denaturing. This makes the prediction of oligonucleotide retention time for arbitrary temperatures possible, provided that the target temperature lies within the temperature range of the training data. We describe different possibilities of feature calculation from base sequence and secondary structure, present the results and compare our model to existing models.