Search: onr:"swepub:oai:DiVA.org:kau-89822" >
Building machine-le...
Building machine-learning-based models for retention time and resolution predictions in ion pair chromatography of oligonucleotides
-
- Enmark, Martin, 1984- (author)
- Karlstads universitet,Institutionen för ingenjörs- och kemivetenskaper (from 2013)
-
- Häggstrom, Jakob (author)
- Karlstads universitet,Institutionen för ingenjörs- och kemivetenskaper (from 2013)
-
- Samuelsson, Jörgen, 1971- (author)
- Karlstads universitet,Institutionen för ingenjörs- och kemivetenskaper (from 2013)
-
show more...
-
- Fornstedt, Torgny, 1957- (author)
- Karlstads universitet,Institutionen för ingenjörs- och kemivetenskaper (from 2013)
-
show less...
-
(creator_code:org_t)
- Elsevier, 2022
- 2022
- English.
-
In: Journal of Chromatography A. - : Elsevier. - 0021-9673 .- 1873-3778. ; 1671
- Related links:
-
https://doi.org/10.1...
-
show more...
-
https://kau.diva-por... (primary) (Raw object)
-
https://urn.kb.se/re...
-
https://doi.org/10.1...
-
show less...
Abstract
Subject headings
Close
- Support vector regression models are created and used to predict the retention times of oligonucleotides separated using gradient ion-pair chromatography with high accuracy. The experimental dataset consisted of fully phosphorothioated oligonucleotides. Two models were trained and validated using two pseudo orthogonal gradient modes and three gradient slopes. The results show that the spread in retention time differs between the two gradient modes, which indicated varying degree of sequence dependent separation. Peak widths from the experimental dataset were calculated and correlated with the guanine cytosine content and retention time of the sequence for each gradient slope. This data was used to predict the resolution of the n - 1 impurity among 250 0 0 0 random 12-and 16-mer sequences; showing one of the investigated gradient modes has a much higher probability of exceeding a resolution of 1.5, particularly for the 16-mer sequences. Sequences having a high guanine-cytosine content and a terminal C are more likely to not reach critical resolution. The trained SVR models can both be used to identify characteristics of different separation methods and to assist in the choice of method conditions, i.e. to optimize resolution for arbitrary sequences. The methodology presented in this study can be expected to be applicable to predict retention times of other oligonucleotide synthesis and degradation impurities if provided enough training data.
Subject headings
- NATURVETENSKAP -- Kemi -- Analytisk kemi (hsv//swe)
- NATURAL SCIENCES -- Chemical Sciences -- Analytical Chemistry (hsv//eng)
Keyword
- Machine-learning
- Support vector regression (SVR) model
- Oligonucleotides
- Ion-pair chromatography
- Resolution
- Kemi - analytisk kemi
- Chemistry - Analytical Chemistry
Publication and Content Type
- ref (subject category)
- art (subject category)
Find in a library
To the university's database