Sökning: onr:"swepub:oai:DiVA.org:kau-89822" >
Building machine-le...
Building machine-learning-based models for retention time and resolution predictions in ion pair chromatography of oligonucleotides
-
- Enmark, Martin, 1984- (författare)
- Karlstads universitet,Institutionen för ingenjörs- och kemivetenskaper (from 2013)
-
- Häggstrom, Jakob (författare)
- Karlstads universitet,Institutionen för ingenjörs- och kemivetenskaper (from 2013)
-
- Samuelsson, Jörgen, 1971- (författare)
- Karlstads universitet,Institutionen för ingenjörs- och kemivetenskaper (from 2013)
-
visa fler...
-
- Fornstedt, Torgny, 1957- (författare)
- Karlstads universitet,Institutionen för ingenjörs- och kemivetenskaper (from 2013)
-
visa färre...
-
(creator_code:org_t)
- Elsevier, 2022
- 2022
- Engelska.
-
Ingår i: Journal of Chromatography A. - : Elsevier. - 0021-9673 .- 1873-3778. ; 1671
- Relaterad länk:
-
https://doi.org/10.1...
-
visa fler...
-
https://kau.diva-por... (primary) (Raw object)
-
https://urn.kb.se/re...
-
https://doi.org/10.1...
-
visa färre...
Abstract
Ämnesord
Stäng
- Support vector regression models are created and used to predict the retention times of oligonucleotides separated using gradient ion-pair chromatography with high accuracy. The experimental dataset consisted of fully phosphorothioated oligonucleotides. Two models were trained and validated using two pseudo orthogonal gradient modes and three gradient slopes. The results show that the spread in retention time differs between the two gradient modes, which indicated varying degree of sequence dependent separation. Peak widths from the experimental dataset were calculated and correlated with the guanine cytosine content and retention time of the sequence for each gradient slope. This data was used to predict the resolution of the n - 1 impurity among 250 0 0 0 random 12-and 16-mer sequences; showing one of the investigated gradient modes has a much higher probability of exceeding a resolution of 1.5, particularly for the 16-mer sequences. Sequences having a high guanine-cytosine content and a terminal C are more likely to not reach critical resolution. The trained SVR models can both be used to identify characteristics of different separation methods and to assist in the choice of method conditions, i.e. to optimize resolution for arbitrary sequences. The methodology presented in this study can be expected to be applicable to predict retention times of other oligonucleotide synthesis and degradation impurities if provided enough training data.
Ämnesord
- NATURVETENSKAP -- Kemi -- Analytisk kemi (hsv//swe)
- NATURAL SCIENCES -- Chemical Sciences -- Analytical Chemistry (hsv//eng)
Nyckelord
- Machine-learning
- Support vector regression (SVR) model
- Oligonucleotides
- Ion-pair chromatography
- Resolution
- Kemi - analytisk kemi
- Chemistry - Analytical Chemistry
Publikations- och innehållstyp
- ref (ämneskategori)
- art (ämneskategori)
Hitta via bibliotek
Till lärosätets databas