Sökning: id:"swepub:oai:DiVA.org:su-141945" >
Large-scale structu...
Large-scale structure prediction by improved contact predictions and model quality assessment
-
- Michel, Mirco (författare)
- Stockholms universitet,Institutionen för biokemi och biofysik,Science for Life Laboratory (SciLifeLab)
-
- Menéndez Hurtado, David (författare)
- Stockholms universitet,Institutionen för biokemi och biofysik,Science for Life Laboratory (SciLifeLab)
-
- Uziela, Karolis (författare)
- Stockholms universitet,Institutionen för biokemi och biofysik,Science for Life Laboratory (SciLifeLab)
-
visa fler...
-
- Elofsson, Arne (författare)
- Stockholms universitet,Institutionen för biokemi och biofysik,Science for Life Laboratory (SciLifeLab)
-
visa färre...
-
(creator_code:org_t)
- 2017-07-12
- 2017
- Engelska.
-
Ingår i: Bioinformatics. - : Oxford University Press (OUP). - 1367-4803 .- 1367-4811 .- 1460-2059. ; 33:14, s. 123-129
- Relaterad länk:
-
https://doi.org/10.1...
-
visa fler...
-
https://academic.oup...
-
https://urn.kb.se/re...
-
https://doi.org/10.1...
-
visa färre...
Abstract
Ämnesord
Stäng
- Motivation: Accurate contact predictions can be used for predicting the structure of proteins. Until recently these methods were limited to very big protein families, decreasing their utility. However, recent progress by combining direct coupling analysis with machine learning methods has made it possible to predict accurate contact maps for smaller families. To what extent these predictions can be used to produce accurate models of the families is not known. Results: We present the PconsFold2 pipeline that uses contact predictions from PconsC3, the CONFOLD folding algorithm and model quality estimations to predict the structure of a protein. We show that the model quality estimation significantly increases the number of models that reliably can be identified. Finally, we apply PconsFold2 to 6379 Pfam families of unknown structure and find that PconsFold2 can, with an estimated 90% specificity, predict the structure of up to 558 Pfam families of unknown structure. Out of these 415 have not been reported before. Availability: Datasets as well as models of all the 558 Pfam families are available at http://c3.pcons.net. All programs used here are freely available.
Ämnesord
- NATURVETENSKAP -- Data- och informationsvetenskap -- Bioinformatik (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Bioinformatics (hsv//eng)
Nyckelord
- Biochemistry towards Bioinformatics
- biokemi med inriktning mot bioinformatik
Publikations- och innehållstyp
- ref (ämneskategori)
- art (ämneskategori)
Hitta via bibliotek
Till lärosätets databas