SwePub
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "WFRF:(Elofsson Arne) ;pers:(Uziela Karolis)"

Sökning: WFRF:(Elofsson Arne) > Uziela Karolis

  • Resultat 1-7 av 7
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  • Cheng, Jianlin, et al. (författare)
  • Estimation of model accuracy in CASP13
  • 2019
  • Ingår i: Proteins. - : Wiley. - 0887-3585 .- 1097-0134. ; 87:12, s. 1361-1377
  • Tidskriftsartikel (refereegranskat)abstract
    • Methods to reliably estimate the accuracy of 3D models of proteins are both a fundamental part of most protein folding pipelines and important for reliable identification of the best models when multiple pipelines are used. Here, we describe the progress made from CASP12 to CASP13 in the field of estimation of model accuracy (EMA) as seen from the progress of the most successful methods in CASP13. We show small but clear progress, that is, several methods perform better than the best methods from CASP12 when tested on CASP13 EMA targets. Some progress is driven by applying deep learning and residue‐residue contacts to model accuracy prediction. We show that the best EMA methods select better models than the best servers in CASP13, but that there exists a great potential to improve this further. Also, according to the evaluation criteria based on local similarities, such as lDDT and CAD, it is now clear that single model accuracy methods perform relatively better than consensus‐based methods.
  •  
2.
  • Elofsson, Arne, et al. (författare)
  • Methods for estimation of model accuracy in CASP12
  • 2018
  • Ingår i: Proteins. - : Wiley. - 0887-3585 .- 1097-0134. ; 86:S1, s. 361-373
  • Tidskriftsartikel (refereegranskat)abstract
    • Methods to reliably estimate the quality of 3D models of proteins are essential drivers for the wide adoption and serious acceptance of protein structure predictions by life scientists. In this article, the most successful groups in CASP12 describe their latest methods for estimates of model accuracy (EMA). We show that pure single model accuracy estimation methods have shown clear progress since CASP11; the 3 top methods (MESHI, ProQ3, SVMQA) all perform better than the top method of CASP11 (ProQ2). Although the pure single model accuracy estimation methods outperform quasi-single (ModFOLD6 variations) and consensus methods (Pcons, ModFOLDclust2, Pcomb-domain, and Wallner) in model selection, they are still not as good as those methods in absolute model quality estimation and predictions of local quality. Finally, we show that when using contact-based model quality measures (CAD, lDDT) the single model quality methods perform relatively better.
  •  
3.
  • Menéndez Hurtado, David, 1990-, et al. (författare)
  • A novel training procedure to train deep networks in the assessment of the quality of protein models
  • Annan publikation (övrigt vetenskapligt/konstnärligt)abstract
    • Motivation: Proteins fold into complex structures that are crucial for their biological functions. Experimental determination of protein structures iscostly and therefore limited to a small fraction of all known proteins. Hence,different computational structure prediction methods are necessary for themodelling of the vast majority of all proteins. In most structure predictionpipelines, the last step is to select the best available model and to estimateits accuracy. This model quality estimation problem has been growing inimportance during the last decade, and progress is believed to be importantfor large scale modelling of proteins. Current machine learning models trained to estimate the protein modelquality suffer from biases in the training set: multiple models of only a fewtargets, generated by a few methods.Results: We propose a new methodology to train deep networks that leveragesthe structure of the problem and takes advantage of some of this redundan-cies. We demonstrate its viability by reaching results comparable with anotherstate-of-the-art method, ProQ3D, trained and evaluated on the same datasets,but employing only a small subset of the input features.The proposed training strategy is applicable to other input features anddatasets, and thus can be applied to other programs.Availability: The code is freely available for download at: github.com/ElofssonLab/ProQ4 and runs with minimal requirements: requires only one multiplesequence alignment and a collection of models and depends only on Python3, hdf5, a deep learning framework compatible with Keras, and dssp.Contact: arne@bioinfo.se
  •  
4.
  • Michel, Mirco, et al. (författare)
  • Large-scale structure prediction by improved contact predictions and model quality assessment
  • 2017
  • Ingår i: Bioinformatics. - : Oxford University Press (OUP). - 1367-4803 .- 1367-4811 .- 1460-2059. ; 33:14, s. 123-129
  • Tidskriftsartikel (refereegranskat)abstract
    • Motivation: Accurate contact predictions can be used for predicting the structure of proteins. Until recently these methods were limited to very big protein families, decreasing their utility. However, recent progress by combining direct coupling analysis with machine learning methods has made it possible to predict accurate contact maps for smaller families. To what extent these predictions can be used to produce accurate models of the families is not known. Results: We present the PconsFold2 pipeline that uses contact predictions from PconsC3, the CONFOLD folding algorithm and model quality estimations to predict the structure of a protein. We show that the model quality estimation significantly increases the number of models that reliably can be identified. Finally, we apply PconsFold2 to 6379 Pfam families of unknown structure and find that PconsFold2 can, with an estimated 90% specificity, predict the structure of up to 558 Pfam families of unknown structure. Out of these 415 have not been reported before. Availability: Datasets as well as models of all the 558 Pfam families are available at http://c3.pcons.net. All programs used here are freely available.
  •  
5.
  • Uziela, Karolis, et al. (författare)
  • ProQ3 : Improved model quality assessments using Rosetta energy terms
  • 2016
  • Ingår i: Scientific Reports. - : Springer Science and Business Media LLC. - 2045-2322. ; 6
  • Tidskriftsartikel (refereegranskat)abstract
    • Quality assessment of protein models using no other information than the structure of the model itself has been shown to be useful for structure prediction. Here, we introduce two novel methods, ProQRosFA and ProQRosCen, inspired by the state-of-art method ProQ2, but using a completely different description of a protein model. ProQ2 uses contacts and other features calculated from a model, while the new predictors are based on Rosetta energies: ProQRosFA uses the full-atom energy function that takes into account all atoms, while ProQRosCen uses the coarse-grained centroid energy function. The two new predictors also include residue conservation and terms corresponding to the agreement of a model with predicted secondary structure and surface area, as in ProQ2. We show that the performance of these predictors is on par with ProQ2 and significantly better than all other model quality assessment programs. Furthermore, we show that combining the input features from all three predictors, the resulting predictor ProQ3 performs better than any of the individual methods. ProQ3, ProQRosFA and ProQRosCen are freely available both as a webserver and stand-alone programs at http://proq3.bioinfo.se/.
  •  
6.
  • Uziela, Karolis, 1987-, et al. (författare)
  • ProQ3D : improved model quality assessments using deep learning
  • 2017
  • Ingår i: Bioinformatics. - : Oxford University Press (OUP). - 1367-4803 .- 1367-4811. ; 33:10, s. 1578-1580
  • Tidskriftsartikel (refereegranskat)abstract
    • Protein quality assessment is a long-standing problem in bioinformatics. For more than a decade we have developed state-of-art predictors by carefully selecting and optimising inputs to a machine learning method. The correlation has increased from 0.60 in ProQ to 0.81 in ProQ2 and 0.85 in ProQ3 mainly by adding a large set of carefully tuned descriptions of a protein. Here, we show that a substantial improvement can be obtained using exactly the same inputs as in ProQ2 or ProQ3 but replacing the support vector machine by a deep neural network. This improves the Pearson correlation to 0.90 (0.85 using ProQ2 input features).
  •  
7.
  • Uziela, Karolis, 1987- (författare)
  • Protein Model Quality Assessment : A Machine Learning Approach
  • 2017
  • Doktorsavhandling (övrigt vetenskapligt/konstnärligt)abstract
    • Many protein structure prediction programs exist and they can efficiently generate a number of protein models of a varying quality. One of the problems is that it is difficult to know which model is the best one for a given target sequence. Selecting the best model is one of the major tasks of Model Quality Assessment Programs (MQAPs). These programs are able to predict model accuracy before the native structure is determined. The accuracy estimation can be divided into two parts: global (the whole model accuracy) and local (the accuracy of each residue). ProQ2 is one of the most successful MQAPs for prediction of both local and global model accuracy and is based on a Machine Learning approach.In this thesis, I present my own contribution to Model Quality Assessment (MQA) and the newest developments of ProQ program series. Firstly, I describe a new ProQ2 implementation in the protein modelling software package Rosetta. This new implementation allows use of ProQ2 as a scoring function for conformational sampling inside Rosetta, which was not possible before. Moreover, I present two new methods, ProQ3 and ProQ3D that both outperform their predecessor. ProQ3 introduces new training features that are calculated from Rosetta energy functions and ProQ3D introduces a new machine learning approach based on deep learning. ProQ3 program participated in the 12th Community Wide Experiment on the Critical Assessment of Techniques for Protein Structure Prediction (CASP12) and was one of the best methods in the MQA category. Finally, an important issue in model quality assessment is how to select a target function that the predictor is trying to learn. In the fourth manuscript, I show that MQA results can be improved by selecting a contact-based target function instead of more conventional superposition based functions.
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-7 av 7

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy