Sökning: id:"swepub:oai:DiVA.org:umu-3651" >
Quantitative protei...
Quantitative protein descriptors for secondary structure characterization and protein classification
-
- Lindström, Anton (författare)
- Umeå universitet,Kemiska institutionen
-
- Pettersson, Fredrik (författare)
- The Wellcome Trust Center for Human Genetics, Oxford University, Oxford, UK
-
- Linusson, Anna (författare)
- Umeå universitet,Kemiska institutionen
-
(creator_code:org_t)
- Elsevier BV, 2009
- 2009
- Engelska.
-
Ingår i: Chemometrics and Intelligent Laboratory Systems. - : Elsevier BV. - 0169-7439 .- 1873-3239. ; 95:1, s. 74-85
- Relaterad länk:
-
https://urn.kb.se/re...
-
visa fler...
-
https://doi.org/10.1...
-
visa färre...
Abstract
Ämnesord
Stäng
- In this study protein chains were characterized based on alignment-independent protein descriptors using three types of structural and sequence data; (i) C-α atom Euclidean distances, (ii) protein backbone ψ and φ angles and (iii) amino acid physicochemical properties (zz-scales). The descriptors were analyzed using principal component analysis (PCA) and further elucidated using the multivariate methods partial least-squares projections to latent structures discriminant-analysis (PLS-DA) and hierarchical-PLS-DA. The descriptors were applied to three protein chain datasets: (i) 82 chains classified, according to the structural classification of proteins (SCOP) scheme, as either all-α or all-β; (ii) 96 chains classified as either α + β or α/β and (iii) 6590 chains of all aforementioned classes selected from the PDB-select database. Results showed that the descriptors related to the secondary structure of the chains. The C-α Euclidean distances, and as expected, the protein backbone angles were found to be most important for the characterization and classification of chains. Assignment of SCOP classes using PLS-DA based on all descriptor types was satisfactory for all-α and all-β chains with more than 93% correct classifications of a large external test set, while the protein chains of types α/β and α + β was harder to discriminate between, resulting in 74% and 54% correct classifications, respectively.
Ämnesord
- NATURVETENSKAP -- Kemi (hsv//swe)
- NATURAL SCIENCES -- Chemical Sciences (hsv//eng)
Nyckelord
- Multivariate analysis
- Protein descriptor
- SCOP
- Auto covariance
- Auto cross-covariance
Publikations- och innehållstyp
- ref (ämneskategori)
- art (ämneskategori)
Hitta via bibliotek
Till lärosätets databas