SwePub
Sök i LIBRIS databas

  Utökad sökning

hsv:(MEDICIN OCH HÄLSOVETENSKAP) hsv:(Medicinska och farmaceutiska grundvetenskaper) hsv:(Farmaceutiska vetenskaper)
 

Sökning: hsv:(MEDICIN OCH HÄLSOVETENSKAP) hsv:(Medicinska och farmaceutiska grundvetenskaper) hsv:(Farmaceutiska vetenskaper) > (2000-2009) > (2005) > Moulton Vincent > Unbiased descriptor...

Unbiased descriptor and parameter selection confirms the potential of proteochemometric modelling

Freyhult, Eva (författare)
Uppsala universitet,Signalbehandling
Prusis, Peteris (författare)
Uppsala universitet,Signalbehandling
Lapinsh, Maris (författare)
Uppsala universitet,Signalbehandling
visa fler...
Wikberg, Jarl E S (författare)
Uppsala universitet,Signalbehandling
Moulton, Vincent (författare)
Uppsala universitet,Signalbehandling
Gustafsson, Mats G (författare)
Uppsala universitet,Institutionen för teknikvetenskaper
visa färre...
 (creator_code:org_t)
Springer Science and Business Media LLC, 2005
2005
Engelska.
Ingår i: BMC Bioinformatics. - : Springer Science and Business Media LLC. - 1471-2105. ; 6, s. 50-
  • Tidskriftsartikel (refereegranskat)
Abstract Ämnesord
Stäng  
  • Background Proteochemometrics is a new methodology that allows prediction of protein function directly from real interaction measurement data without the need of 3D structure information. Several reported proteochemometric models of ligand-receptor interactions have already yielded significant insights into various forms of bio-molecular interactions. The proteochemometric models are multivariate regression models that predict binding affinity for a particular combination of features of the ligand and protein. Although proteochemometric models have already offered interesting results in various studies, no detailed statistical evaluation of their average predictive power has been performed. In particular, variable subset selection performed to date has always relied on using all available examples, a situation also encountered in microarray gene expression data analysis. Results A methodology for an unbiased evaluation of the predictive power of proteochemometric models was implemented and results from applying it to two of the largest proteochemometric data sets yet reported are presented. A double cross-validation loop procedure is used to estimate the expected performance of a given design method. The unbiased performance estimates (P2) obtained for the data sets that we consider confirm that properly designed single proteochemometric models have useful predictive power, but that a standard design based on cross validation may yield models with quite limited performance. The results also show that different commercial software packages employed for the design of proteochemometric models may yield very different and therefore misleading performance estimates. In addition, the differences in the models obtained in the double CV loop indicate that detailed chemical interpretation of a single proteochemometric model is uncertain when data sets are small. Conclusion The double CV loop employed offer unbiased performance estimates about a given proteochemometric modelling procedure, making it possible to identify cases where the proteochemometric design does not result in useful predictive models. Chemical interpretations of single proteochemometric models are uncertain and should instead be based on all the models selected in the double CV loop employed here.

Ämnesord

MEDICIN OCH HÄLSOVETENSKAP  -- Medicinska och farmaceutiska grundvetenskaper -- Farmaceutiska vetenskaper (hsv//swe)
MEDICAL AND HEALTH SCIENCES  -- Basic Medicine -- Pharmaceutical Sciences (hsv//eng)

Nyckelord

Algorithms
Animals
Computational Biology/*methods
Computer Simulation
Data Interpretation; Statistical
Humans
Ligands
Models; Biological
Models; Chemical
Models; Molecular
Models; Statistical
Models; Theoretical
Oligonucleotide Array Sequence Analysis/*methods
Predictive Value of Tests
Programming Languages
Protein Binding
Protein Conformation
Rats
Receptors; Adrenergic; alpha-1/chemistry
Receptors; G-Protein-Coupled/chemistry
Regression Analysis
Reproducibility of Results
Research Support; Non-U.S. Gov't
Selection (Genetics)
Software
PHARMACY
FARMACI

Publikations- och innehållstyp

ref (ämneskategori)
art (ämneskategori)

Hitta via bibliotek

Till lärosätets databas

Sök utanför SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy