SwePub
Sök i LIBRIS databas

  Utökad sökning

L773:1758 2946
 

Sökning: L773:1758 2946 > Towards interoperab...

Towards interoperable and reproducible QSAR analyses : Exchange of data sets

Spjuth, Ola, 1977- (författare)
Uppsala universitet,Institutionen för farmaceutisk biovetenskap
Willighagen, Egon (författare)
Uppsala universitet,Institutionen för farmaceutisk biovetenskap
Guha, Rajarshi (författare)
NIH Chemical Genomics Center
visa fler...
Eklund, Martin, 1978- (författare)
Uppsala universitet,Institutionen för farmaceutisk biovetenskap
Wikberg, Jarl (författare)
Uppsala universitet,Institutionen för farmaceutisk biovetenskap
visa färre...
 (creator_code:org_t)
2010-06-30
2010
Engelska.
Ingår i: Journal of Cheminformatics. - : BioMed Central. - 1758-2946. ; 2
  • Tidskriftsartikel (refereegranskat)
Abstract Ämnesord
Stäng  
  • BACKGROUND: QSAR/QSPR is a widely used method to relate chemical structures and responses based on ex- perimental observations. In QSAR, chemical structures are expressed as descriptors, which are mathematical representations like calculated properties or enumerated fragments. Many existing QSAR data sets are based on a combination of different software tools mixed with in-house developed solutions, with datasets manually assembled in spreadsheets. Currently there exists no agreed-upon definition of descriptors and no standard for exchanging data sets in QSAR, which together with numerous different descriptor implementations makes it a virtually impossible task to reproduce and validate analyses, and significantly hinders collaborations and re-use of data.RESULTS: We present a step towards standardizing QSAR analyses by defining interoperable and reproducible QSAR/QSPR data sets, comprising an open XML format (QSAR-ML) and an open extensible descriptor ontology (Blue Obelisk Descriptor Ontology). The ontology provides an extensible way of uniquely defining descriptors for use in QSAR experiments, and the exchange format supports multiple versioned implementations of these descriptors. Hence, a data set described by QSAR-ML makes its setup completely reproducible. We also provide an implementation as a set of plugins for Bioclipse that simplifies QSAR data set formation, and allows for exporting in QSAR-ML as well as traditional CSV formats. The implementation facilitates addition of new descriptor implementations, from locally installed software and remote Web services; the latter is demonstrated with REST and XMPP Web services.CONCLUSIONS: Standardized QSAR data sets opens up new ways to store, query, and exchange data for subsequent analyses. QSAR-ML supports completely reproducible dataset formation, solving the problems of defining which software components were used, their versions, and the case of multiple names for the same descriptor. This makes is easy to join, extend, combine data sets and also to work collectively. The presented Bioclipse plugins equip scientists with intuitive tools that make QSAR-ML widely available for the community.

Ämnesord

NATURVETENSKAP  -- Biologi -- Bioinformatik och systembiologi (hsv//swe)
NATURAL SCIENCES  -- Biological Sciences -- Bioinformatics and Systems Biology (hsv//eng)

Nyckelord

QSAR
Bioclipse
standard
ontology
life sciences
bioinformatics
cheminformatics
reproducible
Bioinformatics
Bioinformatik
Bioinformatik
Bioinformatics

Publikations- och innehållstyp

ref (ämneskategori)
art (ämneskategori)

Hitta via bibliotek

Till lärosätets databas

Sök utanför SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy