SwePub
Sök i LIBRIS databas

  Extended search

onr:"swepub:oai:DiVA.org:kth-146956"
 

Search: onr:"swepub:oai:DiVA.org:kth-146956" > Using iterative Map...

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Using iterative MapReduce for parallel virtual screening

Ahmed, Laeeq (author)
KTH,High Performance Computing and Visualization (HPCViz)
Edlund, Åke (author)
KTH,High Performance Computing and Visualization (HPCViz)
Laure, Erwin (author)
KTH,High Performance Computing and Visualization (HPCViz)
show more...
Spjuth, O. (author)
show less...
 (creator_code:org_t)
IEEE Computer Society, 2013
2013
English.
In: 2013 IEEE 5th International Conference on Cloud Computing Technology and Science (CloudCom). - : IEEE Computer Society. - 9780769550954 ; , s. 27-32
  • Conference paper (peer-reviewed)
Abstract Subject headings
Close  
  • Virtual Screening is a technique in chemo informatics used for Drug discovery by searching large libraries of molecule structures. Virtual Screening often uses SVM, a supervised machine learning technique used for regression and classification analysis. Virtual screening using SVM not only involves huge datasets, but it is also compute expensive with a complexity that can grow at least up to O(n2). SVM based applications most commonly use MPI, which becomes complex and impractical with large datasets. As an alternative to MPI, MapReduce, and its different implementations, have been successfully used on commodity clusters for analysis of data for problems with very large datasets. Due to the large libraries of molecule structures in virtual screening, it becomes a good candidate for MapReduce. In this paper we present a MapReduce implementation of SVM based virtual screening, using Spark, an iterative MapReduce programming model. We show that our implementation has a good scaling behaviour and opens up the possibility of using huge public cloud infrastructures efficiently for virtual screening.

Subject headings

NATURVETENSKAP  -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Computer Sciences (hsv//eng)

Keyword

Big Data
Chemoinformatics
MapReduce
Parallel SVM
Spark

Publication and Content Type

ref (subject category)
kon (subject category)

Find in a library

To the university's database

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Find more in SwePub

By the author/editor
Ahmed, Laeeq
Edlund, Åke
Laure, Erwin
Spjuth, O.
About the subject
NATURAL SCIENCES
NATURAL SCIENCES
and Computer and Inf ...
and Computer Science ...
Articles in the publication
2013 IEEE 5th In ...
By the university
Royal Institute of Technology

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view