Search: onr:"swepub:oai:DiVA.org:kth-181254" >
SAASFEE :
SAASFEE : Scalable scientific workflow execution engine
-
- Bux, M. (author)
- Humboldt University of Berlin, Germany
-
- Brandt, J. (author)
- Humboldt University of Berlin, Germany
-
- Lipka, C. (author)
- Humboldt University of Berlin, Germany
-
show more...
-
- Hakimzadeh, Kamal (author)
- KTH,Programvaruteknik och Datorsystem, SCS,KTH Royal Institute of Technology, Sweden
-
- Dowling, Jim (author)
- RISE,KTH,Programvaruteknik och Datorsystem, SCS,SICS,KTH Royal Institute of Technology, Sweden
-
- Leser, U. (author)
- Humboldt University of Berlin, Germany
-
show less...
-
(creator_code:org_t)
- 2015-08
- 2015
- English.
-
In: Proceedings of the VLDB Endowment. - : Association for Computing Machinery (ACM). - 2150-8097. ; 8:12, s. 1892-1895, s. 1892-1903
- Related links:
-
http://www.vldb.org/...
-
show more...
-
https://urn.kb.se/re...
-
https://doi.org/10.1...
-
https://urn.kb.se/re...
-
show less...
Abstract
Subject headings
Close
- Across many fields of science, primary data sets like sensor read-outs, time series, and genomic sequences are analyzed by complex chains of specialized tools and scripts exchanging intermediate results in domain-specific file formats. Scientific work ow management systems (SWfMSs) support the development and execution of these tool chains by providing work ow specification languages, graphical editors, fault-tolerant execution engines, etc. However, many SWfMSs are not prepared to handle large data sets because of inadequate support for distributed computing. On the other hand, most SWfMSs that do support distributed computing only allow static task execution orders. We present SAASFEE, a SWfMS which runs arbitrarily complex work ows on Hadoop YARN. Work ows are specified in Cuneiform, a functional work ow language focusing on parallelization and easy integration of existing software. Cuneiform work ows are executed on Hi-WAY, a higher-level scheduler for running work ows on YARN. Distinct features of SAASFEE are the ability to execute iterative work ows, an adaptive task scheduler, re-executable provenance traces, and compatibility to selected other work ow systems. In the demonstration, we present all components of SAASFEE using real-life work ows from the field of genomics.
Subject headings
- NATURVETENSKAP -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Computer Sciences (hsv//eng)
- NATURVETENSKAP -- Data- och informationsvetenskap (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences (hsv//eng)
Keyword
- Chains
- Computational linguistics
- Engines
- Information management
- Specification languages
- Wool
- Yarn
- Execution engine
- Genomic sequence
- Graphical editors
- Intermediate results
- Management systems
- Parallelizations
- Scientific workflows
- Specialized tools
- Distributed computer systems
Publication and Content Type
- ref (subject category)
- art (subject category)
Find in a library
To the university's database