Sökning: onr:"swepub:oai:DiVA.org:mau-48635" >
On the Trade-off Be...
On the Trade-off Between Robustness and Complexity in Data Pipelines
-
- Munappy, Aiswarya Raj, 1990 (författare)
- Chalmers University of Technology,Chalmers tekniska högskola
-
- Bosch, Jan, 1967 (författare)
- Chalmers University of Technology,Chalmers tekniska högskola
-
- Olsson, Helena Holmström (författare)
- Malmö universitet,Institutionen för datavetenskap och medieteknik (DVMT),Chalmers tekniska högskola,Chalmers University of Technology
-
(creator_code:org_t)
- 2021-08-25
- 2021
- Engelska.
-
Ingår i: Quality of Information and Communications Technology. - Cham : Springer. - 9783030853464 - 9783030853471 ; 1439 CCIS, s. 401-415
- Relaterad länk:
-
https://urn.kb.se/re...
-
visa fler...
-
https://doi.org/10.1...
-
https://research.cha...
-
visa färre...
Abstract
Ämnesord
Stäng
- Data pipelines play an important role throughout the data management process whether these are used for data analytics or machine learning. Data-driven organizations can make use of data pipelines for producing good quality data applications. Moreover, data pipelines ensure end-to-end velocity by automating the processes involved in extracting, transforming, combining, validating, and loading data for further analysis and visualization. However, the robustness of data pipelines is equally important since unhealthy data pipelines can add more noise to the input data. This paper identifies the essential elements for a robust data pipeline and analyses the trade-off between data pipeline robustness and complexity.
Ämnesord
- NATURVETENSKAP -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Computer Sciences (hsv//eng)
- NATURVETENSKAP -- Data- och informationsvetenskap -- Annan data- och informationsvetenskap (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Other Computer and Information Science (hsv//eng)
- NATURVETENSKAP -- Data- och informationsvetenskap -- Bioinformatik (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Bioinformatics (hsv//eng)
- TEKNIK OCH TEKNOLOGIER -- Annan teknik -- Mediateknik (hsv//swe)
- ENGINEERING AND TECHNOLOGY -- Other Engineering and Technologies -- Media Engineering (hsv//eng)
Nyckelord
- Complexity
- Composite nodes
- Data pipelines
- Data quality
- Robustness
- Trade-off
- Data Analytics
- Data visualization
- Economic and social effects
- Information management
- Metadata
- Data driven
- Essential elements
- Input datas
- Loading data
- Management process
- Quality data
- Robust datum
- Pipelines
Publikations- och innehållstyp
- ref (ämneskategori)
- kon (ämneskategori)
Hitta via bibliotek
Till lärosätets databas