SwePub
Sök i LIBRIS databas

  Extended search

id:"swepub:oai:research.chalmers.se:7a5aeea0-aa1e-441c-b2d3-f3d583c49a03"
 

Search: id:"swepub:oai:research.chalmers.se:7a5aeea0-aa1e-441c-b2d3-f3d583c49a03" > Data Pipeline Manag...

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Data Pipeline Management in Practice: Challenges and Opportunities

Munappy, Aiswarya Raj, 1990 (author)
Chalmers tekniska högskola,Chalmers University of Technology,Department of Computer Science and Engineering, Chalmers University of Technology, Hörselgången 11, 412 96, Gothenburg, Sweden
Bosch, Jan, 1967 (author)
Chalmers tekniska högskola,Chalmers University of Technology,Department of Computer Science and Engineering, Chalmers University of Technology, Hörselgången 11, 412 96, Gothenburg, Sweden
Holmström Olsson, Helena, 1975 (author)
Malmö universitet,Institutionen för datavetenskap och medieteknik (DVMT)
 (creator_code:org_t)
2020-11-21
2020
English.
In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). - Cham : Springer International Publishing. - 1611-3349 .- 0302-9743. ; 12562, s. 168-184, s. 168-184
  • Conference paper (peer-reviewed)
Abstract Subject headings
Close  
  • Data pipelines involve a complex chain of interconnected activities that starts with a data source and ends in a data sink. Data pipelines are important for data-driven organizations since a data pipeline can process data in multiple formats from distributed data sources with minimal human intervention, accelerate data life cycle activities, and enhance productivity in data-driven enterprises. However, there are challenges and opportunities in implementing data pipelines but practical industry experiences are seldom reported. The findings of this study are derived by conducting a qualitative multiple-case study and interviews with the representatives of three companies. The challenges include data quality issues, infrastructure maintenance problems, and organizational barriers. On the other hand, data pipelines are implemented to enable traceability, fault-tolerance, and reduce human errors through maximizing automation thereby producing high-quality data. Based on multiple-case study research with five use cases from three case companies, this paper identifies the key challenges and benefits associated with the implementation and use of data pipelines.

Subject headings

NATURVETENSKAP  -- Data- och informationsvetenskap -- Annan data- och informationsvetenskap (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Other Computer and Information Science (hsv//eng)
TEKNIK OCH TEKNOLOGIER  -- Annan teknik -- Mediateknik (hsv//swe)
ENGINEERING AND TECHNOLOGY  -- Other Engineering and Technologies -- Media Engineering (hsv//eng)
NATURVETENSKAP  -- Biologi -- Bioinformatik och systembiologi (hsv//swe)
NATURAL SCIENCES  -- Biological Sciences -- Bioinformatics and Systems Biology (hsv//eng)

Keyword

Organizational
Opportunities
Data quality
Data pipelines
Challenges
Issues
Infrastructure

Publication and Content Type

kon (subject category)
ref (subject category)

Find in a library

To the university's database

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view