SwePub
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "hsv:(NATURVETENSKAP) hsv:(Data och informationsvetenskap) hsv:(Annan data och informationsvetenskap) ;lar1:(mau)"

Sökning: hsv:(NATURVETENSKAP) hsv:(Data och informationsvetenskap) hsv:(Annan data och informationsvetenskap) > Malmö universitet

  • Resultat 1-10 av 37
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  • Fredriksson, Teodor, 1992, et al. (författare)
  • Machine learning models for automatic labeling: A systematic literature review
  • 2020
  • Ingår i: ICSOFT 2020 - Proceedings of the 15th International Conference on Software Technologies. - : SCITEPRESS - Science and Technology Publications. ; , s. 552-566
  • Konferensbidrag (refereegranskat)abstract
    • Automatic labeling is a type of classification problem. Classification has been studied with the help of statistical methods for a long time. With the explosion of new better computer processing units (CPUs) and graphical processing units (GPUs) the interest in machine learning has grown exponentially and we can use both statistical learning algorithms as well as deep neural networks (DNNs) to solve the classification tasks. Classification is a supervised machine learning problem and there exists a large amount of methodology for performing such task. However, it is very rare in industrial applications that data is fully labeled which is why we need good methodology to obtain error-free labels. The purpose of this paper is to examine the current literature on how to perform labeling using ML, we will compare these models in terms of popularity and on what datatypes they are used on. We performed a systematic literature review of empirical studies for machine learning for labeling. We identified 43 primary studies relevant to our search. From this we were able to determine the most common machine learning models for labeling. Lack of unlabeled instances is a major problem for industry as supervised learning is the most widely used. Obtaining labels is costly in terms of labor and financial costs. Based on our findings in this review we present alternate ways for labeling data for use in supervised learning tasks.
  •  
2.
  • John, Meenu Mary, et al. (författare)
  • Towards an AI-driven business development framework: A multi-case study
  • 2023
  • Ingår i: Journal of Software: Evolution and Process. - : Wiley. - 2047-7481 .- 2047-7473. ; 35:6
  • Tidskriftsartikel (refereegranskat)abstract
    • Artificial intelligence (AI) and the use of machine learning (ML) and deep learning (DL) technologies are becoming increasingly popular in companies. These technologies enable companies to leverage big quantities of data to improve system performance and accelerate business development. However, despite the appeal of ML/DL, there is a lack of systematic and structured methods and processes to help data scientists and other company roles and functions to develop, deploy and evolve models. In this paper, based on multi-case study research in six companies, we explore practices and challenges practitioners experience in developing ML/DL models as part of large software-intensive embedded systems. Based on our empirical findings, we derive a conceptual framework in which we identify three high-level activities that companies perform in parallel with the development, deployment and evolution of models. Within this framework, we outline activities, iterations and triggers that optimize model design as well as roles and company functions. In this way, we provide practitioners with a blueprint for effectively integrating ML/DL model development into the business to achieve better results than other (algorithmic) approaches. In addition, we show how this framework helps companies solve the challenges we have identified and discuss checkpoints for terminating the business case.
  •  
3.
  • Munappy, Aiswarya Raj, 1990, et al. (författare)
  • On the Trade-off Between Robustness and Complexity in Data Pipelines
  • 2021
  • Ingår i: Quality of Information and Communications Technology. - Cham : Springer. - 9783030853464 - 9783030853471 ; 1439 CCIS, s. 401-415
  • Konferensbidrag (refereegranskat)abstract
    • Data pipelines play an important role throughout the data management process whether these are used for data analytics or machine learning. Data-driven organizations can make use of data pipelines for producing good quality data applications. Moreover, data pipelines ensure end-to-end velocity by automating the processes involved in extracting, transforming, combining, validating, and loading data for further analysis and visualization. However, the robustness of data pipelines is equally important since unhealthy data pipelines can add more noise to the input data. This paper identifies the essential elements for a robust data pipeline and analyses the trade-off between data pipeline robustness and complexity.
  •  
4.
  • Spalazzese, Romina, et al. (författare)
  • INTERO: An Interoperability Model for Large Systems
  • 2020
  • Ingår i: IEEE Software. - : IEEE. - 1937-4194 .- 0740-7459. ; 37:3, s. 38-45
  • Tidskriftsartikel (refereegranskat)abstract
    • The INTERO (interoperability) model helps organizations manage and improve interoperability among their large, evolving software systems. They can analyze a specific interoperability problem, conceive strategies to enhance interoperability, and reevaluate the problem to determine whether interoperability has improved.
  •  
5.
  • Munappy, Aiswarya Raj, 1990, et al. (författare)
  • Data Management Challenges for Deep Learning
  • 2019
  • Ingår i: Proceedings - 45th Euromicro Conference on Software Engineering and Advanced Applications, SEAA 2019. - : IEEE. ; , s. 140-147
  • Konferensbidrag (refereegranskat)abstract
    • © 2019 IEEE. Deep learning is one of the most exciting and fast-growing techniques in Artificial Intelligence. The unique capacity of deep learning models to automatically learn patterns from the data differentiates it from other machine learning techniques. Deep learning is responsible for a significant number of recent breakthroughs in AI. However, deep learning models are highly dependent on the underlying data. So, consistency, accuracy, and completeness of data is essential for a deep learning model. Thus, data management principles and practices need to be adopted throughout the development process of deep learning models. The objective of this study is to identify and categorise data management challenges faced by practitioners in different stages of end-to-end development. In this paper, a case study approach is employed to explore the data management issues faced by practitioners across various domains when they use real-world data for training and deploying deep learning models. Our case study is intended to provide valuable insights to the deep learning community as well as for data scientists to guide discussion and future research in applied deep learning with real-world data.
  •  
6.
  • Lewenhagen, Kenneth, et al. (författare)
  • An Interdisciplinary Web-based Framework for Data-driven Placement Analysis of CCTV Cameras
  • 2021
  • Ingår i: Proceedings of the 2021 Swedish Workshop on Data Science, SweDS 2021. - : Institute of Electrical and Electronics Engineers Inc.. - 9781665418300
  • Konferensbidrag (refereegranskat)abstract
    • This paper describes work in progress of an interdisciplinary research project that focuses on the placement and analysis of public close-circuit television (CCTV) cameras using data-driven analysis of crime data. A novel web-based prototype that acts as a framework for the camera placement analysis with regards to historical crime occurrence is presented. The web-based prototype enables various analyses involving public CCTV cameras e.g., to determine suitable locations for both stationary CCTV cameras as well as temporary cameras that are moved around after a few months to address crime seasonality. The framework also opens up for other analyses, e.g. automatically highlighting crimes that are carried out closed by at least one camera. The research also investigates to what extent it is possible to generate estimates on the amount of detail captured by a camera given the distance to the crime light conditions. The research project includes interdisciplinary competences from various areas such as criminology, computer and data science as well as the Swedish Police. © 2021 IEEE.
  •  
7.
  • Figalist, Iris, et al. (författare)
  • An End-to-End Framework for Productive Use of Machine Learning in Software Analytics and Business Intelligence Solutions
  • 2020
  • Ingår i: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). - Cham : Springer International Publishing. - 1611-3349 .- 0302-9743. ; 12562 LNCS, s. 217-233, s. 217-233
  • Konferensbidrag (refereegranskat)abstract
    • Nowadays, machine learning (ML) is an integral component in a wide range of areas, including software analytics (SA) and business intelligence (BI). As a result, the interest in custom ML-based software analytics and business intelligence solutions is rising. In practice, however, such solutions often get stuck in a prototypical stage because setting up an infrastructure for deployment and maintenance is considered complex and time-consuming. For this reason, we aim at structuring the entire process and making it more transparent by deriving an end-to-end framework from existing literature for building and deploying ML-based software analytics and business intelligence solutions. The framework is structured in three iterative cycles representing different stages in a model’s lifecycle: prototyping, deployment, update. As a result, the framework specifically supports the transitions between these stages while also covering all important activities from data collection to retraining deployed ML models. To validate the applicability of the framework in practice, we compare it to and apply it in a real-world ML-based SA/BI solution.
  •  
8.
  • Fredriksson, Teodor, 1992, et al. (författare)
  • Classification of Complex-Valued Radar Data using Semi-Supervised Learning: a Case Study
  • 2023
  • Ingår i: Proceedings - 2023 49th Euromicro Conference on Software Engineering and Advanced Applications, SEAA 2023. - : Institute of Electrical and Electronics Engineers (IEEE). ; , s. 102-107
  • Konferensbidrag (refereegranskat)abstract
    • In recent years, the interest in applying machine learning (ML) and deep learning (DL) has been increasing due to their ability to learn to predict and find structure in data. The most common approach of ML and DL is supervised learning. Supervised learning requires the input data to be labeled. However, as reported by many industries, such as the embedded systems domain, fully labeled datasets are difficult to obtain since data labeling is manually intensive. This paper uses a semi-supervised learning approach on real-world Pulse-Doppler data obtained from our industry collaborator Saab to address this challenge. We took inspiration from the FixMatch algorithm. To investigate whether unlabeled data can help improve classification accuracy, we compare FixMatch to a supervised baseline. We use five different settings for the number of available labels per class label to investigate how many labeled instances and how much manual effort is required for optimal accuracy. Bayesian Linear Regression is used to analyze the results. The results show that FixMatch can reach a higher accuracy than the supervised baseline. Furthermore, FixMatch requires more computation time but will help reduce manual effort. In addition, FixMatch will not underfit or overfit. Thanks to this study, practitioners know the benefits of utilizing FixMatch and when it is safe to use to improve a supervised baseline in the industry.
  •  
9.
  • Hyrynsalmi, Sami, et al. (författare)
  • Quō vādis, Data Business?: A Study for Understanding Maturity of Embedded System Companies in Data Economy
  • 2022
  • Ingår i: Lecture Notes in Business Information Processing. - Cham : Springer International Publishing. - 1865-1356 .- 1865-1348. ; 463 LNBIP, s. 141-148, s. 141-148
  • Konferensbidrag (refereegranskat)abstract
    • Data has been claimed to be the new oil of the 21st century as it has seen to be able both to improve the existing products and services as well as to create new revenue streams for its utilizing company with a secondary customers base. However, while there is active streams of research for developing machine learning and data science methods, considerably less has been done to understand and characterize data business activities in the software-intensive companies. This study uses a multiple case study approach in the software-intensive embedded system domain. Four large international embedded system companies were selected as the case study subjects. The objective is to understand how the case companies are developing their activities for successful utilization of the data. The study identifies six distinct stages with their own challenges. In addition, this study serves as a starting for further work for supporting software-intensive embedded system companies to start data business.
  •  
10.
  • Issa Mattos, David, 1990, et al. (författare)
  • Statistical Models for the Analysis of Optimization Algorithms with Benchmark Functions
  • 2021
  • Ingår i: IEEE Transactions on Evolutionary Computation. - : IEEE. - 1089-778X .- 1941-0026. ; 25:6, s. 1163-1177
  • Tidskriftsartikel (refereegranskat)abstract
    • Frequentist statistical methods, such as hypothesis testing, are standard practices in studies that provide benchmark comparisons. Unfortunately, these methods have often been misused, e.g., without testing for their statistical test assumptions or without controlling for familywise errors in multiple group comparisons, among several other problems. Bayesian data analysis (BDA) addresses many of the previously mentioned shortcomings but its use is not widely spread in the analysis of empirical data in the evolutionary computing community. This article provides three main contributions. First, we motivate the need for utilizing BDA and provide an overview of this topic. Second, we discuss the practical aspects of BDA to ensure that our models are valid and the results are transparent. Finally, we provide five statistical models that can be used to answer multiple research questions. The online Appendix provides a step-by-step guide on how to perform the analysis of the models discussed in this article, including the code for the statistical models, the data transformations, and the discussed tables and figures. 
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-10 av 37
Typ av publikation
konferensbidrag (26)
tidskriftsartikel (9)
samlingsverk (redaktörskap) (1)
licentiatavhandling (1)
Typ av innehåll
refereegranskat (35)
övrigt vetenskapligt/konstnärligt (2)
Författare/redaktör
Bosch, Jan, 1967 (22)
Olsson, Helena Holms ... (16)
Munappy, Aiswarya Ra ... (8)
Holmström Olsson, He ... (6)
Zhang, Hongyi, 1996 (5)
Davidsson, Paul (3)
visa fler...
Persson, Jan A. (3)
Tegen, Agnes (3)
Issa Mattos, David, ... (3)
Fredriksson, Teodor, ... (3)
Olsson, Carl Magnus (2)
Johnsson, Magnus (2)
Arpteg, Anders (2)
Dahlén, Johan (1)
Boström, Henrik (1)
Jansson, Anders (1)
Jonsson, Håkan (1)
Boldt, Martin (1)
Borg, Anton (1)
Light, Ann (1)
Malekian, Reza, 1983 ... (1)
Svenningsson, Per (1)
Spalazzese, Romina (1)
Pelliccione, Patrizi ... (1)
Gerell, Manne, Docen ... (1)
Boistrup, Lisa Björk ... (1)
Paasch, Jesper M., 1 ... (1)
Selander, Staffan, 1 ... (1)
Eklund, Ulrik, 1967 (1)
Lundsten, Jonas (1)
Brinne, Bjorn (1)
Salvi, Dario (1)
Mûller, Michael (1)
Ymeri, Gent (1)
Wassenburg, Myrthe (1)
Wang, Wenming (1)
Erickson, Ingrid (1)
Lewkowicz, Myriam (1)
Dakkak, Anas (1)
Hyrynsalmi, Sami (1)
Ciolfi, Luigina (1)
Krischkowsky, Alina (1)
Jonasson, Kalle, 197 ... (1)
Kurti, Erdelina (1)
Figalist, Iris (1)
Elsner, Christoph (1)
Kajtazi, Miranda (1)
Thiborg, Jesper (1)
Lewenhagen, Kenneth (1)
Huang, Haiping (1)
visa färre...
Lärosäte
Chalmers tekniska högskola (23)
Lunds universitet (2)
Linnéuniversitetet (2)
Göteborgs universitet (1)
Högskolan i Halmstad (1)
visa fler...
Stockholms universitet (1)
Högskolan i Gävle (1)
Örebro universitet (1)
Blekinge Tekniska Högskola (1)
visa färre...
Språk
Engelska (37)
Forskningsämne (UKÄ/SCB)
Naturvetenskap (37)
Teknik (15)
Samhällsvetenskap (7)
Humaniora (2)
Medicin och hälsovetenskap (1)

År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy