SwePub
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "WFRF:(Fallgren Per) "

Sökning: WFRF:(Fallgren Per)

  • Resultat 1-10 av 15
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  • Artman, Henrik, 1968-, et al. (författare)
  • Effektiv miljötillsyn : slutrapport
  • 2013
  • Rapport (övrigt vetenskapligt/konstnärligt)abstract
    • Målsättningen har varit att ta fram ny kunskap inom miljötillsynen och därigenom uppnå en effektivare miljötillsyn samt att få in nya vetenskapliga perspektiv på miljötillsyn.I rapporten studeras metoder för inspektioner och det kommunikativa samspelet mellan inspektören och företrädare för den verksamhet som inspekteras, hur den institutionella ramen för inspektionsprocessen fungerar samt visar på möjligheter att mäta effekterna av inspektioner och tillsyn.Naturvårdsverket kommer att ha resultatet som ett kunskapsunderlag i fortsatt arbete med tillsynsvägledning och utveckling av hur tillsyn och tillsynsvägledning kan följas upp och utvärderas.
  •  
2.
  •  
3.
  • Domeij, Rickard, 1958-, et al. (författare)
  • Exploring the archives for textual entry points to speech - Experiences of interdisciplinary collaboration in making cultural heritage accessible for research
  • 2020
  • Ingår i: CEUR Workshop Proceedings. - Riga : CEUR-WS. ; , s. 45-55, s. 45-55
  • Konferensbidrag (refereegranskat)abstract
    • Tilltal (Tillgängligt kulturarv för forskning i tal, 'Accessible cultural heritage for speech research') is a multidisciplinary and methodological project undertaken by the Institute of Language and Folklore, KTH Royal Institute of Technology, and The Swedish National Archives in cooperation with the National Language Bank and SWE-CLARIN [1]. It aims to provide researchers better access to archival audio recordings using methods from language technology. The project comprises three case studies and one activity and usage study. In the case studies, actual research agendas from three different fields (ethnology, sociolinguistics and interaction analysis) serve as a basis for identifying procedures that may be simplified with the aid of digital tools. In the activity and usage study, we are applying an activity-theoretical approach with the aim of involving researchers and investigating how they use - and would like to be able to use - the archival resources at ISOF. Involving researchers in participatory design ensures that digital solutions are suggested and evaluated in relation to the requirements expressed by researchers engaged in specific research tasks [2]. In this paper we focus on one of the case studies, which investigates the process by which personal experience narratives are transformed into cultural heritage [3], and account for our results in exploring how different types of text material from the archives can be used to find relevant sections of the audio recordings. Finally, we discuss what lessons can be learned, and what conclusions can be drawn, from our experiences of interdisciplinary collaboration in the project.
  •  
4.
  • Fallgren, Per, et al. (författare)
  • Bringing order to chaos : A non-sequential approach for browsing large sets of found audio data
  • 2019
  • Ingår i: Proceedings Of The Eleventh International Conference On Language Resources And Evaluation (LREC 2018). - : European Language Resources Association (ELRA). ; , s. 4307-4311
  • Konferensbidrag (refereegranskat)abstract
    • We present a novel and general approach for fast and efficient non-sequential browsing of sound in large archives that we know little or nothing about, e.g. so called found data - data not recorded with the specific purpose to be analysed or used as training data. Our main motivation is to address some of the problems speech and speech technology researchers see when they try to capitalise on the huge quantities of speech data that reside in public archives. Our method is a combination of audio browsing through massively multi-object sound environments and a well-known unsupervised dimensionality reduction algorithm (SOM). We test the process chain on four data sets of different nature (speech, speech and music, farm animals, and farm animals mixed with farm sounds). The methods are shown to combine well, resulting in rapid and readily interpretable observations. Finally, our initial results are demonstrated in prototype software which is freely available.
  •  
5.
  • Fallgren, Per, et al. (författare)
  • Crowdsource-based validation of the audio cocktail as a sound browsing tool
  • 2023
  • Ingår i: Interspeech 2023. - : International Speech Communication Association. ; , s. 2178-2182
  • Konferensbidrag (refereegranskat)abstract
    • We conduct two crowdsourcing experiments designed to examine the usefulness of audio cocktails to quickly find out information on the contents of large audio data. Several thousand crowd workers were engaged to listen to audio cocktails with systematically varied composition. They were then asked to state either which sound out of four categories (Children, Women, Men, Orchestra) they heard the most of, or if they heard anything of a specific category at all. The results show that their responses have high reliability and provide information as to whether a specific task can be performed using audio cocktails. We also propose that the combination of crowd workers and audio cocktails can be used directly as a tool to investigate the contents of large audio data.
  •  
6.
  • Fallgren, Per, et al. (författare)
  • Edyson: rapid human-in-the-loop browsing, exploration and annotation of large speech and audio data
  • Annan publikation (övrigt vetenskapligt/konstnärligt)abstract
    • The audio exploration tool Edyson integrates a variety of techniques to achieve the efficient exploration of otherwise prohibitively large collections of speech and other sounds. A main strength is that this combination of techniques allows us to place a human-in-the-loop in a coherent and operationalised manner. The two most prominent techniques that we incorporate are temporally dis- assembled audio (TDA) and massively multi-component audio environments (MMAE). The first allows us to decouple input audio from the temporal dimen- sion by segmenting it into sound snippets of short duration, akin to the frames used in signal processing. These snippets are organised and visualised in an interactive interface where the investigator can navigate through the snippets freely while providing labels and judgements that are not tied to the tempo- ral context of the original audio. This, in turn, removes the real-time or near real-time requirement associated with temporally linear audio browsing. We further argue that a human-in-the-loop inclusion, as opposed to fully automated black-box approaches, is valuable and perhaps necessary to understand and fully exploit larger quantities of found speech. We describe in this paper the details of the tool and its underlying method- ologies, and provide a summary of results and findings that has come out of our efforts to validate and quantify the characteristics of this new type of audio browsing to date. 
  •  
7.
  • Fallgren, Per (författare)
  • Found speech and humans in the loop : Ways to gain insight into large quantities of speech
  • 2022
  • Doktorsavhandling (övrigt vetenskapligt/konstnärligt)abstract
    • Found data - data used for something other than the purpose for which it was originally collected - holds great value in many regards. It typically reflects high ecological validity, a strong cultural worth, and there are significant quantities at hand. However, it is noisy, hard to search through, and its contents are often largely unknown. This thesis explores ways to gain insight into such data collections, specifically with regard to speech and audio data.In recent years, deep learning approaches have shown unrivaled performance in many speech and language technology tasks. However, in addition to large datasets, many of these methods require vast quantities of high-quality labels, which are costly to produce. Moreover, while there are exceptions, machine learning models are typically trained for solving well-defined, narrow problems and perform inadequately in tasks of more general nature - such as providing a high-level description of the contents in a large audio file. This observation reveals a methodological gap that this thesis aims to fill.An ideal system for tackling these matters would combine humans' flexibility and general intelligence with machines' processing power and pattern-finding capabilities. With this idea in mind, the thesis explores the value of including the human-in-the-loop, specifically in the context of gaining insight into collections of found speech. The aim is to combine techniques from speech technology, machine learning paradigms, and human-in-the-loop approaches, with the overall goal of developing and evaluating novel methods for efficiently exploring large quantities of found speech data.One of the main contributions is Edyson, a tool for fast browsing, exploring, and annotating audio. It uses temporally disassembled audio, a technique that decouples the audio from the temporal dimension, in combination with feature extraction methods, dimensionality reduction algorithms, and a flexible listening function, which allows a user to get an informative overview of the contents.Furthermore, crowdsourcing is explored in the context of large-scale perception studies and speech & language data collection. Prior reports on the usefulness of crowd workers for such tasks show promise and are here corroborated.The thesis contributions suggest that the explored approaches are promising options for utilizing large quantities of found audio data and deserve further consideration in research and applied settings.
  •  
8.
  • Fallgren, Per, et al. (författare)
  • How to annotate 100 hours in 45 minutes
  • 2019
  • Ingår i: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. - : ISCA. ; , s. 341-345
  • Konferensbidrag (refereegranskat)abstract
    • Speech data found in the wild hold many advantages over artificially constructed speech corpora in terms of ecological validity and cultural worth. Perhaps most importantly, there is a lot of it. However, the combination of great quantity, noisiness and variation poses a challenge for its access and processing. Generally speaking, automatic approaches to tackle the problem require good labels for training, while manual approaches require time. In this study, we provide further evidence for a semi-supervised, human-in-the-loop framework that previously has shown promising results for browsing and annotating large quantities of found audio data quickly. The findings of this study show that a 100-hour long subset of the Fearless Steps corpus can be annotated for speech activity in less than 45 minutes, a fraction of the time it would take traditional annotation methods, without a loss in performance.
  •  
9.
  • Fallgren, Per, et al. (författare)
  • Human-in-the-Loop Efficiency Analysis for Binary Classification in Edyson
  • 2021
  • Ingår i: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. - ISCA : International Speech Communication Association. ; , s. 3685-3689
  • Konferensbidrag (refereegranskat)abstract
    • Edyson is a human-in-the-loop (HITL) tool for browsing and annotating large amounts of audio data quickly. It builds on temporally disassembled audio and massively multi-component audio environments to overcome the cumbersome time con- straints that come with linear exploration of large audio data. This study adds the following contributions to Edyson: 1) We add the new use case of HITL binary classification by sample; 2) We explore the new domain oceanic hydrophone recordings with whale song, along with speech activity detection in noisy audio; 3) We propose a repeatable method of analysing the effi- ciency of HITL in Edyson for binary classification, specifically designed to measure the return on human time spent in a given domain. We exemplify this method on two domains, and show that for a manageable initial cost in terms of HITL, it does dif- ferentiate between suitable and unsuitable domains for our new use case - a valuable insight when working with large collections of audio.
  •  
10.
  • Fallgren, Per, et al. (författare)
  • The audio cocktail as a sound browsing tool - a crowdsourcing based validation
  • Annan publikation (övrigt vetenskapligt/konstnärligt)abstract
    • We conduct two crowdsourcing experiments designed to examine the usefulness of audio cocktails to quickly find out information on the contents of large audio data. Several thousand crowd workers were engaged to listen to audio cocktails with systematically varied composition. They were then asked to state either which sound out of four categories (Children, Women, Men, Orchestra) they heard the most of, or if they heard anything of a specific category at all. The results show that their responses have high reliability and provide information as to whether a specific task can be performed using audio cocktails. We also propose that the combination of crowd workers and audio cocktails can be used directly as a tool to investigate the contents of large audio data.
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-10 av 15
Typ av publikation
konferensbidrag (10)
annan publikation (2)
rapport (1)
tidskriftsartikel (1)
doktorsavhandling (1)
Typ av innehåll
refereegranskat (10)
övrigt vetenskapligt/konstnärligt (5)
Författare/redaktör
Fallgren, Per (15)
Edlund, Jens (6)
Malisz, Zofia (5)
Edlund, Jens, Docent ... (5)
Eriksson, Gunnar, 19 ... (2)
Domeij, Rickard, 195 ... (2)
visa fler...
House, David (2)
Nylund Skog, Susanne ... (2)
Öqvist, Jenny, 1969- (2)
Jonell, Patrik (2)
Oertel, Catharine (1)
Forsberg, Lars (1)
Ghilagaber, Gebreneg ... (1)
Skantze, Gabriel, 19 ... (1)
Lopes, J. (1)
Edlund, Lena (1)
Lindström, E. (1)
Lindström, Eva (1)
Berg, Johanna (1)
Artman, Henrik, 1968 ... (1)
Brynielsson, Joel, 1 ... (1)
Lindquist, Sinna (1)
Herzing, Mathias (1)
Jacobsson, Adam (1)
Gustavii, Jonathan (1)
Häckner, Jonas (1)
Jacobsson, Eva-Maria (1)
Källmén, Håkan (1)
Lundström, Anders (1)
Muren, Astri (1)
Sjöberg, Eric (1)
Thuresson, Björn, 19 ... (1)
Tjörnhammar, Edward (1)
Wickström, Hans (1)
Kuhlmann, Marco, 197 ... (1)
Dogan, Fethiye Irmak (1)
Wennberg, Ulme (1)
Magnusson Petzell, E ... (1)
Shore, Todd (1)
Bystedt, Mattias (1)
Mascarenhas, Samuel (1)
Tånnander, Christina (1)
Cummins, Fred, Assoc ... (1)
Segeblad, Jesper (1)
Gustafson, Joakim, p ... (1)
Kontogiorgos, Dimost ... (1)
David Aguas Lopes, J ... (1)
Eran, Raveh (1)
visa färre...
Lärosäte
Kungliga Tekniska Högskolan (13)
Institutet för språk och folkminnen (2)
Stockholms universitet (1)
Linköpings universitet (1)
Naturvårdsverket (1)
Språk
Engelska (14)
Svenska (1)
Forskningsämne (UKÄ/SCB)
Naturvetenskap (10)
Teknik (3)
Humaniora (3)

År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy