SwePub - sökning: WFRF:(Edlund Jens)

Numrering	Referens	Omslagsbild	Hitta
1.	Al Moubayed, Samer, et al. (författare) Analysis of gaze and speech patterns in three-party quiz game interaction 2013 Ingår i: Interspeech 2013. - : The International Speech Communication Association (ISCA). ; , s. 1126-1130 Konferensbidrag (refereegranskat)abstract In order to understand and model the dynamics between interaction phenomena such as gaze and speech in face-to-face multiparty interaction between humans, we need large quantities of reliable, objective data of such interactions. To date, this type of data is in short supply. We present a data collection setup using automated, objective techniques in which we capture the gaze and speech patterns of triads deeply engaged in a high-stakes quiz game. The resulting corpus consists of five one-hour recordings, and is unique in that it makes use of three state-of-the-art gaze trackers (one per subject) in combination with a state-of-theart conical microphone array designed to capture roundtable meetings. Several video channels are also included. In this paper we present the obstacles we encountered and the possibilities afforded by a synchronised, reliable combination of large-scale multi-party speech and gaze data, and an overview of the first analyses of the data. Index Terms: multimodal corpus, multiparty dialogue, gaze patterns, multiparty gaze.
2.	Al Moubayed, Samer, et al. (författare) Animated Faces for Robotic Heads : Gaze and Beyond 2011 Ingår i: Analysis of Verbal and Nonverbal Communication and Enactment. - Berlin, Heidelberg : Springer Berlin/Heidelberg. - 9783642257742 ; , s. 19-35 Konferensbidrag (refereegranskat)abstract We introduce an approach to using animated faces for robotics where a static physical object is used as a projection surface for an animation. The talking head is projected onto a 3D physical head model. In this chapter we discuss the different benefits this approach adds over mechanical heads. After that, we investigate a phenomenon commonly referred to as the Mona Lisa gaze effect. This effect results from the use of 2D surfaces to display 3D images and causes the gaze of a portrait to seemingly follow the observer no matter where it is viewed from. The experiment investigates the perception of gaze direction by observers. The analysis shows that the 3D model eliminates the effect, and provides an accurate perception of gaze direction. We discuss at the end the different requirements of gaze in interactive systems, and explore the different settings these findings give access to.
3.	Al Moubayed, Samer, et al. (författare) Taming Mona Lisa : communicating gaze faithfully in 2D and 3D facial projections 2012 Ingår i: ACM Transactions on Interactive Intelligent Systems. - : Association for Computing Machinery (ACM). - 2160-6455 .- 2160-6463. ; 1:2, s. 25- Tidskriftsartikel (refereegranskat)abstract The perception of gaze plays a crucial role in human-human interaction. Gaze has been shown to matter for a number of aspects of communication and dialogue, especially for managing the flow of the dialogue and participant attention, for deictic referencing, and for the communication of attitude. When developing embodied conversational agents (ECAs) and talking heads, modeling and delivering accurate gaze targets is crucial. Traditionally, systems communicating through talking heads have been displayed to the human conversant using 2D displays, such as flat monitors. This approach introduces severe limitations for an accurate communication of gaze since 2D displays are associated with several powerful effects and illusions, most importantly the Mona Lisa gaze effect, where the gaze of the projected head appears to follow the observer regardless of viewing angle. We describe the Mona Lisa gaze effect and its consequences in the interaction loop, and propose a new approach for displaying talking heads using a 3D projection surface (a physical model of a human head) as an alternative to the traditional flat surface projection. We investigate and compare the accuracy of the perception of gaze direction and the Mona Lisa gaze effect in 2D and 3D projection surfaces in a five subject gaze perception experiment. The experiment confirms that a 3Dprojection surface completely eliminates the Mona Lisa gaze effect and delivers very accurate gaze direction that is independent of the observer's viewing angle. Based on the data collected in this experiment, we rephrase the formulation of the Mona Lisa gaze effect. The data, when reinterpreted, confirms the predictions of the new model for both 2D and 3D projection surfaces. Finally, we discuss the requirements on different spatially interactive systems in terms of gaze direction, and propose new applications and experiments for interaction in a human-ECA and a human-robot settings made possible by this technology.
4.	Albertsson, Ann-Christine, et al. (författare) Design of renewable hydrogel release systems from fiberboard mill wastewater 2010 Ingår i: Biomacromolecules. - : American Chemical Society (ACS). - 1525-7797 .- 1526-4602. ; 11:5, s. 1406-1411 Tidskriftsartikel (refereegranskat)abstract A new route for the design of renewable hydrogels is presented. The soluble waste from masonite production was isolated, fractionized, and upgraded. The resulting hemicellulose rich fraction was alkenyl-functionalized and used in the preparation of covalently cross-linked hydrogels capable of sustained release of incorporated agents. Said hydrogels showed a Fickian diffusion-based release of incorporated bovine serum albumin. Also, a method for the coating of seeds with hydrogel was developed. The sustained release of incorporated growth retardant agents from the hydrogel coating on rape seeds was shown to enable the temporary inhibition of germination.
5.	Andréasson, Maia, 1960, et al. (författare) Swedish CLARIN activities 2009 Ingår i: Proceedings of the Nodalida 2009 workshop on CLARIN activities in the Nordic countries. NEALT Proceedings Series. - 1736-6305. ; 5, s. 1-5 Konferensbidrag (refereegranskat)
6.	Andréasson, Maia, et al. (författare) Swedish CLARIN Activities 2009 Ingår i: Proceedings of the NODALIDA 2009 workshop Nordic Perspectives on the CLARIN Infrastructure of Language Resources. - : Northern European Association for Language Technology (NEALT). ; , s. 1-5 Konferensbidrag (refereegranskat)abstract Although Sweden has yet to allocate funds specifically intended for CLARIN activities, there are some ongoing activities which are directly relevant to CLARIN, and which are explicitly linked to CLARIN. These activities have been funded by the Committee for Research Infrastructures and its subcommittee DISC (Database Infrastructure Committee) of the Swedish Research Council.
7.	Aquino, Jorge B, et al. (författare) In vitro and in vivo differentiation of boundary cap neural crest stem cells into mature Schwann cells. 2006 Ingår i: Experimental Neurology. - : Elsevier BV. - 0014-4886. ; 198:2, s. 438-49 Tidskriftsartikel (refereegranskat)
8.	Berg, Johanna, et al. (författare) Making Archival Speech Recordings Accessible for Research : A Report from the Tilltal Project 2019 Ingår i: Svenska landsmål och svenskt folkliv. - : Kungl. Skogs- och Lantbruksakademien. - 0347-1837. ; 141, s. 171-178 Tidskriftsartikel (övrigt vetenskapligt/konstnärligt)
9.	Beskow, Jonas, et al. (författare) A Model for Multimodal Dialogue System Output Applied to an Animated Talking Head 2005 Ingår i: SPOKEN MULTIMODAL HUMAN-COMPUTER DIALOGUE IN MOBILE ENVIRONMENTS. - Dordrecht : Springer. - 9781402030758 ; , s. 93-113 Bokkapitel (refereegranskat)abstract We present a formalism for specifying verbal and non-verbal output from a multimodal dialogue system. The output specification is XML-based and provides information about communicative functions of the output, without detailing the realisation of these functions. The aim is to let dialogue systems generate the same output for a wide variety of output devices and modalities. The formalism was developed and implemented in the multimodal spoken dialogue system AdApt. We also describe how facial gestures in the 3D-animated talking head used within this system are controlled through the formalism.
10.	Beskow, Jonas, et al. (författare) Face-to-Face Interaction and the KTH Cooking Show 2010 Ingår i: Development of multimodal interfaces. - Berlin, Heidelberg : Springer Berlin Heidelberg. - 9783642123962 ; , s. 157-168 Konferensbidrag (refereegranskat)abstract We share our experiences with integrating motion capture recordings in speech and dialogue research by describing (1) Spontal, a large project collecting 60 hours of video, audio and motion capture spontaneous dialogues, is described with special attention to motion capture and its pitfalls; (2) a tutorial where we use motion capture, speech synthesis and an animated talking head to allow students to create an active listener; and (3) brief preliminary results in the form of visualizations of motion capture data over time in a Spontal dialogue. We hope that given the lack of writings on the use of motion capture for speech research, these accounts will prove inspirational and informative.
11.	Beskow, Jonas, et al. (författare) Innovative interfaces in MonAMI : The Reminder 2008 Ingår i: Perception In Multimodal Dialogue Systems, Proceedings. - Berlin, Heidelberg : Springer Berlin Heidelberg. - 9783540693680 ; , s. 272-275 Konferensbidrag (refereegranskat)abstract This demo paper presents the first version of the Reminder, a prototype ECA developed in the European project MonAMI, which aims at "main-streaming accessibility in consumer goods and services, using advanced technologies to ensure equal access, independent living and participation for all". The Reminder helps users to plan activities and to remember what to do. The prototype merges ECA technology with other, existing technologies: Google Calendar and a digital pen and paper. This innovative combination of modalities allows users to continue using a paper calendar in the manner they are used to, whilst the ECA provides verbal notifications on what has been written in the calendar. Users may also ask questions such as "When was I supposed to meet Sara?" or "What's on my schedule today?"
12.	Beskow, Jonas, et al. (författare) Kinetic Data for Large-Scale Analysis and Modeling of Face-to-Face Conversation 2011 Ingår i: Proceedings of International Conference on Audio-Visual Speech Processing 2011. - Stockholm : KTH Royal Institute of Technology. ; , s. 103-106 Konferensbidrag (refereegranskat)abstract Spoken face to face interaction is a rich and complex form of communication that includes a wide array of phenomena thatare not fully explored or understood. While there has been extensive studies on many aspects in face-to-face interaction, these are traditionally of a qualitative nature, relying on hand annotated corpora, typically rather limited in extent, which is a natural consequence of the labour intensive task of multimodal data annotation. In this paper we present a corpus of 60 hours of unrestricted Swedish face-to-face conversations recorded with audio, video and optical motion capture, and we describe a new project setting out to exploit primarily the kinetic data in this corpus in order to gain quantitative knowledge on humanface-to-face interaction.
13.	Beskow, Jonas, et al. (författare) Modelling humanlike conversational behaviour 2010 Ingår i: SLTC 2010. - Linköping, Sweden. ; , s. 9-10 Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract We have a visionar y goal: to learn enough about human face-to-face interaction that we are able to create an artificial conversational partner that is humanlike. We take the opportunity here to present four new projects inaugurated in 2010, each adding pieces of the puzzle through a shared research focus: modelling interactional aspects of spoken face-to-face communication.
14.	Beskow, Jonas, et al. (författare) Multimodal Interaction Control 2009 Ingår i: Computers in the Human Interaction Loop. - Berlin/Heidelberg : Springer Berlin/Heidelberg. - 9781848820531 - 9781848820548 ; , s. 143-158 Bokkapitel (refereegranskat)
15.	Beskow, Jonas, et al. (författare) Project presentation: Spontal : multimodal database of spontaneous dialog 2009 Ingår i: Proceedings of Fonetik 2009. - Stockholm : Stockholm University. - 9789163348921 - 9789163348938 ; , s. 190-193 Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract We describe the ongoing Swedish speech database project Spontal: Multimodal database of spontaneous speech in dialog (VR 2006-7482). The project takes as its point of departure the fact that both vocal signals and gesture involving the face and body are important in every-day, face-to-face communicative interaction, and that there is a great need for data with which we more precisely measure these.
16.	Beskow, Jonas, et al. (författare) Research focus : Interactional aspects of spoken face-to-face communication 2010 Ingår i: Proceedings from Fonetik, Lund, June 2-4, 2010. - Lund, Sweden : Lund University. ; , s. 7-10 Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract We have a visionary goal: to learn enough about human face-to-face interaction that we are able to create an artificial conversational partner that is human-like. We take the opportunity here to present four new projects inaugurated in 2010, each adding pieces of the puzzle through a shared research focus: interactional aspects of spoken face-to-face communication.
17.	Beskow, Jonas, et al. (författare) Research focus : Interactional aspects of spoken face-to-face communication 2010 Ingår i: Proceedings from Fonetik 2010. - Lund : Lund University. ; , s. 7-10 Konferensbidrag (övrigt vetenskapligt/konstnärligt)
18.	Beskow, Jonas, et al. (författare) Speech technology in the European project MonAMI 2008 Ingår i: Proceedings of FONETIK 2008. - Gothenburg, Sweden : University of Gothenburg. - 9789197719605 ; , s. 33-36 Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract This paper describes the role of speech and speech technology in the European project MonAMI, which aims at “mainstreaming ac-cessibility in consumer goods and services, us-ing advanced technologies to ensure equal ac-cess, independent living and participation for all”. It presents the Reminder, a prototype em-bodied conversational agent (ECA) which helps users to plan activities and to remember what to do. The prototype merges speech technology with other, existing technologies: Google Cal-endar and a digital pen and paper. The solution allows users to continue using a paper calendar in the manner they are used to, whilst the ECA provides notifications on what has been written in the calendar. Users may also ask questions such as “When was I supposed to meet Sara?” or “What’s on my schedule today?”
19.	Beskow, Jonas, et al. (författare) The MonAMI Reminder : a spoken dialogue system for face-to-face interaction 2009 Ingår i: Proceedings of the 10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009. - Brighton, U.K. ; , s. 300-303 Konferensbidrag (refereegranskat)abstract We describe the MonAMI Reminder, a multimodal spoken dialogue system which can assist elderly and disabled people in organising and initiating their daily activities. Based on deep interviews with potential users, we have designed a calendar and reminder application which uses an innovative mix of an embodied conversational agent, digital pen and paper, and the web to meet the needs of those users as well as the current constraints of speech technology. We also explore the use of head pose tracking for interaction and attention control in human-computer face-to-face interaction.
20.	Borin, Lars, et al. (författare) European Language Equality : D1.33: Report on the Swedish language 2022 Rapport (övrigt vetenskapligt/konstnärligt)
21.	Borin, Lars, et al. (författare) Language Report Swedish 2022 Ingår i: EuropeanLanguage Equality. - : ELE Consortium. Bokkapitel (övrigt vetenskapligt/konstnärligt)
22.	Borin, Lars, et al. (författare) Language Report Swedish 2023 Ingår i: Cognitive Technologies. - : Springer Nature. ; , s. 219-222 Bokkapitel (övrigt vetenskapligt/konstnärligt)abstract Swedish speech and language technology (LT) research goes back over 70 years. This has paid off: there is a national research infrastructure, as well as significant research projects, and Swedish is well-endowed with language resources (LRs) and tools. However, there are gaps that need to be filled, especially high-quality goldstandard LRs required by the most recent deep-learning methods. In the future, we would like to see closer collaborations and communication between the “traditional” LT research community and the burgeoning AI field, the establishment of dedicated academic LT training programmes, and national funding for LT research.
23.	Borin, Lars, 1957, et al. (författare) Language technology and 3rd wave HCI: Towards phatic communication and situated interaction 2018 Ingår i: New Directions in Third Wave Human-Computer Interaction: Volume 1 - Technologies / edited by Michael Filimowicz, Veronika Tzankova.. - Cham : Springer International Publishing. - 1571-5035 .- 2524-4477. - 9783319733555 ; , s. 251-264 Bokkapitel (refereegranskat)abstract In the field of language technology, researchers are starting to pay more attention to various interactional aspects of language – a development prompted by a confluence of factors, and one which applies equally to the processing of written and spoken language. Notably, the so-called ‘phatic’ aspects of linguistic communication are coming into focus in this work, where linguistic interaction is increasingly recognized as being fundamentally situated. This development resonates well with the concerns of third wave HCI, which involves a shift in focus from stating the requirements on HCI design primarily in terms of “context-free” information flow, to a view where it is recognized that HCI – just like interaction among humans – is indissolubly embedded in complex, shifting contexts. These – together with the different backgrounds and intentions of interaction participants – shape the interaction in ways which are not readily understandable in terms of rational information exchange, but which are nevertheless central aspects of the interaction, and which therefore must be taken into account in HCI design, including its linguistic aspects, forming the focus of this chapter.
24.	Borin, L., et al. (författare) Språkbanken 2018 : Research resources for text, speech, & society 2018 Ingår i: CEUR Workshop Proceedings. - : CEUR-WS. ; , s. 504-506 Konferensbidrag (refereegranskat)abstract We introduce an expanded version of the Swedish research resource Språkbanken (the Swedish Language Bank). In 2018, Språkbanken, which has supported national and international research for over four decades, adds two branches, one focusing on speech and one on societal aspects of language, to its existing organization, which targets text.
25.	Borin, Lars, 1957, et al. (författare) Svenska språket i den digitala tidsåldern : The swedish language in the digital age 2012 Bok (övrigt vetenskapligt/konstnärligt)
26.	Borin, Lars, et al. (författare) Svenska språket i den digitala tidsåldern 2012 Bok (refereegranskat)
27.	Borin, Lars, et al. (författare) The Swedish Language in the Digital Age/Svenska språket i den digitala tidsåldern 2012 Bok (refereegranskat)
28.	Bystedt, Mattias, et al. (författare) New applications of gaze tracking in speech science 2019 Ingår i: CEUR Workshop Proceedings. - : CEUR-WS. ; , s. 73-78 Konferensbidrag (refereegranskat)abstract We present an overview of speech research applications of gaze tracking technology, where gaze behaviours are exploited as a tool for analysis rather than as a primary object of study. The methods presented are all in their infancy, but can greatly assist the analysis of digital audio and video as well as unlock the relationship between writing and other encodings on the one hand, and natural language, such as speech, on the other. We discuss three directions in this type of gaze tracking application: modelling of text that is read aloud, evaluation and annotation with naïve informants, and evaluation and annotation with expert annotators. In each of these areas, we use gaze tracking information to gauge the behaviour of people when working with speech and conversation, rather than when reading text aloud or partaking in conversations, in order to learn something about how the speech may be ana-lysed from a human perspective.
29.	Carlson, Rolf, et al. (författare) Towards human-like behaviour in spoken dialog systems 2006 Ingår i: Proceedings of Swedish Language Technology Conference (SLTC 2006). - Gothenburg, Sweden. Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract We and others have found it fruitful to assume that users, when interacting with spoken dialogue systems, perceive the systems and their actions metaphorically. Common metaphors include the human metaphor and the interface metaphor (cf. Edlund, Heldner, & Gustafson, 2006). In the interface metaphor, the spoken dialogue system is perceived as a machine interface – often but not always a computer interface. Speech is used to accomplish what would have otherwise been accomplished by some other means of input, such as a keyboard or a mouse. In the human metaphor, on the other hand, the computer is perceived as a creature (or even a person) with humanlike conversational abilities, and speech is not a substitute or one of many alternatives, but rather the primary means of communicating with this creature. We are aware that more “natural ” or human-like behaviour does not automatically make a spoken dialogue system “better ” (i.e. more efficient or more well-liked by its users). Indeed, we are quite convinced that the advantage (or disadvantage) of humanlike behaviour will be highly dependent on the application. However, a dialogue system that is coherent with a human metaphor may profit from a number of characteristics.
30.	Clark, Leigh, et al. (författare) Mapping Theoretical and Methodological Perspectives for Understanding Speech Interface Interactions 2019 Ingår i: CHI EA '19 EXTENDED ABSTRACTS. - New York, NY, USA : ASSOC COMPUTING MACHINERY. Konferensbidrag (refereegranskat)abstract The use of speech as an interaction modality has grown considerably through the integration of Intelligent Personal Assistants (IPAs- e.g. Siri, Google Assistant) into smartphones and voice based devices (e.g. Amazon Echo). However, there remain significant gaps in using theoretical frameworks to understand user behaviours and choices and how they may applied to specific speech interface interactions. This part-day multidisciplinary workshop aims to critically map out and evaluate theoretical frameworks and methodological approaches across a number of disciplines and establish directions for new paradigms in understanding speech interface user behaviour. In doing so, we will bring together participants from HCI and other speech related domains to establish a cohesive, diverse and collaborative community of researchers from academia and industry with interest in exploring theoretical and methodological issues in the field.
31.	Clark, Leigh, et al. (författare) The State of Speech in HCI : Trends, Themes and Challenges 2019 Ingår i: Interacting with computers. - : Oxford University Press. - 0953-5438 .- 1873-7951. ; 31:4, s. 349-371 Tidskriftsartikel (refereegranskat)abstract Speech interfaces are growing in popularity. Through a review of 99 research papers this work maps the trends, themes, findings and methods of empirical research on speech interfaces in the field of human-computer interaction (HCI). We find that studies are usability/theory-focused or explore wider system experiences, evaluating Wizard of Oz, prototypes or developed systems. Measuring task and interaction was common, as was using self-report questionnaires to measure concepts like usability and user attitudes. A thematic analysis of the research found that speech HCI work focuses on nine key topics: system speech production, design insight, modality comparison, experiences with interactive voice response systems, assistive technology and accessibility, user speech production, using speech technology for development, peoples' experiences with intelligent personal assistants and how user memory affects speech interface interaction. From these insights we identify gaps and challenges in speech research, notably taking into account technological advancements, the need to develop theories of speech interface interaction, grow critical mass in this domain, increase design work and expand research from single to multiple user interaction contexts so as to reflect current use contexts. We also highlight the need to improve measure reliability, validity and consistency, in the wild deployment and reduce barriers to building fully functional speech interfaces for research.
32.	Dalmas, T., et al. (författare) Introduction 2014 Ingår i: Proceedings 2014 Workshop on Dialogue in Motion, DM 2014. - : Association for Computational Linguistics (ACL). Konferensbidrag (refereegranskat)
33.	Domeij, Rickard, 1958-, et al. (författare) Exploring the archives for textual entry points to speech - Experiences of interdisciplinary collaboration in making cultural heritage accessible for research 2020 Ingår i: CEUR Workshop Proceedings. - Riga : CEUR-WS. ; , s. 45-55, s. 45-55 Konferensbidrag (refereegranskat)abstract Tilltal (Tillgängligt kulturarv för forskning i tal, 'Accessible cultural heritage for speech research') is a multidisciplinary and methodological project undertaken by the Institute of Language and Folklore, KTH Royal Institute of Technology, and The Swedish National Archives in cooperation with the National Language Bank and SWE-CLARIN [1]. It aims to provide researchers better access to archival audio recordings using methods from language technology. The project comprises three case studies and one activity and usage study. In the case studies, actual research agendas from three different fields (ethnology, sociolinguistics and interaction analysis) serve as a basis for identifying procedures that may be simplified with the aid of digital tools. In the activity and usage study, we are applying an activity-theoretical approach with the aim of involving researchers and investigating how they use - and would like to be able to use - the archival resources at ISOF. Involving researchers in participatory design ensures that digital solutions are suggested and evaluated in relation to the requirements expressed by researchers engaged in specific research tasks [2]. In this paper we focus on one of the case studies, which investigates the process by which personal experience narratives are transformed into cultural heritage [3], and account for our results in exploring how different types of text material from the archives can be used to find relevant sections of the audio recordings. Finally, we discuss what lessons can be learned, and what conclusions can be drawn, from our experiences of interdisciplinary collaboration in the project.
34.	Edlund, Jens, et al. (författare) 3rd party observer gaze as a continuous measure of dialogue flow 2012 Ingår i: Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012. - Istanbul, Turkey : LREC. ; , s. 1354-1358 Konferensbidrag (refereegranskat)abstract We present an attempt at using 3rd party observer gaze to get a measure of how appropriate each segment in a dialogue is for a speaker change. The method is a step away from the current dependency of speaker turns or talkspurts towards a more general view of speaker changes. We show that 3rd party observers do indeed largely look at the same thing (the speaker), and how this can be captured and utilized to provide insights into human communication. In addition, the results also suggest that there might be differences in the distribution of 3rd party observer gaze depending on how information-rich an utterance is.
35.	Edlund, Jens, et al. (författare) 3rd party observer gaze during backchannels 2012 Ingår i: Proc. of the Interspeech 2012 Interdisciplinary Workshop on Feedback Behaviors in Dialog. - Skamania Lodge, WA, USA. Konferensbidrag (refereegranskat)abstract This paper describes a study of how the gazes of 3rd party observers of dialogue move when a speaker is taking the turn and producing a back-channel, respectively. The data is collected and basic processing is complete, but the results section for the paper is not yet in place. It will be in time for the workshop, however, and will be presented there, should this paper outline be accepted..
36.	Edlund, Jens, Docent/Associate Professor, 1967-, et al. (författare) A Multimodal Digital Humanities Study of Terrorism in Swedish Politics: An Interdisciplinary Mixed Methods Project on the Configuration of Terrorism in Parliamentary Debates, Legislation, and Policy Networks 1968–2018 2022 Ingår i: Intelligent Systems and Applications. Proceedings of the 2021 Intelligent Systems Conference, September 2–3, 2021 / Arai K. (eds). - Cham : Springer. - 2367-3370 .- 2367-3389. - 9783030821951 ; , s. 435-449 Konferensbidrag (refereegranskat)abstract This paper presents the design of one of Sweden’s largest digital humanities projects, SweTerror, that through an interdisciplinary multi-modal methodological approach develops an extensive speech-to-text digital HSS resource. SweTerror makes a major contribution to the study of terrorism in Sweden through a comprehensive mixed methods study of the political discourse on terrorism since the late 1960s. Drawing on artificial intelligence in the form of state-of-the-art language and speech technology, it systematically analyses all forms of relevant parliamentary utterances. It explores and curates an exhaustive but understudied multi-modal collection of primary sources of central relevance to Swedish democracy: the audio recordings of the Swedish Parliament’s debates. The project studies the framing of terrorism both as policy discourse and enacted politics, examining semantic and emotive components of the parliamentary discourse on terrorism as well as major actors and social networks involved. It covers political responses to a range of terrorism-related issues as well as factors influencing policy-makers’ engagement, including political affiliations and gender. SweTerror also develops an online research portal, featuring the complete research material and searchable audio made readily accessible for further exploration. Long-term, the project establishes a model for combining extraction technologies (speech recognition and analysis) for audiovisual parliamentary data with text mining and HSS interpretive methods and the portal is designed to serve as a prototype for other similar projects.
37.	Edlund, Jens, et al. (författare) Applications of distributed dialogue systems : the KTH Connector 2005 Ingår i: Proceedings of ISCA Tutorial and Research Workshop on Applied Spoken Language Interaction in Distributed Environments (ASIDE 2005). Konferensbidrag (refereegranskat)abstract We describe a spoken dialogue system domain: that of the personal secretary. This domain allows us to capitalise on the characteristics that make speech a unique interface; characteristics that humans use regularly, implicitly, and with remarkable ease. We present a prototype system - the KTH Connector - and highlight several dialogue research issues arising in the domain.
38.	Edlund, Jens, et al. (författare) Ask the experts : Part II: Analysis 2010 Ingår i: Linguistic Theory and Raw Sound. - Frederiksberg : Samfundslitteratur. - 9788759314791 ; , s. 183-198 Bokkapitel (refereegranskat)abstract We present work fuelled by an urge to understand speech in its original and most fundamental context: in conversation between people. And what better way than to look to the experts? Regarding human conversation, authority lies with the speakers themselves, and asking the experts is a matter of observing and analyzing what speakers do. This is the second part of a two-part discussion which is illustrated with examples mainly from the work at KTH Speech, Music and Hearing. In this part, we discuss methods of extracting useful information from captured data, with a special focus on raw sound.
39.	Edlund, Jens, et al. (författare) Audience response system based annotation of speech 2013 Ingår i: Proceedings of Fonetik 2013. - Linköping : Linköping University. - 9789175195827 - 9789175195797 ; , s. 13-16 Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract Manual annotators are often used to label speech. The task is associated with high costs and with great time consumption. We suggest to reach an increased throughput while maintaining a high measure of experimental control by borrowing from the Audience Response Systems used in the film and television industries, and demonstrate a cost-efficient setup for rapid, plenary annotation of phenomena occurring in recorded speech together with some results from studies we have undertaken to quantify the temporal precision and reliability of such annotations.
40.	Edlund, Jens, et al. (författare) Audience response system-based assessment for analysis-by-synthesis 2015 Ingår i: Proc. of ICPhS 2015. - : ICPhS. Konferensbidrag (refereegranskat)
41.	Edlund, Jens, et al. (författare) Capturing massively multimodal dialogues : affordable synchronization and visualization 2010 Ingår i: Proc. of Multimodal Corpora. ; , s. 160-161 Konferensbidrag (refereegranskat)abstract In this demo, we show (a) affordable and relatively easy-to-implement means to facilitate synchronization of audio, video and motion capture data in post processing, and (b) a flexible tool for 3D visualization of recorded motion capture data aligned with audio and video sequences. The synchronisation is made possible by the use of two simple and analogues devices: a turntable and an easy to build electronic clapper board. The demo shows examples of how the signals from the turntable and the clapper board are traced over the three modalities, using the 3D visualisation tool. We also demonstrate how the visualisation tool shows head and torso movements captured by the motion capture system.
42.	Edlund, Jens, et al. (författare) Catching wind of multiparty conversation 2014 Ingår i: LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION. - 9782951740884 Konferensbidrag (refereegranskat)abstract The paper describes the design of a novel corpus of respiratory activity in spontaneous multiparty face-to-face conversations in Swedish. The corpus is collected with the primary goal of investigating the role of breathing for interactive control of interaction. Physiological correlates of breathing are captured by means of respiratory belts, which measure changes in cross sectional area of the rib cage and the abdomen. Additionally, auditory and visual cues of breathing are recorded in parallel to the actual conversations. The corpus allows studying respiratory mechanisms underlying organisation of spontaneous communication, especially in connection with turn management. As such, it is a valuable resource both for fundamental research and speech techonology applications.
43.	Edlund, Jens, 1967-, et al. (författare) Catching wind of multiparty conversation 2014 Ingår i: Proceedings of Multimodal Corpora. - Reykjavik, Iceland : European Language Resources Association. ; , s. 35-36 Bokkapitel (övrigt vetenskapligt/konstnärligt)abstract The paper describes the design of a novel multimodal corpus of spontaneous multiparty conversations in Swedish. The corpus is collected with the primary goal of investigating the role of breathing and its perceptual cues for interactive control of interaction. Physiological correlates of breathing are captured by means of respiratory belts, which measure changes in cross sectional area of the rib cage and the abdomen. Additionally, auditory and visual correlates of breathing are recorded in parallel to the actual conversations. The corpus allows studying respiratory mechanisms underlying organisation of spontaneous conversation, especially in connection with turn management. As such, it is a valuable resource both for fundamental research and speech techonology applications.
44.	Edlund, Jens, et al. (författare) Co-present or Not? : Embodiment, Situatedness and the Mona Lisa Gaze Effect 2013 Ingår i: Eye gaze in intelligent user interfaces. - London : Springer London. - 9781447147831 - 9781447147848 ; , s. 185-203 Bokkapitel (refereegranskat)abstract The interest in embodying and situating computer programmes took off in the autonomous agents community in the 90s. Today, researchers and designers of programmes that interact with people on human terms endow their systems with humanoid physiognomies for a variety of reasons. In most cases, attempts at achieving this embodiment and situatedness has taken one of two directions: virtual characters and actual physical robots. In addition, a technique that is far from new is gaining ground rapidly: projection of animated faces on head-shaped 3D surfaces. In this chapter, we provide a history of this technique; an overview of its pros and cons; and an in-depth description of the cause and mechanics of the main drawback of 2D displays of 3D faces (and objects): the Mona Liza gaze effect. We conclude with a description of an experimental paradigm that measures perceived directionality in general and the Mona Lisa gaze effect in particular.
45.	Edlund, Jens, et al. (författare) Cocktail : a demonstration of massively multi-component audio environments for illustration and analysis 2010 Ingår i: SLTC 2010, The Third Swedish Language Technology Conference (SLTC 2010). Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract We present MMAE – Massively Multi-component Audio Environments – a new concept in auditory presentation, and Cocktail – a demonstrator built on this technology. MMAE creates a dynamic audio environment by playing a large number of sound clips simultaneously at different locations in a virtual 3D space. The technique utilizes standard soundboards and is based in the Snack Sound Toolkit. The result is an efficient 3D audio environment that can be modified dynamically, in real time. Applications range from the creation of canned as well as online audio environments for games and entertainment to the browsing, analyzing and comparing of large quantities of audio data. We also demonstrate the Cocktail implementation of MMAE using several test cases as examples.
46.	Edlund, Jens, et al. (författare) Gesture movement profiles in dialogues from a Swedish multimodal database of spontaneous speech 2012 Ingår i: Prosodic and Visual Resources in Interactional Grammar. - : Walter de Gruyter. Bokkapitel (refereegranskat)
47.	Edlund, Jens, et al. (författare) Hidden resources - Strategies to acquire and exploit potential spoken language resources in national archives 2016 Ingår i: Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016. - : European Language Resources Association (ELRA). - 9782951740891 ; , s. 4531-4534 Konferensbidrag (refereegranskat)abstract In 2014, the Swedish government tasked a Swedish agency, The Swedish Post and Telecom Authority (PTS), with investigating how to best create and populate an infrastructure for spoken language resources (Ref N2014/2840/ITP). As a part of this work, the department of Speech, Music and Hearing at KTH Royal Institute of Technology have taken inventory of existing potential spoken language resources, mainly in Swedish national archives and other governmental or public institutions. In this position paper, key priorities, perspectives, and strategies that may be of general, rather than Swedish, interest are presented. We discuss broad types of potential spoken language resources available; to what extent these resources are free to use; and thirdly the main contribution: strategies to ensure the continuous acquisition of spoken language resources in a manner that facilitates speech and speech technology research.
48.	Edlund, Jens, et al. (författare) Higgins : a spoken dialogue system for investigating error handling techniques 2004 Ingår i: Proceedings of the International Conference on Spoken Language Processing, ICSLP 04. ; , s. 229-231 Konferensbidrag (refereegranskat)abstract In this paper, an overview of the Higgins project and the research within the project is presented. The project incorporates studies of error handling for spoken dialogue systems on several levels, from processing to dialogue level. A domain in which a range of different error types can be studied has been chosen: pedestrian navigation and guiding. Several data collections within Higgins have been analysed along with data from Higgins' predecessor, the AdApt system. The error handling research issues in the project are presented in light of these analyses.
49.	Edlund, Jens (författare) How deeply rooted are the turns we take? 2011 Ingår i: SemDial 2011. ; , s. 196-197 Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract This poster presents preliminary work investigatingturn-taking in text-based chat with aview to learn something about how deeplyrooted turn-taking is in the human cognition.A connexion is shown between preferred turntakingpatterns and length and type of experiencewith such chats, which supports the ideathat the orderly type of turn-taking found inmost spoken conversations is indeed deeplyrooted, but not more so than that it can beovercome with training in a situation wheresuch turn-taking is not beneficial to the communication.
50.	Edlund, Jens, et al. (författare) Human pause and resume behaviours for unobtrusive humanlike in-car spoken dialogue systems 2014 Ingår i: Proceedings of the of the EACL 2014 Workshop on Dialogue in Motion (DM). - Gothenburg, Sweden. ; , s. 73-77 Konferensbidrag (refereegranskat)abstract This paper presents a first, largely qualitative analysis of a set of human-human dialogues recorded specifically to provide insights in how humans handle pauses and resumptions in situations where the speakers cannot see each other, but have to rely on the acoustic signal alone. The work presented is part of a larger effort to find unobtrusive human dialogue behaviours that can be mimicked and implemented in-car spoken dialogue systems within in the EU project Get Home Safe, a collaboration between KTH, DFKI, Nuance, IBM and Daimler aiming to find ways of driver interaction that minimizes safety issues,. The analysis reveals several human temporal, semantic/pragmatic, and structural behaviours that are good candidates for inclusion in spoken dialogue systems.

Skapa referenser, mejla, bekava och länka

Länka till träfflistan

Träfflista för sökning "WFRF:(Edlund Jens) "

Avgränsa träffmängd

År