SwePub
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "hsv:(NATURVETENSKAP) hsv:(Data och informationsvetenskap) hsv:(Medieteknik) ;lar1:(kth)"

Sökning: hsv:(NATURVETENSKAP) hsv:(Data och informationsvetenskap) hsv:(Medieteknik) > Kungliga Tekniska Högskolan

  • Resultat 1-10 av 279
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  • Lu, Zhihan, et al. (författare)
  • Multimodal Hand and Foot Gesture Interaction for Handheld Devices
  • 2014
  • Ingår i: ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP). - : Association for Computing Machinery (ACM). - 1551-6857 .- 1551-6865. ; 11:1
  • Tidskriftsartikel (refereegranskat)abstract
    • We present a hand-and-foot-based multimodal interaction approach for handheld devices. Our method combines input modalities (i.e., hand and foot) and provides a coordinated output to both modalities along with audio and video. Human foot gesture is detected and tracked using contour-based template detection (CTD) and Tracking-Learning-Detection (TLD) algorithm. 3D foot pose is estimated from passive homography matrix of the camera. 3D stereoscopic and vibrotactile are used to enhance the immersive feeling. We developed a multimodal football game based on the multimodal approach as a proof-of-concept. We confirm our systems user satisfaction through a user study.
  •  
2.
  • Frid, Emma, et al. (författare)
  • Perception of Mechanical Sounds Inherent to Expressive Gestures of a NAO Robot - Implications for Movement Sonification of Humanoids
  • 2018
  • Ingår i: Proceedings of the 15th Sound and Music Computing Conference. - Limassol, Cyprus. - 9789963697304
  • Konferensbidrag (refereegranskat)abstract
    • In this paper we present a pilot study carried out within the project SONAO. The SONAO project aims to compen- sate for limitations in robot communicative channels with an increased clarity of Non-Verbal Communication (NVC) through expressive gestures and non-verbal sounds. More specifically, the purpose of the project is to use move- ment sonification of expressive robot gestures to improve Human-Robot Interaction (HRI). The pilot study described in this paper focuses on mechanical robot sounds, i.e. sounds that have not been specifically designed for HRI but are inherent to robot movement. Results indicated a low correspondence between perceptual ratings of mechanical robot sounds and emotions communicated through ges- tures. In general, the mechanical sounds themselves ap- peared not to carry much emotional information compared to video stimuli of expressive gestures. However, some mechanical sounds did communicate certain emotions, e.g. frustration. In general, the sounds appeared to commu- nicate arousal more effectively than valence. We discuss potential issues and possibilities for the sonification of ex- pressive robot gestures and the role of mechanical sounds in such a context. Emphasis is put on the need to mask or alter sounds inherent to robot movement, using for exam- ple blended sonification.
  •  
3.
  • Frid, Emma, 1988-, et al. (författare)
  • Perceptual Evaluation of Blended Sonification of Mechanical Robot Sounds Produced by Emotionally Expressive Gestures : Augmenting Consequential Sounds to Improve Non-verbal Robot Communication
  • 2021
  • Ingår i: International Journal of Social Robotics. - : Springer Nature. - 1875-4791 .- 1875-4805.
  • Tidskriftsartikel (refereegranskat)abstract
    • This paper presents two experiments focusing on perception of mechanical sounds produced by expressive robot movement and blended sonifications thereof. In the first experiment, 31 participants evaluated emotions conveyed by robot sounds through free-form text descriptions. The sounds were inherently produced by the movements of a NAO robot and were not specifically designed for communicative purposes. Results suggested no strong coupling between the emotional expression of gestures and how sounds inherent to these movements were perceived by listeners; joyful gestures did not necessarily result in joyful sounds. A word that reoccurred in text descriptions of all sounds, regardless of the nature of the expressive gesture, was “stress”. In the second experiment, blended sonification was used to enhance and further clarify the emotional expression of the robot sounds evaluated in the first experiment. Analysis of quantitative ratings of 30 participants revealed that the blended sonification successfully contributed to enhancement of the emotional message for sound models designed to convey frustration and joy. Our findings suggest that blended sonification guided by perceptual research on emotion in speech and music can successfully improve communication of emotions through robot sounds in auditory-only conditions.
  •  
4.
  • Latupeirissa, Adrian Benigno, et al. (författare)
  • Exploring emotion perception in sonic HRI
  • 2020
  • Ingår i: 17th Sound and Music Computing Conference. - Torino : Zenodo. ; , s. 434-441
  • Konferensbidrag (refereegranskat)abstract
    • Despite the fact that sounds produced by robots can affect the interaction with humans, sound design is often an overlooked aspect in Human-Robot Interaction (HRI). This paper explores how different sets of sounds designed for expressive robot gestures of a humanoid Pepper robot can influence the perception of emotional intentions. In the pilot study presented in this paper, it has been asked to rate different stimuli in terms of perceived affective states. The stimuli were audio, audio-video and video only and contained either Pepper’s original servomotors noises, sawtooth, or more complex designed sounds. The preliminary results show a preference for the use of more complex sounds, thus confirming the necessity of further exploration in sonic HRI.
  •  
5.
  • Frid, Emma, et al. (författare)
  • An Exploratory Study On The Effect Of Auditory Feedback On Gaze Behavior In a Virtual Throwing Task With and Without Haptic Feedback
  • 2017
  • Ingår i: Proceedings of the 14th Sound and Music Computing Conference. - Espoo, Finland : Aalto University. - 9789526037295 ; , s. 242-249
  • Konferensbidrag (refereegranskat)abstract
    • This paper presents findings from an exploratory study on the effect of auditory feedback on gaze behavior. A total of 20 participants took part in an experiment where the task was to throw a virtual ball into a goal in different conditions: visual only, audiovisual, visuohaptic and audio- visuohaptic. Two different sound models were compared in the audio conditions. Analysis of eye tracking metrics indicated large inter-subject variability; difference between subjects was greater than difference between feedback conditions. No significant effect of condition could be observed, but clusters of similar behaviors were identified. Some of the participants’ gaze behaviors appeared to have been affected by the presence of auditory feedback, but the effect of sound model was not consistent across subjects. We discuss individual behaviors and illustrate gaze behavior through sonification of gaze trajectories. Findings from this study raise intriguing questions that motivate future large-scale studies on the effect of auditory feedback on gaze behavior. 
  •  
6.
  • Elowsson, Anders (författare)
  • Modeling Music : Studies of Music Transcription, Music Perception and Music Production
  • 2018
  • Doktorsavhandling (övrigt vetenskapligt/konstnärligt)abstract
    • This dissertation presents ten studies focusing on three important subfields of music information retrieval (MIR): music transcription (Part A), music perception (Part B), and music production (Part C).In Part A, systems capable of transcribing rhythm and polyphonic pitch are described. The first two publications present methods for tempo estimation and beat tracking. A method is developed for computing the most salient periodicity (the “cepstroid”), and the computed cepstroid is used to guide the machine learning processing. The polyphonic pitch tracking system uses novel pitch-invariant and tone-shift-invariant processing techniques. Furthermore, the neural flux is introduced – a latent feature for onset and offset detection. The transcription systems use a layered learning technique with separate intermediate networks of varying depth.  Important music concepts are used as intermediate targets to create a processing chain with high generalization. State-of-the-art performance is reported for all tasks.Part B is devoted to perceptual features of music, which can be used as intermediate targets or as parameters for exploring fundamental music perception mechanisms. Systems are proposed that can predict the perceived speed and performed dynamics of an audio file with high accuracy, using the average ratings from around 20 listeners as ground truths. In Part C, aspects related to music production are explored. The first paper analyzes long-term average spectrum (LTAS) in popular music. A compact equation is derived to describe the mean LTAS of a large dataset, and the variation is visualized. Further analysis shows that the level of the percussion is an important factor for LTAS. The second paper examines songwriting and composition through the development of an algorithmic composer of popular music. Various factors relevant for writing good compositions are encoded, and a listening test employed that shows the validity of the proposed methods.The dissertation is concluded by Part D - Looking Back and Ahead, which acts as a discussion and provides a road-map for future work. The first paper discusses the deep layered learning (DLL) technique, outlining concepts and pointing out a direction for future MIR implementations. It is suggested that DLL can help generalization by enforcing the validity of intermediate representations, and by letting the inferred representations establish disentangled structures supporting high-level invariant processing. The second paper proposes an architecture for tempo-invariant processing of rhythm with convolutional neural networks. Log-frequency representations of rhythm-related activations are suggested at the main stage of processing. Methods relying on magnitude, relative phase, and raw phase information are described for a wide variety of rhythm processing tasks.
  •  
7.
  • Koniaris, Christos, 1979- (författare)
  • Perceptually motivated speech recognition and mispronunciation detection
  • 2012
  • Doktorsavhandling (övrigt vetenskapligt/konstnärligt)abstract
    • This doctoral thesis is the result of a research effort performed in two fields of speech technology, i.e., speech recognition and mispronunciation detection. Although the two areas are clearly distinguishable, the proposed approaches share a common hypothesis based on psychoacoustic processing of speech signals. The conjecture implies that the human auditory periphery provides a relatively good separation of different sound classes. Hence, it is possible to use recent findings from psychoacoustic perception together with mathematical and computational tools to model the auditory sensitivities to small speech signal changes.The performance of an automatic speech recognition system strongly depends on the representation used for the front-end. If the extracted features do not include all relevant information, the performance of the classification stage is inherently suboptimal. The work described in Papers A, B and C is motivated by the fact that humans perform better at speech recognition than machines, particularly for noisy environments. The goal is to make use of knowledge of human perception in the selection and optimization of speech features for speech recognition. These papers show that maximizing the similarity of the Euclidean geometry of the features to the geometry of the perceptual domain is a powerful tool to select or optimize features. Experiments with a practical speech recognizer confirm the validity of the principle. It is also shown an approach to improve mel frequency cepstrum coefficients (MFCCs) through offline optimization. The method has three advantages: i) it is computationally inexpensive, ii) it does not use the auditory model directly, thus avoiding its computational cost, and iii) importantly, it provides better recognition performance than traditional MFCCs for both clean and noisy conditions.The second task concerns automatic pronunciation error detection. The research, described in Papers D, E and F, is motivated by the observation that almost all native speakers perceive, relatively easily, the acoustic characteristics of their own language when it is produced by speakers of the language. Small variations within a phoneme category, sometimes different for various phonemes, do not change significantly the perception of the language’s own sounds. Several methods are introduced based on similarity measures of the Euclidean space spanned by the acoustic representations of the speech signal and the Euclidean space spanned by an auditory model output, to identify the problematic phonemes for a given speaker. The methods are tested for groups of speakers from different languages and evaluated according to a theoretical linguistic study showing that they can capture many of the problematic phonemes that speakers from each language mispronounce. Finally, a listening test on the same dataset verifies the validity of these methods.
  •  
8.
  • Saqr, Mohammed, et al. (författare)
  • People, Ideas, Milestones : A Scientometric Study of Computational Thinking
  • 2021
  • Ingår i: ACM Transactions on Computing Education. - : Association for Computing Machinery (ACM). - 1946-6226. ; 21:3
  • Tidskriftsartikel (refereegranskat)abstract
    • The momentum around computational thinking (CT) has kindled a rising wave of research initiatives andscholarly contributions seeking to capitalize on the opportunities that CT could bring. A number of literaturereviews have showed a vibrant community of practitioners and a growing number of publications. However,the history and evolution of the emerging research topic, the milestone publications that have shaped itsdirections, and the timeline of the important developments may be better told through a quantitative, scientometric narrative. This article presents a bibliometric analysis of the drivers of the CT topic, as well as itsmain themes of research, international collaborations, influential authors, and seminal publications, and howauthors and publications have influenced one another. The metadata of 1,874 documents were retrieved fromthe Scopus database using the keyword “computational thinking.” The results show that CT research has been US-centric from the start, and continues to be dominated by US researchers both in volume and impact. International collaboration is relatively low, but clusters of joint research are found between, for example, anumber of Nordic countries, lusophone- and hispanophone countries, and central European countries. The results show that CT features the computing’s traditional tripartite disciplinary structure (design, modeling, and theory), a distinct emphasis on programming, and a strong pedagogical and educational backdrop including constructionism, self-efficacy, motivation, and teacher training.
  •  
9.
  •  
10.
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-10 av 279
Typ av publikation
konferensbidrag (164)
tidskriftsartikel (70)
bokkapitel (17)
doktorsavhandling (15)
konstnärligt arbete (5)
rapport (4)
visa fler...
forskningsöversikt (3)
proceedings (redaktörskap) (2)
samlingsverk (redaktörskap) (1)
bok (1)
annan publikation (1)
visa färre...
Typ av innehåll
refereegranskat (248)
övrigt vetenskapligt/konstnärligt (27)
populärvet., debatt m.m. (4)
Författare/redaktör
Bresin, Roberto, 196 ... (51)
Pargman, Daniel (21)
Holzapfel, André, 19 ... (19)
Frid, Emma, 1988- (17)
Elblaus, Ludvig, 198 ... (14)
Holzapfel, Andre (13)
visa fler...
Hedin, Björn, 1970- (12)
Hüttenrauch, Helge (10)
Bresin, Roberto (10)
Räsänen, Minna (10)
Tollmar, Konrad (8)
Hrastinski, Stefan (8)
Parnes, Peter (7)
Friberg, Anders (7)
Frid, Emma (7)
Nilsson, Marcus (6)
Hansen, Kjetil Falke ... (6)
Gullström, Charlie (6)
Panariello, Claudio (6)
Benetos, Emmanouil (5)
Severinson Eklundh, ... (5)
Zapico Lamela, Jorge ... (5)
Jakobsson, Peter (4)
Höök, Kristina, 1964 ... (4)
Green, Anders (4)
Eriksson, Elina, 197 ... (4)
Turpeinen, Marko (4)
Bälter, Olle (4)
Handberg, Leif, 1962 ... (4)
Hedman, Anders (3)
Hansen, Kjetil Falke ... (3)
Enlund, Nils (3)
Romero, Mario, 1973- (3)
Hallberg, Josef (3)
Josefsson, Pernilla (3)
Falkenberg, Kjetil, ... (3)
Núñez-Pacheco, Claud ... (3)
Naeve, Ambjörn (3)
Unander-Scharin, Car ... (3)
Pauletto, Sandra (3)
Enoksson, Fredrik, 1 ... (3)
Synnes, Kåre (3)
Eriksson, Elina (3)
Palmer, Matthias (3)
Ternström, Sten, 195 ... (3)
Zapico, Jorge Luis (3)
Pauletto, Sandra, As ... (3)
Latupeirissa, Adrian ... (3)
Mancini, Maurizio (3)
Jääskeläinen, Petra (3)
visa färre...
Lärosäte
Södertörns högskola (38)
Luleå tekniska universitet (17)
Uppsala universitet (12)
Linnéuniversitetet (10)
Stockholms universitet (9)
visa fler...
Chalmers tekniska högskola (7)
Karlstads universitet (5)
Kungl. Musikhögskolan (5)
Linköpings universitet (4)
RISE (4)
Mittuniversitetet (3)
Umeå universitet (2)
Örebro universitet (2)
Stockholms konstnärliga högskola (2)
Mälardalens universitet (1)
Lunds universitet (1)
Konstfack (1)
Högskolan i Skövde (1)
Högskolan Dalarna (1)
Blekinge Tekniska Högskola (1)
visa färre...
Språk
Engelska (273)
Svenska (6)
Forskningsämne (UKÄ/SCB)
Naturvetenskap (279)
Teknik (59)
Samhällsvetenskap (42)
Humaniora (38)
Medicin och hälsovetenskap (2)

År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy