↓ Direkt till sidans innehåll
↓ Direkt till sidans sekundära innehåll (sidomenyn)

Träfflista för sökning "hsv:(NATURVETENSKAP) hsv:(Data och informationsvetenskap) hsv:(Medieteknik) ;lar1:(kth)"

Sökning: hsv:(NATURVETENSKAP) hsv:(Data och informationsvetenskap) hsv:(Medieteknik) > Kungliga Tekniska Högskolan

Resultat 1-10 av 279

Sortera/gruppera träfflistan

Sortering: Träffar per sida:

Numrering	Referens	Omslagsbild	Hitta
1.	Lu, Zhihan, et al. (författare) Multimodal Hand and Foot Gesture Interaction for Handheld Devices 2014 Ingår i: ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP). - : Association for Computing Machinery (ACM). - 1551-6857 .- 1551-6865. ; 11:1 Tidskriftsartikel (refereegranskat)abstract We present a hand-and-foot-based multimodal interaction approach for handheld devices. Our method combines input modalities (i.e., hand and foot) and provides a coordinated output to both modalities along with audio and video. Human foot gesture is detected and tracked using contour-based template detection (CTD) and Tracking-Learning-Detection (TLD) algorithm. 3D foot pose is estimated from passive homography matrix of the camera. 3D stereoscopic and vibrotactile are used to enhance the immersive feeling. We developed a multimodal football game based on the multimodal approach as a proof-of-concept. We confirm our systems user satisfaction through a user study.
2.	Frid, Emma, et al. (författare) Perception of Mechanical Sounds Inherent to Expressive Gestures of a NAO Robot - Implications for Movement Sonification of Humanoids 2018 Ingår i: Proceedings of the 15th Sound and Music Computing Conference. - Limassol, Cyprus. - 9789963697304 Konferensbidrag (refereegranskat)abstract In this paper we present a pilot study carried out within the project SONAO. The SONAO project aims to compen- sate for limitations in robot communicative channels with an increased clarity of Non-Verbal Communication (NVC) through expressive gestures and non-verbal sounds. More specifically, the purpose of the project is to use move- ment sonification of expressive robot gestures to improve Human-Robot Interaction (HRI). The pilot study described in this paper focuses on mechanical robot sounds, i.e. sounds that have not been specifically designed for HRI but are inherent to robot movement. Results indicated a low correspondence between perceptual ratings of mechanical robot sounds and emotions communicated through ges- tures. In general, the mechanical sounds themselves ap- peared not to carry much emotional information compared to video stimuli of expressive gestures. However, some mechanical sounds did communicate certain emotions, e.g. frustration. In general, the sounds appeared to commu- nicate arousal more effectively than valence. We discuss potential issues and possibilities for the sonification of ex- pressive robot gestures and the role of mechanical sounds in such a context. Emphasis is put on the need to mask or alter sounds inherent to robot movement, using for exam- ple blended sonification.
3.	Frid, Emma, 1988-, et al. (författare) Perceptual Evaluation of Blended Sonification of Mechanical Robot Sounds Produced by Emotionally Expressive Gestures : Augmenting Consequential Sounds to Improve Non-verbal Robot Communication 2021 Ingår i: International Journal of Social Robotics. - : Springer Nature. - 1875-4791 .- 1875-4805. Tidskriftsartikel (refereegranskat)abstract This paper presents two experiments focusing on perception of mechanical sounds produced by expressive robot movement and blended sonifications thereof. In the first experiment, 31 participants evaluated emotions conveyed by robot sounds through free-form text descriptions. The sounds were inherently produced by the movements of a NAO robot and were not specifically designed for communicative purposes. Results suggested no strong coupling between the emotional expression of gestures and how sounds inherent to these movements were perceived by listeners; joyful gestures did not necessarily result in joyful sounds. A word that reoccurred in text descriptions of all sounds, regardless of the nature of the expressive gesture, was “stress”. In the second experiment, blended sonification was used to enhance and further clarify the emotional expression of the robot sounds evaluated in the first experiment. Analysis of quantitative ratings of 30 participants revealed that the blended sonification successfully contributed to enhancement of the emotional message for sound models designed to convey frustration and joy. Our findings suggest that blended sonification guided by perceptual research on emotion in speech and music can successfully improve communication of emotions through robot sounds in auditory-only conditions.
4.	Latupeirissa, Adrian Benigno, et al. (författare) Exploring emotion perception in sonic HRI 2020 Ingår i: 17th Sound and Music Computing Conference. - Torino : Zenodo. ; , s. 434-441 Konferensbidrag (refereegranskat)abstract Despite the fact that sounds produced by robots can affect the interaction with humans, sound design is often an overlooked aspect in Human-Robot Interaction (HRI). This paper explores how different sets of sounds designed for expressive robot gestures of a humanoid Pepper robot can influence the perception of emotional intentions. In the pilot study presented in this paper, it has been asked to rate different stimuli in terms of perceived affective states. The stimuli were audio, audio-video and video only and contained either Pepper’s original servomotors noises, sawtooth, or more complex designed sounds. The preliminary results show a preference for the use of more complex sounds, thus confirming the necessity of further exploration in sonic HRI.
5.	Frid, Emma, et al. (författare) An Exploratory Study On The Effect Of Auditory Feedback On Gaze Behavior In a Virtual Throwing Task With and Without Haptic Feedback 2017 Ingår i: Proceedings of the 14th Sound and Music Computing Conference. - Espoo, Finland : Aalto University. - 9789526037295 ; , s. 242-249 Konferensbidrag (refereegranskat)abstract This paper presents findings from an exploratory study on the effect of auditory feedback on gaze behavior. A total of 20 participants took part in an experiment where the task was to throw a virtual ball into a goal in different conditions: visual only, audiovisual, visuohaptic and audio- visuohaptic. Two different sound models were compared in the audio conditions. Analysis of eye tracking metrics indicated large inter-subject variability; difference between subjects was greater than difference between feedback conditions. No significant effect of condition could be observed, but clusters of similar behaviors were identified. Some of the participants’ gaze behaviors appeared to have been affected by the presence of auditory feedback, but the effect of sound model was not consistent across subjects. We discuss individual behaviors and illustrate gaze behavior through sonification of gaze trajectories. Findings from this study raise intriguing questions that motivate future large-scale studies on the effect of auditory feedback on gaze behavior.
6.	Elowsson, Anders (författare) Modeling Music : Studies of Music Transcription, Music Perception and Music Production 2018 Doktorsavhandling (övrigt vetenskapligt/konstnärligt)abstract This dissertation presents ten studies focusing on three important subfields of music information retrieval (MIR): music transcription (Part A), music perception (Part B), and music production (Part C).In Part A, systems capable of transcribing rhythm and polyphonic pitch are described. The first two publications present methods for tempo estimation and beat tracking. A method is developed for computing the most salient periodicity (the “cepstroid”), and the computed cepstroid is used to guide the machine learning processing. The polyphonic pitch tracking system uses novel pitch-invariant and tone-shift-invariant processing techniques. Furthermore, the neural flux is introduced – a latent feature for onset and offset detection. The transcription systems use a layered learning technique with separate intermediate networks of varying depth. Important music concepts are used as intermediate targets to create a processing chain with high generalization. State-of-the-art performance is reported for all tasks.Part B is devoted to perceptual features of music, which can be used as intermediate targets or as parameters for exploring fundamental music perception mechanisms. Systems are proposed that can predict the perceived speed and performed dynamics of an audio file with high accuracy, using the average ratings from around 20 listeners as ground truths. In Part C, aspects related to music production are explored. The first paper analyzes long-term average spectrum (LTAS) in popular music. A compact equation is derived to describe the mean LTAS of a large dataset, and the variation is visualized. Further analysis shows that the level of the percussion is an important factor for LTAS. The second paper examines songwriting and composition through the development of an algorithmic composer of popular music. Various factors relevant for writing good compositions are encoded, and a listening test employed that shows the validity of the proposed methods.The dissertation is concluded by Part D - Looking Back and Ahead, which acts as a discussion and provides a road-map for future work. The first paper discusses the deep layered learning (DLL) technique, outlining concepts and pointing out a direction for future MIR implementations. It is suggested that DLL can help generalization by enforcing the validity of intermediate representations, and by letting the inferred representations establish disentangled structures supporting high-level invariant processing. The second paper proposes an architecture for tempo-invariant processing of rhythm with convolutional neural networks. Log-frequency representations of rhythm-related activations are suggested at the main stage of processing. Methods relying on magnitude, relative phase, and raw phase information are described for a wide variety of rhythm processing tasks.
7.	Koniaris, Christos, 1979- (författare) Perceptually motivated speech recognition and mispronunciation detection 2012 Doktorsavhandling (övrigt vetenskapligt/konstnärligt)abstract This doctoral thesis is the result of a research effort performed in two fields of speech technology, i.e., speech recognition and mispronunciation detection. Although the two areas are clearly distinguishable, the proposed approaches share a common hypothesis based on psychoacoustic processing of speech signals. The conjecture implies that the human auditory periphery provides a relatively good separation of different sound classes. Hence, it is possible to use recent findings from psychoacoustic perception together with mathematical and computational tools to model the auditory sensitivities to small speech signal changes.The performance of an automatic speech recognition system strongly depends on the representation used for the front-end. If the extracted features do not include all relevant information, the performance of the classification stage is inherently suboptimal. The work described in Papers A, B and C is motivated by the fact that humans perform better at speech recognition than machines, particularly for noisy environments. The goal is to make use of knowledge of human perception in the selection and optimization of speech features for speech recognition. These papers show that maximizing the similarity of the Euclidean geometry of the features to the geometry of the perceptual domain is a powerful tool to select or optimize features. Experiments with a practical speech recognizer confirm the validity of the principle. It is also shown an approach to improve mel frequency cepstrum coefficients (MFCCs) through offline optimization. The method has three advantages: i) it is computationally inexpensive, ii) it does not use the auditory model directly, thus avoiding its computational cost, and iii) importantly, it provides better recognition performance than traditional MFCCs for both clean and noisy conditions.The second task concerns automatic pronunciation error detection. The research, described in Papers D, E and F, is motivated by the observation that almost all native speakers perceive, relatively easily, the acoustic characteristics of their own language when it is produced by speakers of the language. Small variations within a phoneme category, sometimes different for various phonemes, do not change significantly the perception of the language’s own sounds. Several methods are introduced based on similarity measures of the Euclidean space spanned by the acoustic representations of the speech signal and the Euclidean space spanned by an auditory model output, to identify the problematic phonemes for a given speaker. The methods are tested for groups of speakers from different languages and evaluated according to a theoretical linguistic study showing that they can capture many of the problematic phonemes that speakers from each language mispronounce. Finally, a listening test on the same dataset verifies the validity of these methods.
8.	Saqr, Mohammed, et al. (författare) People, Ideas, Milestones : A Scientometric Study of Computational Thinking 2021 Ingår i: ACM Transactions on Computing Education. - : Association for Computing Machinery (ACM). - 1946-6226. ; 21:3 Tidskriftsartikel (refereegranskat)abstract The momentum around computational thinking (CT) has kindled a rising wave of research initiatives andscholarly contributions seeking to capitalize on the opportunities that CT could bring. A number of literaturereviews have showed a vibrant community of practitioners and a growing number of publications. However,the history and evolution of the emerging research topic, the milestone publications that have shaped itsdirections, and the timeline of the important developments may be better told through a quantitative, scientometric narrative. This article presents a bibliometric analysis of the drivers of the CT topic, as well as itsmain themes of research, international collaborations, influential authors, and seminal publications, and howauthors and publications have influenced one another. The metadata of 1,874 documents were retrieved fromthe Scopus database using the keyword “computational thinking.” The results show that CT research has been US-centric from the start, and continues to be dominated by US researchers both in volume and impact. International collaboration is relatively low, but clusters of joint research are found between, for example, anumber of Nordic countries, lusophone- and hispanophone countries, and central European countries. The results show that CT features the computing’s traditional tripartite disciplinary structure (design, modeling, and theory), a distinct emphasis on programming, and a strong pedagogical and educational backdrop including constructionism, self-efficacy, motivation, and teacher training.
9.	Battel, G U, et al. (författare) Analysis by synthesis in piano performance - A study on the theme of the Brahms’ "Variations on a Theme of Paganini€", op. 35 1993 Ingår i: Proceedings of SMAC 93 (Stockholm Music Acoustic Conference). - Stockholm : KTH Royal Institute of Technology. ; , s. 69-73 Konferensbidrag (refereegranskat)
10.	Battel, G U, et al. (författare) Automatic performance of musical scores by mean of neural nerworks : evaluation with listening tests 1993 Ingår i: X CIM Colloquium on Musical Informatics. ; , s. 97-101 Konferensbidrag (refereegranskat)

Skapa referenser, mejla, bekava och länka

Länka till träfflistan

Resultat 1-10 av 279

Avgränsa träffmängd

Typ av publikation: konferensbidrag (164); tidskriftsartikel (70); bokkapitel (17); doktorsavhandling (15); konstnärligt arbete (5); rapport (4); visa fler...; forskningsöversikt (3); proceedings (redaktörskap) (2); samlingsverk (redaktörskap) (1); bok (1); annan publikation (1); visa färre...

Typ av innehåll: refereegranskat (248); övrigt vetenskapligt/konstnärligt (27); populärvet., debatt m.m. (4)

Författare/redaktör: Bresin, Roberto, 196 ... (51); Pargman, Daniel (21); Holzapfel, André, 19 ... (19); Frid, Emma, 1988- (17); Elblaus, Ludvig, 198 ... (14); Holzapfel, Andre (13); visa fler...; Hedin, Björn, 1970- (12); Hüttenrauch, Helge (10); Bresin, Roberto (10); Räsänen, Minna (10); Tollmar, Konrad (8); Hrastinski, Stefan (8); Parnes, Peter (7); Friberg, Anders (7); Frid, Emma (7); Nilsson, Marcus (6); Hansen, Kjetil Falke ... (6); Gullström, Charlie (6); Panariello, Claudio (6); Benetos, Emmanouil (5); Severinson Eklundh, ... (5); Zapico Lamela, Jorge ... (5); Jakobsson, Peter (4); Höök, Kristina, 1964 ... (4); Green, Anders (4); Eriksson, Elina, 197 ... (4); Turpeinen, Marko (4); Bälter, Olle (4); Handberg, Leif, 1962 ... (4); Hedman, Anders (3); Hansen, Kjetil Falke ... (3); Enlund, Nils (3); Romero, Mario, 1973- (3); Hallberg, Josef (3); Josefsson, Pernilla (3); Falkenberg, Kjetil, ... (3); Núñez-Pacheco, Claud ... (3); Naeve, Ambjörn (3); Unander-Scharin, Car ... (3); Pauletto, Sandra (3); Enoksson, Fredrik, 1 ... (3); Synnes, Kåre (3); Eriksson, Elina (3); Palmer, Matthias (3); Ternström, Sten, 195 ... (3); Zapico, Jorge Luis (3); Pauletto, Sandra, As ... (3); Latupeirissa, Adrian ... (3); Mancini, Maurizio (3); Jääskeläinen, Petra (3); visa färre...

Lärosäte: Kungliga Tekniska Högskolan (279)Ta bort avgränsningen; Södertörns högskola (38); Luleå tekniska universitet (17); Uppsala universitet (12); Linnéuniversitetet (10); Stockholms universitet (9); visa fler...; Chalmers tekniska högskola (7); Karlstads universitet (5); Kungl. Musikhögskolan (5); Linköpings universitet (4); RISE (4); Mittuniversitetet (3); Umeå universitet (2); Örebro universitet (2); Stockholms konstnärliga högskola (2); Mälardalens universitet (1); Lunds universitet (1); Konstfack (1); Högskolan i Skövde (1); Högskolan Dalarna (1); Blekinge Tekniska Högskola (1); visa färre...

Språk: Engelska (273); Svenska (6)

Forskningsämne (UKÄ/SCB): Naturvetenskap (279); Teknik (59); Samhällsvetenskap (42); Humaniora (38); Medicin och hälsovetenskap (2)

År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

Copyright © LIBRIS - Nationella bibliotekssystem
LIBRIS.kb.se

pil uppåt

Stäng

Kopiera och spara länken för att återkomma till aktuell vy