Style-Controllable Speech-Driven Gesture Synthesis Using Normalising Flows

↓ Direkt till sidans innehåll
↓ Direkt till sidans sekundära innehåll (sidomenyn)

Sökning: id:"swepub:oai:DiVA.org:kth-279231" > Style-Controllable ...

1 av 1
Föregående post
Nästa post
Till träfflistan

Style-Controllable Speech-Driven Gesture Synthesis Using Normalising Flows

Alexanderson, Simon (författare): KTH,Tal, musik och hörsel, TMH

Henter, Gustav Eje, Assistant Professor (författare): KTH,Tal, musik och hörsel, TMH

Kucherenko, Taras, 1994- (författare): KTH,Robotik, perception och lärande, RPL

visa fler...

Beskow, Jonas (författare): KTH,Tal, musik och hörsel, TMH

visa färre...

(creator_code:org_t)

2020-07-13
2020
Engelska.
Ingår i: Computer graphics forum (Print). - : Wiley. - 0167-7055 .- 1467-8659. ; 39:2, s. 487-496

Relaterad länk:: https://diglib.eg.or...; visa fler...; https://diglib.eg.or...; https://kth.diva-por... (primary) (Raw object); https://kth.diva-por... (primary) (Raw object); http://kth.diva-port...; https://urn.kb.se/re...; https://doi.org/10.1...; visa färre...

Tidskriftsartikel (refereegranskat)

Abstract Ämnesord

Stäng

Automatic synthesis of realistic gestures promises to transform the fields of animation, avatars and communicative agents. In off-line applications, novel tools can alter the role of an animator to that of a director, who provides only high-level input for the desired animation; a learned network then translates these instructions into an appropriate sequence of body poses. In interactive scenarios, systems for generating natural animations on the fly are key to achieving believable and relatable characters. In this paper we address some of the core issues towards these ends. By adapting a deep learning-based motion synthesis method called MoGlow, we propose a new generative model for generating state-of-the-art realistic speech-driven gesticulation. Owing to the probabilistic nature of the approach, our model can produce a battery of different, yet plausible, gestures given the same input speech signal. Just like humans, this gives a rich natural variation of motion. We additionally demonstrate the ability to exert directorial control over the output style, such as gesture level, speed, symmetry and spacial extent. Such control can be leveraged to convey a desired character personality or mood. We achieve all this without any manual annotation of the data. User studies evaluating upper-body gesticulation confirm that the generated motions are natural and well match the input speech. Our method scores above all prior systems and baselines on these measures, and comes close to the ratings of the original recorded motions. We furthermore find that we can accurately control gesticulation styles without unnecessarily compromising perceived naturalness. Finally, we also demonstrate an application of the same method to full-body gesticulation, including the synthesis of stepping motion and stance.

Ämnesord

NATURVETENSKAP -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
NATURAL SCIENCES -- Computer and Information Sciences -- Computer Sciences (hsv//eng)
NATURVETENSKAP -- Data- och informationsvetenskap -- Människa-datorinteraktion (hsv//swe)
NATURAL SCIENCES -- Computer and Information Sciences -- Human Computer Interaction (hsv//eng)
NATURVETENSKAP -- Data- och informationsvetenskap -- Språkteknologi (hsv//swe)
NATURAL SCIENCES -- Computer and Information Sciences -- Language Technology (hsv//eng)

Nyckelord

CCS Concepts
Computing methodologies
Motion capture
Animation
Neural networks
Gestures
Motion capture
Data-driven animation
Character control
Probabilistic models
WASP_publications
Human-computer Interaction
Människa-datorinteraktion
Computer Science
Datalogi

Publikations- och innehållstyp

ref (ämneskategori)
art (ämneskategori)

Hitta via bibliotek

Computer graphics forum (Print) (Sök värdpublikationen i LIBRIS)

Till lärosätets databas

1 av 1
Föregående post
Nästa post
Till träfflistan

Hitta mer i SwePub

Av författaren/redakt...: Alexanderson, Si ...; Henter, Gustav E ...; Kucherenko, Tara ...; Beskow, Jonas

Om ämnet

NATURVETENSKAP: NATURVETENSKAP; och Data och informa ...; och Datavetenskap

NATURVETENSKAP: NATURVETENSKAP; och Data och informa ...; och Människa datorin ...

NATURVETENSKAP: NATURVETENSKAP; och Data och informa ...; och Språkteknologi

Artiklar i publikationen: Computer graphic ...

Av lärosätet: Kungliga Tekniska Högskolan

Sök utanför SwePub

Sök vidare i:: Google; Google Book Search; Google Scholar

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

LIBRIS.kb.se