Towards Sonification in Multimodal and User-Friendly Explainable Artificial Intelligence

↓ Direkt till sidans innehåll
↓ Direkt till sidans sekundära innehåll (sidomenyn)

Sökning: id:"swepub:oai:DiVA.org:his-22301" > Towards Sonificatio...

1 av 1
Föregående post
Nästa post
Till träfflistan

Towards Sonification in Multimodal and User-Friendly Explainable Artificial Intelligence

Schuller, Björn W. (författare): Chair of Embedded Intelligence for Health Care & Wellbeing, University of Augsburg, Augsburg, Germany

Virtanen, Tuomas (författare): Audio Research Group, Tampere University, Finland

Riveiro, Maria, 1978- (författare): Jönköping University,Jönköping AI Lab (JAIL),epartment of Computing, Jönköping University, Jönköping AI Lab (JAIL), Sweden

visa fler...

Rizos, Georgios (författare): GLAM – the Group on Language, Audio, & Music, Imperial College London, United Kingdom

Han, Jing (författare): Department of Computer Science and Technology, University of Cambridge, United Kingdom

Mesaros, Annamaria (författare): Audio Research Group, Tampere University, Finland

Drossos, Konstantinos (författare): Audio Research Group, Tampere University, Finland

visa färre...

(creator_code:org_t)

2021-10-18
2021
Engelska.
Ingår i: ICMI '21. - New York, NY, USA : Association for Computing Machinery (ACM). - 9781450384810 ; , s. 788-792

Relaterad länk:: https://opus.bibliot...; visa fler...; https://urn.kb.se/re...; https://doi.org/10.1...; https://urn.kb.se/re...; visa färre...

Konferensbidrag (refereegranskat)

Abstract Ämnesord

Stäng

We are largely used to hearing explanations. For example, if someone thinks you are sad today, they might reply to your “why?” with “because you were so Hmmmmm-mmm-mmm”. Today’s Artificial Intelligence (AI), however, is – if at all – largely providing explanations of decisions in a visual or textual manner. While such approaches are good for communication via visual media such as in research papers or screens of intelligent devices, they may not always be the best way to explain; especially when the end user is not an expert. In particular, when the AI’s task is about Audio Intelligence, visual explanations appear less intuitive than audible, sonified ones. Sonification has also great potential for explainable AI (XAI) in systems that deal with non-audio data – for example, because it does not require visual contact or active attention of a user. Hence, sonified explanations of AI decisions face a challenging, yet highly promising and pioneering task. That involves incorporating innovative XAI algorithms to allow pointing back at the learning data responsible for decisions made by an AI, and to include decomposition of the data to identify salient aspects. It further aims to identify the components of the preprocessing, feature representation, and learnt attention patterns that are responsible for the decisions. Finally, it targets decision-making at the model-level, to provide a holistic explanation of the chain of processing in typical pattern recognition problems from end-to-end. Sonified AI explanations will need to unite methods for sonification of the identified aspects that benefit decisions, decomposition and recomposition of audio to sonify which parts in the audio were responsible for the decision, and rendering attention patterns and salient feature representations audible. Benchmarking sonified XAI is challenging, as it will require a comparison against a backdrop of existing, state-of-the-art visual and textual alternatives, as well as synergistic complementation of all modalities in user evaluations. Sonified AI explanations will need to target different user groups to allow personalisation of the sonification experience for different user needs, to lead to a major breakthrough in comprehensibility of AI via hearing how decisions are made, hence supporting tomorrow’s humane AI’s trustability. Here, we introduce and motivate the general idea, and provide accompanying considerations including milestones of realisation of sonifed XAI and foreseeable risks.

Ämnesord

NATURVETENSKAP -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
NATURAL SCIENCES -- Computer and Information Sciences -- Computer Sciences (hsv//eng)
TEKNIK OCH TEKNOLOGIER -- Annan teknik -- Mediateknik (hsv//swe)
ENGINEERING AND TECHNOLOGY -- Other Engineering and Technologies -- Media Engineering (hsv//eng)
NATURVETENSKAP -- Data- och informationsvetenskap -- Människa-datorinteraktion (hsv//swe)
NATURAL SCIENCES -- Computer and Information Sciences -- Human Computer Interaction (hsv//eng)

Nyckelord

Explainable artificial intelligence
sonification
trustworthy artificial intelligence
human computer interaction
multimodality

Publikations- och innehållstyp

ref (ämneskategori)
kon (ämneskategori)

Hitta via bibliotek

ICMI '21 (Sök värdpublikationen i LIBRIS)

Till lärosätets databas

1 av 1
Föregående post
Nästa post
Till träfflistan

Hitta mer i SwePub

Av författaren/redakt...: Schuller, Björn ...; Virtanen, Tuomas; Riveiro, Maria, ...; Rizos, Georgios; Han, Jing; Mesaros, Annamar ...; visa fler...; Drossos, Konstan ...; visa färre...

Om ämnet

NATURVETENSKAP: NATURVETENSKAP; och Data och informa ...; och Datavetenskap

TEKNIK OCH TEKNOLOGIER: TEKNIK OCH TEKNO ...; och Annan teknik; och Mediateknik

NATURVETENSKAP: NATURVETENSKAP; och Data och informa ...; och Människa datorin ...

Artiklar i publikationen: ICMI '21

Av lärosätet: Högskolan i Skövde; Jönköping University

Sök utanför SwePub

Sök vidare i:: Google; Google Book Search; Google Scholar

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

LIBRIS.kb.se