Sökning: id:"swepub:oai:DiVA.org:kth-337872" >
An Analysis of Good...
An Analysis of Goodness of Pronunciation for Child Speech
-
- Cao, Xinwei (författare)
- Department of Electronic Systems, NTNU, Norway
-
- Fan, Zijian (författare)
- Department of Electronic Systems, NTNU, Norway
-
- Svendsen, Torbjørn (författare)
- Department of Electronic Systems, NTNU, Norway
-
visa fler...
-
- Salvi, Giampiero (författare)
- KTH,Tal, musik och hörsel, TMH,Department of Electronic Systems, NTNU, Norway
-
visa färre...
-
(creator_code:org_t)
- International Speech Communication Association, 2023
- 2023
- Engelska.
-
Ingår i: Interspeech 2023. - : International Speech Communication Association. ; , s. 4613-4617
- Relaterad länk:
-
https://urn.kb.se/re...
-
visa fler...
-
https://doi.org/10.2...
-
visa färre...
Abstract
Ämnesord
Stäng
- In this paper, we study the use of goodness of pronunciation (GOP) on child speech. We first compare the distributions of GOP scores on several open datasets representing various dimensions of speech variability. We show that the GOP distribution over CMU Kids, corresponding to young age, has larger spread than those on datasets representing other dimensions, i.e., accent, dialect, spontaneity and environmental conditions. We hypothesize that the increased variability of pronunciation in young age may impair the use of traditional mispronunciation detection methods for children. To support this hypothesis, we perform simulated mispronunciation experiments both for children and adults using different variants of the GOP algorithm. We also compare the results to real-case mispronunciations for native children showing that GOP is less effective for child speech than for adult speech.
Ämnesord
- NATURVETENSKAP -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Computer Sciences (hsv//eng)
- NATURVETENSKAP -- Data- och informationsvetenskap -- Språkteknologi (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Language Technology (hsv//eng)
Nyckelord
- ASR
- child speech
- data scarcity
- GOP
- mispronunciation detection and diagnosis
- speech assessment
Publikations- och innehållstyp
- ref (ämneskategori)
- kon (ämneskategori)