Sökning: onr:"swepub:oai:DiVA.org:kth-53571" >
Using accent inform...
-
Salvi, GiampieroKTH,Tal, musik och hörsel
(författare)
Using accent information in ASR models for Swedish
- Artikel/kapitelEngelska2003
Förlag, utgivningsår, omfång ...
Nummerbeteckningar
-
LIBRIS-ID:oai:DiVA.org:kth-53571
-
https://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-53571URI
Kompletterande språkuppgifter
-
Språk:engelska
-
Sammanfattning på:engelska
Ingår i deldatabas
Klassifikation
-
Ämneskategori:ref swepub-contenttype
-
Ämneskategori:kon swepub-publicationtype
Anmärkningar
-
QC 20111230
-
In this study accent information is used in an attempt to improve acoustic models for automatic speech recognition (ASR). First, accent dependent Gaussian models were trained independently. The Bhattacharyya distance was then used in conjunction with agglomerative hierarchical clustering to define optimal strategies for merging those models. The resulting allophonic classes were analyzed and compared with the phonetic literature. Finally, accent "aware" models were built, in which the parametric complexity for each phoneme corresponds to the degree of variability across accent areas and to the amount of training data available for it. The models were compared to models with the same, but evenly spread, overall complexity showing in some cases a slight improvement in recognition accuracy.
Ämnesord och genrebeteckningar
Biuppslag (personer, institutioner, konferenser, titlar ...)
-
KTHTal, musik och hörsel
(creator_code:org_t)
Sammanhörande titlar
-
Ingår i:Proceedings of INTERSPEECH'2003, s. 2677-2680
Internetlänk