Search: onr:"swepub:oai:DiVA.org:kth-53571" >
Using accent inform...
-
Salvi, GiampieroKTH,Tal, musik och hörsel
(author)
Using accent information in ASR models for Swedish
- Article/chapterEnglish2003
Publisher, publication year, extent ...
Numbers
-
LIBRIS-ID:oai:DiVA.org:kth-53571
-
https://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-53571URI
Supplementary language notes
-
Language:English
-
Summary in:English
Part of subdatabase
Classification
-
Subject category:ref swepub-contenttype
-
Subject category:kon swepub-publicationtype
Notes
-
QC 20111230
-
In this study accent information is used in an attempt to improve acoustic models for automatic speech recognition (ASR). First, accent dependent Gaussian models were trained independently. The Bhattacharyya distance was then used in conjunction with agglomerative hierarchical clustering to define optimal strategies for merging those models. The resulting allophonic classes were analyzed and compared with the phonetic literature. Finally, accent "aware" models were built, in which the parametric complexity for each phoneme corresponds to the degree of variability across accent areas and to the amount of training data available for it. The models were compared to models with the same, but evenly spread, overall complexity showing in some cases a slight improvement in recognition accuracy.
Subject headings and genre
Added entries (persons, corporate bodies, meetings, titles ...)
-
KTHTal, musik och hörsel
(creator_code:org_t)
Related titles
-
In:Proceedings of INTERSPEECH'2003, s. 2677-2680
Internet link
To the university's database