SwePub
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "L773:0167 6393 OR L773:1872 7182 "

Sökning: L773:0167 6393 OR L773:1872 7182

  • Resultat 1-10 av 41
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  • Wood, Sidney A J, et al. (författare)
  • A cinefluorographic study of the temporal organization of articulator gestures: Examples from Greenlandic
  • 1997
  • Ingår i: Speech Communication (Special Issue: Speech Production: Models and Data ). - 1872-7182 .- 0167-6393. ; 22:2-3, s. 207-225
  • Konferensbidrag (refereegranskat)abstract
    • Movement data on articulator gestures in West Greenlandic are presented in order to elucidate principles of articulator coordination, especially the domain of coarticulation (as distinct from the domain of assimilation), the handling of conflicting demands on articulators, and the relation of vowels to consonants. The present data are consistent with results previously obtained from Swedish and Bulgarian. The Greenlandic informant varied his domain of coarticulation by up to two phonemes either side of the current phoneme; potential gesture conflicts were resolved in accordance with the model of Kozhevnikov and Chistovich, oncoming gestures being delayed when they were antagonistic to ongoing gestures; finally, articulator gestures were organized according to the same principles for both vowels and consonants.
  •  
2.
  • Beaugendre, F., et al. (författare)
  • Accentuation boundaries in Dutch, French and Swedish
  • 2001
  • Ingår i: Speech Communication. - 0167-6393 .- 1872-7182. ; 33:4, s. 305-318
  • Tidskriftsartikel (refereegranskat)abstract
    • This paper presents a comparative study investigating the relation between the timing of a rising or falling pitch movement and the temporal structure of the syllable it accentuates for three languages: Dutch, French and Swedish. In a perception experiment, the five-syllable utterances /mamamamama/ and /?a?a?a?a?a/ were provided with a relatively fast rising or falling pitch movement. The timing of the movement was systematically varied so that it accented the third or the fourth syllable, subjects were asked to indicate which syllable they perceived as accented. The accentuation boundary (AB) between the third and the fourth syllable was then defined as the moment before which more than half of the subjects indicated the third syllable as accented and after which more than half of the subjects indicated the fourth syllable. The results show that there are significant differences between the three languages as to the location of the AB. In general, for the rises, well-defined ABs were found. They were located in the middle of the vowel of the third syllable for French subjects, and later in that vowel for Dutch and swedish subjects. For the falls, a clear AB was obtained only for the Dutch and the Swedish listeners. This was located at the end of the third syllable. For the French listeners, the fall did not yield a clear AB, This corroborates the absence of accentuation by means of falls in French. By varying the duration of the pitch movement it could be shown that, in all cases in which a clear AB was found. the cue for accentuation was located at the beginning of the pitch movement.
  •  
3.
  • Bimbot, F, et al. (författare)
  • An overwiev of the CAVE project research activities in speaker verification
  • 2000
  • Ingår i: Speech Communication. - 0167-6393 .- 1872-7182. ; 31:2-3, s. 155-180
  • Tidskriftsartikel (refereegranskat)abstract
    • This article presents an overview of the research activities carried out in the European CAVE project, which focused on text-dependent speaker verification on the telephone network using whole word Hidden Markov Models. It documents in detail various aspects of the technology and the methodology used within the project. In particular, it addresses the issue of model estimation in the context of limited enrollment data and the problem of a posteriori decision threshold setting. Experiments are carried out on the realistic telephone speech database SESP. State-of-the-art performance levels are obtained, which validates the technical approaches developed and assessed during the project as well as the working infrastructure which facilitated cooperation between the partners.
  •  
4.
  • Botinis, A., et al. (författare)
  • Developments and paradigms in intonation research
  • 2001
  • Ingår i: Speech Communication. - 0167-6393 .- 1872-7182. ; 33:4, s. 263-296
  • Forskningsöversikt (refereegranskat)abstract
    • The present tutorial paper is addressed to a wide audience with different discipline backgrounds as well as variable expertise on intonation. The paper is structured into five sections. In Section 1, Introduction, basic concepts of intonation and prosody are summarised and cornerstones of intonation research are highlighted. In Section 2, Functions and forms of intonation, a wide range of functions from morpholexical and phrase levels to discourse and dialogue levels are discussed and forms of intonation with examples from different languages are presented. In Section 3, Modelling and labelling of intonation, established models of intonation as well as labelling systems are presented. In Section 4, Applications of intonation the most widespread applications of intonation and especially technological ones are presented and methodological issues are discussed. In Section 5, Research perspective research avenues and ultimate goals as well as the significance and benefits of intonation research in the upcoming years are outlined.
  •  
5.
  • Eklund, Robert, 1962-, et al. (författare)
  • Xenophones : An investigation of phone set expansion in Swedish and implications for speech recognition and speech synthesis
  • 2001
  • Ingår i: Speech Communication. - : Elsevier. - 0167-6393 .- 1872-7182. ; 35:1-2, s. 81-102
  • Tidskriftsartikel (refereegranskat)abstract
    • In recent years, both automatic speech recognition (ASR) and text-to-speech (TTS) conversion systems have attained quality levels that allow inclusion in everyday applications. One remaining problem to be solved in both these types of applications is that alleged phone inventories of specific languages are commonly expanded with phones from other languages, a problem that becomes more acute in an increasingly internationalized world where multilingual automatic speech-based services are a desideratum. This paper investigates the nature of phone set expansion in Swedish. The status of these phones is discussed, and since such added phones do not have a phonemic (or allophonic) function, the term 'xenophones' is suggested. The analysis is based on a production study involving 491 subjects, and the observed xenophonic expansion is described in terms of three categories along the "awareness" and the "fidelity" dimensions. The results show that very few subjects resort to full rephonematization and that xenophonic expansion is the rule, although there is an uneven distribution depending on particular phones, spanning from phones produced by most subjects, to phones produced by almost no subjects. Of the possible explanatory factors analyzed - regional background, gender, age and educational level - the latter is by far the most important. © 2001 Elsevier Science B.V.
  •  
6.
  • Engwall, Olov (författare)
  • Combining MRI, EMA and EPG measurements in a three-dimensional tongue model
  • 2003
  • Ingår i: Speech Communication. - 0167-6393 .- 1872-7182. ; 41:2-3, s. 303-329
  • Tidskriftsartikel (refereegranskat)abstract
    • A three-dimensional (3D) tongue model has been developed using MR images of a reference subject producing 44 artificially sustained Swedish articulations. Based on the difference in tongue shape between the articulations and a reference, the six linear parameters jaw height, tongue body, tongue dorsum, tongue tip, tongue advance and tongue width were determined using an ordered linear factor analysis controlled by articulatory measures. The first five factors explained 88% of the tongue data variance in the midsagittal plane and 78% in the 3D analysis. The six-parameter model is able to reconstruct the modelled articulations with an overall mean reconstruction error of 0.13 cm, and it specifically handles lateral differences and asymmetries in tongue shape. In order to correct articulations that were hyperarticulated due to the artificial sustaining in the magnetic resonance imaging (MRI) acquisition, the parameter values in the tongue model were readjusted based on a comparison of virtual and natural linguopalatal contact patterns, collected with electropalatography (EPG). Electromagnetic articulography (EMA) data was collected to control the kinematics of the tongue model for vowel-fricative sequences and an algorithm to handle surface contacts has been implemented, preventing the tongue from protruding through the palate and teeth.
  •  
7.
  • Karlsson, Inger A., et al. (författare)
  • Speaker verification with elicited speaking styles in the VeriVox project
  • 2000
  • Ingår i: Speech Communication. - 0167-6393 .- 1872-7182. ; 31:03-feb, s. 121-129
  • Tidskriftsartikel (refereegranskat)abstract
    • Some experiments have been carried out to study and compensate for within-speaker variations in speaker verification. To induce speaker variation, a speaking behaviour elicitation software package has been developed. A 50-speaker database with voluntary and involuntary speech variation has been recorded using this software. The database has been used for acoustic analysis as well as for automatic speaker verification (ASV) tests. The voluntary speech variations are used to form an enrolment set for the ASV system. This set is called structured training and is compared to neutral training where only normal speech is used. Both sets contain the same number of utterances. It is found that the ASV system improves its performance when testing on a mixed speaking style test without decreasing the performance of the tests with normal speech.
  •  
8.
  • Ambrazaitis, Gilbert, 1979-, et al. (författare)
  • Multimodal prominences : Exploring the patterning and usage of focal pitch accents, head beats and eyebrow beats in Swedish television news readings
  • 2017
  • Ingår i: Speech Communication. - : Elsevier B.V.. - 0167-6393 .- 1872-7182. ; 95, s. 100-113
  • Tidskriftsartikel (refereegranskat)abstract
    • Facial beat gestures align with pitch accents in speech, functioning as visual prominence markers. However, it is not yet well understood whether and how gestures and pitch accents might be combined to create different types of multimodal prominence, and how specifically visual prominence cues are used in spoken communication. In this study, we explore the use and possible interaction of eyebrow (EB) and head (HB) beats with so-called focal pitch accents (FA) in a corpus of 31 brief news readings from Swedish television (four news anchors, 986 words in total), focusing on effects of position in text, information structure as well as speaker expressivity. Results reveal an inventory of four primary (combinations of) prominence markers in the corpus: FA+HB+EB, FA+HB, FA only (i.e., no gesture), and HB only, implying that eyebrow beats tend to occur only in combination with the other two markers. In addition, head beats occur significantly more frequently in the second than in the first part of a news reading. A functional analysis of the data suggests that the distribution of head beats might to some degree be governed by information structure, as the text-initial clause often defines a common ground or presents the theme of the news story. In the rheme part of the news story, FA, HB, and FA+HB are all common prominence markers. The choice between them is subject to variation which we suggest might represent a degree of freedom for the speaker to use the markers expressively. A second main observation concerns eyebrow beats, which seem to be used mainly as a kind of intensification marker for highlighting not only contrast, but also value, magnitude, or emotionally loaded words; it is applicable in any position in a text. We thus observe largely different patterns of occurrence and usage of head beats on the one hand and eyebrow beats on the other, suggesting that the two represent two separate modalities of visual prominence cuing.
  •  
9.
  • Ananthakrishnan, Gopal, et al. (författare)
  • Mapping between acoustic and articulatory gestures
  • 2011
  • Ingår i: Speech Communication. - : Elsevier BV. - 0167-6393 .- 1872-7182. ; 53:4, s. 567-589
  • Tidskriftsartikel (refereegranskat)abstract
    • This paper proposes a definition for articulatory as well as acoustic gestures along with a method to segment the measured articulatory trajectories and acoustic waveforms into gestures. Using a simultaneously recorded acoustic-articulatory database, the gestures are detected based on finding critical points in the utterance, both in the acoustic and articulatory representations. The acoustic gestures are parameterized using 2-D cepstral coefficients. The articulatory trajectories arc essentially the horizontal and vertical movements of Electromagnetic Articulography (EMA) coils placed on the tongue, jaw and lips along the midsagittal plane. The articulatory movements are parameterized using 2D-DCT using the same transformation that is applied on the acoustics. The relationship between the detected acoustic and articulatory gestures in terms of the timing as well as the shape is studied. In order to study this relationship further, acoustic-to-articulatory inversion is performed using GMM-based regression. The accuracy of predicting the articulatory trajectories from the acoustic waveforms are at par with state-of-the-art frame-based methods with dynamical constraints (with an average error of 1.45-1.55 mm for the two speakers in the database). In order to evaluate the acoustic-to-articulatory inversion in a more intuitive manner, a method based on the error in estimated critical points is suggested. Using this method, it was noted that the estimated articulatory trajectories using the acoustic-to-articulatory inversion methods were still not accurate enough to be within the perceptual tolerance of audio-visual asynchrony.
  •  
10.
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-10 av 41
Typ av publikation
tidskriftsartikel (39)
konferensbidrag (1)
forskningsöversikt (1)
Typ av innehåll
refereegranskat (40)
övrigt vetenskapligt/konstnärligt (1)
Författare/redaktör
House, David (6)
Salvi, Giampiero (4)
Granström, Björn (4)
Hjalmarsson, Anna (4)
Engwall, Olov (4)
Gustafson, Joakim (3)
visa fler...
Lindberg, J (2)
Edlund, Jens (2)
Skantze, Gabriel (2)
Blomberg, Mats (2)
Traunmüller, Hartmut (2)
Carlson, Rolf (2)
Botinis, A (1)
Ahmed, Zeeshan (1)
Oertel, Catharine (1)
Székely, Eva (1)
Carson-Berndsen, Jul ... (1)
Kjellström, Hedvig (1)
Eklund, Robert, 1962 ... (1)
Ananthakrishnan, Gop ... (1)
Kleijn, W. Bastiaan (1)
Koniaris, Christos, ... (1)
Verikas, Antanas, 19 ... (1)
Bacauskiene, Marija (1)
Gelzinis, Adas (1)
Ambrazaitis, Gilbert ... (1)
Wik, Preben (1)
Zlotea, Claudia (1)
Lyberg Åhlander, Viv ... (1)
Sahlén, Birgitta (1)
Elenius, Kjell (1)
Lindholm, Torun, 196 ... (1)
Themistocleous, Char ... (1)
Nirme, Jens (1)
Haake, Magnus (1)
Melin, H (1)
Boye, Johan (1)
Uloza, Virgilijus (1)
Beaugendre, F. (1)
Hermes, D. J. (1)
Wirén, Mats, 1954- (1)
McAllister, Anita (1)
Nordstrand, Magnus (1)
Svanfeldt, Gunilla (1)
Heldner, Mattias (1)
Laukka, Petri (1)
Hirschberg, J. (1)
Bimbot, F. (1)
Hutter, H. -P (1)
Jaboulet, C. (1)
visa färre...
Lärosäte
Kungliga Tekniska Högskolan (31)
Stockholms universitet (5)
Göteborgs universitet (3)
Lunds universitet (3)
Uppsala universitet (1)
Högskolan i Halmstad (1)
visa fler...
Linköpings universitet (1)
Linnéuniversitetet (1)
Karolinska Institutet (1)
visa färre...
Språk
Engelska (41)
Forskningsämne (UKÄ/SCB)
Naturvetenskap (18)
Humaniora (12)
Teknik (4)
Samhällsvetenskap (3)
Medicin och hälsovetenskap (1)

År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy