SwePub
Tyck till om SwePub Sök här!
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "hsv:(NATURVETENSKAP) hsv:(Data och informationsvetenskap) hsv:(Språkteknologi) srt2:(2005-2009)"

Sökning: hsv:(NATURVETENSKAP) hsv:(Data och informationsvetenskap) hsv:(Språkteknologi) > (2005-2009)

  • Resultat 1-10 av 631
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  • Neiberg, Daniel, et al. (författare)
  • Emotion Recognition in Spontaneous Speech Using GMMs
  • 2006
  • Ingår i: INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING. - BAIXAS : ISCA-INST SPEECH COMMUNICATION ASSOC. ; , s. 809-812, s. 101-104
  • Konferensbidrag (refereegranskat)abstract
    • Automatic detection of emotions has been evaluated using standard Mel-frequency Cepstral Coefficients, MFCCs, and a variant, MFCC-low, calculated between 20 and 300 Hz, in order to model pitch. Also plain pitch features have been used. These acoustic features have all been modeled by Gaussian mixture models, GMMs, on the frame level. The method has been tested on two different corpora and languages; Swedish voice controlled telephone services and English meetings. The results indicate that using GMMs on the frame level is a feasible technique for emotion classification. The two MFCC methods have similar performance, and MFCC-low outperforms the pitch features. Combining the three classifiers significantly improves performance.
  •  
2.
  • Larsson, Staffan, 1969, et al. (författare)
  • Corrective feedback and concept updates
  • 2008
  • Ingår i: Carlson et al (eds.): Proceedings of The second Swedish Language Technology Conference (SLTC-08).
  • Tidskriftsartikel (refereegranskat)
  •  
3.
  • Kokkinakis, Dimitrios, 1965 (författare)
  • Shallow Features for Differentiating Disease-Treatment Relations using Supervised Learning, a pilot study
  • 2009
  • Ingår i: Proceedings of the 12th International Conference TSD (Text, Speech and Dialogue). Springer Verlag, LNCS/LNAI series.. ; 5729, s. 395-402
  • Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract
    • Clinical narratives provide an information rich, nearly unexplored corpus of evidential knowledge that is considered as a challenge for practitioners in the language technology field, particularly because of the nature of the texts (excessive use of terminology, abbreviations, orthographic term variation), the significant opportunities for clinical research that such material can provide and the potentially broad impact that clinical findings may have in every day life. It is therefore recognized that the capability to automatically extract key concepts and their relationships from such data will allow systems to properly understand the content and knowledge embedded in the free text which can be of great value for applications such as information extraction and question & answering. This paper gives a brief presentation of such textual data and its semantic annotation, and discuss the set of semantic relations that can be observed between diseases and treatments in the sample. The problem is then designed as a machine learning task in which the relations are tried to be learned in a supervised fashion, using pre-annotated data. The challenges designing the problem and empirical results are presented.
  •  
4.
  •  
5.
  • Olsson, Fredrik, 1971 (författare)
  • Bootstrapping Named Entity Annotation by Means of Active Machine Learning
  • 2008
  • Doktorsavhandling (övrigt vetenskapligt/konstnärligt)abstract
    • This thesis describes the development and in-depth empirical investigation of a method, called BootMark, for bootstrapping the marking up of named entities in textual documents. The reason for working with documents, as opposed to for instance sentences or phrases, is that the BootMark method is concerned with the creation of corpora. The claim made in the thesis is that BootMark requires a human annotator to manually annotate fewer documents in order to produce a named entity recognizer with a given performance, than would be needed if the documents forming the basis for the recognizer were randomly drawn from the same corpus. The intention is then to use the created named entity recognizer as a pre-tagger and thus eventually turn the manual annotation process into one in which the annotator reviews system-suggested annotations rather than creating new ones from scratch. The BootMark method consists of three phases: (1) Manual annotation of a set of documents; (2) Bootstrapping -- active machine learning for the purpose of selecting which document to annotate next; (3) The remaining unannotated documents of the original corpus are marked up using pre-tagging with revision. Five emerging issues are identified, described and empirically investigated in the thesis. Their common denominator is that they all depend on the realization of the named entity recognition task, and as such, require the context of a practical setting in order to be properly addressed. The emerging issues are related to: (1) the characteristics of the named entity recognition task and the base learners used in conjunction with it; (2) the constitution of the set of documents annotated by the human annotator in phase one in order to start the bootstrapping process; (3) the active selection of the documents to annotate in phase two; (4) the monitoring and termination of the active learning carried out in phase two, including a new intrinsic stopping criterion for committee-based active learning; and (5) the applicability of the named entity recognizer created during phase two as a pre-tagger in phase three. The outcomes of the empirical investigations concerning the emerging issues support the claim made in the thesis. The results also suggest that while the recognizer produced in phases one and two is as useful for pre-tagging as a recognizer created from randomly selected documents, the applicability of the recognizer as a pre-tagger is best investigated by conducting a user study involving real annotators working on a real named entity recognition task.
  •  
6.
  • Argaw, Atelach Alemu, et al. (författare)
  • Dictionary-based Amharic-French information retrieval
  • 2006
  • Ingår i: Accessing Multilingual Information Repositories. - Berlin, Heidelberg : Springer Berlin Heidelberg. - 354045697X ; , s. 83-92, s. 83-92
  • Konferensbidrag (refereegranskat)abstract
    • We present four approaches to the Amharic - French bilingual track at CLEF 2005. All experiments use a dictionary based approach to translate the Amharic queries into French Bags-of-words, but while one approach uses word sense discrimination on the translated side of the queries, the other one includes all senses of a translated word in the query for searching. We used two search engines: The SICS experimental engine and Lucene, hence four runs with the two approaches. Non-content bearing words were removed both before and after the dictionary lookup. TF/IDF values supplemented by a heuristic function was used to remove the stop words from the Amharic queries and two French stopwords lists were used to remove them from the French translations. In our experiments, we found that the SICS search engine performs better than Lucene and that using the word sense discriminated keywords produce a slightly better result than the full set of non discriminated keywords.
  •  
7.
  • Boye, Johan, et al. (författare)
  • Robust parsing and spoken negotiative dialogue with databases
  • 2008
  • Ingår i: Natural Language Engineering. - : Cambridge University Press. - 1351-3249 .- 1469-8110. ; 14:3, s. 289-312
  • Tidskriftsartikel (refereegranskat)abstract
    • This paper presents a robust parsing algorithm and semantic formalism for the interpretation of utterances in spoken negotiative dialogue with databases. The algorithm works in two passes: a domain-specific pattern-matching phase and a domain-independent semantic analysis phase. Robustness is achieved by limiting the set of representable utterance types to an empirically motivated subclass which is more expressive than propositional slot–value lists, but much less expressive than first-order logic. Our evaluation shows that in actual practice the vast majority of utterances that occur can be handled, and that the parsing algorithm is highly efficient and accurate.
  •  
8.
  • Gey, Frederic, et al. (författare)
  • Information access in a multilingual world: transitioning from research to real-world applications
  • 2009. - 5
  • Ingår i: SIGIR Forum. - Kista, Sweden : Swedish Institute of Computer Science. - 0163-5840 .- 1558-0229. ; 43, s. 24-28
  • Rapport (övrigt vetenskapligt/konstnärligt)abstract
    • This report constitutes the proceedings of the workshop on Information Access in a Multilingual World: Transitioning from Research to Real-World Applications}, held at SIGIR 2009 in Boston, July 23, 2009. Multilingual Information Access (MLIA) is at a turning point wherein substantial real-world applications are being introduced after fifteen years of research into cross-language information retrieval, question answering, statistical machine translation and named entity recognition. Previous workshops on this topic have focused on research and small-scale applications. The focus of this workshop was on technology transfer from research to applications and on what future research needs to be done which facilitates MLIA in an increasingly connected multilingual world.
  •  
9.
  • Rosell, Magnus, et al. (författare)
  • Global Evaluation of Random Indexing through Swedish Word Clustering Compared to the People’s Dictionary of Synonyms
  • 2009
  • Ingår i: Proceedings of the International Conference RANLP-2009. ; , s. 376-380
  • Konferensbidrag (refereegranskat)abstract
    • Evaluation of word space models is usually local in the sense that it only considers words that are deemed very similar by the model. We propose a global evaluation scheme based on clustering of the words. A clustering of high quality in an external evaluation against a semantic resource, such as a dictionary of synonyms, indicates a word space model of high quality. We use Random Indexing to create several different models and compare them by clustering evaluation against the People's Dictionary of Synonyms, a list of Swedish synonyms that are graded by the public. Most notably we get better results for models based on syntagmatic information (words that appear together) than for models based on paradigmatic information (words that appear in similar contexts). This is quite contrary to previous results that have been presented for local evaluation. Clusterings to ten clusters result in a recall of 83% for a syntagmatic model, compared to 34% for a comparable paradigmatic model, and 10% for a random partition.
  •  
10.
  • Sahlgren, Magnus, et al. (författare)
  • Automatic Bilingual Lexicon Acquisition Using Random Indexing of Parallel Corpora
  • 2005
  • Ingår i: Natural Language Engineering. - 1351-3249 .- 1469-8110. ; 11:3, s. 327-341
  • Tidskriftsartikel (refereegranskat)abstract
    • This paper presents a very simple and effective approach to using parallel corpora for automatic bilingual lexicon acquisition. The approach, which uses the Random Indexing vector space methodology, is based on finding correlations between terms based on their distributional characteristics. The approach requires a minimum of preprocessing and linguistic knowledge, and is efficient, fast and scalable. In this paper, we explain how our approach differs from traditional cooccurrence-based word alignment algorithms, and we demonstrate how to extract bilingual lexica using the Random Indexing approach applied to aligned parallel data. The acquired lexica are evaluated by comparing them to manually compiled gold standards, and we report overlap of around 60%. We also discuss methodological problems with evaluating lexical resources of this kind.
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-10 av 631
Typ av publikation
konferensbidrag (400)
tidskriftsartikel (83)
bokkapitel (73)
rapport (35)
doktorsavhandling (14)
licentiatavhandling (11)
visa fler...
samlingsverk (redaktörskap) (6)
bok (6)
annan publikation (2)
recension (1)
visa färre...
Typ av innehåll
refereegranskat (411)
övrigt vetenskapligt/konstnärligt (206)
populärvet., debatt m.m. (14)
Författare/redaktör
Larsson, Staffan, 19 ... (41)
Tiedemann, Jörg (35)
Edlund, Jens (31)
Nivre, Joakim (25)
House, David (25)
Borin, Lars, 1957 (23)
visa fler...
Ljunglöf, Peter, 197 ... (22)
Nivre, Joakim, 1962- (21)
Beskow, Jonas (20)
Granström, Björn (20)
Shaw, Philip (20)
Johansson Kokkinakis ... (20)
Kokkinakis, Dimitrio ... (19)
Cooper, Robin, 1947 (19)
Heldner, Mattias (19)
Gustafson, Joakim (17)
Megyesi, Beata (17)
Wik, Preben (16)
Carlson, Rolf (16)
Sundberg, Johan (15)
Hjalmarsson, Anna (15)
Skantze, Gabriel (14)
Engwall, Olov (13)
Bringert, Björn, 197 ... (13)
Plas, Lonneke van de ... (13)
Ranta, Aarne, 1963 (11)
Hall, Johan (11)
Al Moubayed, Samer (10)
Toporowska Gronostaj ... (10)
Askenfelt, Anders (10)
Perez, Guillermo (10)
Villing, Jessica, 19 ... (10)
Amores, Gabriel (10)
Manchon, Pilar (10)
Dannélls, Dana, 1976 (9)
Blomberg, Mats (9)
Kann, Viggo (9)
Nilsson, Jens (9)
Elenius, Kjell (9)
Jonson, Rebecca, 197 ... (9)
Mur, Jori (9)
Schoonderwaldt, Erwi ... (9)
Lindberg, Inger, 194 ... (9)
Neiberg, Daniel (8)
Forsbom, Eva (8)
Hansen, Kjetil Falke ... (8)
Bouma, Gosse (8)
Noord, Gertjan van (8)
Hincks, Rebecca (8)
Laskowski, Kornel (8)
visa färre...
Lärosäte
Kungliga Tekniska Högskolan (254)
Göteborgs universitet (189)
Uppsala universitet (133)
Stockholms universitet (47)
Linnéuniversitetet (23)
Linköpings universitet (15)
visa fler...
Chalmers tekniska högskola (14)
Umeå universitet (10)
Mittuniversitetet (8)
Örebro universitet (7)
RISE (7)
Högskolan i Halmstad (5)
Lunds universitet (5)
Högskolan i Skövde (2)
Mälardalens universitet (1)
Jönköping University (1)
Malmö universitet (1)
Högskolan i Borås (1)
visa färre...
Språk
Engelska (583)
Svenska (38)
Spanska (3)
Tyska (2)
Franska (1)
Danska (1)
visa fler...
Odefinierat språk (1)
Portugisiska (1)
Nygrekiska (1)
visa färre...
Forskningsämne (UKÄ/SCB)
Naturvetenskap (631)
Humaniora (147)
Samhällsvetenskap (23)
Teknik (7)
Medicin och hälsovetenskap (2)
Lantbruksvetenskap (1)

År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy