SwePub - sökning: WFRF:(Wirén Mats)

Numrering	Referens	Omslagsbild	Hitta
1.	Akerstrom, Ulf, et al. (författare) Comparison of Moisturizing Creams for the Prevention of Atopic Dermatitis Relapse: A Randomized Double-blind Controlled Multicentre Clinical Trial 2015 Ingår i: Acta Dermato-Venereologica. - : Medical Journals Sweden AB. - 1651-2057 .- 0001-5555. ; 95:5, s. 587-592 Tidskriftsartikel (refereegranskat)abstract Atopic dermatitis (AD) affects adults and children and has a negative impact on quality of life. The present multicentre randomized double-blind controlled trial showed a barrier-improving cream (5% urea) to be superior to a reference cream in preventing eczema relapse in patients with AD (hazard ratio 0.634, p = 0.011). The risk of eczema relapse was reduced by 37% (95% confidence interval (95% CI) 10-55%). Median time to relapse in the test cream group and in the reference cream group was 22 days and 15 days, respectively (p = 0.013). At 6 months 26% of the patients in the test cream group were still eczema free, compared with 10% in the reference cream group. Thus, the barrier-improving cream significantly prolonged the eczema-free time compared with the reference cream and decreased the risk of eczema relapse. The test cream was well tolerated in patients with AD.
2.	Becket, Ralph, et al. (författare) Spoken Language Translator: Phase Two Report 1997 Rapport (övrigt vetenskapligt/konstnärligt)abstract Spoken Language Translator (SLT) is a project whose long-term goal is the construction of practically useful systems capable of translating human speech from one language into another. The current SLT prototype, described in detail in this report, is capable of speech-to-speech translation between English and Swedish in either direction within the domain of airline flight inquiries, using a vocabulary of about 1500 words. Translation from English and Swedish into French is also possible, with slightly poorer performance.A good English-language speech recognizer existed before the start of the project, and has since been improved in several ways. During the project, we have constructed a Swedish-language recognizer, arguably the best system of its kind so far built. This has involved among other things collection of a large amount of Swedish training data. The recognizer is essentially domain-independent, but has been tuned to give high performance in the air travel inquiry domain.The main version of the Swedish recognizer is trained on the Stockholm dialect of Swedish, and achieves near-real-time performance with a word error rate of about 7%. Techniques developed partly under this project make it possible to port the recognizer to other Swedish dialects using only modest quantities of training data.On the language-processing side, we had at the start of the project a substantial domain-independent language-processing system for English, a preliminary Swedish version, and a sketchy set of rules to permit English to Swedish translation. We now have good versions of the language-processing system for English, Swedish and French, and fair to good support for translation in five of the six possible language- pairs. Translation is carried out using a novel robust architecture developed under the project. In essence, this translates as much of the input utterance as possible using a sophisticated grammar-based method, and then employs a much simpler set of word- to-word translation rules to fill in the gaps.The language-processing modules are all generic in nature, are based on large, linguistically motivated grammars, and can fairly easily be tuned to give good performance in new domains. Much of the work involved in the domain adaptation process can be carried out by non-experts using tools developed under the project.Formal comparisons are problematic, in view of the different domains and languages used and the lack of accepted evaluation criteria. None the less, the evidence at our disposal suggests that the current SLT prototype is no worse than the German Verbmobil demonstrator, in spite of a difference in project budget of more than an order of magnitude.
3.	Bell, Linda, et al. (författare) Modality Convergence in a Multimodal Dialogue System 2000 Ingår i: Proceedings of Götalog. ; , s. 29-34 Konferensbidrag (refereegranskat)abstract When designing multimodal dialogue systems allowing speech as well as graphical operations, it is important to understand not only how people make use of the different modalities in their utterances, but also how the system might influence a user’s choice of modality by its own behavior. This paper describes an experiment in which subjects interacted with two versions of a simulated multimodal dialogue system. One version used predominantly graphical means when referring to specific objects; the other used predominantly verbal referential expressions. The purpose of the study was to find out what effect, if any, the system’s referential strategy had on the user’s behavior. The results provided limited support for the hypothesis that the system can influence users to adopt another modality for the purpose of referring
4.	Bell, Linda, et al. (författare) The Swedish NICE Corpus – Spoken dialogues between children and embodied characters in a computer game scenario 2005 Ingår i: Proceedings Interspeech 2005 - Eurospeech. - Lisbon, Portugal : ISCA. ; , s. 2765-2768, s. 2765-2768 Konferensbidrag (refereegranskat)abstract This article describes the collection and analysis of a Swedish database of spontaneous and unconstrained children-machine dialogues. The Swedish NICE corpus consists of spoken dialogues between children aged 8 to 15 and embodied fairytale characters in a computer game scenario. Compared to previously collected corpora of children's computer-directed speech, the Swedish NICE corpus contains extended interactions, including three-party conversation, in which the young users used spoken dialogue as the primary means of progression in the game.
5.	Borin, Lars, et al. (författare) Swe-Clarin : Language Resources and Technology for Digital Humanities 2016 Ingår i: <em>Extended Papers of the International Symposium on Digital Humanities</em>. - : CEUR. ; , s. 29-51, s. 29-51 Konferensbidrag (refereegranskat)abstract CLARIN is a European Research Infrastructure Consortium (ERIC), which aims at (a) making extensive language-based materials available as primary research data to the humanities and social sciences (HSS); and (b) offering state-of-the-art language technology (LT) as an eresearch tool for this purpose, positioning CLARIN centrally in what is often referred to as the digital humanities (DH). The Swedish CLARIN node Swe-Clarin was established in 2015 with funding from the Swedish Research Council.In this paper, we describe the composition and activities of Swe-Clarin, aiming at meeting the requirements of all HSS and other researchers whose research involves using text and speech as primary research data, and spreading the awareness of what Swe-Clarin can offer these research communities. We focus on one of the central means for doing this: pilot projects conducted in collaboration between HSS researchers and Swe-Clarin, together formulating a research question, the addressing of which requires working with large language-based materials. Four such pilot projects are described in more detail, illustrating research on rhetorical history, second-language acquisition, literature, and political science. A common thread to these projects is an aspiration to meet the challenge of conducting research on the basis of very large amounts of textual data in a consistent way without losing sight of the individual cases making up the mass of data, i.e., to be able to move between Moretti’s “distant” and “close reading” modes.While the pilot projects clearly make substantial contributions to DH, they also reveal some needs for more development, and in particular a need for document-level access to the text materials. As a consequence of this, work has now been initiated in Swe-Clarin to meet this need, so that Swe-Clarin together with HSS scholars investigating intricate research questions can take on the methodological challenges of big-data language-based digital humanities.
6.	Borin, Lars, 1957, et al. (författare) Swe-Clarin: Language resources and technology for Digital Humanities 2017 Ingår i: Digital Humanities 2016. Extended Papers of the International Symposium on Digital Humanities (DH 2016) Växjö, Sweden, November, 7-8, 2016. Edited by Koraljka Golub, Marcelo Milra. Vol-2021. - Aachen : M. Jeusfeld c/o Redaktion Sun SITE, Informatik V, RWTH Aachen.. - 1613-0073. Konferensbidrag (refereegranskat)abstract CLARIN is a European Research Infrastructure Consortium (ERIC), which aims at (a) making extensive language-based materials available as primary research data to the humanities and social sciences (HSS); and (b) offering state-of-the-art language technology (LT) as an e-research tool for this purpose, positioning CLARIN centrally in what is often referred to as the digital humanities (DH). The Swedish CLARIN node Swe-Clarin was established in 2015 with funding from the Swedish Research Council. In this paper, we describe the composition and activities of Swe-Clarin, aiming at meeting the requirements of all HSS and other researchers whose research involves using text and speech as primary research data, and spreading the awareness of what Swe-Clarin can offer these research communities. We focus on one of the central means for doing this: pilot projects conducted in collaboration between HSS researchers and Swe-Clarin, together formulating a research question, the addressing of which requires working with large language-based materials. Four such pilot projects are described in more detail, illustrating research on rhetorical history, second-language acquisition, literature, and political science. A common thread to these projects is an aspiration to meet the challenge of conducting research on the basis of very large amounts of textual data in a consistent way without losing sight of the individual cases making up the mass of data, i.e., to be able to move between Moretti’s “distant” and “close reading” modes. While the pilot projects clearly make substantial contributions to DH, they also reveal some needs for more development, and in particular a need for document-level access to the text materials. As a consequence of this, work has now been initiated in Swe-Clarin to meet this need, so that Swe-Clarin together with HSS scholars investigating intricate research questions can take on the methodological challenges of big-data language-based digital humanities.
7.	Borin, Lars, 1957, et al. (författare) Swe-Clarin: Language resources and technology for digital humanities 2016 Ingår i: CEUR Workshop Proceedings. - 1613-0073. ; 2021, s. 29-51 Konferensbidrag (refereegranskat)abstract CLARIN is a European Research Infrastructure Consortium (ERIC), which aims at (a) making extensive language-based materials available as primary research data to the humanities and social sciences (HSS); and (b) offering state-of-the-art language technology (LT) as an e-research tool for this purpose, positioning CLARIN centrally in what is often referred to as the digital humanities (DH). The Swedish CLARIN node Swe-Clarin was established in 2015 with funding from the Swedish Research Council. In this paper, we describe the composition and activities of Swe-Clarin, aiming at meeting the requirements of all HSS and other researchers whose research involves using text and speech as primary research data, and spreading the awareness of what Swe-Clarin can offer these research communities. We focus on one of the central means for doing this: pilot projects conducted in collaboration between HSS researchers and Swe-Clarin, together formulating a research question, the addressing of which requires working with large language-based materials. Four such pilot projects are described in more detail, illustrating research on rhetorical history, second-language acquisition, literature, and political science. A common thread to these projects is an aspiration to meet the challenge of conducting research on the basis of very large amounts of textual data in a consistent way without losing sight of the individual cases making up the mass of data, i.e., to be able to move between Moretti’s “distant” and “close reading” modes.
8.	Boye, Johan, et al. (författare) Contextual reasoning in multimodal dialogue systems : two case studies 2004 Ingår i: Proceedings of The 8th Workshop on the Semantics and Pragmatics of Dialogue Catalogue'04. - Barcelona. ; , s. 19-21 Konferensbidrag (refereegranskat)abstract This paper describes an approach to contextual reasoning for interpretation ofspoken multimodal dialogue. The approach is based on combining recencybased search for antecedents with an object-oriented domain representation insuch a way that the search is highly constrained by the type information of theantecedents. By furthermore representingcandidate antecedents from the dialoguehistory and visual context in a uniformway, a single machinery (based on -reduction in lambda calculus) can be usedfor resolving many kinds of underspecified utterances. The approach has beenimplemented in two highly different domains.
9.	Boye, Johan, et al. (författare) Language-Processing Strategies and Mixed-Initiative Dialogues 1999 Rapport (övrigt vetenskapligt/konstnärligt)abstract We describe an implemented spoken-language dialogue system for a travel-planning domain, which accesses a commercially available travel-information web-server and supports a flexible mixed-initiative dialogue strategy. We argue, based on data from initial Wizard-of-Oz experiments, that mixed-initiative strategies are appropriate for many types of user, but require more sophisticated architectures for processing of language and dialogue; we then use these observations to motivate an architecture which combines parallel deep and shallow natural language analysis engines and an agenda-driven dialogue manager. We outline the top-level processing strategy used by the dialogue manager, and also a novel formalism, which we call Flat Utterance Description, that allows us to reduce the output of the deep and shallow language-processing engines to a common representation.
10.	Boye, Johan, et al. (författare) Multi-slot semantics for natural-language call routing systems 2007 Ingår i: Proceedings of Bridging the Gap. ; , s. 68-75, s. 68-75 Konferensbidrag (refereegranskat)abstract Statistical classification techniques for natural-language call routing systems have matured to the point where it is possible to distinguish between several hundreds of semantic categories with an accuracy that is sufficient for commercial deployments. For category sets of this size, the problem of maintaining consistency among manually tagged utterances becomes limiting, as lack of consistency in the training data will degrade performance of the classifier. It is thus essential that the set of categories be structured in a way that alleviates this problem, and enables consistency to be preserved as the domain keeps changing. In this paper, we describe our experiences of using a two-level multi-slot semantics as a way of meeting this problem. Furthermore, we explore the ramifications of the approach with respect to classification, evaluation and dialogue design for call routing systems.
11.	Boye, Johan, et al. (författare) Negotiative Spoken-Dialogue Interfaces to Databases 2003 Ingår i: Proceedings of Diabruck, Wallerfangen, Germany. Konferensbidrag (refereegranskat)abstract The aim of this paper is to develop a principled and empirically motivated approach to robust, negotiative spoken dialogue with databases. Robustness is achieved by limiting the set of representable utterance types. Still, the vast majority of utterances that occur in practice can be handled.
12.	Boye, Johan, et al. (författare) Robust parsing and spoken negotiative dialogue with databases 2008 Ingår i: Natural Language Engineering. - : Cambridge University Press. - 1351-3249 .- 1469-8110. ; 14:3, s. 289-312 Tidskriftsartikel (refereegranskat)abstract This paper presents a robust parsing algorithm and semantic formalism for the interpretation of utterances in spoken negotiative dialogue with databases. The algorithm works in two passes: a domain-specific pattern-matching phase and a domain-independent semantic analysis phase. Robustness is achieved by limiting the set of representable utterance types to an empirically motivated subclass which is more expressive than propositional slot–value lists, but much less expressive than first-order logic. Our evaluation shows that in actual practice the vast majority of utterances that occur can be handled, and that the parsing algorithm is highly efficient and accurate.
13.	Boye, Johan, et al. (författare) Robust Parsing of Utterances in Negotiative Dialogue 2003 Ingår i: Proceedings 8th European Conference on Speech Communication and Technology (Eurospeech), Geneva, Switzerland. Konferensbidrag (refereegranskat)abstract This paper presents an algorithm for domain-dependent parsing of utterances in negotiative dialogue. To represent such utterances, the algorithm outputs semantic expressions that are more expressive than propositional slot-filler structures. It is very fast and robust, yet precise and capable of correctly combining information from different utterance fragments.
14.	Boye, Johan, et al. (författare) Robust spoken language understanding in a computer game 2006 Ingår i: Speech Communication. - : Elsevier. - 0167-6393 .- 1872-7182. ; 48:3-4, s. 335-353 Tidskriftsartikel (refereegranskat)abstract We present and evaluate a robust method for the interpretation of spoken input to a conversational computer game. The scenario of the game is that of a player interacting with embodied fairy-tale characters in a 3D world via spoken dialogue (supplemented by graphical pointing actions) to solve various problems. The player himself cannot directly perform actions in the world, but interacts with the fairy-tale characters to have them perform various tasks, and to get information about the world and the problems to solve. Hence the role of spoken dialogue as the primary means of control is obvious and natural to the player. Naturally, this means that robust spoken language understanding becomes a critical component. To this end, the paper describes a semantic representation formalism and an accompanying parsing algorithm which works off the output of the speech recogniser's statistical language model. The evaluation shows that the parser is robust in the sense of considerably improving on the noisy output of the speech recogniser.
15.	Börstell, Carl, et al. (författare) Towards an Annotation of Syntactic Structure in the Swedish Sign Language Corpus 2016 Ingår i: Workshop Proceedings. - Paris : ELRA. ; , s. 19-24 Konferensbidrag (refereegranskat)abstract This paper describes on-going work on extending the annotation of the Swedish Sign Language Corpus (SSLC) with a level of syntactic structure. The basic annotation of SSLC in ELAN consists of six tiers: four for sign glosses (two tiers for each signer; one for each of a signer’s hands), and two for written Swedish translations (one for each signer). In an additional step by Östling et al. (2015), all ¨ glosses of the corpus have been further annotated for parts of speech. Building on the previous steps, we are now developing annotation of clause structure for the corpus, based on meaning and form. We define a clause as a unit in which a predicate asserts something about one or more elements (the arguments). The predicate can be a (possibly serial) verbal or nominal. In addition to predicates and their arguments, criteria for delineating clauses include non-manual features such as body posture, head movement and eye gaze. The goal of this work is to arrive at two additional annotation tier types in the SSLC: one in which the sign language texts are segmented into clauses, and the other in which the individual signs are annotated for their argument types.
16.	Cap, Fabienne, et al. (författare) SWORD : Towards Cutting-Edge Swedish Word Processing 2016 Ingår i: Proceedings of SLTC 2016. Konferensbidrag (refereegranskat)abstract Despite many years of research on Swedish language technology, there is still no well-documented standard for Swedish word processing covering the whole spectrum from low-level tokenization to morphological analysis and disambiguation. SWORD is a new initiative within the SWE-CLARIN consortium aiming to develop documented standards for Swedish word processing. In this paper, we report on a pilot study of Swedish tokenization, where we compare the output of six different tokenizers on four different text types. For one text type (Wikipedia articles), we also compare to the tokenization produced by six manual annotators.
17.	Cap, Fabienne, et al. (författare) SWORD: Towards Cutting-Edge Swedish Word Processing 2016 Konferensbidrag (övrigt vetenskapligt/konstnärligt)
18.	Carter, David, et al. (författare) Common speech/language issues 2000 Ingår i: The spoken language translator. - Cambridge : Cambridge University Press. - 0521770777 - 9780521038829 ; , s. 284-294 Bokkapitel (övrigt vetenskapligt/konstnärligt)
19.	Carter, David, et al. (författare) Common Speech–Language Issues 2000 Ingår i: The Spoken Language Translator. - Cambridge : Cambridge University Press. - 0521770777 - 9780521038829 ; , s. 284-294 Bokkapitel (refereegranskat)abstract This book presents a detailed description of Spoken Language Translator (SLT), one of the first major projects in the area of automatic speech translation. The SLT system can translate between English, French, and Swedish in the domain of air travel planning, using a vocabulary of about 1500 words, and with an accuracy of about 75%. The greater part of the book describes the language processing components, which are largely built on top of the SRI Core Language Engine, using a combination of general grammars and techniques that allow them to be rapidly customized to specific domains. Speech recognition is based on Hidden Markov Mode technology, and uses versions of the SRI DECIPHER system. This account of the Spoken Language Translator should be an essential resource both for those who wish to know what is achievable in spoken-language translation today, and for those who wish to understand how to achieve it.
20.	Carter, David, et al. (författare) Evaluation 2000 Ingår i: The Spoken Language Translator. - Cambridge : Cambridge University Press. - 0521770777 - 9780521770774 ; , s. 297-312 Bokkapitel (övrigt vetenskapligt/konstnärligt)abstract This book presents a detailed description of Spoken Language Translator (SLT), one of the first major projects in the area of automatic speech translation. The SLT system can translate between English, French, and Swedish in the domain of air travel planning, using a vocabulary of about 1500 words, and with an accuracy of about 75%. The greater part of the book describes the language processing components, which are largely built on top of the SRI Core Language Engine, using a combination of general grammars and techniques that allow them to be rapidly customized to specific domains. Speech recognition is based on Hidden Markov Mode technology, and uses versions of the SRI DECIPHER system. This account of the Spoken Language Translator should be an essential resource both for those who wish to know what is achievable in spoken-language translation today, and for those who wish to understand how to achieve it.
21.	Carter, David, et al. (författare) Evaluation 2000 Ingår i: The spoken language translator. - Cambridge : Cambridge University Press. - 0521770777 - 9780521038829 ; , s. 297-312 Bokkapitel (övrigt vetenskapligt/konstnärligt)
22.	Carter, David, et al. (författare) Translation Methodology in the Spoken Language Translator: An Evaluation 1997 Ingår i: Proceedings of ACL/EACL workshop Spoken Language Translation. ; , s. 73-81 Konferensbidrag (refereegranskat)abstract In this paper we describe how the translation methodology adopted fro the Spoken Language Translator (SLT) addresses the characteristics of the speech translation task in a context where it is essential to achieve easy customization to new languages and new domains. We then discuss the issues that arise in any attempt to evaluate a speech translator, and present results of such an evaluation carried out on SLT for several language pairs.
23.	Ek, Adam, 1990-, et al. (författare) Distinguishing Narration and Speech in Prose Fiction Dialogues 2019 Ingår i: Proceedings of the Digital Humanities in the Nordic Countries 4th Conference. - : CEUR-WS.org. ; , s. 124-132 Konferensbidrag (refereegranskat)abstract This paper presents a supervised method for a novel task, namely, detecting elements of narration in passages of dialogue in prose fiction. The method achieves an F1-score of 80.8%, exceeding the best baseline by almost 33 percentage points. The purpose of the method is to enable a more fine-grained analysis of fictional dialogue than has previously been possible, and to provide a component for the further analysis of narrative structure in general.
24.	Ek, Adam, 1990-, et al. (författare) Identifying Speakers and Addressees in Dialogues Extracted from Literary Fiction 2018 Ingår i: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC-2018). - : European Language Resources Association. - 9791095546009 ; , s. 817-824 Konferensbidrag (refereegranskat)abstract This paper describes an approach to identifying speakers and addressees in dialogues extracted from literary fiction, along with a dataset annotated for speaker and addressee. The overall purpose of this is to provide annotation of dialogue interaction between characters in literary corpora in order to allow for enriched search facilities and construction of social networks from the corpora. To predict speakers and addressees in a dialogue, we use a sequence labeling approach applied to a given set of characters. We use features relating to the current dialogue, the preceding narrative, and the complete preceding context. The results indicate that even with a small amount of training data, it is possible to build a fairly accurate classifier for speaker and addressee identification across different authors, though the identification of addressees is the more difficult task.
25.	Eklund, Robert, 1962-, et al. (författare) Effects of open and directed prompts on filled pauses and utterance production 2010 Ingår i: Proceedings from Fonetik 2010, Lund, June 2–4, 2010. - Lund : Mediatryck. ; , s. 23-28 Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract This paper describes an experiment where open and directed prompts were alternated when collecting speech data for the deployment of a call-routing application. The experiment tested whether open and directed prompts resulted in any differences with respect to the filled pauses exhibited by the callers, which is interesting in the light of the “many-options” hypothesis of filled pause production. The experiment also investigated the effects of the prompts on utterance form and meaning of the callers.
26.	Eklund, Robert, 1962-, et al. (författare) ”Njutandes av en Monte Christo no 5 och en iskall Mojito” : Observationer om användning av s-particip 2006 Ingår i: <em>Svenskans beskrivning 28, Förhandlingar vid Tjugoåttonde sammankomsten för svenskans beskrivning</em>. - 9176685144 ; , s. 97-108 Konferensbidrag (refereegranskat)
27.	Eklund, Robert, 1962-, et al. (författare) Rational Re-Use of Linguistic Data 2000 Ingår i: The Spoken Language Translator. - Cambridge : Cambridge University Press. - 0521770777 - 9780521770774 ; , s. 212-228 Bokkapitel (övrigt vetenskapligt/konstnärligt)abstract This book presents a detailed description of Spoken Language Translator (SLT), one of the first major projects in the area of automatic speech translation. The SLT system can translate between English, French, and Swedish in the domain of air travel planning, using a vocabulary of about 1500 words, and with an accuracy of about 75%. The greater part of the book describes the language processing components, which are largely built on top of the SRI Core Language Engine, using a combination of general grammars and techniques that allow them to be rapidly customized to specific domains. Speech recognition is based on Hidden Markov Mode technology, and uses versions of the SRI DECIPHER system. This account of the Spoken Language Translator should be an essential resource both for those who wish to know what is achievable in spoken-language translation today, and for those who wish to understand how to achieve it.
28.	Grigonyté, Gintaré, et al. (författare) Improving Readability of Swedish Electronic Health Records through Lexical Simplification : First Results 2014 Ingår i: Proceedings of the 3rd Workshop on Predicting and Improving Text Readability for Target Reader Populations (PITR). - Stroudsburg, USA : Association for Computational Linguistics. - 9781937284916 ; , s. 74-83 Konferensbidrag (refereegranskat)abstract This paper describes part of an ongoing effort to improve the readability of Swedish electronic health records (EHRs). An EHR contains systematic documentation of a single patient’s medical history across time, entered by healthcare professionals with the purpose of enabling safe and informed care. Linguistically, medical records exemplify a highly specialised domain, which can be superficially characterised as having telegraphic sentences involving displaced or missing words, abundant abbreviations, spelling variations including misspellings, and terminology. We report results on lexical simplification of Swedish EHRs, by which we mean detecting the unknown, out-ofdictionary words and trying to resolve them either as compounded known words, abbreviations or misspellings.
29.	Grigonyté, Gintaré, et al. (författare) Spelling Variation of Latin and Greek words in Swedish Medical Text 2014 Konferensbidrag (refereegranskat)
30.	Grigonyte, Gintare, et al. (författare) Swedification patterns of Latin and Greek affixes in clinical text 2016 Ingår i: Nordic Journal of Linguistics. - 0332-5865 .- 1502-4717. ; 39:1, s. 5-37 Tidskriftsartikel (refereegranskat)abstract Swedish medical language is rich with Latin and Greek terminology which has undergone a Swedification since the 1980s. However, many original expressions are still used by clinical professionals. The goal of this study is to obtain precise quantitative measures of how the foreign terminology is manifested in Swedish clinical text. To this end, we explore the use of Latin and Greek affixes in Swedish medical texts in three genres: clinical text, scientific medical text and online medical information for laypersons. More specifically, we use frequency lists derived from tokenised Swedish medical corpora in the three domains, and extract word pairs belonging to types that display both the original and Swedified spellings. We describe six distinct patterns explaining the variation in the usage of Latin and Greek affixes in clinical text. The results show that to a large extent affixes in clinical text are Swedified and that prefixes are used more conservatively than suffixes.
31.	Gustafson, Joakim, et al. (författare) AdApt — A Multimodal Conversational Dialogue System in an Apartment Domain 2000 Ingår i: Proceedings of the Sixth International Conference on Spoken Language Processing (ICSLP), Beijing, China. ; , s. 134-137 Konferensbidrag (refereegranskat)
32.	Gustafson, Joakim, et al. (författare) Constraint Manipulation and Visualization in a Multimodal Dialogue System 2002 Ingår i: Proceedings of the ISCA Workshop on Multimodal Dialogue in Mobile Environments, Kloster Irsee, Germany.. Konferensbidrag (refereegranskat)abstract When interacting with spoken and multimodal dialogue systems, it is often difficult for users to understand and influence how their input is processed by the system. In this paper, wedescribe how these problems were addressed in the multimodal real-estate dialogue systemAdApt. During the course of a dialogue, the user's contraints are translated into symbolicicons that are visualized on the screen and can be manipulated by drag-and-drop operations.Users are thus given a clear picture of how their utterances are understood, and are given atransparent means of controlling the interaction with the system.
33.	Gustafson, Joakim, et al. (författare) The NICE fairy-tale game system 2004 Ingår i: Proceedings of the 5th SIGdial Workshop on Discourse and Dialogue at HLT-NAACL 2004. - Boston. Konferensbidrag (refereegranskat)abstract This paper presents the NICE fairy-tale game system, in which adults and children can interact with various animated characters in a 3D world. Computer games is an interesting application for spoken and multimodal dialogue systems. Moreover, for the development of future computer games, multimodal dialogue has the potential to greatly enrichen the user's experience. In this paper, we also present some requirements that have to be fulfilled to successfully integrate spoken dialogue technology with a computer game application
34.	Kurfali, Murathan, 1990-, et al. (författare) A Multi-Word Expression Dataset for Swedish 2020 Ingår i: Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020). - Marseille : European Language Resources Association (ELRA). ; , s. 4402-4409 Konferensbidrag (refereegranskat)abstract We present a new set of 96 Swedish multi-word expressions annotated with degree of (non-)compositionality. In contrast to most previous compositionality datasets we also consider syntactically complex constructions and publish a formal specification of each expression. This allows evaluation of computational models beyond word bigrams, which have so far been the norm. Finally, we use the annotations to evaluate a system for automatic compositionality estimation based on distributional semantics. Our analysis of the disagreements between human annotators and the distributional model reveal interesting questions related to the perception of compositionality, and should be informative to future work in the area.
35.	Kurfalı, Murathan, 1990-, et al. (författare) Breaking the Narrative: Scene Segmentation through Sequential Sentence Classification 2021 Konferensbidrag (refereegranskat)abstract In this paper, we describe our submission to the Shared Task on Scene Segmentation (STSS). The shared task requires participants to segment novels into coherent segments, called scenes. We approach this as a sequential sentence classification task and offer a BERT-based solution with a weighted cross-entropy loss. According to the results, the proposed approach performs relatively well on the task as our model ranks first and second, in official in-domain and out-domain evaluations, respectively. However, the overall low performances (0.37 F1-score) suggest that there is still much room for improvement.
36.	Kurfalı, Murathan, 1990- (författare) Contributions to Shallow Discourse Parsing : To English and beyond 2022 Doktorsavhandling (övrigt vetenskapligt/konstnärligt)abstract Discourse is a coherent set of sentences where the sequential reading of the sentences yields a sense of accumulation and readers can easily follow why one sentence follows another. A text that lacks coherence will most certainly fail to communicate its intended message and leave the reader puzzled as to why the sentences are presented together. However, formally accounting for the differences between a coherent and a non-coherent text still remains a challenge. Various theories propose that the semantic links that are inferred between sentences/clauses, known as discourse relations, are the building blocks of the discourse that can be connected to one another in various ways to form the discourse structure. This dissertation focuses on the former problem of discovering such discourse relations without aiming to arrive at any structure, a task known as shallow discourse parsing (SDP). Unfortunately, so far, SDP has been almost exclusively performed on the available gold annotations in English, leading to only limited insight into how the existing models would perform in a low-resource scenario potentially involving any non-English language. The main objective of the current dissertation is to address these shortcomings and help extend SDP to the non-English territory. This aim is pursued through three different threads: (i) investigation of what kind of supervision is minimally required to perform SDP, (ii) construction of multilingual resources annotated at discourse-level, (iii) extension of well-known means to (SDP-wise) low-resource languages. An additional aim is to explore the feasibility of SDP as a probing task to evaluate discourse-level understanding abilities of modern language models is also explored.The dissertation is based on six papers grouped in three themes. The first two papers perform different subtasks of SDP through relatively understudied means. Paper I presents a simplified method to perform explicit discourse relation labeling without any feature-engineering whereas Paper II shows how implicit discourse relation recognition benefits from large amounts of unlabeled text through a novel method for distant supervision. The third and fourth papers describe two novel multilingual discourse resources, TED-MDB (Paper III) and three bilingual discourse connective lexicons (Paper IV). Notably, Ted-MDB is the first parallel corpus annotated for PDTB-style discourse relations covering six non-English languages. Finally, the last two studies directly deal with multilingual discourse parsing where Paper V reports the first results in cross-lingual implicit discourse relation recognition and Paper VI proposes a multilingual benchmark including certain discourse-level tasks that have not been explored in this context before. Overall, the dissertation allows for a more detailed understanding of what is required to extend shallow discourse parsing beyond English. The conventional aspects of traditional supervised approaches are replaced in favor of less knowledge-intensive alternatives which, nevertheless, achieve state-of-the-art performance in their respective settings. Moreover, thanks to the introduction of TED-MDB, cross-lingual SDP is explored in a zero-shot setting for the first time. In sum, the proposed methodologies and the constructed resources are among the earliest steps towards building high-performance multilingual, or non-English monolingual, shallow discourse parsers.
37.	Kurfali, Murathan, 1990-, et al. (författare) Zero-shot cross-lingual identification of direct speech using distant supervision 2020 Ingår i: The 4th Joint SIGHUM Workshopon Computational Linguistics for Cultural Heritage,Social Sciences, Humanities and Literature. - 9781952148347 ; , s. 105-111 Konferensbidrag (refereegranskat)
38.	Ljunglöf, Peter, 1971, et al. (författare) Syntactic parsing 2010 Ingår i: Handbook of Natural Language Processing, 2nd edition. - : CRC Press, Taylor and Francis. - 9781420085921 Bokkapitel (refereegranskat)abstract This chapter presents basic techniques for grammar-driven natural language parsing, that is, analysing a string of words (typically a sentence) to determine its structural description according to a formal grammar. Basic parsing concepts are explained after which a number of well-known parsing techniques are described.
39.	Ljunglöf, Peter, 1971-, et al. (författare) Syntactic parsing 2010. - 2 Ingår i: Handbook of Natural Language Processing. - Boca Raton, Florida : Chapman & Hall/CRC. ; , s. 59-91 Bokkapitel (populärvet., debatt m.m.)abstract This chapter presents basic techniques for grammar-driven natural language parsing, that is, analyzing a string of words (typically a sentence) to determine its structural description according to a formal grammar. In most circumstances, this is not a goal in itself but rather an intermediary step for the purpose of further processing, such as the assignment of a meaning to the sentence. To this end, the desired output of grammar-driven parsing is typically a hierarchical, syntactic structure suitable for semantic interpretation (the topic of Chapter 5). The string of words constituting the input will usually have been processed in separate phases of tokenization (Chapter 2) and lexical analysis (Chapter 3), which is hence not part of parsing proper.
40.	Megyesi, Beáta, 1971-, et al. (författare) Learner Corpus Anonymization in the Age of GDPR: Insights from the Creation of a Learner Corpus of Swedish 2018 Ingår i: Proceedings of the 7th Workshop on NLP for Computer Assisted Language Learning (NLP4CALL 2018) at SLTC, Stockholm, 7th November 2018 / edited by Ildikó Pilán, Elena Volodina, David Alfter and Lars Borin. - Linköpings universitet : Linköping University Electronic Press. - 1650-3686 .- 1650-3740. - 9789176851739 Konferensbidrag (refereegranskat)abstract This paper reports on the status of learner corpus anonymization for the ongoing research infrastructure project SweLL. The main project aim is to deliver and make available for research a well-annotated corpus of essays written by second language (L2) learners of Swedish. As the practice shows, annotation of learner texts is a sensitive process demanding a lot of compromises between ethical and legal demands on the one hand, and research and technical demands, on the other. Below, is a concise description of the current status of pseudonymization of language learner data to ensure anonymity of the learners, with numerous examples of the above-mentioned compromises.
41.	Mollevik, Johan, 1986- (författare) Natural language interfaces over spatial data : investigations in scalability, extensibility and reliability 2013 Licentiatavhandling (övrigt vetenskapligt/konstnärligt)
42.	Nilsson Björkenstam, Kristina, et al. (författare) Disfluency in Child-Directed Speech 2013 Ingår i: Proceedings of Fonetik 2013. - Linköping : Department of Culture a nd Communication, Linköping University, Sweden. - 9789175195827 - 9789175195797 ; , s. 57-60 Konferensbidrag (refereegranskat)abstract We report results from a longitudinal study of the rate and location of disfluencies in child-directed speech, using data for children between 0;6 and 2;9 years. We compare these results to adult-directed speech by the same speakers.
43.	Nilsson Björkenstam, Kristina, et al. (författare) En korpusstudie om multimodal synkroni i tidig ordinlärning 2013 Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract I denna studie undersöker vi synkroni i tidig multimodal interaktion mellan föräldrar och barn. Med synkroni menas här återkommande mönster eller strukturella regelbundenheter (vad gäller ord, prosodi, blickriktning, gester och handlingar) som kan reducera komplexitet i språkinlärning.Data består av inspelningar av fem longitudinella dyader med två barn (0;7-2;7 år) och deras föräldrar. Inspelningarna transkriberas och annoteras med grundtonsfrekvens, blickriktning, gester och hantering av objekt. Vi undersöker synkroni genom att studera samtliga omnämnanden av två valda objekt (två dockor). För varje omnämnande undersöks grundtonsfrekvens och om omnämnandet kombineras med att den vuxne/barnet tittar på, pekar mot eller rör objektet.Man tänker sig att barnet använder sig av grundläggande perceptuella processer för att ta fasta på mönster och regelbundenheter i interaktionen med den vuxne, både i den akustiska signalen men också i den fysiska omgivningen (Gogate & Hollich, 2010). Den vuxne är dessutom benägen att framhäva den språkliga strukturen i interaktion med barnet, t ex genom att den vuxne talar om ett objekt och samtidigt visar objektet för barnet eller låter barnet känna på objektet. Denna synkroniserade multimodala input blir en hjälp för barnet att strukturera och sortera talsignalen och göra kopplingar mellan ord och objekt. I den här studien vill vi försöka fånga den här typen av multimodal synkroni genom att studera två specifika målord och hur interaktionen ser ut just kring dessa ord. Vi tänker oss att regelbundenheter vad gäller prosodi, blickriktningar och gester kommer att vara mer synkroniserade när barnet är mindre och målorden nya, än när barnen är äldre och målorden bekanta.Studien är del av ett projekt där vi försöker förklara tidig språkinlärning utifrån generella sociala och kognitiva förmågor. Genom att studera tidig förälder-barn-interaktion vill vi undersöka hur språkliga konstruktioner växer fram, vilka funktioner de har och hur de korrelerar med andra stimuli i barnets omgivning.Gogate, L., Hollich, G. 2010. Invariance detection within an interactive system: A perceptual gateway to language development. Psychological Review 117(2), 496-516.
44.	Nilsson Björkenstam, Kristina, et al. (författare) Modelling the informativeness and timing of non-verbal cues in parent–child interaction 2016 Ingår i: The 54th Annual Meeting of the Association for Computational Linguistics. - Stroudsburg, PA, USA : Association for Computational Linguistics. - 9781945626074 ; , s. 82-90 Konferensbidrag (refereegranskat)abstract How do infants learn the meanings of their first words? This study investigates the informativeness and temporal dynamics of non-verbal cues that signal the speaker's referent in a model of early word–referent mapping. To measure the information provided by such cues, a supervised classifier is trained on information extracted from a multimodally annotated corpus of 18 videos of parent–child interaction with three children aged 7 to 33 months. Contradicting previous research, we find that gaze is the single most informative cue, and we show that this finding can be attributed to our fine-grained temporal annotation. We also find that offsetting the timing of the non-verbal cues reduces accuracy, especially if the offset is negative. This is in line with previous research, and suggests that synchrony between verbal and non-verbal cues is important if they are to be perceived as causally related.
45.	Nilsson Björkenstam, Kristina, et al. (författare) Modelling the informativeness of different modalities in parent-child interaction 2015 Ingår i: Workshop on Extensive and Intensive Recordings of Children's Language Environment. Konferensbidrag (refereegranskat)
46.	Nilsson Björkenstam, Kristina, et al. (författare) Multimodal annotation of parent-child interaction in a free-play setting 2013 Ingår i: Multimodal Corpora 2013: Beyond Audio and Video. Konferensbidrag (refereegranskat)abstract This paper describes the verbal, non-verbal, and discourse annotation of a longitudinal corpus of parent-child interaction. The verbal annotation includes transcription of child-directed speech and child vocalizations. The non-verbal annotation describes gestures and objectrelatedactions by both parent and child. The verbal and non-verbal annotation is combined in discourse annotation that distinguishes initial from subsequent mentions, and further categorizes initial mentions depending on initiative.
47.	Nilsson Björkenstam, Kristina, et al. (författare) Multimodal Annotation of Synchrony in Longitudinal Parent–Child Interaction 2014 Ingår i: MMC 2014 Multimodal Corpora: Combining applied and basic research targets. - : European Language Resources Association. Konferensbidrag (refereegranskat)abstract This paper describes the multimodal annotation of speech, gaze and hand movement in a corpus of longitudinal parent–child interaction,and reports results on synchrony, structural regularities which appear to be a key means for parents to facilitate learning of new conceptsto children. The results provide additional support for our previous finding that parents display decreasing synchrony as a function ofthe age of the child.
48.	Nilsson Björkenstam, Kristina, et al. (författare) Reference to Objects in Longitudinal Parent-Child Interaction 2012 Ingår i: Workshop on Language, Action and Perception (APL). Konferensbidrag (refereegranskat)abstract A cognitive model of language learning needs to be dialogue-driven and multimodal to reflect how parent and child interact, using words, eye gaze, and object manipulation.In this paper, we present a scheme for multimodal annotation of parent-child interaction. We use this annotation for studying invariance across modalities. Our basic hypothesis is that perception of invariance (or synchrony) in multimodal patterns in auditory-visual speech is the device primarily used to reduce complexity in language learning.To this end, we have added verbal and non-verbal annotation to a corpus of longitudinal video and sound recordings of parent-child dyads. We use this data to try to determine if the amount of synchrony across modalities of parent-child interaction decreases as the child grows older and learns more language and gestures.
49.	Nilsson Björkenstam, Kristina, et al. (författare) Stockholm University Strindberg Corpus : Contents and possibilities 2012 Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract The Stockholm University Strindberg Corpus (SUSC) consists of seven novels by August Strindberg annotated for parts-of-speech with morphological analysis and lemmas. The corpus is freely available.SUSC consists of approximately 400 000 tokens annotated for parts-of-speech, including morphological analysis and lemmas, using the Stockholm-Umeå Corpus tag set in PAROLE-format. The annotated texts have been converted to XML which makes the corpus searchable with corpus analysis tools such as Xaira. This allows for e.g., searching for concordances with a specific wordform, part-of-speech and/or lemma, for pattern matching, and collocation extraction.The current version of the corpus includes seven works which can be classified as autobiographical:Tjänstekvinnans son (The son of a servant, 1886-87)Han och hon (He and she, 1919)Inferno (Inferno, 1897)Legender and Jakob brottas (Legends and Jacob wrestles, 1898)Fagervik och Skamsund (Fair haven and Foulstrand, 1902)Ensam (Alone, 1903)We are aware of three other electronic collections of Strindberg’s works: Projekt Runeberg, Litteraturbanken and Språkbanken. While these are valuable resources, SUSC is an important addition because, unlike the first two, it is linguistically annotated, and unlike the third, the data is available for download and thus can be fully inspected and processed using the researcher’s software of choice. Even more importantly, researchers can add their analyses as new layers of annotation of the corpus.
50.	Nilsson Björkenstam, Kristina, et al. (författare) SWE-CLARIN partner presentation : Natural Language Processing Resources from the Department of Linguistics, Stockholm University 2014 Ingår i: The first Swedish national SWE-CLARIN workshop. Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract The aim of the CLARIN Research Infrastructure and SWE-CLARIN is to facilitate for scholars in the humanities and social sciences to access primary data in the form of natural language, and to provide tools for exploring, annotating and analysing these data. This paper gives an overview of the resources and tools developed at the Department of Linguistics at Stockholm University planned to be made available within the SWE-CLARIN project. The paper also outlines our collaborations with neighbouring areas in the humanities and social sciences where these resources and tools will be put to use.

Skapa referenser, mejla, bekava och länka

Länka till träfflistan

Träfflista för sökning "WFRF:(Wirén Mats) "

Avgränsa träffmängd

År