SwePub
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "WFRF:(Dannélls Dana 1976) "

Sökning: WFRF:(Dannélls Dana 1976)

  • Resultat 1-25 av 66
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  •  
2.
  • Adesam, Yvonne, 1975, et al. (författare)
  • Exploring the Quality of the Digital Historical Newspaper Archive KubHist
  • 2019
  • Ingår i: Proceedings of the 4th Conference of The Association Digital Humanities in the Nordic Countries (DHN), Copenhagen, Denmark, March 5-8, 2019 / edited by Costanza Navarretta, Manex Agirrezabal, Bente Maegaard. - Aachen : CEUR Workshop Proceedings. - 1613-0073.
  • Konferensbidrag (refereegranskat)abstract
    • The KubHist Corpus is a massive corpus of Swedish historical newspapers, digitized by the Royal Swedish library, and available through the Språkbanken corpus infrastructure Korp. This paper contains a first overview of the KubHist corpus, exploring some of the difficulties with the data, such as OCR errors and spelling variation, and discussing possible paths for improving the quality and the searchability.
  •  
3.
  •  
4.
  • Berdicevskis, Aleksandrs, 1983, et al. (författare)
  • Superlim: A Swedish Language Understanding Evaluation Benchmark
  • 2023
  • Ingår i: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, December 6-10, 2023, Singapore / Houda Bouamor, Juan Pino, Kalika Bali (Editors). - Stroudsburg, PA : Association for Computational Linguistics. - 9798891760608
  • Konferensbidrag (refereegranskat)
  •  
5.
  •  
6.
  • Borin, Lars, 1957, et al. (författare)
  • Geographic visualization of place names in Swedish literary texts
  • 2014
  • Ingår i: Literary & Linguistic Computing. - : Oxford University Press (OUP). - 0268-1145 .- 1477-4615. ; 29:3, s. 400-404
  • Tidskriftsartikel (refereegranskat)abstract
    • This article describes the development of a geographical information system (GIS) at Språkbanken as part of a visualization solution to be used in an archive of historical Swedish literary texts. The research problems we are aiming to address concern orthographic and morphological variation, missing place names, and missing place name coordinates. Some of these problems form a central part in the development of methods and tools for the automatic analysis of historical Swedish literary texts at our research unit. We discuss the advantages and challenges of covering large-scale spelling variation in place names from different sources and in generating maps with focus on different time periods.
  •  
7.
  • Borin, Lars, 1957, et al. (författare)
  • Introduction: Swedish FrameNet+
  • 2021
  • Ingår i: The Swedish FrameNet++. Harmonization, integration, method development and practical language technology applications / editor(s): Dana Dannélls, Lars Borin and Karin Friberg Heppin. - Amsterdam / Philadelphia : John Benjamins Publishing Company. - 1567-8202. - 978 90 272 5848 9 ; , s. 3-36
  • Bokkapitel (refereegranskat)abstract
    • The Swedish FrameNet++ was designed to be several things. As a digital artifact, it is an integrated panchronic lexical macroresource, primarily for Swedish, but including several other languages, intended as a basic infrastructural component in Swedish language technology research and for developing natural language processing applications. As an activity, it is a long-term R&D initiative, initially aimed at bringing about this macroresource, and now at maintaining and extending it, at promoting its use in language technology research and application development, as well as ensuring that the results of this research and development in their turn are incorporated in the macroresource. As a product of research, it reflects both computational and linguistic approaches to lexicology, lexical semantics, and lexical typology.
  •  
8.
  • Borin, Lars, 1957, et al. (författare)
  • Linguistics vs. language technology in constructicon building and use
  • 2018
  • Ingår i: Constructicography: Constructicon development across languages / edited by Benjamin Lyngfelt, Lars Borin, Kyoko Ohara, Tiago Timponi Torrent. - Amsterdam : John Benjamins. - 1573-594X. - 9789027263865 ; , s. 229-253
  • Bokkapitel (refereegranskat)abstract
    • In this chapter, we describe the close interaction of linguists and language technologists in the Swedish constructicon project. This kind of collaboration is not so common today, because of the way that language technology has developed in recent decades, but in our case the collaboration has been very successful, and constituted a genuine instance of cross-fertilization, where an evolving language technology infrastructure and a computational lexical macroresource described in the chapter has formed an integral part of the Swedish constructicon development environment, while at the same time the structured linguistic knowledge described in the constructicon has informed the language technology making up the infrastructure.
  •  
9.
  • Borin, Lars, 1957, et al. (författare)
  • Representing Swedish Lexical Resources in RDF with lemon
  • 2014
  • Ingår i: Proceedings of the ISWC 2014 Posters & Demonstrations Track a track within the 13th International Semantic Web Conference (ISWC 2014). - 1613-0073. ; 1272, s. 329-332
  • Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract
    • The paper presents an ongoing project which aims to publish Swedish lexical-semantic resources using Semantic Web and Linked Data technologies. In this article, we highlight the practical conversion methods and challenges of converting three of the Swedish language resources in RDF with lemon.
  •  
10.
  • Borin, Lars, 1957, et al. (författare)
  • Swedish FrameNet+
  • 2010
  • Ingår i: Swedish Language Technology Conference 2010.
  • Tidskriftsartikel (refereegranskat)
  •  
11.
  • Borin, Lars, 1957, et al. (författare)
  • The past meets the present in Swedish FrameNet+
  • 2010
  • Ingår i: 14th EURALEX International Congress. ; , s. 269-281
  • Konferensbidrag (refereegranskat)abstract
    • The paper is about a recently initiated project which aims at the development of a Swedish FrameNet as an integral part of a larger lexical resource, hence the name “Swedish FrameNet++” (SweFN++). It focuses on reuse of free electronic resources and their role in the acquisition and population of Swedish frames. After a brief overview of Swedish resources, we reflect on three approaches to recycling the available lexical data in a semi-automatic manner. SweFN++ will be a multi-functional resource supporting research within lexicology and linguistics as well as different applications within computational lexicography and language technology, not to mention e-science.
  •  
12.
  • Borin, Lars, 1957, et al. (författare)
  • Thinking Green: Toward Swedish FrameNet+
  • 2009
  • Ingår i: FrameNet Masterclass and Workshop.
  • Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract
    • Access to multi-layered lexical, grammatical and semantic information representing text content is a prerequisite for efficient automatic understanding and generation of natural language. A FrameNet is considered a valuable resource for both linguistics and language technology research that may contribute to the achievement of these goals. Currently, FrameNet-like resources exist for a few languages,1 including some domain-specific and multilingual initiatives (Dolbey et al., 2006; Boas, 2009; Uematsu et al., 2009; Venturi et al., 2009), but are unavailable for most languages, including Swedish, although there have been some pilot studies exploring the semi-automatic acquisition of Swedish frames (Johansson & Nugues, 2006; Borin et al., 2007). At the University of Gothenburg, we are now embarking on a project to build a Swedish FrameNet-like resource. A novel feature of this project is that the Swedish FrameNetwill be an integral part of a largermany-faceted lexical resource. Hence the name Swedish FrameNet++ (SweFN++).
  •  
13.
  • Caprotti, Olga, 1964, et al. (författare)
  • High-quality translation: Molto tools and applications
  • 2012
  • Ingår i: The fourth Swedish Language Technology Conference (SLTC).
  • Konferensbidrag (refereegranskat)abstract
    • MOLTO (Multilingual On Line Translation, FP7-ICT-247914, www.molto-project.eu) is a European project focusing on translation on the web. MOLTO targets translation that has production quality, that is, usable for quick and reliable dissemination of information. MOLTO’s main focus is to increase the productivity of such translation systems, building on the technology of GF (Grammatical Framework) and its Resource Grammar Library. But MOLTO also develops hybrid methods which increase the quality of Statistical Machine Translation (SMT) by adding linguistic information, or bootstrap grammatical models from statistical models. This paper gives a brief overview of MOLTO’s latest achievements, many of which are more thoroughly described in separate papers and available as web-based demos and as open-source software.
  •  
14.
  • Damova, Mariana, et al. (författare)
  • Natural Language Interaction with Semantic Web Knowledge Bases and Linked Open Data
  • 2014
  • Ingår i: Towards multilingual Semantic Web. - Berlin : Springer. - 9783662435854 ; , s. 211-226
  • Bokkapitel (refereegranskat)abstract
    • Cultural heritage appears to be a very useful use case for Semantic Web technologies. The domain provides with plenty of circumstances where linkages between different knowledge sources are required to ensure access to rich information and respond to the needs of professionals dealing with cultural heritage content. Semantic Web technologies offer the technological backbone to meet the requirement of integrating heterogeneous data easily, but they are still more adapted to be consumed by computers than by humans, especially non-engineers or developers. This chapter is about a technique which allows interaction in natural language with semantic knowledge bases. The proposed technique offers a method that allows querying a semantic repository in natural language and obtaining results from it as a coherent text. This unique solution includes several steps of transition from natural language to SPARQL and from RDF to coherent multilingual descriptions, using the Grammatical Framework, GF. The approach builds on a semantic knowledge infrastructure in RDF, it is based on OWLIM-SE and the data integration method Reason-able View supplied with an ontological reference layer. The latter is connected via formal rules with abstract representations derived from the syntactic trees of natural language input using the GF resource grammar library.
  •  
15.
  • Dannélls, Dana, 1976, et al. (författare)
  • A Framework for Improved Access to Museum Databases in the Semantic Web
  • 2011
  • Ingår i: Language Technologies for Digital Humanities and Cultural Heritage (RANLPDigHum 2011). - 9789544520199 ; , s. 3-10
  • Konferensbidrag (refereegranskat)abstract
    • Digital museum databases have extremely heterogeneous data structures which require advanced mapping and vocabulary integration for them to benefit from the interoperability enabled by semantic technologies. In addition to establishing ways of extracting and manipulating digitally encoded cultural material, there exists a need to make this material available and accessible to human users in different forms and languages that are available to them. In this paper we describe a method to manage and access museum data by integrating it within a series of interlinked ontological models. The method allows querying and generation of query results in natural language. We report on the results of applying this method from experiments we have been pursuing.
  •  
16.
  •  
17.
  • Dannélls, Dana, 1976, et al. (författare)
  • A Supervised Machine Learning Approach for Post-OCR Error Detection for Historical Text
  • 2021
  • Ingår i: Linköping Electronic Press Workshop and Conference Collection. Selected contributions from the Eighth Swedish Language Technology Conference (SLTC-2020), 25-27 November, 2020. - Linköping : Linköping Electronic Press. - 2003-6523.
  • Konferensbidrag (refereegranskat)abstract
    • Training machine learning models with high accuracy requires careful feature engineering, which involves finding the best feature combinations and extracting their values from the data. The task becomes extremely laborious for specific problems such as post Optical Character Recognition (OCR) error detection because of the diversity of errors in the data. In this paper we present a machine learning approach which exploits character n-gram statistics as the only feature for the OCR error detection task. Our method achieves a significant improvement over the baseline reaching state-of-the-art results of 91% and 89% F1 measure on English and Swedish datasets respectively. We report various experiments to select the appropriate machine learning algorithm and to compare our approach to previously reported traditional approaches.
  •  
18.
  • Dannélls, Dana, 1976 (författare)
  • A System Architecture for Conveying Historical Knowledge to Museum Visitors
  • 2008
  • Ingår i: Proceedings of ECDL 2008, Århus, Danmark. - 9789081348911
  • Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract
    • One of the requirements posed by cultural organizations is how to accommodate cultural content of formal ontology object descriptions to different user needs. This paper introduces a personal museum guide system architecture that is being developed to exploit linguistic aspects of realization of a domain-specific ontology in relation to the user’s interaction with this ontology.
  •  
19.
  • Dannélls, Dana, 1976, et al. (författare)
  • A Two-OCR Engine Method for Digitized Swedish Newspapers
  • 2021
  • Ingår i: Selected Papers from the CLARIN Annual Conference 2020, Linköping Electronic Conference Proceedings 180. - Linköping : Linköping University Electronic Press. - 1650-3686 .- 1650-3740. - 9789179296094
  • Konferensbidrag (refereegranskat)abstract
    • In this paper we present a two-OCR engine method that was developed at Kungliga biblioteket (KB), the National Library of Sweden, for improving the correctness of the OCR for mass digitization of Swedish newspapers. To evaluate the method a reference material spanning the years 1818–2018 was prepared and manually transcribed. A quantitative evaluation was then performed against the material. In this first evaluation we experimented with word lists for different time periods. The results show that even though there was no significant overall improvement of the OCR results, some combinations of word lists are successful for certain periods and should therefore be explored further.
  •  
20.
  • Dannélls, Dana, 1976 (författare)
  • Applying semantic frame theory to automate natural language templates generation from ontology statements
  • 2010
  • Ingår i: The 6th International Natural Language Generation Conference.
  • Konferensbidrag (refereegranskat)abstract
    • Today there exist a growing number of framenet-like resources offering semantic and syntactic phrase specifications that can be exploited by natural language generation systems. In this paper we present on-going work that provides a starting point for exploiting framenet information for multilingual natural language generation. We describe the kind of information offered by modern computational lexical resources and discuss how template-based generation systems can benefit from them.
  •  
21.
  • Dannélls, Dana, 1976 (författare)
  • Automatic Acronym Recognition
  • 2006
  • Ingår i: Proceedings of the 11th conference on European chapter of the Association for Computational Linguistics (EACL). - 1932432590
  • Konferensbidrag (refereegranskat)abstract
    • This paper deals with the problem of recognizing and extracting acronym- definition pairs in Swedish medical texts. This project applies a rule-based method to solve the acronym recognition task and compares and evaluates the results of different machine learning algorithms on the same task. The method proposed is based on the approach that acronym-definition pairs follow a set of patterns and other regularities that can be usefully applied for the acronym identification task. Supervised machine learning was applied to monitor the performance of the rule-based method, using Memory Based Learning (MBL). The rule-based algorithm was evaluated on a hand tagged acronym corpus and performance was measured using standard measures recall, precision and f-score. The results show that performance could further improve by increasing the training set and modifying the input settings for the machine learning algorithms. An analysis of the errors produced indicates that further improvement of the rule-based method requires the use of syntactic information and textual pre-processing.
  •  
22.
  • Dannélls, Dana, 1976, et al. (författare)
  • Beyond strings of characters: Resources meet NLP – Again
  • 2022
  • Ingår i: Live and learn: Festschrift in honor of Lars Borin / Editors: Elena Volodina, Dana Dannélls, Aleksandrs Berdicevskis, Markus Forsberg, Shafqat Virk. - Göteborg : Institutionen för svenska, flerspråkighet och språkteknologi, Göteborgs universitet. - 1401-5919. - 9789187850837 ; , s. 29-37
  • Bokkapitel (refereegranskat)abstract
    • FrameNet (FN) resources have existed for many languages for over a decade but their adoption in real world applications has been limited. To celebrate the 65 anniversary of Lars Borin, the initiator and leader of Swedish FrameNet, among others, we take a standpoint to motivate why language resources are crucial for moving NLP forward. We present our position on (a) the need for language resources to embrace other dimensions of text and language use, and (b) the need for them to relate to other representations through multimodality.
  •  
23.
  • Dannélls, Dana, 1976, et al. (författare)
  • Building a Language Technology Infrastructure for Digital Humanities: Challenges, Opportunities and Progress
  • 2020
  • Ingår i: Proceedings of the Twin Talks 2 and 3 Workshops at DHN 2020 and DH 2020 Ottawa Canada and Riga Latvia, July 23 and October 20, 2020 / edited by Steven Krauwer, Darja Fišer. - : CEUR-WS.org. - 1613-0073.
  • Konferensbidrag (refereegranskat)abstract
    • Språkbanken Text, a research unit at the University of Gothenburg, forms part of the National Language Bank of Sweden and is the main coordinating node of Swe-Clarin, the Swedish national CLARIN node. During the past years, Språkbanken Text has been actively engaged in a number of humanities and social sciences related research projects. This engagement has primarily concerned the development of new resources, methods and tools to accurately process large amounts of digitized material, in addition to interfaces for visualizing the materials, making them easily accessible for further analysis. The activities within Swe-Clarin have been essential for the progress and the success of this work. In this paper we present what was required from Språkbanken Text in order to meet the expectations of researchers from the humanities and social sciences. We discuss some of the challenges this work involves and describe the opportunities this field brings with it and how these opportunities could help to progress the work of Språkbanken Text toward building a language technology infrastructure that supports interdisciplinary research.
  •  
24.
  • Dannélls, Dana, 1976, et al. (författare)
  • Computational representation of FrameNet for multilingual natural language generation
  • 2021
  • Ingår i: The Swedish FrameNet++. Harmonization, integration, method development and practical language technology applications. - Amsterdam / Philadelphia : John Benjamins Publishing Company. - 1567-8202. - 9789027258489 ; , s. 281-301
  • Bokkapitel (refereegranskat)abstract
    • Multilingual natural language generation, the process of producing written or spoken utterances in parallel languages from either structured or unstructured representations requires large amounts of syntactic and semantic information to generate an expression that is tailored to the target audience. This information is offered by FrameNet-like resources, which have been developed for a number of languages. In this chapter, we present a computational FrameNet grammar resource for multilingual natural language generation. We compare between English and Swedish framenets to illustrate how these can be unified under a shared computational representation using Grammatical Framework. We demonstrate how the grammar was exploited in two practical multilingual natural language generation applications to facilitate tourist communication and empower museum users with coherent artwork descriptions.
  •  
25.
  • Dannélls, Dana, 1976, et al. (författare)
  • Controlled Natural Language Generation from a Multilingual FrameNet-based Grammar
  • 2014
  • Ingår i: Lecture Notes in Computer Science. - Cham : Springer International Publishing. - 0302-9743 .- 1611-3349. - 9783319102221 ; 8625, s. 155-166
  • Konferensbidrag (refereegranskat)abstract
    • This paper presents a currently bilingual but potentially multilingual FrameNet-based grammar library implemented in Grammatical Framework. The contribution of this paper is two-fold. First, it offers a methodological approach to automatically generate the grammar based on semantico-syntactic valence patterns extracted from FrameNet-annotated corpora. Second, it provides a proof of concept for two use cases illustrating how the acquired multilingual grammar can be exploited in different CNL applications in the domains of arts and tourism.
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-25 av 66
Typ av publikation
konferensbidrag (48)
bokkapitel (6)
tidskriftsartikel (3)
rapport (2)
annan publikation (2)
samlingsverk (redaktörskap) (1)
visa fler...
bok (1)
proceedings (redaktörskap) (1)
doktorsavhandling (1)
forskningsöversikt (1)
visa färre...
Typ av innehåll
refereegranskat (44)
övrigt vetenskapligt/konstnärligt (22)
Författare/redaktör
Dannélls, Dana, 1976 (66)
Borin, Lars, 1957 (16)
Forsberg, Markus, 19 ... (9)
Enache, Ramona, 1985 (8)
Ranta, Aarne, 1963 (7)
Gruzitis, Normunds (7)
visa fler...
Friberg Heppin, Kari ... (6)
Virk, Shafqat, 1979 (6)
Damova, Mariana (6)
Toporowska Gronostaj ... (5)
Kokkinakis, Dimitrio ... (5)
Adesam, Yvonne, 1975 (4)
Olsson, Leif-Jöran, ... (4)
Mateva, Maria (4)
Tahmasebi, Nina, 198 ... (3)
Bouma, Gerlof, 1979 (3)
Volodina, Elena, 197 ... (3)
Johansson, Torsten (3)
Björk, Lars (3)
Johansson, Richard, ... (2)
Berdicevskis, Aleksa ... (2)
Morger, Felix (2)
Dobnik, Simon, 1977 (2)
Camilleri, John J., ... (2)
Ahlberg, Malin, 1986 (1)
Angelov, Krasimir, 1 ... (1)
Uppström, Jonatan (1)
Nivre, Joakim (1)
Malmsten, Martin (1)
Megyesi, Beata (1)
Sahlgren, Magnus (1)
Prasad, K V S, 1952 (1)
Brodén, Daniel, 1975 (1)
Lyngfelt, Benjamin, ... (1)
Caprotti, Olga, 1964 (1)
Kurtz, Robin (1)
Öhman, Joey (1)
Isbister, Tim (1)
Lindahl, Anna, 1988 (1)
Rekathati, Faton (1)
Börjeson, Love (1)
Hengchen, Simon, 198 ... (1)
Sheikh, Muhammad Aza ... (1)
Torrent, Tiago Timpo ... (1)
McCrae, John P. (1)
Détrez, Grégoire (1)
Hallgren, Thomas, 19 ... (1)
Ilinykh, Nikolai, 19 ... (1)
Deleger, Louise (1)
Chechev, Milen (1)
visa färre...
Lärosäte
Göteborgs universitet (66)
Chalmers tekniska högskola (4)
Språk
Engelska (66)
Forskningsämne (UKÄ/SCB)
Naturvetenskap (65)
Humaniora (22)
Teknik (1)

År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy