SwePub
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "WFRF:(Ahlberg Malin 1986) "

Sökning: WFRF:(Ahlberg Malin 1986)

  • Resultat 1-16 av 16
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  •  
2.
  • Adesam, Yvonne, 1975, et al. (författare)
  • Computer-aided Morphology Expansion for Old Swedish
  • 2014
  • Ingår i: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14) May 26-31, 2014 Reykjavik, Iceland. - 9782951740884 ; , s. 1102-1105
  • Konferensbidrag (refereegranskat)abstract
    • In this paper we describe and evaluate a tool for paradigm induction and lexicon extraction that has been applied to Old Swedish. The tool is semi-supervised and uses a small seed lexicon and unannotated corpora to derive full inflection tables for input lemmata. In the work presented here, the tool has been modified to deal with the rich spelling variation found in Old Swedish texts. We also present some initial experiments, which are the first steps towards creating a large-scale morphology for Old Swedish.
  •  
3.
  • Adesam, Yvonne, 1975, et al. (författare)
  • FSvReader – Exploring Old Swedish Cultural Heritage Texts
  • 2018
  • Ingår i: CEUR Workshop Proceedings, vol. 2084. Proceedings of the Digital Humanities in the Nordic Countries 3rd Conference Helsinki, Finland, March 7-9, 2018. Edited by Eetu, Mäkelä Mikko, Tolonen Jouni Tuominen. - Helsinki : University of Helsinki, Faculty of Arts. - 1613-0073.
  • Konferensbidrag (refereegranskat)abstract
    • This paper describes FSvReader, a tool for easier access to Old Swedish (13th–16th century) texts. Through automatic fuzzy linking of words in a text to a dictionary describing the language of the time, the reader has direct access to dictionary pop-up definitions, in spite of the large amount of morphological and spelling variation. The linked dictionary entries can also be used for simple searches in the text, highlighting possible further instances of the same entry.
  •  
4.
  •  
5.
  • Adesam, Yvonne, 1975, et al. (författare)
  • Språkteknologi för svenska språket genom tiderna
  • 2016
  • Ingår i: Kungliga Skytteanska Samfundets Handlingar. - Umeå : Institutionen för språkstudier, Umeå universitet & Kungl. Skytteanska Samfundet. - 0560-2416. ; 76:Studier i svensk språkhistoria 13, s. 65-87, s. 65-87
  • Tidskriftsartikel (refereegranskat)abstract
    • Språkbanken, the Swedish Language Bank, is a language technology research unit at the Department of Swedish, University of Gothenburg. We develop language resources – such as corpora, lexical resources, and analytical tools – for all variants of Swedish, from Old Swedish laws to present-day social media. Historical texts offer exciting theoretical and methodological challenges for language technology because they often defy the assumption inherent in most automatic analysis tools that the texts contain a standardized written language. In this article, we describe our ongoing work on the development of annotated historical corpora, as well as our efforts on linking various resources (both corpora and lexical resources). This research advances the state of the art of language technology as well as enables new research for scholars in other disciplines.
  •  
6.
  • Ahlberg, Malin, 1986, et al. (författare)
  • A best-first anagram hashing filter for approximate string matching with generalized edit distance
  • 2012
  • Ingår i: 24th International Conference on Computational Linguistics COLING, 8-15 December 2012, Mumbai, India. Proceedings.
  • Konferensbidrag (refereegranskat)abstract
    • This paper presents an efficient method for approximate string matching against a lexicon. We define a filter that for each source word selects a small set of target lexical entries, from which the best match is then selected using generalized edit distance, where edit operations can be assigned an arbitrary weight. The filter combines a specialized hash function with best-first search. Our work extends and improves upon a previously proposed hash-based filter, developed for matching with uniform-weight edit distance. We evaluate an approximate matching system implemented with the new best-first filter, by conducting several experiments on a historical corpus and a set of weighted rules taken from the literature. We present running times and discuss how performance varies using different stopping criteria and target lexica. The results show that the filter is suitable for large rule sets and million word corpora, and encourage further development.
  •  
7.
  • Ahlberg, Malin, 1986, et al. (författare)
  • A case study on supervised classification of Swedish pseudo-coordination
  • 2015
  • Ingår i: Proceedings of the 20th Nordic Conference of Computational Linguistics, NODALIDA 2015, May 11-13, 2015, Vilnius, Lithuania. - Linköpings universitet : Linköping University Electronic Press. - 1650-3686 .- 1650-3740. - 9789175190983
  • Konferensbidrag (refereegranskat)abstract
    • We present a case study on supervised classification of Swedish pseudo-coordination (SPC). The classification is attempted on the type-level with data collected from two data sets: a blog corpus and a fiction corpus. Two small experiments were designed to evaluate the feasability of this task. The first experiment explored a classifier’s ability to discriminate pseudo-coordinations from ordinary verb coordinations, given a small labeled data set created during the experiment. The second experiment evaluated how well the classifier performed at detecting and ranking SPCs in a set of unlabeled verb coordinations, to investigate if it could be used as a semi-automatic discovery procedure to find new SPCs.
  •  
8.
  • Ahlberg, Malin, 1986, et al. (författare)
  • A Type-Theoretical Wide-Coverage Computational Grammar for Swedish
  • 2012
  • Ingår i: Proceedings of the 15th International Conference, TSD(Text, Speech and Dialogue) 2012, Brno, Czech Republic, September 3-7, 2012,LNCS series "Text, Speech and Dialogue". - 0302-9743. - 9783642327902 ; 7499, s. 183-190
  • Konferensbidrag (refereegranskat)
  •  
9.
  •  
10.
  • Ahlberg, Malin, 1986, et al. (författare)
  • Korp and Karp – a bestiary of language resources: the research infrastructure of Språkbanken
  • 2013
  • Ingår i: Proceedings of the 19th Nordic Conference of Computational Linguistics (NODALIDA 2013), May 22–24, 2013, Oslo University, Norway. NEALT Proceedings Series 16. - Linköping : Linköping University Electronic Press. - 1650-3686 .- 1650-3740.
  • Konferensbidrag (refereegranskat)abstract
    • A central activity in Språkbanken, an R&D unit at the University of Gothenburg, is the systematic construction of a research infrastructure based on interoperability and widely accepted standards for metadata and data. The two main components of this infrastructure deal with text corpora and with lexical resources. For modularity and flexibility, both components have a backend, or server-side part, accessed through an API made up of a set of well-defined web services. This means that there can be any number of different user interfaces to these components, corresponding, e.g., to different research needs. Here, we will demonstrate the standard corpus and lexicon search interfaces, designed primarily for linguistic searches: Korp and Karp.
  •  
11.
  • Ahlberg, Malin, 1986, et al. (författare)
  • Paradigm classification in supervised learning of morphology
  • 2015
  • Ingår i: Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.
  • Konferensbidrag (refereegranskat)abstract
    • Supervised morphological paradigm learning by identifying and aligning the longest common subsequence found in inflection tables has recently been proposed as a simple yet competitive way to induce morphological patterns. We combine this non-probabilistic strategy of inflection table generalization with a discriminative classifier to permit the reconstruction of complete inflection tables of unseen words. Our system learns morphological paradigms from labeled examples of inflection patterns (inflection tables) and then produces inflection tables from unseen lemmas or base forms. We evaluate the approach on datasets covering 11 different languages and show that this approach results in consistently higher accuracies vis-a-vis other methods on the same task, thus indicating that the general method is a viable approach to quickly creating high-accuracy morphological resources.
  •  
12.
  • Ahlberg, Malin, 1986, et al. (författare)
  • Semi-supervised learning of morphological paradigms and lexicons
  • 2014
  • Ingår i: Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, Gothenburg, Sweden 26–30 April 2014. - 9781937284787 ; , s. 569-578
  • Konferensbidrag (refereegranskat)abstract
    • We present a semi-supervised approach to the problem of paradigm induction from inflection tables. Our system extracts generalizations from inflection tables, representing the resulting paradigms in an abstract form. The process is intended to be language-independent, and to provide human-readable generalizations of paradigms. The tools we provide can be used by linguists for the rapid creation of lexical resources. We evaluate the system through an inflection table reconstruction task using Wiktionary data for German, Spanish, and Finnish. With no additional corpus information available, the evaluation yields per word form accuracy scores on inflecting unseen base forms in different lan guages ranging from 87.81% (German nouns) to 99.52% (Spanish verbs); with additional unlabeled tex t corpora available for training the scores range from 91.81% (German nouns) to 99.58% (Spanish verbs). We separately evaluate the system in a simulated task of Swedish lexicon creation, and show that on the basis of a small number of inflection tables, the system can accurately collect from a list of noun forms a lexicon with inflection information ranging from 100.0% correct (collect 100 words), to 96.4% correct (collect 1000 words).
  •  
13.
  • Ahlberg, Malin, 1986, et al. (författare)
  • Språkbanken’s Open Lexical Infrastructure
  • 2016
  • Ingår i: SLTC 2016. The Sixth Swedish Language Technology Conference. Umeå University, 17-18 November, 2016.
  • Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract
    • Karp is an open lexical infrastructure and a web based tool for searching, exploring and developing lexical resources. Språkbanken currently hosts a number of lexicons in Karp and on-going work aims at broadening the type of resources that can be developed in the system. This abstract gives a short overview of Karp's basic functionality, and describes some current projects and on-going work.
  •  
14.
  •  
15.
  •  
16.
  • Malm, Per, et al. (författare)
  • Uneek: a Web Tool for Comparative Analysis of Annotated Texts
  • 2018
  • Ingår i: Proceedings of the LREC 2018 Workshop International FrameNetWorkshop 2018: Multilingual Framenets and Constructicons, 7-12 May 2018, Miyazaki (Japan) / [ed] Tiago Timponi Torrent, Lars Borin & Collin F. Baker, 2018. - 9791095546047
  • Konferensbidrag (refereegranskat)abstract
    • In this paper, we present Uneek, a web based linguistic tool that performs set operations on raw or annotated texts. The tool may be used for automatic distributional analysis, and for disambiguating polysemy with a method that we refer to as semi-automatic uniqueness differentiation (SUDi). Uneek outputs the intersection and differences between their listed attributes, e.g. POS, dependencies, word forms, frame elements. This makes it an ideal supplement to methods for lumping or splitting in frame development processes. In order to make some of Uneek’s functions more clear, we employ SUDi on a small data set containing the polysemous verb "bake". As of now, Uneek may only run two files at a time, but there are plans to develop the tool so that it may simultaneously operate on multiple files. Finally, we relate the developmental plans for added functionality, to how such functions may support FrameNet work in the future.
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-16 av 16

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy