SwePub
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "swepub ;mspu:(conferencepaper);lar1:(gu);pers:(Borin Lars 1957)"

Sökning: swepub > Konferensbidrag > Göteborgs universitet > Borin Lars 1957

  • Resultat 1-10 av 87
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  • Virk, Shafqat, 1979, et al. (författare)
  • Exploiting frame semantics and frame-semantic parsing for automatic extraction of typological information from descriptive grammars of natural languages
  • 2019
  • Ingår i: International Conference Recent Advances in Natural Language Processing, RANLP. - Shoumen : Incoma Ltd. - 1313-8502. - 9789544520557 - 9789544520564 ; 2019-September, s. 1247-1256
  • Konferensbidrag (refereegranskat)abstract
    • We describe a novel system for automatic extraction of typological linguistic information from descriptive grammars of natural languages, applying the theory of frame semantics in the form of frame-semantic parsing. The current proof-of-concept system covers a few selected linguistic features, but the methodology is general and can be extended not only to other typological features but also to descriptive grammars written in languages other than English. Such a system is expected to be a useful assistance for automatic curation of typological databases which otherwise are built manually, a very labor and time consuming as well as cognitively taxing enterprise.
  •  
2.
  • Borin, Lars, 1957, et al. (författare)
  • Mining semantics for culturomics: towards a knowledge-based approach
  • 2013
  • Ingår i: 2013 ACM International Workshop on Mining Unstructured Big Data Using Natural Language Processing, UnstructureNLP 2013, Held at 22nd ACM International Conference on Information and Knowledge Management, CIKM 2013; San Francisco, CA; United States; 28 October 2013 through 28 October 2013. - New York, NY, USA : ACM. - 9781450324151 ; , s. 3-10
  • Konferensbidrag (refereegranskat)abstract
    • The massive amounts of text data made available through the Google Books digitization project have inspired a new field of big-data textual research. Named culturomics, this field has attracted the attention of a growing number of scholars over recent years. However, initial studies based on these data have been criticized for not referring to relevant work in linguistics and language technology. This paper provides some ideas, thoughts and first steps towards a new culturomics initiative, based this time on Swedish data, which pursues a more knowledge-based approach than previous work in this emerging field. The amount of new Swedish text produced daily and older texts being digitized in cultural heritage projects grows at an accelerating rate. These volumes of text being available in digital form have grown far beyond the capacity of human readers, leaving automated semantic processing of the texts as the only realistic option for accessing and using the information contained in them. The aim of our recently initiated research program is to advance the state of the art in language technology resources and methods for semantic processing of Big Swedish text and focus on the theoretical and methodological advancement of the state of the art in extracting and correlating information from large volumes of Swedish text using a combination of knowledge-based and statistical methods.
  •  
3.
  • Cap, Fabienne, et al. (författare)
  • SWORD : Towards Cutting-Edge Swedish Word Processing
  • 2016
  • Ingår i: Proceedings of SLTC 2016.
  • Konferensbidrag (refereegranskat)abstract
    • Despite many years of research on Swedish language technology, there is still no well-documented standard for Swedish word processing covering the whole spectrum from low-level tokenization to morphological analysis and disambiguation. SWORD is a new initiative within the SWE-CLARIN consortium aiming to develop documented standards for Swedish word processing. In this paper, we report on a pilot study of Swedish tokenization, where we compare the output of six different tokenizers on four different text types. For one text type (Wikipedia articles), we also compare to the tokenization produced by six manual annotators.
  •  
4.
  • Borin, Lars, 1957, et al. (författare)
  • Language technology for digital linguistics: Turning the Linguistic Survey of India into a rich source of linguistic information
  • 2018
  • Ingår i: Lecture Notes in Computer Science. Computational Linguistics and Intelligent Text Processing, 18th International Conference, CICLing 2017, Budapest, Hungary, April 17–23, 2017. - Cham : Springer. - 0302-9743 .- 1611-3349. ; , s. 550-563
  • Konferensbidrag (refereegranskat)abstract
    • We present our work aiming at turning the linguistic material available in Grierson’s classical Linguistic Survey of India (LSI) from a printed discursive textual description into a formally structured digital language resource, a database suitable for a broad array of linguistic investigations of the languages of South Asia. While doing so, we develop state-of-the-art language technology for automatically extracting the relevant grammatical information from the text of the LSI, and interactive linguistic information visualization tools for better analysis and comparisons of languages based on their structural and functional features.
  •  
5.
  • Bäckström, Linnéa, 1975, et al. (författare)
  • Automatic identification of construction candidates for a Swedish constructicon
  • 2013
  • Ingår i: Proceedings of the workshop on lexical semantic resources for NLP at NODALIDA 2013, May 22-24, 2013, Oslo, Norway. NEALT Proceedings Series 19. - 1650-3686 .- 1650-3740. ; , s. 2-11
  • Konferensbidrag (refereegranskat)abstract
    • We present an experiment designed for extracting construction candidates for a Swedish constructicon from text corpora. We have explored the use of hybrid n-grams with the practical goal to discover previously undescribed partially schematic constructions. The experiment was successful, in that quite a few new constructions were discovered. The precision is low, but as a push-button tool for construction discovery, it has proven a valuable tool for the work on a Swedish constructicon.
  •  
6.
  • Lyngfelt, Benjamin, 1968, et al. (författare)
  • Ett svenskt konstruktikon. Grammatik möter lexikon
  • 2014
  • Ingår i: Svenskans beskrivning : Förhandlingar vid Trettiotredje sammankomsten för svenskans beskrivning. Helsingfors den 15–17 maj 2013. - 1795-4428. - 9789515101204 ; 33, s. 268-279, s. 268-279
  • Konferensbidrag (refereegranskat)
  •  
7.
  • Malm, Per, et al. (författare)
  • LingFN: Towards a framenet for the linguistics domain
  • 2018
  • Ingår i: Proceedings : LREC 2018 Workshop, International FrameNet Workshop 2018. Multilingual Framenets and Constructicons, May 12, 2018, Miyazaki, Japan / Edited by Tiago Timponi Torrent, Lars Borin and Collin F. Baker. - Miyazaki : ELRA. - 9791095546047
  • Konferensbidrag (refereegranskat)abstract
    • Framenets and frame semantics have proved useful for a number of natural language processing (NLP) tasks. However, in this connection framenets have often been criticized for limited coverage. A proposed reasonable-effort solution to this problem is to develop domain-specific (sublanguage) framenets to complement the corresponding general-language framenets for particular NLP tasks, and in the literature we find such initiatives covering, e.g., medicine, soccer, and tourism. In this paper, we report on our experiments and first results on building a framenet to cover the terms and concepts encountered in descriptive linguistic grammars. A contextual statistics based approach is used to judge the polysemous nature of domain-specific terms, and to design new domain-specific frames. The work is part of a more extensive research undertaking where we are developing NLP methodologies for automatic extraction of linguistic information from traditional linguistic descriptions to build typological databases, which otherwise are populated using a labor intensive manual process.
  •  
8.
  • Sköldberg, Emma, 1968, et al. (författare)
  • Between Grammars and Dictionaries: a Swedish Constructicon
  • 2013
  • Ingår i: Kosem, I., Kallas, J., Gantar, P., Krek, S., Langemets, M., Tuulik, M. (eds.) 2013. Electronic lexicography in the 21st century: thinking outside the paper. Proceedings of the eLex 2013 conference, 17-19 October 2013, Tallinn, Estonia. Ljubljana/Tallinn: Trojina, Institute for Applied Slovene Studies/Eesti Keele Instituut.. ; , s. 310-327, s. 310-327
  • Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract
    • This paper introduces the Swedish Constructicon (SweCxn), a database of Swedish constructions currently under development. We also present a small study of the treatment of constructions in Swedish (paper) dictionaries, thus illustrating the need for a constructionist approach, and discuss three different methods used to identify potential constructions for inclusion in the constructicon. SweCxn is a freely available electronic resource, with a particular focus on semi-general linguistic patterns of the type that are difficult to account for from a purely lexicographic or a purely grammatical perspective, and which therefore have tended to be neglected in both dictionaries and grammars. Far from being a small set of borderline cases, such constructions are both numerous and common. They are also quite problematic for second language acquisition as well as LT applications. Accordingly, various kinds of multi-word units have received more attention in recent years, not least from a lexicographic perspective. The coverage, however, is only partial, and the productivity of many constructions is hard to capture from a lexical viewpoint. To identify constructions for SweCxn, we use a combination of methods, such as working from existing construction descriptions for Swedish and other languages, applying LT tools to discover recurring patterns in texts, and extrapolating constructional information from dictionaries.
  •  
9.
  •  
10.
  • Virk, Shafqat, 1979, et al. (författare)
  • Automatic extraction of typological linguistic features from descriptive grammars
  • 2017
  • Ingår i: Text, Speech, and Dialogue 20th International Conference, TSD 2017, Prague, Czech Republic, August 27-31, 2017, Proceedings / edited by Kamil Ekštein, Václav Matoušek.. - Cham : Springer International Publishing. - 0302-9743 .- 1611-3349. - 9783319642055 ; , s. 111-119
  • Konferensbidrag (refereegranskat)abstract
    • The present paper describes experiments on automatically extracting typological linguistic features of natural languages from traditional written descriptive grammars. The feature-extraction task has high potential value in typological, genealogical, historical, and other related areas of linguistics that make use of databases of structural features of languages. Until now, extraction of such features from grammars has been done manually, which is highly time and labor consuming and becomes prohibitive when extended to the thousands of languages for which linguistic descriptions are available. The system we describe here starts from semantically parsed text over which a set of rules are applied in order to extract feature values. We evaluate the system’s performance on the manually curated Grambank database as the gold standard and report the first measures of precision and recall for this problem.
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-10 av 87
Typ av publikation
Typ av innehåll
refereegranskat (71)
övrigt vetenskapligt/konstnärligt (16)
Författare/redaktör
Forsberg, Markus, 19 ... (34)
Olsson, Leif-Jöran, ... (11)
Tahmasebi, Nina, 198 ... (10)
Kokkinakis, Dimitrio ... (10)
Dannélls, Dana, 1976 (9)
visa fler...
Volodina, Elena, 197 ... (9)
Uppström, Jonatan (8)
Virk, Shafqat, 1979 (8)
Johansson, Richard, ... (7)
Prentice, Julia, 197 ... (5)
Lyngfelt, Benjamin, ... (5)
Sköldberg, Emma, 196 ... (5)
Skadina, Inguna (5)
Saxena, Anju (4)
Adesam, Yvonne, 1975 (4)
Bouma, Gerlof, 1979 (4)
Ahlberg, Malin, 1986 (4)
Olsson, Olof, 1982 (4)
Toporowska Gronostaj ... (4)
Pilán, Ildikó, 1985 (4)
Tingsell, Sofia, 197 ... (4)
Rydstedt, Rudolf, 19 ... (4)
de Smedt, Koenraad (4)
Schumacher, Anne, 19 ... (3)
Saxena, Anju, 1959- (3)
Rama, Taraka, 1986 (3)
Bäckström, Linnéa, 1 ... (3)
Schulz, Stefan (2)
Alfter, David, 1986 (2)
Hammarstedt, Martin (2)
Roxendal, Johan (2)
Friberg Heppin, Kari ... (2)
Calzolari, Nicoletta (2)
Merkel, Magnus (2)
Megyesi, Beata (2)
Zweigenbaum, Pierre (2)
Lindström Tiedemann, ... (2)
Voionmaa, Kaarlo (2)
Viklund, Jon (2)
Brodén, Daniel, 1975 (2)
Ekman, Stefan, 1972 (2)
Jordan, Caspar (2)
Baud, Robert (2)
Wirén, Mats (2)
Lindahl, Anna, 1988 (2)
Fridlund, Mats, 1965 (2)
Miegel, Fredrik (2)
Hammarlin, Mia-Marie (2)
Arnbjörnsdóttir, Bir ... (2)
visa färre...
Lärosäte
Uppsala universitet (5)
Chalmers tekniska högskola (4)
Högskolan i Halmstad (3)
Stockholms universitet (1)
Lunds universitet (1)
visa fler...
Högskolan i Skövde (1)
visa färre...
Språk
Engelska (84)
Svenska (3)
Forskningsämne (UKÄ/SCB)
Naturvetenskap (85)
Humaniora (61)
Samhällsvetenskap (8)

År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy