↓ Direkt till sidans innehåll
↓ Direkt till sidans sekundära innehåll (sidomenyn)

Träfflista för sökning "WFRF:(Näsman Jesper) "

Sökning: WFRF:(Näsman Jesper)

Resultat 1-6 av 6

Sortera/gruppera träfflistan

Sortering: Träffar per sida:

Numrering	Referens	Omslagsbild	Hitta
1.	Borin, Lars, et al. (författare) Swe-Clarin : Language Resources and Technology for Digital Humanities 2016 Ingår i: <em>Extended Papers of the International Symposium on Digital Humanities</em>. - : CEUR. ; , s. 29-51, s. 29-51 Konferensbidrag (refereegranskat)abstract CLARIN is a European Research Infrastructure Consortium (ERIC), which aims at (a) making extensive language-based materials available as primary research data to the humanities and social sciences (HSS); and (b) offering state-of-the-art language technology (LT) as an eresearch tool for this purpose, positioning CLARIN centrally in what is often referred to as the digital humanities (DH). The Swedish CLARIN node Swe-Clarin was established in 2015 with funding from the Swedish Research Council.In this paper, we describe the composition and activities of Swe-Clarin, aiming at meeting the requirements of all HSS and other researchers whose research involves using text and speech as primary research data, and spreading the awareness of what Swe-Clarin can offer these research communities. We focus on one of the central means for doing this: pilot projects conducted in collaboration between HSS researchers and Swe-Clarin, together formulating a research question, the addressing of which requires working with large language-based materials. Four such pilot projects are described in more detail, illustrating research on rhetorical history, second-language acquisition, literature, and political science. A common thread to these projects is an aspiration to meet the challenge of conducting research on the basis of very large amounts of textual data in a consistent way without losing sight of the individual cases making up the mass of data, i.e., to be able to move between Moretti’s “distant” and “close reading” modes.While the pilot projects clearly make substantial contributions to DH, they also reveal some needs for more development, and in particular a need for document-level access to the text materials. As a consequence of this, work has now been initiated in Swe-Clarin to meet this need, so that Swe-Clarin together with HSS scholars investigating intricate research questions can take on the methodological challenges of big-data language-based digital humanities.
2.	Borin, Lars, 1957, et al. (författare) Swe-Clarin: Language resources and technology for Digital Humanities 2017 Ingår i: Digital Humanities 2016. Extended Papers of the International Symposium on Digital Humanities (DH 2016) Växjö, Sweden, November, 7-8, 2016. Edited by Koraljka Golub, Marcelo Milra. Vol-2021. - Aachen : M. Jeusfeld c/o Redaktion Sun SITE, Informatik V, RWTH Aachen.. - 1613-0073. Konferensbidrag (refereegranskat)abstract CLARIN is a European Research Infrastructure Consortium (ERIC), which aims at (a) making extensive language-based materials available as primary research data to the humanities and social sciences (HSS); and (b) offering state-of-the-art language technology (LT) as an e-research tool for this purpose, positioning CLARIN centrally in what is often referred to as the digital humanities (DH). The Swedish CLARIN node Swe-Clarin was established in 2015 with funding from the Swedish Research Council. In this paper, we describe the composition and activities of Swe-Clarin, aiming at meeting the requirements of all HSS and other researchers whose research involves using text and speech as primary research data, and spreading the awareness of what Swe-Clarin can offer these research communities. We focus on one of the central means for doing this: pilot projects conducted in collaboration between HSS researchers and Swe-Clarin, together formulating a research question, the addressing of which requires working with large language-based materials. Four such pilot projects are described in more detail, illustrating research on rhetorical history, second-language acquisition, literature, and political science. A common thread to these projects is an aspiration to meet the challenge of conducting research on the basis of very large amounts of textual data in a consistent way without losing sight of the individual cases making up the mass of data, i.e., to be able to move between Moretti’s “distant” and “close reading” modes. While the pilot projects clearly make substantial contributions to DH, they also reveal some needs for more development, and in particular a need for document-level access to the text materials. As a consequence of this, work has now been initiated in Swe-Clarin to meet this need, so that Swe-Clarin together with HSS scholars investigating intricate research questions can take on the methodological challenges of big-data language-based digital humanities.
3.	Borin, Lars, 1957, et al. (författare) Swe-Clarin: Language resources and technology for digital humanities 2016 Ingår i: CEUR Workshop Proceedings. - 1613-0073. ; 2021, s. 29-51 Konferensbidrag (refereegranskat)abstract CLARIN is a European Research Infrastructure Consortium (ERIC), which aims at (a) making extensive language-based materials available as primary research data to the humanities and social sciences (HSS); and (b) offering state-of-the-art language technology (LT) as an e-research tool for this purpose, positioning CLARIN centrally in what is often referred to as the digital humanities (DH). The Swedish CLARIN node Swe-Clarin was established in 2015 with funding from the Swedish Research Council. In this paper, we describe the composition and activities of Swe-Clarin, aiming at meeting the requirements of all HSS and other researchers whose research involves using text and speech as primary research data, and spreading the awareness of what Swe-Clarin can offer these research communities. We focus on one of the central means for doing this: pilot projects conducted in collaboration between HSS researchers and Swe-Clarin, together formulating a research question, the addressing of which requires working with large language-based materials. Four such pilot projects are described in more detail, illustrating research on rhetorical history, second-language acquisition, literature, and political science. A common thread to these projects is an aspiration to meet the challenge of conducting research on the basis of very large amounts of textual data in a consistent way without losing sight of the individual cases making up the mass of data, i.e., to be able to move between Moretti’s “distant” and “close reading” modes.
4.	Megyesi, Beáta, 1971-, et al. (författare) SWEGRAM: Annotering och analys av svenska texter 2019 Rapport (övrigt vetenskapligt/konstnärligt)abstract Dokumentet syftar till att beskriva verktyget swegram med vars hjälp du kan genomföra automatisk annotering och lingvistisk analys av svenska och engelska texter eller skapa din egen, lingvistiskt annoterade textsamling, en så kallad korpus. Vi presenterar verktygets beståndsdelar och ger förslag på hur man kan genomföra storskalig, empirisk språklig analys med hjälp av verktyget.
5.	Megyesi, Beata, 1971-, et al. (författare) The Uppsala Corpus of Student Writings : Corpus Creation, Annotation, and Analysis 2016 Ingår i: LREC 2016. - Paris : EUROPEAN LANGUAGE RESOURCES ASSOC-ELRA. - 9782951740891 ; , s. 3192-3199 Konferensbidrag (refereegranskat)abstract The Uppsala Corpus of Student Writings consists of Swedish texts produced as part of a national test of students ranging in age from nine (in year three of primary school) to nineteen (the last year of upper secondary school) who are studying either Swedish or Swedish as a second language. National tests have been collected since 1996. The corpus currently consists of 2,500 texts containing over 1.5 million tokens. Parts of the texts have been annotated on several linguistic levels using existing state-of-the-art natural language processing tools. In order to make the corpus easy to interpret for scholars in the humanities, we chose the CoNLL format instead of an XML-based representation. Since spelling and grammatical errors are common in student writings, the texts are automatically corrected while keeping the original tokens in the corpus. Each token is annotated with part-of-speech and morphological features as well as syntactic structure. The main purpose of the corpus is to facilitate the systematic and quantitative empirical study of the writings of various student groups based on gender, geographic area, age, grade awarded or a combination of these, synchronically or diachronically. The intention is for this to be a monitor corpus, currently under development.
6.	Näsman, Jesper, et al. (författare) SWEGRAM : A Web-Based Tool for Automatic Annotation and Analysis of Swedish Texts 2017 Ingår i: <em>Proceedings of the 21st Nordic Conference on Computational Linguistics</em>, Nodalida 2017.. - Göteborg. - 9789176856017 ; , s. 132-141 Konferensbidrag (refereegranskat)abstract We present SWEGRAM, a web-based tool for the automatic linguistic annotation and quantitative analysis of Swedish text, enabling researchers in the humanities and social sciences to annotate their own text and produce statistics on linguistic and other text-related features on the basis of this annotation. The tool allows users to upload one or several documents, which are automatically fed into a pipeline of tools for tokenization and sentence segmentation, spell checking, part-of-speech tagging and morpho-syntactic analysis as well as dependency parsing for syntactic annotation of sentences. The analyzer provides statistics on the number of tokens, words and sentences, the number of parts of speech (PoS), readability measures, the average length of various units, and frequency lists of tokens, lemmas, PoS, and spelling errors. SWEGRAM allows users to create their own corpus or compare texts on various linguistic levels.

Skapa referenser, mejla, bekava och länka

Länka till träfflistan

Resultat 1-6 av 6

Avgränsa träffmängd

Typ av publikation: konferensbidrag (5); rapport (1)

Typ av innehåll: refereegranskat (5); övrigt vetenskapligt/konstnärligt (1)

Författare/redaktör: Näsman, Jesper (6); Megyesi, Beáta, 1971 ... (4); Wirén, Mats (3); Palmér, Anne, 1961- (3); Grigonyté, Gintaré (3); Palmér, Anne (3); visa fler...; Tahmasebi, Nina, 198 ... (2); Borin, Lars, 1957 (2); Megyesi, Beata (2); Viklund, Jon (2); Jordan, Caspar (2); Ekman, Stefan (2); Volodina, Elena (2); Björkenstam, Kristin ... (2); Gustafson Capková, S ... (2); Kosiński, Tomasz (2); Tahmasebi, Nina (1); Volodina, Elena, 197 ... (1); Viklund, Jon, 1973- (1); Borin, Lars (1); Ekman, Stefan, 1972 (1); Jordan, Caspar, 1984 ... (1); Björkenstam, Kristin ... (1); Capková, Sofia Gusta ... (1); Kosinski, Tomasz, 19 ... (1); visa färre...

Lärosäte: Uppsala universitet (4); Göteborgs universitet (1); Chalmers tekniska högskola (1); Karlstads universitet (1)

Språk: Engelska (5); Svenska (1)

Forskningsämne (UKÄ/SCB): Naturvetenskap (5); Humaniora (4); Samhällsvetenskap (2)

År

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

Copyright © LIBRIS - Nationella bibliotekssystem
LIBRIS.kb.se

pil uppåt

Stäng

Kopiera och spara länken för att återkomma till aktuell vy