Search: WFRF:(Virk Shafqat 1979) >
Many a little makes...
Many a little makes a mickle - infrastructure component reuse for a massively multilingual linguistic study
-
- Borin, Lars, 1957 (author)
- Gothenburg University,Göteborgs universitet,Institutionen för svenska språket,Department of Swedish
-
- Virk, Shafqat, 1979 (author)
- Gothenburg University,Göteborgs universitet,Institutionen för svenska språket,Department of Swedish
-
Saxena, Anju (author)
-
(creator_code:org_t)
- Linköping : Linköping University Electronic Press, 2018
- 2018
- English.
-
In: Selected papers from the CLARIN Annual Conference 2017, Budapest, 18–20 September 2017. - Linköping : Linköping University Electronic Press. - 1650-3686 .- 1650-3740. - 9789176852736
- Related links:
-
https://gup.ub.gu.se...
Abstract
Subject headings
Close
- We present ongoing work aiming at turning the linguistic material available in Grierson’s classical Linguistic Survey of India (LSI) into a digital language resource, a database suitable for a broad array of linguistic investigations of the languages of South Asia and studies relating to language typology and contact linguistics. The project has two concrete main aims: (1) to conduct a linguistic investigation of the claim that South Asia constitutes a linguistic area; (2) to develop state-of-the-art language technology for automatically extracting the relevant information from the text of the LSI. In this presentation we focus on how, in the first part of the project, a number of existing research infrastructure components provided by Swe-Clarin, the Swedish CLARIN consortium, have been ‘recycled’ in order to allow the linguists involved in the project to quickly orient themselves in the vast LSI material, and to be able to provide input to the language technologists designing the tools for information extraction from the descriptive grammars.
Subject headings
- HUMANIORA -- Språk och litteratur -- Studier av enskilda språk (hsv//swe)
- HUMANITIES -- Languages and Literature -- Specific Languages (hsv//eng)
- HUMANIORA -- Språk och litteratur -- Jämförande språkvetenskap och allmän lingvistik (hsv//swe)
- HUMANITIES -- Languages and Literature -- General Language Studies and Linguistics (hsv//eng)
- NATURVETENSKAP -- Data- och informationsvetenskap -- Språkteknologi (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Language Technology (hsv//eng)
Keyword
- corpus infrastructure
- lexicon infrastructure
- Swe-Clarin
- large-scale comparative linguistics
- linguistic database
- language typology
- areal linguistics
- genetic linguistics
- South Asian languages
- language technology
Publication and Content Type
- ref (subject category)
- kon (subject category)
Find in a library
To the university's database