Search: id:"swepub:oai:gup.ub.gu.se/326112" >
To drop or not to d...
To drop or not to drop? Predicting the omission of the infinitival marker in a Swedish future construction
-
- Berdicevskis, Aleksandrs, 1983 (author)
- Gothenburg University,Göteborgs universitet,Språkbanken Text, Institutionen för svenska, flerspråkighet och språkteknologi,Institutionen för svenska, flerspråkighet och språkteknologi,Språkbanken Text, Department of Swedish, multilingualism, language technology,Department of Swedish, Multilingualism, Language Technology
-
- Coussé, Evie, 1980 (author)
- Gothenburg University,Göteborgs universitet,Institutionen för språk och litteraturer,Department of Languages and Literatures
-
Koplenig, Alexander, 1980 (author)
-
show more...
-
- Adesam, Yvonne, 1975 (author)
- Gothenburg University,Göteborgs universitet,Språkbanken Text, Institutionen för svenska, flerspråkighet och språkteknologi,Institutionen för svenska, flerspråkighet och språkteknologi,Språkbanken Text, Department of Swedish, multilingualism, language technology,Department of Swedish, Multilingualism, Language Technology
-
show less...
-
(creator_code:org_t)
- 2024
- 2024
- English.
-
In: Corpus Linguistics and Linguistic Theory. - 1613-7027 .- 1613-7035. ; 20:1, s. 219-261
- Related links:
-
https://gup.ub.gu.se...
-
show more...
-
https://doi.org/10.1...
-
show less...
Abstract
Subject headings
Close
- We investigate the optional omission of the infinitival marker in a Swedish future tense construction. During the last two decades the frequency of omission has been rapidly increasing, and this process has received considerable attention in the literature. We test whether the knowledge which has been accumulated can yield accurate predictions of language variation and change. We extracted all occurrences of the construction from a very large collection of corpora. The dataset was automatically annotated with language-internal predictors which have previously been shown or hypothesized to affect the variation. We trained several models in order to make two kinds of predictions: whether the marker will be omitted in a specific utterance and how large the proportion of omissions will be for a given time period. For most of the approaches we tried, we were not able to achieve a better-than-baseline performance. The only exception was predicting the proportion of omissions using autoregressive integrated moving average models for one-step-ahead forecast, and in this case time was the only predictor that mattered. Our data suggest that most of the language-internal predictors do have some effect on the variation, but the effect is not strong enough to yield reliable predictions.
Subject headings
- NATURVETENSKAP -- Data- och informationsvetenskap -- Språkteknologi (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Language Technology (hsv//eng)
- HUMANIORA -- Språk och litteratur -- Jämförande språkvetenskap och allmän lingvistik (hsv//swe)
- HUMANITIES -- Languages and Literature -- General Language Studies and Linguistics (hsv//eng)
Keyword
- corpus
- language change
- language variation
- logistic regression
- predictive approach
- Swedish
- time-series analysis
Publication and Content Type
- ref (subject category)
- art (subject category)
Find in a library
To the university's database