SwePub
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "L773:1749 5032 OR L773:1755 1676 "

Sökning: L773:1749 5032 OR L773:1755 1676

  • Resultat 1-10 av 15
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  • Allwood, Jens, et al. (författare)
  • Construction and annotation of a corpus of contemporary Nepali
  • 2008
  • Ingår i: Corpora. - : Edinburgh University Press. - 1749-5032 .- 1755-1676. ; 3:2, s. 213-225
  • Tidskriftsartikel (refereegranskat)abstract
    • In this paper, we describe the construction of the 14-million-word Nepali National Corpus (NNC). This corpus includes both spoken and written data, the latter incorporating a Nepali match for FLOB and a broader collection of text. Additional resources within the NNC include parallel data (English–Nepali and Nepali–English) and a speech corpus. The NNC is encoded as Unicode text and marked up in CES-compatible XML. The whole corpus is also annotated with part-of-speech tags. We describe the process of devising a tagset and retraining tagger software for the Nepali language, for which there were no existing corpus resources. Finally, we explore some present and future applications of the corpus, including lexicography, NLP, and grammatical research.
  •  
2.
  • Fuoli, Matteo, et al. (författare)
  • Optimising transparency, reliability and replicability: annotation principles and inter-coder agreement in the quantification of evaluative expressions
  • 2015
  • Ingår i: Corpora. - : Edinburgh University Press. - 1755-1676 .- 1749-5032. ; 10:3, s. 315-349
  • Tidskriftsartikel (refereegranskat)abstract
    • Manual corpus annotation facilitates exhaustive and detailed corpus-based analyses of evaluation that would not be possible with purely automatic techniques. However, manual annotation is a complex and subjective process. Most studies adopting this approach have paid insufficient attention to the methodological challenges involved in manually annotating evaluation - especially concerning transparency, reliability and replicability. This article illustrates a procedure for annotating evaluative expressions in text that facilitates more transparent, reliable and replicable analyses. The method is demonstrated through a case study analysis of APPRAISAL (Martin and White, 2005) in a small-size specialised corpus of CEO letters published by the British energy company, BP, and four competitors before and after the Deepwater Horizon oil spill of 2010. Drawing on Fuoli and Paradis's (2014) model of trust-repair discourse, we examine how ATTITUDE and ENGAGEMENT resources are strategically deployed by BP's CEO in the attempt to repair stakeholders' trust after the accident.
  •  
3.
  • Glynn, Dylan, et al. (författare)
  • Cognitive Corpus Linguistics : Five points of debate on current theory and methodology
  • 2010
  • Ingår i: Corpora. - : Edinburgh University Press. - 1755-1676 .- 1749-5032. ; 5:1, s. 1-27
  • Tidskriftsartikel (refereegranskat)abstract
    • Abstract in UndeterminedWithin cognitive linguistics, there is an increasing awareness that the study of linguistic phenomena needs to be grounded in usage. Ideally, research in cognitive linguistics should be based on authentic language use, its results should be replicable, and its claims falsifiable. Consequently, more and more studies now turn to corpora as a source of data. While corpus-based methodologies have increased in sophistication, the use of corpus data is also associated with a number of unresolved problems. The study of cognition through off-line linguistic data is, arguably, indirect, even if such data fulfils desirable qualities such as being natural, representative and plentiful. Several topics in this context stand out as particularly pressing issues. This discussion note addresses (1) converging evidence from corpora and experimentation, (2) whether corpora mirror psychological reality, (3) the theoretical value of corpus linguistic studies of ‘alternations’, (4) the relation of corpus linguistics and grammaticality judgments, and, lastly, (5) the nature of explanations in cognitive corpus linguistics. We do not claim to resolve these issues nor to cover all possible angles; instead, we strongly encourage reactions and further discussion.
  •  
4.
  •  
5.
  • Kaatari, Henrik, PhD, 1982-, et al. (författare)
  • Introducing the Swedish Learner English Corpus : a corpus that enables investigations of the impact of extramural activities on L2 writing
  • 2024
  • Ingår i: Corpora. - : Edinburgh University Press. - 1749-5032 .- 1755-1676. ; 19:1, s. 17-30
  • Tidskriftsartikel (refereegranskat)abstract
    • This paper introduces the Swedish Learner English Corpus (slec), which consists of argumentative texts in English that are written by Swedish junior and senior high school students. slec includes rich metadata, enabling empirical studies of various extra-linguistic variables. Most noteworthy is the inclusion of detailed information on students’ extramural English activities (ee), such as reading, watching, conversing, gaming and engaging in social media in English. In addition, a sub-set of texts from slec have been assessed for proficiency using the Common European Framework of Reference for Languages (cefr). This paper provides an overview of the corpus compilation process, the metadata, and the available versions of slec. Researchers, teachers and students can access this resource to investigate various aspects of second language use and development, such as the impact of extramural language activities on linguistic complexity. 
  •  
6.
  • Kaatari, Henrik (författare)
  • On the syntactic status of I'm sure
  • 2018
  • Ingår i: Corpora. - : Edinburgh University Press. - 1749-5032 .- 1755-1676. ; 13:1, s. 1-25
  • Tidskriftsartikel (refereegranskat)abstract
    • This study tests whether the syntactic status of the subject-adjective combination I'm/I am sure is similar to the subject-verb combination I think (i.e., whether it exhibits the same signs of grammaticalisation along two different parameters). More specifically, the study is concerned with the ability of I'm/I am sure to (i) occur in clause-medial and clause-final position, and with (ii) its preference for that-omission, by comparing the behaviour of I'm/I am sure with the results reported for I think in previous studies. The results show that I'm/I am sure behaves in a similar way to I think both in terms of its ability to occur in clause-medial and clause-final position, and in terms of its preference for that-omission. However, SURE is both much less frequent than THINK in general, and is also proportionally less dominant among the class of adjectival predicates followed by that-clauses than THINK is among verbal predicates. This makes it difficult to argue that they have developed independently through the same frequency correlation. Instead, I argue that SURE and THINK are part of the same grammaticalised constructional schema, and that the frequency of THINK could be seen to have an impact on the grammatical status of the parallel construction with SURE.
  •  
7.
  •  
8.
  • Larsson, Tove, et al. (författare)
  • On the status of statistical reporting versus linguistic description in corpus linguistics : A ten-year perspective
  • 2022
  • Ingår i: Corpora. - : Edinburgh University Press. - 1749-5032 .- 1755-1676. ; 17:1, s. 137-157
  • Tidskriftsartikel (refereegranskat)abstract
    • This study investigates (i) whether there has been a shift towards increased statistical focus in corpus linguistic research articles, and if so, (ii) whether it has had any repercussions for the attention paid to linguistic description. This is done through an analysis of the relative focus on statistical reporting vs. linguistic description in the way the results are reported and discussed in research articles published in four major corpus linguistics journals in 2009 and 2019. The results display a marked change: In 2009, a clear majority of the articles exhibit a preference for linguistic description over statistical reporting; in 2019, the exact opposite is true. The number of different statistical techniques employed has also gone up. While the increased statistical focus may reflect increased methodological sophistication, our results show that it has come at a cost: A diminished focus on linguistic description, evident, for example, through fewer text excerpts and linguistic examples, appear to be symptomatic of increasing distance from the language that is the object of study. We discuss these shifts and suggest some ways of employing sophisticated statistical techniques without sacrificing a focus on language.
  •  
9.
  • Montoro, Rocío, et al. (författare)
  • Subordination as a potential marker of complexity in serious and popular fiction : A corpus stylistic approach to the testing of literary critical claims
  • 2019
  • Ingår i: Corpora. - : Edinburgh University Press. - 1749-5032 .- 1755-1676. ; 14:3, s. 275-299
  • Tidskriftsartikel (refereegranskat)abstract
    • In this paper, we use a corpus stylistic methodology to investigate whether serious (i.e., ‘literary’) fiction is syntactically more complex than popular (i.e., ‘genre’) fiction. This is on the basis of literary critical claims that the structural complexity of serious fiction is one of the features that distinguishes it from popular literature (which, by contrast, is seen as easier to read). We compare the serious and popular fiction sections of the Lancaster Speech, Writing and Thought Presentation corpus (see Semino and Short, 2004) against various samples of the British National Corpus available in Wmatrix (Rayson, 2009), focussing particularly (though not exclusively) on the identification of subordinating conjunctions. We find that, on this measure, there is no basis for claiming that serious fiction is any more complex syntactically than popular fiction. We then investigate the issue in relation to a specific genre of popular fiction, Chick Lit. Here we find that while syntactic simplicity exists, this is at a phrasal rather than a clausal level. We argue that by using a corpus stylistic approach we are able to qualify accurately certain literary critical claims about syntactic complexity as a distinguishing feature of serious and popular fiction, and to propose a refined hypothesis which might be used in further studies of the syntactic structures used in these two text types.
  •  
10.
  • Norberg, Cathrine (författare)
  • Male and female shame : a corpus-based study of emotion
  • 2012
  • Ingår i: Corpora. - : Edinburgh University Press. - 1749-5032 .- 1755-1676. ; 7:2, s. 159-185
  • Tidskriftsartikel (refereegranskat)abstract
    • In this study, I investigate the representation of the emotion terms shame, ashamed and shameless in relation to women and men in late twentieth-century British English. The study is based on analyses of examples of shame retrieved from the British National Corpus with the specific aim to study in what contexts men and women express shame or are associated with it, and evaluate whether the emotion is represented as negative or positive. I present two general models of shame, where the first model concentrates on a negative connection between shame and pain, exposure and embodiment, and the second model describes shame as a necessary ingredient of social life that makes people recommit to socially sanctioned behaviour and values. Most examples of women's shame in the material correspond to the description given in the first model, whereas the majority of the examples of men's shame correspond with the second. The two models illustrate how shame functions to preserve hierarchical gender structures.
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-10 av 15

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy