Sökning: onr:"swepub:oai:gup.ub.gu.se/339981" >
Name Biases in Auto...
Name Biases in Automated Essay Assessment
-
- Muñoz Sánchez, Ricardo, 1992 (författare)
- Gothenburg University,Göteborgs universitet,Språkbanken Text, Institutionen för svenska, flerspråkighet och språkteknologi,Institutionen för svenska, flerspråkighet och språkteknologi,Språkbanken Text, Department of Swedish, multilingualism, language technology,Department of Swedish, Multilingualism, Language Technology
-
- Dobnik, Simon, 1977 (författare)
- Gothenburg University,Göteborgs universitet,Institutionen för filosofi, lingvistik och vetenskapsteori,Department of Philosophy, Linguistics and Theory of Science
-
Lindström Tiedemann, Therese, 1976 (författare)
-
visa fler...
-
- Szawerna, Maria Irena (författare)
- Gothenburg University,Göteborgs universitet,Språkbanken Text, Institutionen för svenska, flerspråkighet och språkteknologi,Institutionen för svenska, flerspråkighet och språkteknologi,Språkbanken Text, Department of Swedish, multilingualism, language technology,Department of Swedish, Multilingualism, Language Technology
-
- Volodina, Elena, 1973 (författare)
- Gothenburg University,Göteborgs universitet,Språkbanken Text, Institutionen för svenska, flerspråkighet och språkteknologi,Institutionen för svenska, flerspråkighet och språkteknologi,Språkbanken Text, Department of Swedish, multilingualism, language technology,Department of Swedish, Multilingualism, Language Technology
-
visa färre...
-
(creator_code:org_t)
- 2024
- 2024
- Engelska.
-
Ingår i: The 28th International Congress of Onomastic Sciences (ICOS 28).
- Relaterad länk:
-
https://gup.ub.gu.se...
Abstract
Ämnesord
Stäng
- Artificial intelligence is being deployed in high-stakes situations, such as automated grading of second language essays in proficiency assessment. While they can improve the opportunities students have (education, work opportunities, etc.), such systems often display human-like biases. Aldrin (2017) notes that human graders have a slight bias based on names appearing in essay texts. We aim to identify whether the same pattern holds in automated systems. In this study we aim to answer the following research questions: 1) Does changing given names inside a second language learner essay affect the way the text is graded? 2) How much does this differ between feature-based machine learning and deep learning? For this, we use a de-anonymized (i.e. original) version of the Swell-pilot corpus of second language Swedish learner essays (Volodina 2016), which consists of 502 essays annotated with CEFR levels as our source data. First, we compile four lists of given names inspired by those of Aldrin (2017): traditional Swedish names; modern Swedish names of Anglo-American origin; Finnish names (due to the close sociocultural links between both countries); and names of Arabic origin (the most prominent group of learners in the corpus). Second, we create a diagnostic dataset to identify biases in the classification task. We select SweLL-pilot essays in which a given name appears only once. Then, we generate an essay version for each name on the lists by substituting the name in the original text with one from the list. Third, we fine-tune a BERT (Devlin et al. 2019) model on the original SweLL-pilot data to predict the CEFR level of a given essay and compare it to an existing feature-based model (Pilan 2016). Finally, we test the two models and compare the equality of opportunity between the different given name groups on the diagnostic dataset.
Ämnesord
- NATURVETENSKAP -- Data- och informationsvetenskap -- Språkteknologi (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Language Technology (hsv//eng)
Nyckelord
- bias and fairness
- nlp
- natural language processing
- pseudonymization
- automated essay scoring
- second language assessment
Publikations- och innehållstyp
- vet (ämneskategori)
- kon (ämneskategori)