Sökning: onr:"swepub:oai:DiVA.org:su-213549" >
A Greek Parliament ...
A Greek Parliament Proceedings Dataset for Computational Linguistics and Political Analysis
-
- Dritsa, Konstantina (författare)
- Athens University of Economics and Business
-
- Thoma, Kaiti (författare)
- Athens University of Economics and Business
-
- Pavlopoulos, John (författare)
- Stockholms universitet,Institutionen för data- och systemvetenskap
-
visa fler...
-
- Louridas, Panos (författare)
- Athens University of Economics and Business
-
visa färre...
-
(creator_code:org_t)
- Neural Information Processing Systems, 2022
- 2022
- Engelska.
- Relaterad länk:
-
https://openreview.n...
-
visa fler...
-
https://su.diva-port... (primary) (Raw object)
-
https://urn.kb.se/re...
-
visa färre...
Abstract
Ämnesord
Stäng
- Large, diachronic datasets of political discourse are hard to come across, especially for resource-lean languages such as g In this paper, we introduce a curated dataset of the Greek Parliament Proceedings that extends chronologically from 1989 up to 2020. It consists of more than 1 million speeches with extensive metadata, extracted from 5,355 parliamentary record files. We explain how it was constructed and the challenges that we had to overcome. The dataset can be used for both computational linguistics and political analysis—ideally, combining the two. We present such an application, showing (i) how the dataset can be used to study the change of word usage through time, (ii) between significant historical events and political parties, (iii) by evaluating and employing algorithms for detecting semantic shifts.
Ämnesord
- NATURVETENSKAP -- Data- och informationsvetenskap -- Systemvetenskap, informationssystem och informatik (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Information Systems (hsv//eng)
Nyckelord
- data- och systemvetenskap
- Computer and Systems Sciences
Publikations- och innehållstyp
- ref (ämneskategori)
- kon (ämneskategori)