2. |
- Rennes, Evelina, 1990-
(författare)
-
An Aligned Resource of Swedish Complex-Simple Sentence Pairs
- 2018
-
Ingår i: Proceedings of the Seventh Swedish Language Technology Conference (SLTC).
-
Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract
- We present a method for aligning comparable corpora of simple-complex articles at the sentence level. Three methods were tested; Average Alignment (AA), Maximum Alignment (MA), and Hungarian Alignment (HA). For evaluating the algorithms, and finding the optimal combination of parameters, a dataset of manually annotated sentences was constructed. The algorithms were evaluated against the manually annotated dataset, and the best-performing algorithm proved to be the MA algorithm, which resulted in corpus comprising 59,513 aligned sentence pairs, of which 17,653 were unique sentences.
|
|