Classifying European Court of Human Rights Cases Using Transformer-Based Techniques

↓ Direkt till sidans innehåll
↓ Direkt till sidans sekundära innehåll (sidomenyn)

Sökning: id:"swepub:oai:DiVA.org:lnu-121563" > Classifying Europea...

1 av 1
Föregående post
Nästa post
Till träfflistan

Classifying European Court of Human Rights Cases Using Transformer-Based Techniques

Imran, Ali Shariq (författare): Norwegian University of Science and Technology, Norway

Hodnefjeld, Henrik (författare): Norwegian University of Science and Technology, Norway

Kastrati, Zenun, 1984- (författare): Linnéuniversitetet,Institutionen för informatik (IK),Institutionen för datavetenskap och medieteknik (DM)

visa fler...

Fatima, Noureen (författare): Sukkur IBA University, Pakistan

Daudpota, Sher Muhammad (författare): Sukkur IBA University, Pakistan

Wani, Mudasir Ahmad (författare): Prince Sultan University, Saudi Arabia

visa färre...

(creator_code:org_t)

IEEE, 2023
2023
Engelska.
Ingår i: IEEE Access. - : IEEE. - 2169-3536. ; 11, s. 55664-55676

Relaterad länk:: https://doi.org/10.1...; visa fler...; https://lnu.diva-por... (primary) (Raw object); https://urn.kb.se/re...; https://doi.org/10.1...; visa färre...

Tidskriftsartikel (refereegranskat)

Abstract Ämnesord

Stäng

In the field of text classification, researchers have repeatedly shown the value of transformer-based models such as Bidirectional Encoder Representation from Transformers (BERT) and its variants. Nonetheless, these models are expensive in terms of memory and computational power but have not been utilized to classify long documents of several domains. In addition, transformer models are also often pre-trained on generalized languages, making them less effective in language-specific domains, such as legal documents. In the natural language processing (NLP) domain, there is a growing interest in creating newer models that can handle more complex input sequences and domain-specific languages. Keeping the power of NLP in mind, this study proposes a legal documentation classifier that classifies the legal document by using the sliding window approach to increase the maximum sequence length of the model. We used the ECHR (European Court of Human Rights) publicly available dataset which to a large extent is imbalanced. Therefore, to balance the dataset we have scrapped the case articles from the web and extracted the data. Then, we employed conventional machine learning techniques such as SVM, DT, NB, AdaBoost, and transformer-based neural networks models including BERT, Legal-BERT, RoBERTa, BigBird, ELECTRA, and XLNet for the classification task. The experimental findings show that RoBERTa outperformed all the mentioned BERT versions by obtaining precision, recall, and F1-score of 89.1%, 86.2%, and 86.7%, respectively. While from conventional machine learning techniques, AdaBoost outclasses SVM, DT, and NB by achieving scores of 81.9%, 81.5%, and 81.7% for precision, recall, and F1-score, respectively.

Hitta via bibliotek

IEEE Access (Sök värdpublikationen i LIBRIS)

Till lärosätets databas

1 av 1
Föregående post
Nästa post
Till träfflistan

Hitta mer i SwePub

Av författaren/redakt...: Imran, Ali Shari ...; Hodnefjeld, Henr ...; Kastrati, Zenun, ...; Fatima, Noureen; Daudpota, Sher M ...; Wani, Mudasir Ah ...

Om ämnet

NATURVETENSKAP: NATURVETENSKAP; och Data och informa ...; och Systemvetenskap ...

Artiklar i publikationen: IEEE Access

Av lärosätet: Linnéuniversitetet

Sök utanför SwePub

Sök vidare i:: Google; Google Book Search; Google Scholar

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

LIBRIS.kb.se

Classifying European Court of Human Rights Cases Using Transformer-Based Techniques

Ämnesord

Nyckelord

Publikations- och innehållstyp

Hitta via bibliotek

Till lärosätets databas

Hitta mer i SwePub

Sök utanför SwePub