SwePub
Sök i SwePub databas

  Utökad sökning

Träfflista för sökning "WFRF:(Frid Johan) ;mspu:(publicationother)"

Sökning: WFRF:(Frid Johan) > Annan publikation

  • Resultat 1-7 av 7
Sortera/gruppera träfflistan
   
NumreringReferensOmslagsbildHitta
1.
  • Carling, Gerd, et al. (författare)
  • DiACL : Diachronic Atlas of Comparative Linguistics
  • 2017
  • Annan publikation (övrigt vetenskapligt/konstnärligt)abstract
    • DiACL is an open access database with lexical and typological/morphosyntactic data for historical, comparative and phylogenetic linguistics. It contains data from 500 languages of 18 families, divided into three macro-areas: Eurasia, Pacific, and the Amazon.
  •  
2.
  • Frid, Johan (författare)
  • Prediction of intonation patterns of accented words in a corpus of read Swedish news
  • 2001
  • Annan publikation (övrigt vetenskapligt/konstnärligt)abstract
    • This paper describes an initial attempt at the construction of a data-driven model of Swedish intonation. The study is mainly concerned with model building and prediction of the intonation patterns of accented words in a corpus of read news in Swedish. Extraction of pitch information is achieved by performing a stylization of the pitch contours. The information is used to build a model for the prediction of pitch patterns using linguistic features such as accent type and position of stress. The model is tested against unseen data from the same corpus. The evaluation is done by numerical comparisons. The RMSE between predicted and original contours for the different categories ranges between 3.7 and 31.4 Hz. The results are quite promising for future studies.
  •  
3.
  • Frid, Johan (författare)
  • Swedish word stress in optimality theory
  • 2001
  • Annan publikation (övrigt vetenskapligt/konstnärligt)abstract
    • The purpose of this paper is to give an introduction to how lexical word stress in Swedish can be analysed with modern phonological theories as metrical phonology (Liberman 1975) and optimality theory (Prince & Smolensky 1993). Central concepts and structures within the phonological theories are introduced and discussed, and examples of how the word stress pattern of Swedish can be treated within optimality theory (OT) are given. We will deal both with monomorphemic words, as well as compound words and affixes.
  •  
4.
  • Horne, Merle, et al. (författare)
  • Discourse markers and the segmentation of spontaneous speech - The case of Swedish men 'but/and/so'
  • 1999
  • Annan publikation (övrigt vetenskapligt/konstnärligt)abstract
    • Prosodic and lexical correlates of ‘clause-like’ and ‘paragraph-like’ boundaries associated with the Swedish discourse marker men ‘but/and/so’ are examined. Men-tokens in spontaneous monologues were labelled as to their boundary-status, first using text-only data. The ‘strong’ tokens (labelled identically by all labellers) were subsequently seen to be correlated with clear differences in the prosodic and lexical parameters examined. This tendency was not found for the corresponding ‘weak’ tokens which were subsequently relabelled using both text and speech nor for the data-base as a whole. A test using a neural network trained using strong tokens is seen to be able to correctly categorize 90% of the strong men-tokens as to their associated boundary-type. The results show that discourse markers along with their prosodic and lexical correlates constitute a constellation of important information for understanding how segmentation of speech is produced and understood
  •  
5.
  • Horne, Merle, et al. (författare)
  • Hesitation disfluencies after the clause marker ATT ‘that’ in Swedish
  • 2005
  • Annan publikation (övrigt vetenskapligt/konstnärligt)abstract
    • This study aims att developing a methodology for investigating the relationship between the fluent and disfluent productions of the Swedish conjunction ATT ‘that’ and the complexity of speech fragments following them. A study of the syntactic structure of the speech fragments following ATT and their relation to the pragmatic structure of the discourse, in particular the fragments’ role as regards the topic structure of the discourse, was made using data from one speaker. Syntactic word order patterns reveal that the pragmatic coherence between two clauses decreases with the use of disfluent ATT as compared to fluent ATT. Disfluent ATT tends to signal a new topic rather than topic continuation, and an elaboration rather than clarification, where clarification is more strongly bound to the preceding utterance. It was observed that even emotional factors correlate with to the production of disfluent ATT. Before empathetic quotations – fragments that imply recognition or imagination of other’s emotions – disfluent ATT may signal a change in the deictic centre as compared to the preceding discourse. A number of observations regarding the prosodic correlates of disfluent ATT were also made. Disfluent ATT is almost always followed by a clear prosodic boundary. In all cases but one, this boundary was marked by a silent pause, in some cases including inhalation. It was also observed that the only filled pause that occurred after a disfluent ATT was before a fragment introducing a new topic.
  •  
6.
  • Kazemi Rashed, Salma, et al. (författare)
  • English dictionaries, gold and silver standard corpora for biomedical natural language processing related to SARS-CoV-2 and COVID-19
  • 2020
  • Annan publikation (övrigt vetenskapligt/konstnärligt)abstract
    • Here we present a toolbox for natural language processing tasks related to SARS-CoV-2. It comprises English dictionaries of synonyms for SARS-CoV-2 and COVID-19, a silver standard corpus generated with the dictionaries and a gold standard corpus of 10 Pubmed abstracts manually annotated for disease, virus, symptom and protein/gene terms. This toolbox is freely available on github and can be used for text analytics in a variety of settings related to the COVID-19 crisis. It will be expanded and applied in NLP tasks over the next weeks and the community is invited to contribute.
  •  
7.
  • Kazemi Rashed, Salma, et al. (författare)
  • Files and code for English dictionaries, gold and silver standard corpora for biomedical natural language processing related to SARS-CoV-2 and COVID-19 : Dataset record
  • 2022
  • Annan publikation (övrigt vetenskapligt/konstnärligt)abstract
    • BACKGROUNDAutomated information extraction with natural language processing (NLP) tools is required to gain systematic insights from the large number of COVID-19 publications, reports and social media posts, which far exceed human processing capabilities. A key challenge for NLP is the extensive variation in terminology used to describe medical entities, which was especially pronounced for this newly emergent disease.FINDINGSHere we present an NLP toolbox comprising very large English dictionaries of synonyms for SARS-CoV-2 (including variant names) and COVID-19, which can be used with dictionary-based NLP tools. We also present a silver standard corpus generated with the dictionaries, and a gold standard corpus, consisting of PubMed abstracts manually annotated for disease, virus, symptom, protein/gene, cell type, chemical and species terms, which can be used to train and evaluate COVID-19-related NLP tools. Code for annotation, which can be used to expand the silver standard corpus or for text mining is also included. This toolbox is freely available on Github (on https://github.com/Aitslab/corona) and here.CONCLUSIONSThe toolbox can be used for a variety of text analytics tasks related to the COVID-19 crisis and has already been used to create a COVID-19 knowledge graph, study the variability and evolution of COVID-19-related terminology and develop and benchmark text mining tools.
  •  
Skapa referenser, mejla, bekava och länka
  • Resultat 1-7 av 7

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy