Sökning: id:"swepub:oai:DiVA.org:kth-347151" >
Investigating the V...
Investigating the Viability of Masked Language Modeling for Symbolic Music Generation in abc-notation
-
- Casini, Luca (författare)
- KTH,Tal, musik och hörsel, TMH
-
- Jonason, Nicolas (författare)
- KTH,Tal, musik och hörsel, TMH
-
- Sturm, Bob, 1975- (författare)
- KTH,Tal, musik och hörsel, TMH
-
(creator_code:org_t)
- Springer Nature, 2024
- 2024
- Engelska.
-
Ingår i: ARTIFICIAL INTELLIGENCE IN MUSIC, SOUND, ART AND DESIGN, EVOMUSART 2024. - : Springer Nature. ; , s. 84-96
- Relaterad länk:
-
https://urn.kb.se/re...
-
visa fler...
-
https://doi.org/10.1...
-
visa färre...
Abstract
Ämnesord
Stäng
- The dominating approach for modeling sequences (e.g. text, music) with deep learning is the causal approach, which consists in learning to predict tokens sequentially given those preceding it. Another paradigm is masked language modeling, which consists of learning to predict the masked tokens of a sequence in no specific order, given all non-masked tokens. Both approaches can be used for generation, but the latter is more flexible for editing, e.g. changing the middle of a sequence. This paper investigates the viability of masked language modeling applied to Irish traditional music represented in the text-based format abc-notation. Our model, called abcMLM, enables a user to edit tunes in arbitrary ways while retaining similar generation capabilities to causal models. We find that generation using masked language modeling is more challenging, but leveraging additional information from a dataset, e.g., imputing musical structure, can generate sequences that are on par with previous models.
Ämnesord
- NATURVETENSKAP -- Data- och informationsvetenskap -- Språkteknologi (hsv//swe)
- NATURAL SCIENCES -- Computer and Information Sciences -- Language Technology (hsv//eng)
Nyckelord
- Symbolic Music Generation
- Masked Language Models
- Irish Traditional Music
Publikations- och innehållstyp
- ref (ämneskategori)
- kon (ämneskategori)