SwePub
Sök i LIBRIS databas

  Extended search

id:"swepub:oai:DiVA.org:kth-347155"
 

Search: id:"swepub:oai:DiVA.org:kth-347155" > The Chordinator :

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

The Chordinator : Modeling Music Harmony by Implementing Transformer Networks and Token Strategies

Dalmazzo, David (author)
KTH
Deguernel, Ken (author)
Univ Lille, CNRS, UMR 9189, Cent Lille,CRIStAL, F-59000 Lille, France.
Sturm, Bob, 1975- (author)
KTH,Tal, musik och hörsel, TMH
KTH Univ Lille, CNRS, UMR 9189, Cent Lille,CRIStAL, F-59000 Lille, France (creator_code:org_t)
Springer Nature, 2024
2024
English.
In: ARTIFICIAL INTELLIGENCE IN MUSIC, SOUND, ART AND DESIGN, EVOMUSART 2024. - : Springer Nature. ; , s. 52-66
  • Conference paper (peer-reviewed)
Abstract Subject headings
Close  
  • This paper compares two tokenization strategies for modeling chord progressions using the encoder transformer architecture trained with a large dataset of chord progressions in a variety of styles. The first strategy includes a tokenization method treating all different chords as unique elements, which results in a vocabulary of 5202 independent tokens. The second strategy expresses the chords as a dynamic tuple describing root, nature (e.g., major, minor, diminished, etc.), and extensions (e.g., additions or alterations), producing a specific vocabulary of 59 tokens related to chords and 75 tokens for style, bars, form, and format. In the second approach, MIDI embeddings are added into the positional embedding layer of the transformer architecture, with an array of eight values related to the notes forming the chords. We propose a trigram analysis addition to the dataset to compare the generated chord progressions with the training dataset, which reveals common progressions and the extent to which a sequence is duplicated. We analyze progressions generated by the models comparing HITS@k metrics and human evaluation of 10 participants, rating the plausibility of the progressions as potential music compositions from a musical perspective. The second model reported lower validation loss, better metrics, and more musical consistency in the suggested progressions.

Subject headings

NATURVETENSKAP  -- Data- och informationsvetenskap -- Språkteknologi (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Language Technology (hsv//eng)

Keyword

Chord progressions
Transformer Neural Networks
Music Generation

Publication and Content Type

ref (subject category)
kon (subject category)

To the university's database

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Find more in SwePub

By the author/editor
Dalmazzo, David
Deguernel, Ken
Sturm, Bob, 1975 ...
About the subject
NATURAL SCIENCES
NATURAL SCIENCES
and Computer and Inf ...
and Language Technol ...
Articles in the publication
By the university
Royal Institute of Technology

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view