Skip to Main content Skip to Navigation

Natural language processing for music information retrieval : deep analysis of lyrics structure and content

Michael Fell 1
1 WIMMICS - Web-Instrumented Man-Machine Interactions, Communities and Semantics
CRISAM - Inria Sophia Antipolis - Méditerranée , Laboratoire I3S - SPARKS - Scalable and Pervasive softwARe and Knowledge Systems
Abstract : Applications in Music Information Retrieval and Computational Musicology have traditionally relied on features extracted from the music content in the form of audio, but mostly ignored the song lyrics. More recently, improvements in fields such as music recommendation have been made by taking into account external metadata related to the song. In this thesis, we argue that extracting knowledge from the song lyrics is the next step to improve the user’s experience when interacting with music. To extract knowledge from vast amounts of song lyrics, we show for different textual aspects (their structure, content and perception) how Natural Language Processing methods can be adapted and successfully applied to lyrics. For the structuralaspect of lyrics, we derive a structural description of it by introducing a model that efficiently segments the lyricsinto its characteristic parts (e.g. intro, verse, chorus). In a second stage, we represent the content of lyrics by meansof summarizing the lyrics in a way that respects the characteristic lyrics structure. Finally, on the perception of lyricswe investigate the problem of detecting explicit content in a song text. This task proves to be very hard and we showthat the difficulty partially arises from the subjective nature of perceiving lyrics in one way or another depending onthe context. Furthermore, we touch on another problem of lyrics perception by presenting our preliminary resultson Emotion Recognition. As a result, during the course of this thesis we have created the annotated WASABI SongCorpus, a dataset of two million songs with NLP lyrics annotations on various levels.
Document type :
Complete list of metadata
Contributor : Abes Star :  Contact
Submitted on : Monday, February 8, 2021 - 5:04:28 PM
Last modification on : Friday, January 21, 2022 - 3:11:15 AM


Version validated by the jury (STAR)


  • HAL Id : tel-02587910, version 2



Michael Fell. Natural language processing for music information retrieval : deep analysis of lyrics structure and content. Document and Text Processing. Université Côte d'Azur, 2020. English. ⟨NNT : 2020COAZ4017⟩. ⟨tel-02587910v2⟩



Record views


Files downloads