<div class="csl-bib-body">
<div class="csl-entry">Shashaani, S., & Knees, P. (2025). Towards Playlist Continuation Through Large-Scale Context and Audio-Based Music Representations. In A. Ferraro, L. Porcaro, & C. Bauer (Eds.), <i>Proceedings of the 3rd Music Recommender Systems Workshop (MuRS 2025) co-located with the 19th ACM Conference on Recommender Systems (RecSys 2025)</i>. CEUR-WS.</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/223057
-
dc.description.abstract
Music recommendation research faces several challenges when modeling the complex relationships between users, items, and the circumstances under which they interact. In spite of access to commercial catalogs and large customer bases, academic research builds its findings on publicly shared datasets. However, these often only contain selected data modalities, limited catalogs, or temporally restricted snapshots of interaction data. Moreover, they might eventually vanish due to licensing issues. A strategy to overcome some of these limitations could consist in learning multimodal representation learning for playlist continuation. For instance, while metadata and interaction data can be used to learn item representations, content-based data can be used to predict representations for tracks where audio is available but interaction data is lacking.
To address this specific case, in this work, as a first pointer into the overall direction, we explore the integration of deep audio features extracted directly from MP3 files for music playlist completion. We first generate track embeddings using a Convolutional Neural Network (CNN) trained on a subset of MP3 files with the Spotify Million Playlist Dataset (MPSD), using pre-learned Word2Vec embeddings as labels. These embeddings serve as item representations in sequential recommender models such as Bidirectional Encoder Representations from Transformers for Sequential Recommendation (BERT4Rec) and Self-Attentive Sequential Recommendation (SASRec). We evaluate four approaches: (1) training the entire recommender model from scratch, (2) incorporating Word2Vec embeddings as item vector in recommenders, (3) incorporating CNN-predicted embeddings only for the last tracks in a playlist while using Word2Vec embeddings for others, and train the remaining model’s parameter, and (4) replacing the CNN with a dilated CNN in the third approach. Our experiments show that audio-based features can enhance playlist continuation, especially in cold-start scenarios, while offering potential for improved explainability over traditional metadata-based methods.
en
dc.description.sponsorship
WWTF Wiener Wissenschafts-, Forschu und Technologiefonds
-
dc.language.iso
en
-
dc.subject
Music Recommendation
en
dc.subject
Representation Learning
en
dc.subject
Item Embeddings
en
dc.subject
Playlist Completion
en
dc.title
Towards Playlist Continuation Through Large-Scale Context and Audio-Based Music Representations
en
dc.type
Inproceedings
en
dc.type
Konferenzbeitrag
de
dc.contributor.editoraffiliation
Pandora/SiriusXM, Barcelona, Spain
-
dc.contributor.editoraffiliation
Department of Computer, Control and Management Engineering - Sapienza – Università di Roma (Rome, IT)
-
dc.contributor.editoraffiliation
Focus area "InterMediation - Music - Effect - Analysis" - Wissenschaft & Kunst (Salzburg, AT)
-
dc.relation.issn
1613-0073
-
dc.relation.grantno
DCDH-001
-
dc.rights.holder
The authors.
-
dc.type.category
Full-Paper Contribution
-
tuw.booktitle
Proceedings of the 3rd Music Recommender Systems Workshop (MuRS 2025) co-located with the 19th ACM Conference on Recommender Systems (RecSys 2025)
-
tuw.container.volume
4045
-
tuw.peerreviewed
true
-
tuw.relation.publisher
CEUR-WS
-
tuw.project.title
Vienna Doctoral College on Digital Humanism
-
tuw.researchTopic.id
I4
-
tuw.researchTopic.name
Information Systems Engineering
-
tuw.researchTopic.value
100
-
tuw.publication.orgunit
E194-04 - Forschungsbereich Data Science
-
tuw.publication.orgunit
E056-23 - Fachbereich Innovative Combinations and Applications of AI and ML (iCAIML)
-
tuw.publication.orgunit
E056-27 - Fachbereich Digital Humanism
-
dc.description.numberOfPages
11
-
tuw.author.orcid
0000-0003-4344-2696
-
tuw.author.orcid
0000-0003-3906-1292
-
tuw.editor.orcid
0000-0003-1236-2503
-
tuw.editor.orcid
0000-0003-0218-5187
-
tuw.editor.orcid
0000-0001-5724-1137
-
tuw.event.name
MuRS 2025: Music Recommender Systems Workshop 2025
en
tuw.event.startdate
22-09-2025
-
tuw.event.enddate
22-09-2025
-
tuw.event.online
On Site
-
tuw.event.type
Event for scientific audience
-
tuw.event.place
Prag
-
tuw.event.country
CZ
-
tuw.event.presenter
Shashaani, Shahrzad
-
tuw.event.track
Single Track
-
wb.sciencebranch
Informatik
-
wb.sciencebranch.oefos
1020
-
wb.sciencebranch.value
100
-
item.grantfulltext
restricted
-
item.languageiso639-1
en
-
item.cerifentitytype
Publications
-
item.openairecristype
http://purl.org/coar/resource_type/c_5794
-
item.fulltext
no Fulltext
-
item.openairetype
conference paper
-
crisitem.author.dept
E194-04 - Forschungsbereich Data Science
-
crisitem.author.dept
E194 - Institut für Information Systems Engineering
-
crisitem.author.orcid
0000-0003-4344-2696
-
crisitem.author.orcid
0000-0003-3906-1292
-
crisitem.author.parentorg
E194 - Institut für Information Systems Engineering
-
crisitem.author.parentorg
E180 - Fakultät für Informatik
-
crisitem.project.funder
WWTF Wiener Wissenschafts-, Forschu und Technologiefonds