Improving music mixability by using rule-based stem modification and contextual information

Sowula, Robert

doi:10.34726/hss.2024.112486

DC Field

Value

Language

dc.contributor.advisor

Knees, Peter

dc.contributor.author

Sowula, Robert

dc.date.accessioned

2024-05-23T10:26:23Z

dc.date.issued

2024

dc.date.submitted

2024-05

dc.identifier.citation

<div class="csl-bib-body"> <div class="csl-entry">Sowula, R. (2024). <i>Improving music mixability by using rule-based stem modification and contextual information</i> [Diploma Thesis, Technische Universität Wien]. reposiTUm. https://doi.org/10.34726/hss.2024.112486</div> </div>

dc.identifier.uri

https://doi.org/10.34726/hss.2024.112486

dc.identifier.uri

http://hdl.handle.net/20.500.12708/197485

dc.description.abstract

This thesis assesses how music source separation (MSS) and contextual information can be used to improve musical similarity measures in the context of automatic music mixing. In particular, we explore how MSS can contribute to the field of music similarity calculation by modifying incompatible stems using a rule-based approach. Additionally, we investigate how audio-based similarity measures can be supplemented by contextual information to capture more aspects of music. In this work, we propose and implement an automatic music mixing system, incorporating a variety of music similarity measures and music information retrieval (MIR) techniques. We also propose a novel approach for tempo detection, outperforming state-of-the-art techniques in low error-tolerance windows. Building upon this system, we implement two additional models, incorporating rule-based stem modification and contextual similarity. To evaluate the performance of our models, we implement a web-based listening survey and performed a listening experiment across our three models and a state-of-the-art model as a baseline. The result of the listening experiment shows that our approach to song selection and automatic music mixing significantly outperforms comparable state-of-the-art. Additionally, we show that our rule-based stem removal approach significantly improves the quality of a mix. Our results do, however, not indicate any improvement in the quality of the mix by including contextual similarity to the music similarity measure. Except for the baseline model, where participants with higher musical knowledge and DJ experience rated the mixes significantly worse, no significant differences in ratings are found for different musical knowledge or DJ experience across our models.

dc.description.abstract

Diese Arbeit evaluiert, wie Music Source Separation (MSS) und kontextuelle Informationen genutzt werden können, um musikalische Ähnlichkeitsmaße für die automatische Mix-Generation zu verbessern. Wir erkunden, wie MSS dem Bereich der musikalischen Ähnlichkeitsberechnung beitragen kann, indem inkompatible Stems mittels eines regelbasierten Ansatzes modifiziert werden. Weiters untersuchen wir, wie audiobasierte Ähnlichkeitsmaße durch kontextuelle Informationen ergänzt werden können, um ein breiteres Spektrum an Aspekten von Musik abzudecken. Im Zuge dieser Arbeit implementieren wir ein System zur automatischen Erstellung von DJ Mixes, welches eine Vielzahl von Musikähnlichkeitsmetriken und Music Information Retrieval (MIR) Techniken integriert. Weiters stellen wir einen neuen Ansatz für die Tempobestimmung von Liedern vor, welcher bei niedriger Fehlertoleranz Ansätze des derzeitigen Standes der Technik übertrifft. Auf dieses System aufbauend, implementieren wir zwei weitere Modelle, welche regelbasierte Stem Modifikation und kontextuelle Informationen integrieren. Um die Leitung unserer Modelle zu evaluieren, implementieren wir eine Webbasierte Audio-Umfrageplattform und führen eine Hörstudie mit unseren drei Modellen und einem weiteren Modell des aktuellen Stands der Technik, welches als Baseline dient, durch. Die Ergebnisse der Hörstudie zeigen, dass unser Ansatz zur Liederauswahl und automatischen Mix Generation den derzeitigen Stand der Technik signifikant übertrifft. Weiters zeigen wir, dass unser regelbasierter Stem Entfernung Ansatz die Qualität des generierten Mixes signifikant erhöht. Durch unsere Ergebnisse kann jedoch keine signifikante Steigerung der Qualität des Mixes durch Ergänzung musikalischer Ähnlichkeitsberechnung durch kontextuelle Informationen nachgewiesen werden. Bis auf das Baseline-Modell, bei dem Studienteilnehmer mit mehr Musikwissen und DJ-Erfahrung den Mix signifikant schlechter bewertet haben, gab es bei unseren Modellen keinen signifikanten Unterschied in den Bewertungen basierend auf dem Musikwissen oder der DJ-Erfahrung der Teilnehmer.

dc.language

English

dc.language.iso

dc.rights.uri

http://rightsstatements.org/vocab/InC/1.0/

dc.subject

Music Similarity

dc.subject

Music Signal Processing

dc.subject

Automatic Music Mixing

dc.subject

Beat-Grid Estimation

dc.subject

Music Source Separation

dc.subject

Contextual Information

dc.subject

Feature Extraction

dc.subject

Music Information Retrieval

dc.title

Improving music mixability by using rule-based stem modification and contextual information

dc.type

Thesis

dc.type

Hochschulschrift

dc.rights.license

In Copyright

dc.rights.license

Urheberrechtsschutz

dc.identifier.doi

10.34726/hss.2024.112486

dc.contributor.affiliation

TU Wien, Österreich

dc.rights.holder

Robert Sowula

dc.publisher.place

Wien

tuw.version

vor

tuw.thesisinformation

Technische Universität Wien

tuw.publication.orgunit

E194 - Institut für Information Systems Engineering

dc.type.qualificationlevel

Diploma

dc.identifier.libraryid

AC17185799

dc.description.numberOfPages

dc.thesistype

Diplomarbeit

dc.thesistype

Diploma Thesis

dc.rights.identifier

In Copyright

dc.rights.identifier

Urheberrechtsschutz

tuw.advisor.staffStatus

staff

tuw.advisor.orcid

0000-0003-3906-1292

item.languageiso639-1

item.openairetype

master thesis

item.openairecristype

http://purl.org/coar/resource_type/c_bdcc

item.grantfulltext

open

item.cerifentitytype

Publications

item.fulltext

with Fulltext

item.mimetype

application/pdf

item.openaccessfulltext

Open Access

crisitem.author.dept

E194 - Institut für Information Systems Engineering

crisitem.author.parentorg

E180 - Fakultät für Informatik

Appears in Collections:

Thesis

Fulltext (Version of Record (published version))

Adobe PDF

(4.17 MB)

In Copyright

Show simple item record

Page view(s)

400

checked on May 23, 2024

Download(s)

516

checked on May 23, 2024

Google Scholar^TM

Check

Page view(s)

Download(s)

Google ScholarTM

Google Scholar^TM