<div class="csl-bib-body">
<div class="csl-entry">Wu, C.-W., Dittmar, C., Southall, C., Vogl, R., Widmer, G., Hockman, J., Müller, M., & Lerch, A. (2018). A Review of Automatic Drum Transcription. <i>IEEE/ACM Transactions on Audio, Speech and Language Processing</i>, <i>26</i>(9), 1457–1483. https://doi.org/10.1109/taslp.2018.2830113</div>
</div>
-
dc.identifier.issn
2329-9290
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/144490
-
dc.description.abstract
In Western popular music, drums and percussion are an important means to emphasize and shape the rhythm, often defining the musical style. If computers were able to analyze the drum part in recorded music, it would enable a variety of rhythm-related music processing tasks. Especially the detection and classification of drum sound events by computational methods is considered to be an important and challenging research problem in the broader field of music information retrieval. Over the last two decades, several authors have attempted to tackle this problem under the umbrella term automatic drum transcription (ADT). This paper presents a comprehensive review of ADT research, including a thorough discussion of the task-specific challenges, categorization of existing techniques, and evaluation of several state-of-the-art systems. To provide more insights on the practice of ADT systems, we focus on two families of ADT techniques, namely methods based on non-negative matrix factorization and recurrent neural networks. We explain the methods' technical details and drum-specific variations and evaluate these approaches on publicly available data sets with a consistent experimental setup. Finally, the open issues and underexplored areas in ADT research are identified and discussed, providing future directions in this field.
en
dc.language.iso
en
-
dc.publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
-
dc.relation.ispartof
IEEE/ACM Transactions on Audio, Speech and Language Processing
-
dc.subject
Electrical and Electronic Engineering
-
dc.subject
Computer Science (miscellaneous)
-
dc.subject
Computational Mathematics
-
dc.subject
Deep Learning
-
dc.subject
Neural Networks
-
dc.subject
Drum Transcription
-
dc.subject
Automatic Music Transcription
-
dc.subject
Acoustics and Ultrasonics
-
dc.subject
Non-Negative Matrix Factorization
-
dc.title
A Review of Automatic Drum Transcription
en
dc.type
Artikel
de
dc.type
Article
en
dc.description.startpage
1457
-
dc.description.endpage
1483
-
dc.type.category
Original Research Article
-
tuw.container.volume
26
-
tuw.container.issue
9
-
tuw.journal.peerreviewed
true
-
tuw.peerreviewed
true
-
wb.publication.intCoWork
International Co-publication
-
tuw.researchTopic.id
I4a
-
tuw.researchTopic.name
Information Systems Engineering
-
tuw.researchTopic.value
100
-
dcterms.isPartOf.title
IEEE/ACM Transactions on Audio, Speech and Language Processing
-
tuw.publication.orgunit
E194-01 - Forschungsbereich Software Engineering
-
tuw.publisher.doi
10.1109/taslp.2018.2830113
-
dc.identifier.eissn
2329-9304
-
dc.description.numberOfPages
27
-
tuw.author.orcid
0000-0002-9019-6515
-
tuw.author.orcid
0000-0002-3220-2446
-
tuw.author.orcid
0000-0002-1860-158X
-
tuw.author.orcid
0000-0003-3531-1282
-
tuw.author.orcid
0000-0001-6319-578X
-
wb.sci
true
-
wb.sciencebranch
Informatik
-
wb.sciencebranch.oefos
1020
-
wb.facultyfocus
Information Systems Engineering (ISE)
de
wb.facultyfocus
Information Systems Engineering (ISE)
en
wb.facultyfocus.faculty
E180
-
item.grantfulltext
none
-
item.openairecristype
http://purl.org/coar/resource_type/c_2df8fbb1
-
item.openairetype
research article
-
item.languageiso639-1
en
-
item.cerifentitytype
Publications
-
item.fulltext
no Fulltext
-
crisitem.author.dept
E194-04 - Forschungsbereich Data Science
-
crisitem.author.dept
E192-02 - Forschungsbereich Databases and Artificial Intelligence
-
crisitem.author.parentorg
E194 - Institut für Information Systems Engineering