<div class="csl-bib-body">
<div class="csl-entry">Brunner, L., & Schörkhuber, D. (2026). A Comparative Analysis of Video- and Pose-based Action Recognition for In-cabin Driver Monitoring. In <i>Electronic Imaging</i>. Electronic Imaging 2026, Burlingame, United States of America (the). https://doi.org/10.2352/EI.2026.38.16.AVM-100</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/227552
-
dc.description.abstract
We present a comparative study of pose-based vs. video-based Human Action Recognition (HAR) methods for driver monitoring in car cockpits. In this context, comparisons of neural network architectures from the field of deep learning-based video understanding are scarce. However, pose- and video-based HAR has significant potential for advanced driver-assistance systems in semi-autonomous driving on public roads. We compare prediction performance, per-class false-negative rate, model size, computational requirements, and inference latency on the established Drive&Act and the proprietary Driver Action Insight datasets. While the diversity and scale of available datasets make comparisons challenging, results suggest that both approaches benefit from pretraining, but pose- and video-based techniques perform differently for specific action classes, such as those that depend on body motion or the appearance of objects.
en
dc.description.sponsorship
FFG - Österr. Forschungsförderungs- gesellschaft mbH
-
dc.description.sponsorship
FFG - Österr. Forschungsförderungs- gesellschaft mbH
-
dc.language.iso
en
-
dc.relation.ispartofseries
Electronic Imaging
-
dc.subject
Human Action Recognition
en
dc.subject
Driver Monitoring
en
dc.subject
ADAS
en
dc.subject
Autonomous Vehicles
en
dc.subject
Computer Vision
en
dc.title
A Comparative Analysis of Video- and Pose-based Action Recognition for In-cabin Driver Monitoring
en
dc.type
Inproceedings
en
dc.type
Konferenzbeitrag
de
dc.relation.grantno
911019
-
dc.relation.grantno
884336
-
dc.type.category
Full-Paper Contribution
-
tuw.booktitle
Electronic Imaging
-
tuw.container.volume
38
-
tuw.peerreviewed
true
-
tuw.project.title
UNIversal Single Camera-based 3D Sensing for COmprehensive Human PErception
-
tuw.project.title
Simulation von Fahrzeuginnenräumen für die effiziente Entwicklung von Driver/Occupant Monitoring Systemen
-
tuw.researchTopic.id
I5
-
tuw.researchTopic.name
Visual Computing and Human-Centered Technology
-
tuw.researchTopic.value
100
-
tuw.publication.orgunit
E193-01 - Forschungsbereich Computer Vision
-
tuw.publisher.doi
10.2352/EI.2026.38.16.AVM-100
-
dc.description.numberOfPages
7
-
tuw.author.orcid
0009-0000-0652-9771
-
tuw.author.orcid
0000-0003-2015-6507
-
tuw.event.name
Electronic Imaging 2026
en
tuw.event.startdate
01-03-2026
-
tuw.event.enddate
05-03-2026
-
tuw.event.online
On Site
-
tuw.event.type
Event for scientific audience
-
tuw.event.place
Burlingame
-
tuw.event.country
US
-
tuw.event.presenter
Schörkhuber, Dominik
-
wb.sciencebranch
Informatik
-
wb.sciencebranch
Mathematik
-
wb.sciencebranch.oefos
1020
-
wb.sciencebranch.oefos
1010
-
wb.sciencebranch.value
90
-
wb.sciencebranch.value
10
-
item.fulltext
no Fulltext
-
item.languageiso639-1
en
-
item.cerifentitytype
Publications
-
item.grantfulltext
none
-
item.openairecristype
http://purl.org/coar/resource_type/c_5794
-
item.openairetype
conference paper
-
crisitem.project.funder
FFG - Österr. Forschungsförderungs- gesellschaft mbH
-
crisitem.project.funder
FFG - Österr. Forschungsförderungs- gesellschaft mbH
-
crisitem.project.grantno
911019
-
crisitem.project.grantno
884336
-
crisitem.author.dept
E193-01 - Forschungsbereich Computer Vision
-
crisitem.author.dept
E193-01 - Forschungsbereich Computer Vision
-
crisitem.author.orcid
0009-0000-0652-9771
-
crisitem.author.orcid
0000-0003-2015-6507
-
crisitem.author.parentorg
E193 - Institut für Visual Computing and Human-Centered Technology
-
crisitem.author.parentorg
E193 - Institut für Visual Computing and Human-Centered Technology