<div class="csl-bib-body">
<div class="csl-entry">Stippel, C., Heitzinger, T., Sterzinger, R., & Kampel, M. (2024). Closing the Gap in Human Behavior Analysis: A Pipeline for Synthesizing Trimodal Data. In <i>2024 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops)</i> (pp. 793–798). https://doi.org/10.1109/PerComWorkshops59983.2024.10503351</div>
</div>
-
dc.identifier.uri
http://hdl.handle.net/20.500.12708/203807
-
dc.description.abstract
In pervasive machine learning, especially in Human Behavior Analysis (HBA), RGB has been the primary modality due to its accessibility and richness of information. However, linked with its benefits are challenges, including sensitivity to lighting conditions and privacy concerns. One possibility to overcome these vulnerabilities is to resort to different modalities. For instance, thermal is particularly adept at accentuating human forms, while depth adds crucial contextual layers. Despite their known benefits, only a few HBA-specific datasets that integrate these modalities exist. To address this shortage, our research introduces a novel generative technique for creating trimodal, i.e., RGB, thermal, and depth, human-focused datasets. This technique capitalizes on human segmentation masks derived from RGB images, combined with thermal and depth backgrounds that are sourced automatically. With these two ingredients, we synthesize depth and thermal counterparts from existing RGB data utilizing conditional image-to-image translation. By employing this approach, we generate trimodal data that can be leveraged to train models for settings with limited data, bad lightning conditions, or privacy-sensitive areas.
en
dc.description.sponsorship
FFG - Österr. Forschungsförderungs- gesellschaft mbH
-
dc.language.iso
en
-
dc.subject
action recognition
en
dc.subject
depth sensing
en
dc.subject
human behavior analysis
en
dc.subject
image-to-image translation
en
dc.subject
thermal imagining
en
dc.title
Closing the Gap in Human Behavior Analysis: A Pipeline for Synthesizing Trimodal Data
en
dc.type
Inproceedings
en
dc.type
Konferenzbeitrag
de
dc.relation.isbn
979-8-3503-0436-7
-
dc.description.startpage
793
-
dc.description.endpage
798
-
dc.relation.grantno
FO999886329
-
dc.type.category
Full-Paper Contribution
-
tuw.booktitle
2024 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops)
-
tuw.peerreviewed
true
-
tuw.project.title
Künstliche Intelligenz im Strafvollzug
-
tuw.researchTopic.id
I5
-
tuw.researchTopic.name
Visual Computing and Human-Centered Technology
-
tuw.researchTopic.value
100
-
tuw.publication.orgunit
E193-01 - Forschungsbereich Computer Vision
-
tuw.publisher.doi
10.1109/PerComWorkshops59983.2024.10503351
-
dc.description.numberOfPages
6
-
tuw.author.orcid
0000-0002-3129-5054
-
tuw.author.orcid
0009-0001-0029-8463
-
tuw.author.orcid
0000-0002-5217-2854
-
tuw.event.name
2024 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops)
en
tuw.event.startdate
11-03-2024
-
tuw.event.enddate
15-03-2024
-
tuw.event.online
On Site
-
tuw.event.type
Event for scientific audience
-
tuw.event.place
Biarritz
-
tuw.event.country
FR
-
tuw.event.presenter
Stippel, Christian
-
wb.sciencebranch
Informatik
-
wb.sciencebranch
Mathematik
-
wb.sciencebranch.oefos
1020
-
wb.sciencebranch.oefos
1010
-
wb.sciencebranch.value
90
-
wb.sciencebranch.value
10
-
item.languageiso639-1
en
-
item.openairetype
conference paper
-
item.grantfulltext
restricted
-
item.fulltext
no Fulltext
-
item.cerifentitytype
Publications
-
item.openairecristype
http://purl.org/coar/resource_type/c_5794
-
crisitem.author.dept
E193-01 - Forschungsbereich Computer Vision
-
crisitem.author.dept
E193-01 - Forschungsbereich Computer Vision
-
crisitem.author.dept
E193-01 - Forschungsbereich Computer Vision
-
crisitem.author.dept
E193-01 - Forschungsbereich Computer Vision
-
crisitem.author.orcid
0000-0002-3129-5054
-
crisitem.author.orcid
0009-0001-0029-8463
-
crisitem.author.orcid
0000-0002-5217-2854
-
crisitem.author.parentorg
E193 - Institut für Visual Computing and Human-Centered Technology
-
crisitem.author.parentorg
E193 - Institut für Visual Computing and Human-Centered Technology
-
crisitem.author.parentorg
E193 - Institut für Visual Computing and Human-Centered Technology
-
crisitem.author.parentorg
E193 - Institut für Visual Computing and Human-Centered Technology
-
crisitem.project.funder
FFG - Österr. Forschungsförderungs- gesellschaft mbH