Real-time person segmentation on mobile phones

Knapp, Jakob

doi:10.34726/hss.2021.78701

DC Field

Value

Language

dc.contributor.advisor

Kampel, Martin

dc.contributor.author

Knapp, Jakob

dc.date.accessioned

2021-03-03T09:55:29Z

dc.date.issued

2021

dc.date.submitted

2021-03

dc.identifier.citation

<div class="csl-bib-body"> <div class="csl-entry">Knapp, J. (2021). <i>Real-time person segmentation on mobile phones</i> [Diploma Thesis, Technische Universität Wien]. reposiTUm. https://doi.org/10.34726/hss.2021.78701</div> </div>

dc.identifier.uri

https://doi.org/10.34726/hss.2021.78701

dc.identifier.uri

http://hdl.handle.net/20.500.12708/16966

dc.description.abstract

Die Erkennung und Segmentierung von Objekten spielt eine essenzielle Rolle im Prozess der Informationsgewinnung aus Videodaten. Von besonderer Relevanz in diversen Anwendungsgebieten des maschinellen Sehens ist in diesem Kontext das Hervorheben menschlicher Silhouetten, etwa in der Videoüberwachung, im autonomen Straßenverkehr, in der Interaktion von Mensch und Maschine oder im Bereich des "Ambient Assisted Living". Im Zuge dieser Diplomarbeit wird die Eignung von "Convolutional Neural Networks" (CNNs) zu diesem Zweck untersucht. Zusätzlich wird evaluiert, wie solch neuronale Netzwerke konstruiert werden können, um die aufeinander folgenden Bildern inhärente zeitliche Information effizient für Segmentierungszwecke zu nutzen und somit die Erkennungsrate zu verbessern. Konkret wird dies anhand der Entwicklung einer Applikation für Mobilgeräte diskutiert, welche die Erkennung menschlicher Umrisse auf einem Live-Video-Stream, aufgezeichnet von der Kamera des Geräts, realisiert. Dadurch bedingt liegt ein weiterer Fokus der Arbeit auf der effizienten Umsetzung neuronaler Netzwerke hinsichtlich der limitierten Ressourcen von Mobilgeräten.

dc.description.abstract

The detection and segmentation of objects plays an essential role in the process of extracting information from video data. The emphasis of human silhouettes is in this context of particular interest in various application fields of computer vision, such as surveillance, autonomous driving, human computer interaction (HCI), or ambient assisted living (AAL). This thesis explores the suitability of convolutional neural networks (CNNs) for this purpose. In addition, it will be evaluated how such neural networks can be constructed to effectively use the temporal information inherent in successive frames for segmentation purposes, thus improving the recognition rate. Specifically, this will be discussed through the development of an application for mobile devices, which realizes the recognition of human silhouettes on a live video stream, recorded by the camera of the device. Therefore, another focus of this thesis lies on the efficient implementation of neural networks with regards to the limited computational resources provided by mobiles.

dc.language

English

dc.language.iso

dc.rights.uri

http://rightsstatements.org/vocab/InC/1.0/

dc.subject

Videosegmentierung

dc.subject

Convolutional Neural Networks & mobile Netzwerke

dc.subject

video segmentation

dc.subject

convolutional neural networks

dc.subject

mobile networks

dc.title

Real-time person segmentation on mobile phones

dc.type

Thesis

dc.type

Hochschulschrift

dc.rights.license

In Copyright

dc.rights.license

Urheberrechtsschutz

dc.identifier.doi

10.34726/hss.2021.78701

dc.contributor.affiliation

TU Wien, Österreich

dc.rights.holder

Jakob Knapp

dc.publisher.place

Wien

tuw.version

vor

tuw.thesisinformation

Technische Universität Wien

tuw.publication.orgunit

E193 - Institut für Visual Computing and Human-Centered Technology

dc.type.qualificationlevel

Diploma

dc.identifier.libraryid

AC16157628

dc.description.numberOfPages

105

dc.thesistype

Diplomarbeit

dc.thesistype

Diploma Thesis

dc.rights.identifier

In Copyright

dc.rights.identifier

Urheberrechtsschutz

tuw.advisor.staffStatus

staff

tuw.advisor.orcid

0000-0002-5217-2854

item.languageiso639-1

item.openairetype

master thesis

item.grantfulltext

open

item.fulltext

with Fulltext

item.cerifentitytype

Publications

item.mimetype

application/pdf

item.openairecristype

http://purl.org/coar/resource_type/c_bdcc

item.openaccessfulltext

Open Access

crisitem.author.dept

E193-01 - Forschungsbereich Computer Vision

crisitem.author.parentorg

E193 - Institut für Visual Computing and Human-Centered Technology

Appears in Collections:

Thesis

Fulltext (Version of Record (published version))

Adobe PDF

(2.57 MB)

In Copyright

Show simple item record

Google Scholar^TM

Check

Google ScholarTM

Google Scholar^TM