Attention-driven object detection and segmentation for robotics

Potapova, Ekaterina

doi:10.34726/hss.2014.27520

DC Field

Value

Language

dc.contributor.advisor

Vincze, Markus

dc.contributor.author

Potapova, Ekaterina

dc.date.accessioned

2020-06-29T11:37:29Z

dc.date.issued

2014

dc.date.submitted

2014-12

dc.identifier.citation

<div class="csl-bib-body"> <div class="csl-entry">Potapova, E. (2014). <i>Attention-driven object detection and segmentation for robotics</i> [Dissertation, Technische Universität Wien]. reposiTUm. https://doi.org/10.34726/hss.2014.27520</div> </div>

dc.identifier.uri

https://doi.org/10.34726/hss.2014.27520

dc.identifier.uri

http://hdl.handle.net/20.500.12708/6186

dc.description

Abweichender Titel laut Übersetzung der Verfasserin/des Verfassers

dc.description.abstract

Vision is an essential part of any robotic system and plays an important role in such typical robotic tasks for domestic environments as searching and grasping objects in cluttered scenes. To be efficient, vision systems are required to provide fast object detection and segmentation mechanisms. In the past, attention mechanisms have been proposed to cope with the complexity of the real world by detecting and prioritizing the processing of objects of interest, and therefore guide the search and segmentation of objects. The goal of this thesis is to create an attention-based visual system, consisting of attention-based object detection and attention-driven object segmentation for a robot. Many models of visual attention have been proposed and proven to be very useful in robotic applications. We address the problem of obtaining meaningful saliency measures based on such characteristics as the object height and surface orientations that appear to be qualitatively better than traditional saliency maps. Moreover, recently it has been shown in the literature that not only single visual features, based on color, orientation or curvature attract attention, but complete objects do. Symmetry is a feature of many man-made and also natural objects and has thus been identified as a candidate for attentional operators. However, not many techniques exist to date that exploit symmetry-based saliency. In this thesis, a novel symmetry-based saliency operator that works on 3D data and does not assume any object model is presented. We show that the proposed saliency maps are better suited for the task of object detection. Object detection was implemented by means of extracting fixation points from saliency maps. The evaluation in terms of the quality of fixation points showed that the proposed algorithms outperform current state-of the-art saliency operators. The quality of attention points was defined in terms of their location within the object and the number of attended objects. Segmentation of highly cluttered indoor scenes is a challenging task and traditional segmentation methods are often overwhelmed by the complexity of the scene and require a significant amount of processing time. To tackle this problem we propose to use attention-driven and incremental segmentation, where attention mechanisms are used to prioritize parts of the scene to be handled first. In this work, we combined a saliency operator based on 3D symmetry with three segmentation methods. The first one is based on clustering locally planar surface patches. The second method segments attended objects using an edge map based on color, depth and curvature within a probabilistic framework. We also proposed a third method, an incremental attention-driven mechanism, that outputs object hypotheses composed of parametric surface models. We evaluated our approaches on two publicly available datasets of cluttered indoor scenes containing man-made objects. We showed that the proposed methods outperform existing state-of-the-art attention-driven segmentation algorithms in terms of segmentation quality and computational performance.

dc.language

English

dc.language.iso

dc.rights.uri

http://rightsstatements.org/vocab/InC/1.0/

dc.subject

Visual attention

dc.subject

3D Visual attention

dc.subject

attention cues

dc.subject

saliency maps

dc.subject

attention points

dc.subject

fixation points

dc.subject

object detection

dc.subject

objectness

dc.subject

clutter

dc.subject

object segmentation

dc.subject

attention-driven object segmentation

dc.subject

incremental segmentation

dc.subject

Visual attention

dc.subject

3D Visual attention

dc.subject

attention cues

dc.subject

saliency maps

dc.subject

attention points

dc.subject

fixation points

dc.subject

object detection

dc.subject

objectness

dc.subject

clutter

dc.subject

object segmentation

dc.subject

attention-driven object segmentation

dc.subject

incremental segmentation

dc.title

Attention-driven object detection and segmentation for robotics

dc.title.alternative

Attention-driven Object Detection and Segmentation for Robotics

dc.type

Thesis

dc.type

Hochschulschrift

dc.rights.license

In Copyright

dc.rights.license

Urheberrechtsschutz

dc.identifier.doi

10.34726/hss.2014.27520

dc.contributor.affiliation

TU Wien, Österreich

dc.rights.holder

Ekaterina Potapova

tuw.version

vor

tuw.thesisinformation

Technische Universität Wien

tuw.publication.orgunit

E376 - Institut für Automatisierungs- und Regelungstechnik

dc.type.qualificationlevel

Doctoral

dc.identifier.libraryid

AC12132764

dc.description.numberOfPages

130

dc.identifier.urn

urn:nbn:at:at-ubtuw:1-67061

dc.thesistype

Dissertation

dc.thesistype

Dissertation

dc.rights.identifier

In Copyright

dc.rights.identifier

Urheberrechtsschutz

tuw.advisor.staffStatus

staff

item.languageiso639-1

item.openairetype

doctoral thesis

item.grantfulltext

open

item.fulltext

with Fulltext

item.cerifentitytype

Publications

item.mimetype

application/pdf

item.openairecristype

http://purl.org/coar/resource_type/c_db06

item.openaccessfulltext

Open Access

crisitem.author.dept

E376 - Institut für Automatisierungs- und Regelungstechnik

crisitem.author.parentorg

E350 - Fakultät für Elektrotechnik und Informationstechnik

Appears in Collections:

Thesis

Fulltext (Version of Record (published version))

Adobe PDF

(20.37 MB)

In Copyright

Show simple item record

Page view(s)

240

checked on Dec 1, 2023

Download(s)

checked on Dec 1, 2023

Google Scholar^TM

Check

Page view(s)

Download(s)

Google ScholarTM

Google Scholar^TM