CAGT: Sim-to-Real Depth Completion with Interactive Embedding Aggregation and Geometry Awareness for Transparent Objects

Jing, Xingshuo; Qian, Kun; Vincze, Markus

doi:10.1109/TCSVT.2025.3543288

DC Field

Value

Language

dc.contributor.author

Jing, Xingshuo

dc.contributor.author

Qian, Kun

dc.contributor.author

Vincze, Markus

dc.date.accessioned

2025-05-22T13:15:30Z

dc.date.available

2025-05-22T13:15:30Z

dc.date.issued

2025

dc.identifier.citation

<div class="csl-bib-body"> <div class="csl-entry">Jing, X., Qian, K., & Vincze, M. (2025). CAGT: Sim-to-Real Depth Completion with Interactive Embedding Aggregation and Geometry Awareness for Transparent Objects. <i>IEEE Transactions on Circuits and Systems for Video Technology</i>. https://doi.org/10.34726/9499</div> </div>

dc.identifier.issn

1051-8215

dc.identifier.uri

http://hdl.handle.net/20.500.12708/215626

dc.identifier.uri

https://doi.org/10.34726/9499

dc.description.abstract

Robust depth completion of transparent objects would be beneficial for industrial automation such as vision-based robotic grasping and manipulation. However, although some methods try to learn a compact intra-layer feature representation with the boost of the attention mechanism or the vision Transformer, they ignore the neglected corner regions and sparse geometry information that are important for accurate depth completion. To tackle these issues, we propose a novel sim-to-real transferable model, named CAGT, with interactive embedding aggregation and geometry awareness to reconstruct severely sparse depth maps of transparent objects in this paper. We design a Depth-clue Interaction Aggregation Module (DIAM) to enhance the Transformer's ability to extract boundary corner features and thus supplement depth clues. Then, we propose a Geometric Information Augmentation Module (GIAM) to fuse the geometry-aware feature containing shape and surface details. Moreover, we introduce a contrastive learning mechanism to facilitate the sim-to-real generalization of the completion model. Extensive experiment results on two challenging datasets, ClearGrasp and TransCG, demonstrate that our proposed CAGT can obtain superior performance over the state-of-the-art methods. We also demonstrate that CAGT can improve the grasp accuracy of transparent objects by a robotic grasping generalization experiment.

dc.language.iso

dc.publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

dc.relation.ispartof

IEEE Transactions on Circuits and Systems for Video Technology

dc.rights.uri

http://rightsstatements.org/vocab/InC/1.0/

dc.subject

contrastive learning

dc.subject

depth completion

dc.subject

geometry-aware

dc.subject

Sim-to-real

dc.title

CAGT: Sim-to-Real Depth Completion with Interactive Embedding Aggregation and Geometry Awareness for Transparent Objects

dc.type

Article

dc.type

Artikel

dc.rights.license

Urheberrechtsschutz

dc.rights.license

In Copyright

dc.identifier.doi

10.34726/9499

dc.identifier.scopus

2-s2.0-85218750148

dc.identifier.url

https://api.elsevier.com/content/abstract/scopus_id/85218750148

dc.contributor.affiliation

Southeast University, China

dc.contributor.affiliation

Southeast University, China

dc.rights.holder

dc.type.category

Original Research Article

tuw.journal.peerreviewed

true

tuw.peerreviewed

true

wb.publication.intCoWork

International Co-publication

tuw.researchTopic.id

tuw.researchTopic.name

Automation and Robotics

tuw.researchTopic.value

100

dcterms.isPartOf.title

IEEE Transactions on Circuits and Systems for Video Technology

tuw.publication.orgunit

E376-02 - Forschungsbereich Komplexe Dynamische Systeme

tuw.publisher.doi

10.1109/TCSVT.2025.3543288

dc.date.onlinefirst

2025-02-18

dc.identifier.eissn

1558-2205

dc.identifier.libraryid

AC17580857

dc.description.numberOfPages

tuw.author.orcid

0000-0002-1328-4706

tuw.author.orcid

0000-0001-7429-1742

dc.rights.identifier

Urheberrechtsschutz

dc.rights.identifier

In Copyright

wb.sci

true

wb.sciencebranch

Elektrotechnik, Elektronik, Informationstechnik

wb.sciencebranch.oefos

2020

wb.sciencebranch.value

100

item.cerifentitytype

Publications

item.languageiso639-1

item.mimetype

application/pdf

item.fulltext

with Fulltext

item.openairetype

research article

item.openaccessfulltext

Open Access

item.openairecristype

http://purl.org/coar/resource_type/c_2df8fbb1

item.grantfulltext

mixedopen

crisitem.author.dept

E376-02 - Forschungsbereich Komplexe Dynamische Systeme

crisitem.author.dept

Southeast University

crisitem.author.dept

E376-02 - Forschungsbereich Komplexe Dynamische Systeme

crisitem.author.orcid

0000-0001-7429-1742

crisitem.author.parentorg

E376 - Institut für Automatisierungs- und Regelungstechnik

crisitem.author.parentorg

E376 - Institut für Automatisierungs- und Regelungstechnik

Appears in Collections:

Article

Fulltext (Accepted Version)

Adobe PDF

(13.14 MB)

In Copyright

Show simple item record

Page view(s)

checked on May 23, 2025

Download(s)

checked on May 23, 2025

Google Scholar^TM

Check

Page view(s)

Download(s)

Google ScholarTM

Google Scholar^TM