Memory-Driven Text-to-Image Generation

Li, Bowen; Torr, Philip H. S.; Lukasiewicz, Thomas

DC Field

Value

Language

dc.contributor.author

Li, Bowen

dc.contributor.author

Torr, Philip H. S.

dc.contributor.author

Lukasiewicz, Thomas

dc.date.accessioned

2024-02-07T17:00:48Z

dc.date.available

2024-02-07T17:00:48Z

dc.date.issued

2022

dc.identifier.citation

<div class="csl-bib-body"> <div class="csl-entry">Li, B., Torr, P. H. S., & Lukasiewicz, T. (2022). Memory-Driven Text-to-Image Generation. In <i>The 33rd British Machine Vision Conference Proceedings</i>. 33rd British Machine Vision Conference, London, United Kingdom of Great Britain and Northern Ireland (the). http://hdl.handle.net/20.500.12708/193654</div> </div>

dc.identifier.uri

http://hdl.handle.net/20.500.12708/193654

dc.description.abstract

We introduce a memory-driven semi-parametric approach to text-to-image generation, which is based on both parametric and non-parametric techniques. The non-parametric component is a memory bank of image features constructed from a training set of images. The parametric component is a generative adversarial network. Given a new text description at inference time, the memory bank is used to selectively retrieve image features that are provided as basic information of target images, which enables the generator to produce realistic synthetic results. We also incorporate content information into the discriminator, together with semantic features, allowing the discriminator to make a more reliable prediction. Experimental results demonstrate that the proposed memory-driven semi-parametric approach produces realistic images, compared to purely parametric approaches, in terms of both visual fidelity and text-image semantic consistency.

dc.language.iso

dc.subject

Text-to-Image Generation

dc.subject

Memory-Driven

dc.title

Memory-Driven Text-to-Image Generation

dc.type

Inproceedings

dc.type

Konferenzbeitrag

dc.contributor.affiliation

University of Oxford, United Kingdom of Great Britain and Northern Ireland (the)

dc.contributor.affiliation

University of Oxford, United Kingdom of Great Britain and Northern Ireland (the)

dc.type.category

Full-Paper Contribution

tuw.booktitle

The 33rd British Machine Vision Conference Proceedings

tuw.book.chapter

0726

tuw.researchTopic.id

tuw.researchTopic.name

Visual Computing and Human-Centered Technology

tuw.researchTopic.name

Information Systems Engineering

tuw.researchTopic.value

tuw.publication.orgunit

E192-07 - Forschungsbereich Artificial Intelligence Techniques

tuw.publication.orgunit

E192-03 - Forschungsbereich Knowledge Based Systems

dc.description.numberOfPages

tuw.author.orcid

0000-0002-8440-543X

tuw.event.name

33rd British Machine Vision Conference

tuw.event.startdate

21-11-2022

tuw.event.enddate

24-11-2022

tuw.event.online

On Site

tuw.event.type

Event for scientific audience

tuw.event.place

London

tuw.event.country

tuw.event.presenter

Li, Bowen

wb.sciencebranch

Informatik

wb.sciencebranch

Mathematik

wb.sciencebranch.oefos

1020

wb.sciencebranch.oefos

1010

wb.sciencebranch.value

item.languageiso639-1

item.grantfulltext

none

item.openairetype

conference paper

item.openairecristype

http://purl.org/coar/resource_type/c_5794

item.cerifentitytype

Publications

item.fulltext

no Fulltext

crisitem.author.dept

University of Oxford, United Kingdom of Great Britain and Northern Ireland (the)

crisitem.author.dept

University of Oxford, United Kingdom of Great Britain and Northern Ireland (the)

crisitem.author.dept

E192-07 - Forschungsbereich Artificial Intelligence Techniques

crisitem.author.parentorg

E192 - Institut für Logic and Computation

Appears in Collections:

Conference Paper

Show simple item record

Page view(s)

224

checked on Feb 8, 2024

Google Scholar^TM

Check

Page view(s)

Google ScholarTM

Google Scholar^TM