C4Synth: Cross-Caption Cycle-Consistent Text-to-Image Synthesis

Joseph, J and Pal, Arghya and Rajanala, Sailaja and Balasubramanian, Vineeth N (2019) C4Synth: Cross-Caption Cycle-Consistent Text-to-Image Synthesis. In: IEEE Winter Conference on Applications of Computer Vision, WACV, 7-11 January 2019, Waikoloa Village, United States.

Full text not available from this repository. (Request a copy)


Generating an image from its description is a challenging task worth solving because of its numerous practical applications ranging from image editing to virtual reality. All existing methods use one single caption to generate a plausible image. A single caption by itself, can be limited and may not be able to capture the variety of concepts and behavior that would be present in the image. We propose two deep generative models that generate an image by making use of multiple captions describing it. This is achieved by ensuring ‘Cross-Caption Cycle Consistency’ between the multiple captions and the generated image(s). We report quantitative and qualitative results on the standard Caltech-UCSD Birds (CUB) and Oxford-102 Flowers datasets to validate the efficacy of the proposed approach.

[error in script]
IITH Creators:
IITH CreatorsORCiD
Balasubramanian, Vineeth NUNSPECIFIED
Item Type: Conference or Workshop Item (Paper)
Subjects: Computer science
Divisions: Department of Computer Science & Engineering
Depositing User: Library Staff
Date Deposited: 15 Apr 2019 09:08
Last Modified: 15 Apr 2019 09:08
URI: http://raiith.iith.ac.in/id/eprint/4954
Publisher URL: http://doi.org/10.1109/WACV.2019.00044
Related URLs:

Actions (login required)

View Item View Item
Statistics for RAIITH ePrint 4954 Statistics for this ePrint Item