Generating Images from Captions with Attention

9 November 2015

Jimmy Lei Ba

Papers citing "Generating Images from Captions with Attention"

50 / 243 papers shown

Title
Recurrent Affine Transformation for Text-to-image Synthesis Senmao Ye Fei Liu Mingkui Tan 9 26 0 22 Apr 2022
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors Oran Gafni Adam Polyak Oron Ashual Shelly Sheynin Devi Parikh Yaniv Taigman DiffM 17 510 0 24 Mar 2022
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models Jaemin Cho Abhaysinh Zala Mohit Bansal ViT 132 170 0 08 Feb 2022
VAEL: Bridging Variational Autoencoders and Probabilistic Logic Programming Eleonora Misino G. Marra Emanuele Sansone 16 21 0 07 Feb 2022
Multimodal Image Synthesis and Editing: The Generative AI Era Fangneng Zhan Yingchen Yu Rongliang Wu Jiahui Zhang Shijian Lu Lingjie Liu Adam Kortylewski Christian Theobalt Eric Xing EGVM 24 48 0 27 Dec 2021
FuseDream: Training-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimization Xingchao Liu Chengyue Gong Lemeng Wu Shujian Zhang Haoran Su Qiang Liu CLIP 23 89 0 02 Dec 2021
Blended Diffusion for Text-driven Editing of Natural Images Omri Avrahami Dani Lischinski Ohad Fried DiffM 11 920 0 29 Nov 2021
Learning to Compose Visual Relations Nan Liu Shuang Li Yilun Du J. Tenenbaum Antonio Torralba CoGe OCL 21 77 0 17 Nov 2021
Multimodal Dialogue Response Generation Qingfeng Sun Yujing Wang Can Xu Kai Zheng Yaming Yang Huang Hu Fei Xu Jessica Zhang Xiubo Geng Daxin Jiang 15 43 0 16 Oct 2021
AffectGAN: Affect-Based Generative Art Driven by Semantics Theodoros Galanos Antonios Liapis Georgios N. Yannakakis GAN 25 12 0 30 Sep 2021
DAE-GAN: Dynamic Aspect-aware GAN for Text-to-Image Synthesis Shulan Ruan Yong Zhang Kun Zhang Yanbo Fan Fan Tang Qi Liu Enhong Chen 26 88 0 27 Aug 2021
Realistic Image Synthesis with Configurable 3D Scene Layouts Jaebong Jeong Jang-Won Jo Jingdong Wang Sunghyun Cho Jaesik Park 3DV 11 1 0 23 Aug 2021
Deep Image Synthesis from Intuitive User Input: A Review and Perspectives Yuan Xue Yuanchen Guo Han Zhang Tao Xu Song-Hai Zhang Xiaolei Huang EGVM 3DV 18 22 0 09 Jul 2021
D2C: Diffusion-Denoising Models for Few-shot Conditional Generation Abhishek Sinha Jiaming Song Chenlin Meng Stefano Ermon VLM DiffM 19 118 0 12 Jun 2021
MOC-GAN: Mixing Objects and Captions to Generate Realistic Images Tao Ma Yikang Li 19 0 0 06 Jun 2021
CogView: Mastering Text-to-Image Generation via Transformers Ming Ding Zhuoyi Yang Wenyi Hong Wendi Zheng Chang Zhou ... Junyang Lin Xu Zou Zhou Shao Hongxia Yang Jie Tang ViT VLM 19 759 0 26 May 2021
Adaptive Appearance Rendering Mengyao Zhai Ruizhi Deng Jiacheng Chen Lei Chen Zhiwei Deng Greg Mori 23 1 0 24 Apr 2021
Towards Adversarial Patch Analysis and Certified Defense against Crowd Counting Qiming Wu Zhikang Zou Pan Zhou Xiaoqing Ye Binghui Wang Ang Li AAML 11 4 0 22 Apr 2021
StEP: Style-based Encoder Pre-training for Multi-modal Image Synthesis Moustafa Meshry Yixuan Ren L. Davis Abhinav Shrivastava 9 11 0 14 Apr 2021
Paint by Word A. Andonian David Bau Audrey Cui YeonHwan Park Ali Jahanian Antonio Torralba A. Oliva DiffM 20 125 0 19 Mar 2021
Zero-Shot Text-to-Image Generation Aditya A. Ramesh Mikhail Pavlov Gabriel Goh Scott Gray Chelsea Voss Alec Radford Mark Chen Ilya Sutskever VLM 253 4,774 0 24 Feb 2021
Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search Federico A. Galatolo M. G. Cimino G. Vaglini VLM 28 84 0 02 Feb 2021
Lightweight Generative Adversarial Networks for Text-Guided Image Manipulation Bowen Li Xiaojuan Qi Philip H. S. Torr Thomas Lukasiewicz GAN 108 68 0 23 Oct 2020
New Ideas and Trends in Deep Multimodal Content Understanding: A Review Wei-Neng Chen Weiping Wang Li Liu M. Lew VLM 110 31 0 16 Oct 2020
Discriminative Cross-Modal Data Augmentation for Medical Imaging Applications Yue Yang P. Xie MedIm 6 0 0 07 Oct 2020
Static and Animated 3D Scene Generation from Free-form Text Descriptions Faria Huq Nafees Ahmed Anindya Iqbal 3DV 11 1 0 04 Oct 2020
TreeGAN: Incorporating Class Hierarchy into Image Generation Ruisi Zhang Luntian Mou P. Xie GAN 16 1 0 16 Sep 2020
Attribute-guided image generation from layout Ke Ma Bo-Lu Zhao Leonid Sigal 12 13 0 27 Aug 2020
Rethinking Generative Zero-Shot Learning: An Ensemble Learning Perspective for Recognising Visual Patches Zhi Chen Sen Wang Jingjing Li Zi Huang VLM 11 36 0 27 Jul 2020
Words as Art Materials: Generating Paintings with Sequential GANs A. C. Özgen H. K. Ekenel GAN 23 1 0 08 Jul 2020
PerceptionGAN: Real-world Image Construction from Provided Text through Perceptual Understanding Kanish Garg A. Singh Dorien Herremans Brejesh Lall GAN 6 4 0 02 Jul 2020
Generating Annotated High-Fidelity Images Containing Multiple Coherent Objects Bryan G. Cardenas Devanshu Arya D. K. Gupta DiffM 10 6 0 22 Jun 2020
XRayGAN: Consistency-preserving Generation of X-ray Images from Radiology Reports Xingyi Yang Nandiraju Gireesh Eric P. Xing P. Xie MedIm 6 2 0 17 Jun 2020
TIME: Text and Image Mutual-Translation Adversarial Networks Bingchen Liu Kunpeng Song Yizhe Zhu Gerard de Melo Ahmed Elgammal 6 30 0 27 May 2020
SegAttnGAN: Text to Image Generation with Segmentation Attention Yuchuan Gou Qiancheng Wu Minghao Li Bo Gong Mei Han VLM 9 22 0 25 May 2020
BachGAN: High-Resolution Image Synthesis from Salient Object Layout Yandong Li Yu Cheng Zhe Gan Licheng Yu Liqiang Wang Jingjing Liu 10 39 0 26 Mar 2020
Learning Layout and Style Reconfigurable GANs for Controllable Image Synthesis Wei Sun Tianfu Wu 17 81 0 25 Mar 2020
OpenGAN: Open Set Generative Adversarial Networks Luke Ditria Benjamin J. Meyer Tom Drummond VLM AI4CE GAN 33 20 0 18 Mar 2020
Text-to-Image Generation with Attention Based Recurrent Neural Networks Tehseen Zia Shahan Arif Shakeeb Murtaza M. A. Ullah 12 7 0 18 Jan 2020
SimEx: Express Prediction of Inter-dataset Similarity by a Fleet of Autoencoders Inseok Hwang Jinho Lee Frank Liu Minsik Cho 8 5 0 14 Jan 2020
Vision and Language: from Visual Perception to Content Creation Tao Mei Wei Zhang Ting Yao VLM 6 8 0 26 Dec 2019
CPGAN: Full-Spectrum Content-Parsing Generative Adversarial Networks for Text-to-Image Synthesis Jiadong Liang Wenjie Pei Feng Lu GAN 21 19 0 18 Dec 2019
Image Manipulation with Natural Language using Two-sidedAttentive Conditional Generative Adversarial Network D. Zhu Aditya Mogadala Dietrich Klakow GAN 11 8 0 16 Dec 2019
Weak Supervision helps Emergence of Word-Object Alignment and improves Vision-Language Tasks Corentin Kervadec G. Antipov M. Baccouche Christian Wolf 19 14 0 06 Dec 2019
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications Chao Zhang Zichao Yang Xiaodong He Li Deng HAI AI4TS 27 320 0 10 Nov 2019
On Architectures for Including Visual Information in Neural Language Models for Image Description Marc Tanti Albert Gatt K. Camilleri VLM 22 2 0 09 Nov 2019
A Survey and Taxonomy of Adversarial Neural Networks for Text-to-Image Synthesis Jorge Agnese Jonathan Herrera Haicheng Tao Xingquan Zhu EGVM 23 101 0 21 Oct 2019
Neuro-SERKET: Development of Integrative Cognitive System through the Composition of Deep Probabilistic Generative Models T. Taniguchi Tomoaki Nakamura Masahiro Suzuki Ryo Kuniyasu Kaede Hayashi Akira Taniguchi Takato Horii Takayuki Nagai BDL DRL 14 48 0 20 Oct 2019
Image Generation and Recognition (Emotions) Hanne Carlsson D. Kollias GAN 14 0 0 13 Oct 2019
Text-to-Image Synthesis Based on Machine Generated Captions Marco Menardi Alex Falcon Saida S. Mohamed Lorenzo Seidenari G. Serra A. Bimbo C. Tasso 17 0 0 09 Oct 2019