ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.02793
  4. Cited By
Generating Images from Captions with Attention

Generating Images from Captions with Attention

9 November 2015
Elman Mansimov
Emilio Parisotto
Jimmy Lei Ba
Ruslan Salakhutdinov
    VLM
ArXivPDFHTML

Papers citing "Generating Images from Captions with Attention"

50 / 243 papers shown
Title
Recurrent Affine Transformation for Text-to-image Synthesis
Recurrent Affine Transformation for Text-to-image Synthesis
Senmao Ye
Fei Liu
Mingkui Tan
9
26
0
22 Apr 2022
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
Oran Gafni
Adam Polyak
Oron Ashual
Shelly Sheynin
Devi Parikh
Yaniv Taigman
DiffM
17
510
0
24 Mar 2022
DALL-Eval: Probing the Reasoning Skills and Social Biases of
  Text-to-Image Generation Models
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models
Jaemin Cho
Abhaysinh Zala
Mohit Bansal
ViT
132
170
0
08 Feb 2022
VAEL: Bridging Variational Autoencoders and Probabilistic Logic
  Programming
VAEL: Bridging Variational Autoencoders and Probabilistic Logic Programming
Eleonora Misino
G. Marra
Emanuele Sansone
16
21
0
07 Feb 2022
Multimodal Image Synthesis and Editing: The Generative AI Era
Multimodal Image Synthesis and Editing: The Generative AI Era
Fangneng Zhan
Yingchen Yu
Rongliang Wu
Jiahui Zhang
Shijian Lu
Lingjie Liu
Adam Kortylewski
Christian Theobalt
Eric Xing
EGVM
24
48
0
27 Dec 2021
FuseDream: Training-Free Text-to-Image Generation with Improved CLIP+GAN
  Space Optimization
FuseDream: Training-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimization
Xingchao Liu
Chengyue Gong
Lemeng Wu
Shujian Zhang
Haoran Su
Qiang Liu
CLIP
23
89
0
02 Dec 2021
Blended Diffusion for Text-driven Editing of Natural Images
Blended Diffusion for Text-driven Editing of Natural Images
Omri Avrahami
Dani Lischinski
Ohad Fried
DiffM
11
920
0
29 Nov 2021
Learning to Compose Visual Relations
Learning to Compose Visual Relations
Nan Liu
Shuang Li
Yilun Du
J. Tenenbaum
Antonio Torralba
CoGe
OCL
21
77
0
17 Nov 2021
Multimodal Dialogue Response Generation
Multimodal Dialogue Response Generation
Qingfeng Sun
Yujing Wang
Can Xu
Kai Zheng
Yaming Yang
Huang Hu
Fei Xu
Jessica Zhang
Xiubo Geng
Daxin Jiang
15
43
0
16 Oct 2021
AffectGAN: Affect-Based Generative Art Driven by Semantics
AffectGAN: Affect-Based Generative Art Driven by Semantics
Theodoros Galanos
Antonios Liapis
Georgios N. Yannakakis
GAN
25
12
0
30 Sep 2021
DAE-GAN: Dynamic Aspect-aware GAN for Text-to-Image Synthesis
DAE-GAN: Dynamic Aspect-aware GAN for Text-to-Image Synthesis
Shulan Ruan
Yong Zhang
Kun Zhang
Yanbo Fan
Fan Tang
Qi Liu
Enhong Chen
26
88
0
27 Aug 2021
Realistic Image Synthesis with Configurable 3D Scene Layouts
Realistic Image Synthesis with Configurable 3D Scene Layouts
Jaebong Jeong
Jang-Won Jo
Jingdong Wang
Sunghyun Cho
Jaesik Park
3DV
11
1
0
23 Aug 2021
Deep Image Synthesis from Intuitive User Input: A Review and
  Perspectives
Deep Image Synthesis from Intuitive User Input: A Review and Perspectives
Yuan Xue
Yuanchen Guo
Han Zhang
Tao Xu
Song-Hai Zhang
Xiaolei Huang
EGVM
3DV
18
22
0
09 Jul 2021
D2C: Diffusion-Denoising Models for Few-shot Conditional Generation
D2C: Diffusion-Denoising Models for Few-shot Conditional Generation
Abhishek Sinha
Jiaming Song
Chenlin Meng
Stefano Ermon
VLM
DiffM
19
118
0
12 Jun 2021
MOC-GAN: Mixing Objects and Captions to Generate Realistic Images
MOC-GAN: Mixing Objects and Captions to Generate Realistic Images
Tao Ma
Yikang Li
19
0
0
06 Jun 2021
CogView: Mastering Text-to-Image Generation via Transformers
CogView: Mastering Text-to-Image Generation via Transformers
Ming Ding
Zhuoyi Yang
Wenyi Hong
Wendi Zheng
Chang Zhou
...
Junyang Lin
Xu Zou
Zhou Shao
Hongxia Yang
Jie Tang
ViT
VLM
19
759
0
26 May 2021
Adaptive Appearance Rendering
Adaptive Appearance Rendering
Mengyao Zhai
Ruizhi Deng
Jiacheng Chen
Lei Chen
Zhiwei Deng
Greg Mori
23
1
0
24 Apr 2021
Towards Adversarial Patch Analysis and Certified Defense against Crowd
  Counting
Towards Adversarial Patch Analysis and Certified Defense against Crowd Counting
Qiming Wu
Zhikang Zou
Pan Zhou
Xiaoqing Ye
Binghui Wang
Ang Li
AAML
11
4
0
22 Apr 2021
StEP: Style-based Encoder Pre-training for Multi-modal Image Synthesis
StEP: Style-based Encoder Pre-training for Multi-modal Image Synthesis
Moustafa Meshry
Yixuan Ren
L. Davis
Abhinav Shrivastava
9
11
0
14 Apr 2021
Paint by Word
Paint by Word
A. Andonian
David Bau
Audrey Cui
YeonHwan Park
Ali Jahanian
Antonio Torralba
A. Oliva
DiffM
20
125
0
19 Mar 2021
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,774
0
24 Feb 2021
Generating images from caption and vice versa via CLIP-Guided Generative
  Latent Space Search
Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search
Federico A. Galatolo
M. G. Cimino
G. Vaglini
VLM
28
84
0
02 Feb 2021
Lightweight Generative Adversarial Networks for Text-Guided Image
  Manipulation
Lightweight Generative Adversarial Networks for Text-Guided Image Manipulation
Bowen Li
Xiaojuan Qi
Philip H. S. Torr
Thomas Lukasiewicz
GAN
108
68
0
23 Oct 2020
New Ideas and Trends in Deep Multimodal Content Understanding: A Review
New Ideas and Trends in Deep Multimodal Content Understanding: A Review
Wei-Neng Chen
Weiping Wang
Li Liu
M. Lew
VLM
110
31
0
16 Oct 2020
Discriminative Cross-Modal Data Augmentation for Medical Imaging
  Applications
Discriminative Cross-Modal Data Augmentation for Medical Imaging Applications
Yue Yang
P. Xie
MedIm
6
0
0
07 Oct 2020
Static and Animated 3D Scene Generation from Free-form Text Descriptions
Static and Animated 3D Scene Generation from Free-form Text Descriptions
Faria Huq
Nafees Ahmed
Anindya Iqbal
3DV
11
1
0
04 Oct 2020
TreeGAN: Incorporating Class Hierarchy into Image Generation
TreeGAN: Incorporating Class Hierarchy into Image Generation
Ruisi Zhang
Luntian Mou
P. Xie
GAN
16
1
0
16 Sep 2020
Attribute-guided image generation from layout
Attribute-guided image generation from layout
Ke Ma
Bo-Lu Zhao
Leonid Sigal
12
13
0
27 Aug 2020
Rethinking Generative Zero-Shot Learning: An Ensemble Learning
  Perspective for Recognising Visual Patches
Rethinking Generative Zero-Shot Learning: An Ensemble Learning Perspective for Recognising Visual Patches
Zhi Chen
Sen Wang
Jingjing Li
Zi Huang
VLM
11
36
0
27 Jul 2020
Words as Art Materials: Generating Paintings with Sequential GANs
Words as Art Materials: Generating Paintings with Sequential GANs
A. C. Özgen
H. K. Ekenel
GAN
23
1
0
08 Jul 2020
PerceptionGAN: Real-world Image Construction from Provided Text through
  Perceptual Understanding
PerceptionGAN: Real-world Image Construction from Provided Text through Perceptual Understanding
Kanish Garg
A. Singh
Dorien Herremans
Brejesh Lall
GAN
6
4
0
02 Jul 2020
Generating Annotated High-Fidelity Images Containing Multiple Coherent
  Objects
Generating Annotated High-Fidelity Images Containing Multiple Coherent Objects
Bryan G. Cardenas
Devanshu Arya
D. K. Gupta
DiffM
10
6
0
22 Jun 2020
XRayGAN: Consistency-preserving Generation of X-ray Images from
  Radiology Reports
XRayGAN: Consistency-preserving Generation of X-ray Images from Radiology Reports
Xingyi Yang
Nandiraju Gireesh
Eric P. Xing
P. Xie
MedIm
6
2
0
17 Jun 2020
TIME: Text and Image Mutual-Translation Adversarial Networks
TIME: Text and Image Mutual-Translation Adversarial Networks
Bingchen Liu
Kunpeng Song
Yizhe Zhu
Gerard de Melo
Ahmed Elgammal
6
30
0
27 May 2020
SegAttnGAN: Text to Image Generation with Segmentation Attention
SegAttnGAN: Text to Image Generation with Segmentation Attention
Yuchuan Gou
Qiancheng Wu
Minghao Li
Bo Gong
Mei Han
VLM
9
22
0
25 May 2020
BachGAN: High-Resolution Image Synthesis from Salient Object Layout
BachGAN: High-Resolution Image Synthesis from Salient Object Layout
Yandong Li
Yu Cheng
Zhe Gan
Licheng Yu
Liqiang Wang
Jingjing Liu
10
39
0
26 Mar 2020
Learning Layout and Style Reconfigurable GANs for Controllable Image
  Synthesis
Learning Layout and Style Reconfigurable GANs for Controllable Image Synthesis
Wei Sun
Tianfu Wu
17
81
0
25 Mar 2020
OpenGAN: Open Set Generative Adversarial Networks
OpenGAN: Open Set Generative Adversarial Networks
Luke Ditria
Benjamin J. Meyer
Tom Drummond
VLM
AI4CE
GAN
33
20
0
18 Mar 2020
Text-to-Image Generation with Attention Based Recurrent Neural Networks
Text-to-Image Generation with Attention Based Recurrent Neural Networks
Tehseen Zia
Shahan Arif
Shakeeb Murtaza
M. A. Ullah
12
7
0
18 Jan 2020
SimEx: Express Prediction of Inter-dataset Similarity by a Fleet of
  Autoencoders
SimEx: Express Prediction of Inter-dataset Similarity by a Fleet of Autoencoders
Inseok Hwang
Jinho Lee
Frank Liu
Minsik Cho
8
5
0
14 Jan 2020
Vision and Language: from Visual Perception to Content Creation
Vision and Language: from Visual Perception to Content Creation
Tao Mei
Wei Zhang
Ting Yao
VLM
6
8
0
26 Dec 2019
CPGAN: Full-Spectrum Content-Parsing Generative Adversarial Networks for
  Text-to-Image Synthesis
CPGAN: Full-Spectrum Content-Parsing Generative Adversarial Networks for Text-to-Image Synthesis
Jiadong Liang
Wenjie Pei
Feng Lu
GAN
21
19
0
18 Dec 2019
Image Manipulation with Natural Language using Two-sidedAttentive
  Conditional Generative Adversarial Network
Image Manipulation with Natural Language using Two-sidedAttentive Conditional Generative Adversarial Network
D. Zhu
Aditya Mogadala
Dietrich Klakow
GAN
11
8
0
16 Dec 2019
Weak Supervision helps Emergence of Word-Object Alignment and improves
  Vision-Language Tasks
Weak Supervision helps Emergence of Word-Object Alignment and improves Vision-Language Tasks
Corentin Kervadec
G. Antipov
M. Baccouche
Christian Wolf
19
14
0
06 Dec 2019
Multimodal Intelligence: Representation Learning, Information Fusion,
  and Applications
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications
Chao Zhang
Zichao Yang
Xiaodong He
Li Deng
HAI
AI4TS
27
320
0
10 Nov 2019
On Architectures for Including Visual Information in Neural Language
  Models for Image Description
On Architectures for Including Visual Information in Neural Language Models for Image Description
Marc Tanti
Albert Gatt
K. Camilleri
VLM
22
2
0
09 Nov 2019
A Survey and Taxonomy of Adversarial Neural Networks for Text-to-Image
  Synthesis
A Survey and Taxonomy of Adversarial Neural Networks for Text-to-Image Synthesis
Jorge Agnese
Jonathan Herrera
Haicheng Tao
Xingquan Zhu
EGVM
23
101
0
21 Oct 2019
Neuro-SERKET: Development of Integrative Cognitive System through the
  Composition of Deep Probabilistic Generative Models
Neuro-SERKET: Development of Integrative Cognitive System through the Composition of Deep Probabilistic Generative Models
T. Taniguchi
Tomoaki Nakamura
Masahiro Suzuki
Ryo Kuniyasu
Kaede Hayashi
Akira Taniguchi
Takato Horii
Takayuki Nagai
BDL
DRL
14
48
0
20 Oct 2019
Image Generation and Recognition (Emotions)
Image Generation and Recognition (Emotions)
Hanne Carlsson
D. Kollias
GAN
14
0
0
13 Oct 2019
Text-to-Image Synthesis Based on Machine Generated Captions
Text-to-Image Synthesis Based on Machine Generated Captions
Marco Menardi
Alex Falcon
Saida S. Mohamed
Lorenzo Seidenari
G. Serra
A. Bimbo
C. Tasso
17
0
0
09 Oct 2019
Previous
12345
Next