ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2105.10026
  4. Cited By
Improving Generation and Evaluation of Visual Stories via Semantic
  Consistency

Improving Generation and Evaluation of Visual Stories via Semantic Consistency

North American Chapter of the Association for Computational Linguistics (NAACL), 2021
20 May 2021
A. Maharana
Darryl Hannan
Joey Tianyi Zhou
    EGVM
ArXiv (abs)PDFHTMLGithub (33★)

Papers citing "Improving Generation and Evaluation of Visual Stories via Semantic Consistency"

39 / 39 papers shown
CoCoIns: Consistent Subject Generation via Contrastive Instantiated Concepts
CoCoIns: Consistent Subject Generation via Contrastive Instantiated Concepts
Lee Hsin-Ying
Kelvin Chan
Ming-Hsuan Yang
DiffM
396
1
0
31 Mar 2025
Text2Story: Advancing Video Storytelling with Text Guidance
Text2Story: Advancing Video Storytelling with Text Guidance
Taewon Kang
D. Kothandaraman
Ming C. Lin
DiffMVGen
416
3
0
08 Mar 2025
StoryAgent: Customized Storytelling Video Generation via Multi-Agent
  Collaboration
StoryAgent: Customized Storytelling Video Generation via Multi-Agent Collaboration
Panwen Hu
Jin Jiang
Jianqi Chen
Mingfei Han
Shengcai Liao
Xiaojun Chang
Xiaodan Liang
VGenDiffM
375
18
0
07 Nov 2024
KAHANI: Culturally-Nuanced Visual Storytelling Tool for Non-Western Cultures
KAHANI: Culturally-Nuanced Visual Storytelling Tool for Non-Western Cultures
Hamna
Deepthi Sudharsan
Agrima Seth
Ritvik Budhiraja
Deepika Khullar
Vyshak Jain
Kalika Bali
Aditya Vashistha
Sameer Segal
DiffM
277
0
0
25 Oct 2024
One missing piece in Vision and Language: A Survey on Comics Understanding
One missing piece in Vision and Language: A Survey on Comics Understanding
Emanuele Vivoli
Andrey Barsky
Mohamed Ali Souibgui
Artemis LLabres
Marco Bertini
Dimosthenis Karatzas
335
7
0
14 Sep 2024
What Makes a Good Story and How Can We Measure It? A Comprehensive
  Survey of Story Evaluation
What Makes a Good Story and How Can We Measure It? A Comprehensive Survey of Story Evaluation
Dingyi Yang
Qin Jin
419
16
0
26 Aug 2024
Anim-Director: A Large Multimodal Model Powered Agent for Controllable
  Animation Video Generation
Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video GenerationACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH Asia), 2024
Yunxin Li
Haoyuan Shi
Baotian Hu
Longyue Wang
Jiashun Zhu
Jinyi Xu
Zhen Zhao
Min Zhang
VGen
261
24
0
19 Aug 2024
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Huiguo He
Huan Yang
Zixi Tuo
Yuan Zhou
Qiuyue Wang
Yuhang Zhang
Zeyu Liu
Wenhao Huang
Hongyang Chao
Jian Yin
VGenDiffM
556
28
0
17 Jul 2024
SEED-Story: Multimodal Long Story Generation with Large Language Model
SEED-Story: Multimodal Long Story Generation with Large Language Model
Shuai Yang
Yuying Ge
Yang Li
Yukang Chen
Yixiao Ge
Mingyu Ding
Yingcong Chen
VGenDiffM
404
57
0
11 Jul 2024
Improving Visual Storytelling with Multimodal Large Language Models
Improving Visual Storytelling with Multimodal Large Language Models
Xiaochuan Lin
Xiangyong Chen
315
1
0
02 Jul 2024
Boosting Consistency in Story Visualization with Rich-Contextual
  Conditional Diffusion Models
Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models
Fei Shen
Hu Ye
Sibo Liu
Jun Zhang
Cong Wang
Xiao Han
Wei Yang
345
61
0
02 Jul 2024
Evolving Storytelling: Benchmarks and Methods for New Character
  Customization with Diffusion Models
Evolving Storytelling: Benchmarks and Methods for New Character Customization with Diffusion Models
Xiyu Wang
Yufei Wang
Satoshi Tsutsui
Weisi Lin
Bihan Wen
Alex C. Kot
289
9
0
20 May 2024
StoryImager: A Unified and Efficient Framework for Coherent Story
  Visualization and Completion
StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion
Ming Tao
Bing-Kun Bao
Hao Tang
Yaowei Wang
Changsheng Xu
DiffM
286
19
0
09 Apr 2024
Masked Generative Story Transformer with Character Guidance and Caption
  Augmentation
Masked Generative Story Transformer with Character Guidance and Caption Augmentation
Christos Papadimitriou
Giorgos Filandrianos
Maria Lymperaiou
Giorgos Stamou
DiffM
263
2
0
13 Mar 2024
Sora as a World Model? A Complete Survey on Text-to-Video Generation
Sora as a World Model? A Complete Survey on Text-to-Video Generation
Joseph Cho
Fachrina Dewi Puspitasari
Sheng Zheng
Jingyao Zheng
Noor Ul Eman
...
Caiyan Qin
Tae-Ho Kim
Choong Seon Hong
Yang Yang
Heng Tao Shen
EGVMVGen
284
66
0
08 Mar 2024
CogCartoon: Towards Practical Story Visualization
CogCartoon: Towards Practical Story Visualization
Zhongyang Zhu
Jie Tang
DiffM
226
7
0
17 Dec 2023
Make-A-Storyboard: A General Framework for Storyboard with Disentangled
  and Merged Control
Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control
Jingkuan Song
Litao Guo
Lianli Gao
Hengtao Shen
Jingkuan Song
DiffM
188
7
0
06 Dec 2023
StoryGPT-V: Large Language Models as Consistent Story Visualizers
StoryGPT-V: Large Language Models as Consistent Story VisualizersComputer Vision and Pattern Recognition (CVPR), 2023
Xiaoqian Shen
Mohamed Elhoseiny
VLM
446
20
0
04 Dec 2023
AutoStory: Generating Diverse Storytelling Images with Minimal Human
  Effort
AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort
Wen Wang
Canyu Zhao
Hao Chen
Zhekai Chen
Kecheng Zheng
Chunhua Shen
DiffM
291
38
0
19 Nov 2023
OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation
OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation
Jie An
Zhengyuan Yang
Linjie Li
Jianfeng Wang
Kevin Qinghong Lin
Zicheng Liu
Lijuan Wang
Jiebo Luo
222
14
0
11 Oct 2023
Causal-Story: Local Causal Attention Utilizing Parameter-Efficient
  Tuning For Visual Story Synthesis
Causal-Story: Local Causal Attention Utilizing Parameter-Efficient Tuning For Visual Story SynthesisIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Tianyi Song
Jiuxin Cao
Kun Wang
Bo Liu
Xiaofeng Zhang
DiffM
266
8
0
18 Sep 2023
StoryBench: A Multifaceted Benchmark for Continuous Story Visualization
StoryBench: A Multifaceted Benchmark for Continuous Story VisualizationNeural Information Processing Systems (NeurIPS), 2023
Emanuele Bugliarello
Hernan Moraldo
Ruben Villegas
Mohammad Babaeizadeh
M. Saffar
Han Zhang
D. Erhan
V. Ferrari
Pieter-Jan Kindermans
P. Voigtlaender
VGen
331
14
0
22 Aug 2023
Story Visualization by Online Text Augmentation with Context Memory
Story Visualization by Online Text Augmentation with Context MemoryIEEE International Conference on Computer Vision (ICCV), 2023
Daechul Ahn
Daneul Kim
Gwangmo Song
Seung Wook Kim
Honglak Lee
Luan Tuyen Chau
Jonghyun Choi
DiffM
261
9
0
15 Aug 2023
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion
  Models
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion ModelsComputer Vision and Pattern Recognition (CVPR), 2023
Chang-rui Liu
Haoning Wu
Yujie Zhong
Xiaoyu Zhang
Yanfeng Wang
Weidi Xie
DiffMVLM
295
66
0
01 Jun 2023
TaleCrafter: Interactive Story Visualization with Multiple Characters
TaleCrafter: Interactive Story Visualization with Multiple CharactersACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH Asia), 2023
Yuan Gong
Youxin Pang
Xiaodong Cun
Menghan Xia
Yingqing He
...
Longyue Wang
Yong Zhang
Xintao Wang
Ying Shan
Yujiu Yang
DiffM
347
64
0
29 May 2023
Improved Visual Story Generation with Adaptive Context Modeling
Improved Visual Story Generation with Adaptive Context ModelingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Zhangyin Feng
Yuchen Ren
Xinmiao Yu
Xiaocheng Feng
Duyu Tang
Shuming Shi
Bing Qin
DiffM
233
23
0
26 May 2023
Video Generation Beyond a Single Clip
Video Generation Beyond a Single Clip
Hsin-Ping Huang
Yu-Chuan Su
Ming-Hsuan Yang
VLMDiffMVGen
295
3
0
15 Apr 2023
The Contribution of Knowledge in Visiolinguistic Learning: A Survey on
  Tasks and Challenges
The Contribution of Knowledge in Visiolinguistic Learning: A Survey on Tasks and Challenges
Maria Lymperaiou
Giorgos Stamou
VLM
235
5
0
04 Mar 2023
Counterfactual Edits for Generative Evaluation
Counterfactual Edits for Generative Evaluation
Maria Lymperaiou
Giorgos Filandrianos
Konstantinos Thomas
Giorgos Stamou
EGVM
234
0
0
02 Mar 2023
An Impartial Transformer for Story Visualization
An Impartial Transformer for Story Visualization
N. Tsakas
Maria Lymperaiou
Giorgos Filandrianos
Giorgos Stamou
ViT
237
3
0
09 Jan 2023
Make-A-Story: Visual Memory Conditioned Consistent Story Generation
Make-A-Story: Visual Memory Conditioned Consistent Story GenerationComputer Vision and Pattern Recognition (CVPR), 2022
Tanzila Rahman
Hsin-Ying Lee
Jian Ren
Sergey Tulyakov
Shweta Mahajan
Leonid Sigal
DiffM
365
91
0
23 Nov 2022
Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models
Synthesizing Coherent Story with Auto-Regressive Latent Diffusion ModelsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Xichen Pan
Pengda Qin
Yuhong Li
Hui Xue
Wenhu Chen
DiffM
244
79
0
20 Nov 2022
A survey on knowledge-enhanced multimodal learning
A survey on knowledge-enhanced multimodal learningArtificial Intelligence Review (Artif Intell Rev), 2022
Maria Lymperaiou
Giorgos Stamou
467
23
0
19 Nov 2022
Learning to Model Multimodal Semantic Alignment for Story Visualization
Learning to Model Multimodal Semantic Alignment for Story VisualizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Bowen Li
Thomas Lukasiewicz
DiffM
249
3
0
14 Nov 2022
Character-Centric Story Visualization via Visual Planning and Token
  Alignment
Character-Centric Story Visualization via Visual Planning and Token AlignmentConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Hong Chen
Rujun Han
Te-Lin Wu
Hideki Nakayama
Nanyun Peng
DiffMVGen
375
36
0
16 Oct 2022
StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story
  Continuation
StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story ContinuationEuropean Conference on Computer Vision (ECCV), 2022
A. Maharana
Darryl Hannan
Joey Tianyi Zhou
DiffM
279
101
0
13 Sep 2022
Word-Level Fine-Grained Story Visualization
Word-Level Fine-Grained Story VisualizationEuropean Conference on Computer Vision (ECCV), 2022
Bowen Li
Thomas Lukasiewicz
DiffM3DH
354
28
0
03 Aug 2022
Integrating Visuospatial, Linguistic and Commonsense Structure into
  Story Visualization
Integrating Visuospatial, Linguistic and Commonsense Structure into Story VisualizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
A. Maharana
Joey Tianyi Zhou
254
70
0
21 Oct 2021
Describe What to Change: A Text-guided Unsupervised Image-to-Image
  Translation Approach
Describe What to Change: A Text-guided Unsupervised Image-to-Image Translation ApproachACM Multimedia (ACM MM), 2020
Yahui Liu
Marco De Nadai
Deng Cai
Huayang Li
Xavier Alameda-Pineda
Andrii Zadaianchuk
Bruno Lepri
274
62
0
10 Aug 2020
1