ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.13319
  4. Cited By
Make-A-Story: Visual Memory Conditioned Consistent Story Generation

Make-A-Story: Visual Memory Conditioned Consistent Story Generation

23 November 2022
Tanzila Rahman
Hsin-Ying Lee
Jian Ren
Sergey Tulyakov
Shweta Mahajan
Leonid Sigal
    DiffM
ArXivPDFHTML

Papers citing "Make-A-Story: Visual Memory Conditioned Consistent Story Generation"

47 / 47 papers shown
Title
One-Minute Video Generation with Test-Time Training
One-Minute Video Generation with Test-Time Training
Karan Dalal
Daniel Koceja
Gashon Hussein
Jiarui Xu
Yue Zhao
...
Tatsunori Hashimoto
Sanmi Koyejo
Yejin Choi
Yu Sun
Xiaolong Wang
ViT
91
3
0
07 Apr 2025
Consistent Subject Generation via Contrastive Instantiated Concepts
Consistent Subject Generation via Contrastive Instantiated Concepts
Lee Hsin-Ying
Kelvin Chan
Ming Yang
DiffM
95
0
0
31 Mar 2025
Object Isolated Attention for Consistent Story Visualization
Object Isolated Attention for Consistent Story Visualization
Xiangyang Luo
Junhao Cheng
Yifan Xie
Xin Zhang
Tao Feng
Z. Liu
Fei Ma
Fei Richard Yu
DiffM
42
1
0
30 Mar 2025
Latent Beam Diffusion Models for Decoding Image Sequences
Latent Beam Diffusion Models for Decoding Image Sequences
Guilherme Fernandes
Vasco Ramos
Regev Cohen
Idan Szpektor
João Magalhães
76
0
0
26 Mar 2025
Automated Movie Generation via Multi-Agent CoT Planning
Weijia Wu
Zeyu Zhu
Mike Zheng Shou
VGen
72
1
0
10 Mar 2025
MAMo: Leveraging Memory and Attention for Monocular Video Depth Estimation
MAMo: Leveraging Memory and Attention for Monocular Video Depth Estimation
R. Yasarla
H. Cai
Jisoo Jeong
Y. Shi
Risheek Garrepalli
Fatih Porikli
MDE
63
16
0
17 Jan 2025
Generative AI for Cel-Animation: A Survey
Generative AI for Cel-Animation: A Survey
Yunlong Tang
Junjia Guo
Pinxin Liu
Zhiyuan Wang
Hang Hua
...
Jing Bi
Mingqian Feng
X. Li
Zeliang Zhang
Chenliang Xu
VGen
88
7
0
08 Jan 2025
Urban Air Temperature Prediction using Conditional Diffusion Models
Urban Air Temperature Prediction using Conditional Diffusion Models
Siyang Dai
Jun Liu
Ngai-Man Cheung
82
0
0
18 Dec 2024
Autoregressive Models in Vision: A Survey
Autoregressive Models in Vision: A Survey
Jing Xiong
Gongye Liu
Lun Huang
Chengyue Wu
Taiqiang Wu
...
M. Zhang
Guillermo Sapiro
Jiebo Luo
Ping Luo
Ngai Wong
VGen
46
9
0
08 Nov 2024
ScriptViz: A Visualization Tool to Aid Scriptwriting based on a Large
  Movie Database
ScriptViz: A Visualization Tool to Aid Scriptwriting based on a Large Movie Database
Anyi Rao
Jean-Peic Chou
Maneesh Agrawala
VGen
23
2
0
04 Oct 2024
SpotActor: Training-Free Layout-Controlled Consistent Image Generation
SpotActor: Training-Free Layout-Controlled Consistent Image Generation
Jiahao Wang
Caixia Yan
Weizhan Zhang
Haonan Lin
Mengmeng Wang
Guang Dai
Tieliang Gong
Hao Sun
Jingdong Wang
DiffM
29
2
0
07 Sep 2024
What Makes a Good Story and How Can We Measure It? A Comprehensive
  Survey of Story Evaluation
What Makes a Good Story and How Can We Measure It? A Comprehensive Survey of Story Evaluation
Dingyi Yang
Qin Jin
36
5
0
26 Aug 2024
Anim-Director: A Large Multimodal Model Powered Agent for Controllable
  Animation Video Generation
Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation
Yunxin Li
Haoyuan Shi
Baotian Hu
Longyue Wang
Jiashun Zhu
Jinyi Xu
Zhen Zhao
Min Zhang
VGen
40
5
0
19 Aug 2024
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Huiguo He
Huan Yang
Zixi Tuo
Yuan Zhou
Qiuyue Wang
Yuhang Zhang
Zeyu Liu
Wenhao Huang
Hongyang Chao
Jian Yin
DiffM
VGen
52
12
0
17 Jul 2024
Contrastive Sequential-Diffusion Learning: An approach to Multi-Scene
  Instructional Video Synthesis
Contrastive Sequential-Diffusion Learning: An approach to Multi-Scene Instructional Video Synthesis
Vasco Ramos
Yonatan Bitton
Michal Yarom
Idan Szpektor
João Magalhães
DiffM
33
0
0
16 Jul 2024
SEED-Story: Multimodal Long Story Generation with Large Language Model
SEED-Story: Multimodal Long Story Generation with Large Language Model
Shuai Yang
Yuying Ge
Yang Li
Yukang Chen
Yixiao Ge
Ying Shan
Yingcong Chen
VGen
DiffM
73
26
0
11 Jul 2024
Replication in Visual Diffusion Models: A Survey and Outlook
Replication in Visual Diffusion Models: A Survey and Outlook
Wenhao Wang
Yifan Sun
Zongxin Yang
Zhengdong Hu
Zhentao Tan
Yi Yang
68
6
0
07 Jul 2024
Improving Visual Storytelling with Multimodal Large Language Models
Improving Visual Storytelling with Multimodal Large Language Models
Xiaochuan Lin
Xiangyong Chen
39
0
0
02 Jul 2024
Boosting Consistency in Story Visualization with Rich-Contextual
  Conditional Diffusion Models
Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models
Fei Shen
Hu Ye
Sibo Liu
Jun Zhang
Cong Wang
Xiao Han
Wei Yang
87
34
0
02 Jul 2024
ORACLE: Leveraging Mutual Information for Consistent Character
  Generation with LoRAs in Diffusion Models
ORACLE: Leveraging Mutual Information for Consistent Character Generation with LoRAs in Diffusion Models
Kiymet Akdemir
Pinar Yanardag
DiffM
30
1
0
04 Jun 2024
RefDrop: Controllable Consistency in Image or Video Generation via
  Reference Feature Guidance
RefDrop: Controllable Consistency in Image or Video Generation via Reference Feature Guidance
JiaoJiao Fan
Haotian Xue
Qinsheng Zhang
Yongxin Chen
30
1
0
27 May 2024
Generating Coherent Sequences of Visual Illustrations for Real-World
  Manual Tasks
Generating Coherent Sequences of Visual Illustrations for Real-World Manual Tasks
João Bordalo
Vasco Ramos
Rodrigo Valerio
Diogo Glória-Silva
Yonatan Bitton
Michal Yarom
Idan Szpektor
João Magalhães
31
7
0
16 May 2024
TALC: Time-Aligned Captions for Multi-Scene Text-to-Video Generation
TALC: Time-Aligned Captions for Multi-Scene Text-to-Video Generation
Hritik Bansal
Yonatan Bitton
Michal Yarom
Idan Szpektor
Aditya Grover
Kai-Wei Chang
DiffM
47
11
0
07 May 2024
Pixel is a Barrier: Diffusion Models Are More Adversarially Robust Than
  We Think
Pixel is a Barrier: Diffusion Models Are More Adversarially Robust Than We Think
Haotian Xue
Yongxin Chen
DiffM
AAML
39
3
0
20 Apr 2024
StoryImager: A Unified and Efficient Framework for Coherent Story
  Visualization and Completion
StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion
Ming Tao
Bing-Kun Bao
Hao Tang
Yaowei Wang
Changsheng Xu
DiffM
39
5
0
09 Apr 2024
MyVLM: Personalizing VLMs for User-Specific Queries
MyVLM: Personalizing VLMs for User-Specific Queries
Yuval Alaluf
Elad Richardson
Sergey Tulyakov
Kfir Aberman
Daniel Cohen-Or
MLLM
VLM
36
18
0
21 Mar 2024
Masked Generative Story Transformer with Character Guidance and Caption
  Augmentation
Masked Generative Story Transformer with Character Guidance and Caption Augmentation
Christos Papadimitriou
Giorgos Filandrianos
Maria Lymperaiou
Giorgos Stamou
DiffM
92
1
0
13 Mar 2024
Sora as an AGI World Model? A Complete Survey on Text-to-Video
  Generation
Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation
Joseph Cho
Fachrina Dewi Puspitasari
Sheng Zheng
Jingyao Zheng
Lik-Hang Lee
Tae-Ho Kim
Choong Seon Hong
Chaoning Zhang
EGVM
VGen
36
40
0
08 Mar 2024
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
Zhen Zhou
Fan Ma
Hehe Fan
Yi Yang
3DGS
27
23
0
09 Feb 2024
Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image
  Diffusion Models
Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models
Shweta Mahajan
Tanzila Rahman
Kwang Moo Yi
Leonid Sigal
DiffM
21
17
0
19 Dec 2023
MagicScroll: Nontypical Aspect-Ratio Image Generation for Visual
  Storytelling via Multi-Layered Semantic-Aware Denoising
MagicScroll: Nontypical Aspect-Ratio Image Generation for Visual Storytelling via Multi-Layered Semantic-Aware Denoising
Bingyuan Wang
Hengyu Meng
Zeyu Cai
Lanjiong Li
Yue Ma
Qifeng Chen
Zeyu Wang
DiffM
29
3
0
18 Dec 2023
CogCartoon: Towards Practical Story Visualization
CogCartoon: Towards Practical Story Visualization
Zhongyang Zhu
Jie Tang
DiffM
27
3
0
17 Dec 2023
A Survey of Generative AI for Intelligent Transportation Systems
A Survey of Generative AI for Intelligent Transportation Systems
Huan Yan
Yong Li
21
8
0
13 Dec 2023
Make-A-Storyboard: A General Framework for Storyboard with Disentangled
  and Merged Control
Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control
Sitong Su
Litao Guo
Lianli Gao
Hengtao Shen
Jingkuan Song
DiffM
33
3
0
06 Dec 2023
StoryGPT-V: Large Language Models as Consistent Story Visualizers
StoryGPT-V: Large Language Models as Consistent Story Visualizers
Xiaoqian Shen
Mohamed Elhoseiny
VLM
90
10
0
04 Dec 2023
Exploiting Diffusion Prior for Generalizable Dense Prediction
Exploiting Diffusion Prior for Generalizable Dense Prediction
Hsin-Ying Lee
Hung-Yu Tseng
Hsin-Ying Lee
Ming-Hsuan Yang
DiffM
MDE
30
18
0
30 Nov 2023
AutoStory: Generating Diverse Storytelling Images with Minimal Human
  Effort
AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort
Wen Wang
Canyu Zhao
Hao Chen
Zhekai Chen
Kecheng Zheng
Chunhua Shen
DiffM
16
21
0
19 Nov 2023
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Omri Avrahami
Amir Hertz
Yael Vinker
Moab Arar
Shlomi Fruchter
Ohad Fried
Daniel Cohen-Or
Dani Lischinski
DiffM
47
32
0
16 Nov 2023
Generating Visual Scenes from Touch
Generating Visual Scenes from Touch
Fengyu Yang
Jiacheng Zhang
Andrew Owens
DiffM
22
24
0
26 Sep 2023
StoryBench: A Multifaceted Benchmark for Continuous Story Visualization
StoryBench: A Multifaceted Benchmark for Continuous Story Visualization
Emanuele Bugliarello
Hernan Moraldo
Ruben Villegas
Mohammad Babaeizadeh
M. Saffar
Han Zhang
D. Erhan
V. Ferrari
Pieter-Jan Kindermans
P. Voigtlaender
VGen
28
10
0
22 Aug 2023
Painter: Teaching Auto-regressive Language Models to Draw Sketches
Painter: Teaching Auto-regressive Language Models to Draw Sketches
Reza Pourreza
Apratim Bhattacharyya
Sunny Panchal
Mingu Lee
Pulkit Madan
Roland Memisevic
27
5
0
16 Aug 2023
TaleCrafter: Interactive Story Visualization with Multiple Characters
TaleCrafter: Interactive Story Visualization with Multiple Characters
Yuan Gong
Youxin Pang
Xiaodong Cun
Menghan Xia
Yingqing He
...
Longyue Wang
Yong Zhang
Xintao Wang
Ying Shan
Yujiu Yang
DiffM
23
45
0
29 May 2023
Text-to-image Diffusion Models in Generative AI: A Survey
Text-to-image Diffusion Models in Generative AI: A Survey
Chenshuang Zhang
Chaoning Zhang
Mengchun Zhang
In So Kweon
VLM
47
263
0
14 Mar 2023
SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation
SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation
Yen-Chi Cheng
Hsin-Ying Lee
Sergey Tulyakov
A. Schwing
Liangyan Gui
DiffM
19
244
0
08 Dec 2022
Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models
Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models
Xichen Pan
Pengda Qin
Yuhong Li
Hui Xue
Wenhu Chen
DiffM
16
62
0
20 Nov 2022
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,764
0
24 Feb 2021
Imagine This! Scripts to Compositions to Videos
Imagine This! Scripts to Compositions to Videos
Tanmay Gupta
Dustin Schwenk
Ali Farhadi
Derek Hoiem
Aniruddha Kembhavi
CoGe
VGen
109
87
0
10 Apr 2018
1