ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1609.02612
  4. Cited By
Generating Videos with Scene Dynamics

Generating Videos with Scene Dynamics

8 September 2016
Carl Vondrick
Hamed Pirsiavash
Antonio Torralba
    GAN
    VGen
ArXivPDFHTML

Papers citing "Generating Videos with Scene Dynamics"

50 / 739 papers shown
Title
Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos
Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos
Rundong Luo
Matthew Wallingford
Ali Farhadi
Noah Snavely
Wei-Chiu Ma
VGen
19
0
0
10 Apr 2025
A Large-Scale Analysis on Contextual Self-Supervised Video Representation Learning
A Large-Scale Analysis on Contextual Self-Supervised Video Representation Learning
Akash Kumar
Ashlesha Kumar
Vibhav Vineet
Y. S. Rawat
SSL
141
0
0
08 Apr 2025
SkyReels-A2: Compose Anything in Video Diffusion Transformers
SkyReels-A2: Compose Anything in Video Diffusion Transformers
Zhengcong Fei
D. Li
Di Qiu
J. Wang
Yikun Dou
...
J. Xu
Mingyuan Fan
Guibin Chen
Yang Li
Yahui Zhou
DiffM
VGen
63
2
0
03 Apr 2025
VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models
VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models
Chi-Pin Huang
Yen-Siang Wu
Hung-Kai Chung
Kai-Po Chang
Fu-En Yang
Yu-Jie Wang
DiffM
VGen
55
0
0
27 Mar 2025
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency
T. Liu
Z. Huang
Zhaoxi Chen
Guangcong Wang
Shoukang Hu
Liao Shen
Huiqiang Sun
Z. Cao
Wei Li
Z. Liu
VGen
3DGS
79
0
0
26 Mar 2025
FullDiT: Multi-Task Video Generative Foundation Model with Full Attention
FullDiT: Multi-Task Video Generative Foundation Model with Full Attention
Xuan Ju
Weicai Ye
Quande Liu
Qiulin Wang
Xintao Wang
Pengfei Wan
Di Zhang
Kun Gai
Qiang Xu
VGen
39
1
0
25 Mar 2025
Joint Self-Supervised Video Alignment and Action Segmentation
Joint Self-Supervised Video Alignment and Action Segmentation
Ali Shah Ali
Syed Ahmed Mahmood
Mubin Saeed
Andrey Konin
M. Zia
Quoc-Huy Tran
OT
75
0
0
21 Mar 2025
WonderVerse: Extendable 3D Scene Generation with Video Generative Models
WonderVerse: Extendable 3D Scene Generation with Video Generative Models
Hao Feng
Zhi Zuo
Jia-Hui Pan
Ka-Hei Hui
Yihua Shao
Qi Dou
Wei Xie
Zhengzhe Liu
VGen
47
1
0
12 Mar 2025
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
Sihyun Yu
Meera Hahn
Dan Kondratyuk
Jinwoo Shin
Agrim Gupta
José Lezama
Irfan Essa
David A. Ross
Jonathan Huang
DiffM
VGen
72
0
0
18 Feb 2025
Object-Centric Image to Video Generation with Language Guidance
Object-Centric Image to Video Generation with Language Guidance
Angel Villar-Corrales
Gjergj Plepi
Sven Behnke
DiffM
VGen
OCL
71
0
0
17 Feb 2025
Towards Precise Scaling Laws for Video Diffusion Transformers
Towards Precise Scaling Laws for Video Diffusion Transformers
Yuanyang Yin
Yaqi Zhao
Mingwu Zheng
Ke Lin
Jiarong Ou
...
Pengfei Wan
Di Zhang
Baoqun Yin
Wentao Zhang
Kun Gai
122
2
0
03 Jan 2025
DTSGAN: Learning Dynamic Textures via Spatiotemporal Generative
  Adversarial Network
DTSGAN: Learning Dynamic Textures via Spatiotemporal Generative Adversarial Network
Xiangtian Li
Xiaobo Wang
Zhen Qi
Han Cao
Zhaoyang Zhang
Ao Xiang
GAN
TTA
82
2
0
22 Dec 2024
Do Language Models Understand Time?
Do Language Models Understand Time?
Xi Ding
Lei Wang
170
0
0
18 Dec 2024
$\texttt{DINO-Foresight}$: Looking into the Future with DINO
DINO-Foresight\texttt{DINO-Foresight}DINO-Foresight: Looking into the Future with DINO
Efstathios Karypidis
Ioannis Kakogeorgiou
Spyros Gidaris
N. Komodakis
AI4CE
79
1
0
16 Dec 2024
A comprehensive GeoAI review: Progress, Challenges and Outlooks
A comprehensive GeoAI review: Progress, Challenges and Outlooks
Anasse Boutayeb
Iyad Lahsen-cherif
Ahmed El Khadimi
79
0
0
16 Dec 2024
Can video generation replace cinematographers? Research on the cinematic language of generated video
Can video generation replace cinematographers? Research on the cinematic language of generated video
X. Li
Kai WU
Siyi Yang
YiZhan Qu
Guohua. Zhang
...
Mingliang Xiong
Hao Deng
Qingwen Liu
Gang Li
Bin He
VGen
DiffM
90
1
0
16 Dec 2024
Video Diffusion Transformers are In-Context Learners
Video Diffusion Transformers are In-Context Learners
Zhengcong Fei
Di Qiu
Changqian Yu
Debang Li
Mingyuan Fan
VGen
DiffM
148
2
0
14 Dec 2024
KDC-MAE: Knowledge Distilled Contrastive Mask Auto-Encoder
KDC-MAE: Knowledge Distilled Contrastive Mask Auto-Encoder
Maheswar Bora
Saurabh Atreya
Aritra Mukherjee
Abhijit Das
83
0
0
19 Nov 2024
Artificial Intelligence for Biomedical Video Generation
Artificial Intelligence for Biomedical Video Generation
Linyuan Li
Jianing Qiu
Anujit Saha
Lin Li
Poyuan Li
Mengxian He
Ziyu Guo
Wu Yuan
VGen
58
1
0
12 Nov 2024
Asymptotic Analysis of Sample-averaged Q-learning
Asymptotic Analysis of Sample-averaged Q-learning
Saunak Kumar Panda
Ruiqi Liu
Yisha Xiang
OnRL
52
8
0
14 Oct 2024
Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Yuqing Wang
Tianwei Xiong
Daquan Zhou
Zhijie Lin
Yang Zhao
Bingyi Kang
Jiashi Feng
Xihui Liu
VGen
46
23
0
03 Oct 2024
Multi-Modal Generative AI: Multi-modal LLM, Diffusion and Beyond
Multi-Modal Generative AI: Multi-modal LLM, Diffusion and Beyond
Hong Chen
Xin Wang
Yuwei Zhou
Bin Huang
Yipeng Zhang
Wei Feng
Houlun Chen
Zeyang Zhang
Siao Tang
Wenwu Zhu
DiffM
47
7
0
23 Sep 2024
Advancing Video Quality Assessment for AIGC
Advancing Video Quality Assessment for AIGC
Xinli Yue
Jianhui Sun
Han Kong
Liangchao Yao
Tianyi Wang
...
Jing Lv
Fan Xia
Yuetang Deng
Qian Wang
Lingchen Zhao
VGen
EGVM
24
0
0
23 Sep 2024
JVID: Joint Video-Image Diffusion for Visual-Quality and
  Temporal-Consistency in Video Generation
JVID: Joint Video-Image Diffusion for Visual-Quality and Temporal-Consistency in Video Generation
Hadrien Reynaud
Matthew Baugh
Mischa Dombrowski
Sarah Cechnicka
Qingjie Meng
Bernhard Kainz
VLM
31
0
0
21 Sep 2024
Real-Time Video Generation with Pyramid Attention Broadcast
Real-Time Video Generation with Pyramid Attention Broadcast
Xuanlei Zhao
Xiaolong Jin
Kai Wang
Yang You
VGen
DiffM
69
31
0
22 Aug 2024
Fréchet Video Motion Distance: A Metric for Evaluating Motion
  Consistency in Videos
Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos
Jiahe Liu
Youran Qu
Qi Yan
Xiaohui Zeng
Lele Wang
Renjie Liao
VGen
EGVM
44
12
0
23 Jul 2024
MiraData: A Large-Scale Video Dataset with Long Durations and Structured
  Captions
MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions
Xuan Ju
Yiming Gao
Zhaoyang Zhang
Ziyang Yuan
Xintao Wang
Ailing Zeng
Yu Xiong
Qiang Xu
Ying Shan
VGen
64
39
0
08 Jul 2024
Diffusion Model-Based Video Editing: A Survey
Diffusion Model-Based Video Editing: A Survey
Wenhao Sun
Rong-Cheng Tu
Jingyi Liao
Dacheng Tao
VGen
58
22
0
26 Jun 2024
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Junbang Liang
Ruoshi Liu
Ege Ozguroglu
Sruthi Sudhakar
Achal Dave
P. Tokmakov
Shuran Song
Carl Vondrick
VGen
40
22
0
24 Jun 2024
FacEnhance: Facial Expression Enhancing with Recurrent DDPMs
FacEnhance: Facial Expression Enhancing with Recurrent DDPMs
Hamza Bouzid
Lahoucine Ballihi
DiffM
34
1
0
13 Jun 2024
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing
  Reliability,Reproducibility, and Practicality
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability,Reproducibility, and Practicality
Tianle Zhang
Langtian Ma
Yuchen Yan
Yuchen Zhang
Kai Wang
...
Wenqi Shao
Yang You
Yu Qiao
Ping Luo
Kaipeng Zhang
VGen
61
2
0
13 Jun 2024
Visual Representation Learning with Stochastic Frame Prediction
Visual Representation Learning with Stochastic Frame Prediction
Huiwon Jang
Dongyoung Kim
Junsu Kim
Jinwoo Shin
Pieter Abbeel
Younggyo Seo
34
2
0
11 Jun 2024
Searching Priors Makes Text-to-Video Synthesis Better
Searching Priors Makes Text-to-Video Synthesis Better
Haoran Cheng
Liang Peng
Linxuan Xia
Yuepeng Hu
Hengjia Li
Qinglin Lu
Xiaofei He
Boxi Wu
VGen
DiffM
28
0
0
05 Jun 2024
Unleashing Generalization of End-to-End Autonomous Driving with
  Controllable Long Video Generation
Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation
Enhui Ma
Lijun Zhou
Tao Tang
Zhan Zhang
Dong Han
...
Peng Jia
Xianpeng Lang
Haiyang Sun
Di Lin
Kaicheng Yu
VGen
18
20
0
03 Jun 2024
SNED: Superposition Network Architecture Search for Efficient Video
  Diffusion Model
SNED: Superposition Network Architecture Search for Efficient Video Diffusion Model
Zhengang Li
Yan Kang
Yuchen Liu
Difan Liu
Tobias Hinz
Feng Liu
Yanzhi Wang
DiffM
19
1
0
31 May 2024
GaussianPrediction: Dynamic 3D Gaussian Prediction for Motion
  Extrapolation and Free View Synthesis
GaussianPrediction: Dynamic 3D Gaussian Prediction for Motion Extrapolation and Free View Synthesis
Boming Zhao
Yuan Li
Ziyu Sun
Lin Zeng
Yujun Shen
Rui-ya Ma
Yinda Zhang
Hujun Bao
Zhaopeng Cui
3DGS
3DV
42
4
0
30 May 2024
Vista: A Generalizable Driving World Model with High Fidelity and
  Versatile Controllability
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Shenyuan Gao
Jiazhi Yang
Li Chen
Kashyap Chitta
Yihang Qiu
Andreas Geiger
Jun Zhang
Hongyang Li
60
75
0
27 May 2024
Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video
  Diffusion Models
Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models
Hanwen Liang
Yuyang Yin
Dejia Xu
Hanxue Liang
Zhangyang Wang
Konstantinos N. Plataniotis
Yao Zhao
Yunchao Wei
VGen
53
38
0
26 May 2024
Review of Deep Representation Learning Techniques for Brain-Computer
  Interfaces and Recommendations
Review of Deep Representation Learning Techniques for Brain-Computer Interfaces and Recommendations
Pierre Guetschel
Sara Ahmadi
Michael Tangermann
22
0
0
17 May 2024
From Sora What We Can See: A Survey of Text-to-Video Generation
From Sora What We Can See: A Survey of Text-to-Video Generation
Rui Sun
Yumin Zhang
Tejal Shah
Jiahao Sun
Shuoying Zhang
Wenqi Li
Haoran Duan
Bo Wei
R. Ranjan
EGVM
79
19
0
17 May 2024
TALC: Time-Aligned Captions for Multi-Scene Text-to-Video Generation
TALC: Time-Aligned Captions for Multi-Scene Text-to-Video Generation
Hritik Bansal
Yonatan Bitton
Michal Yarom
Idan Szpektor
Aditya Grover
Kai-Wei Chang
DiffM
47
11
0
07 May 2024
Matten: Video Generation with Mamba-Attention
Matten: Video Generation with Mamba-Attention
Yu Gao
Jiancheng Huang
Xiaopeng Sun
Zequn Jie
Yujie Zhong
Lin Ma
64
12
0
05 May 2024
Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text
  Consistency and Domain Distribution Gap
Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribution Gap
Bowen Qu
Xiaoyu Liang
Shangkun Sun
Wei-Nan Gao
EGVM
30
6
0
21 Apr 2024
On the Content Bias in Fréchet Video Distance
On the Content Bias in Fréchet Video Distance
Jason S. Hoffman
Aniruddha Mahapatra
Gaurav Parmar
Jun-Yan Zhu
Jia-Bin Huang
EGVM
50
15
0
18 Apr 2024
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Sicheng Xu
Guojun Chen
Yu-Xiao Guo
Jiaolong Yang
Chong Li
Zhenyu Zang
Yizhong Zhang
Xin Tong
Baining Guo
40
86
0
16 Apr 2024
Motion Inversion for Video Customization
Motion Inversion for Video Customization
Luozhou Wang
Guibao Shen
Yixun Liang
Xin Tao
Pengfei Wan
Di Zhang
Yijun Li
Yingcong Chen
VGen
DiffM
34
7
0
29 Mar 2024
Minimax density estimation in the adversarial framework under local
  differential privacy
Minimax density estimation in the adversarial framework under local differential privacy
Mélisande Albert
Juliette Chevallier
Béatrice Laurent
Ousmane Sacko
19
0
0
27 Mar 2024
SSM Meets Video Diffusion Models: Efficient Video Generation with
  Structured State Spaces
SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces
Yuta Oshima
Shohei Taniguchi
Masahiro Suzuki
Yutaka Matsuo
32
7
0
12 Mar 2024
AesopAgent: Agent-driven Evolutionary System on Story-to-Video
  Production
AesopAgent: Agent-driven Evolutionary System on Story-to-Video Production
Jiuniu Wang
Zehua Du
Yuyuan Zhao
Bo Yuan
Kexiang Wang
...
Yihen Lu
Gengliang Li
Junlong Gao
Xin Tu
Zhenyu Guo
LLMAG
VGen
28
7
0
12 Mar 2024
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video
  Generation
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Guosheng Zhao
Xiaofeng Wang
Zheng Zhu
Xinze Chen
Guan Huang
Xiaoyi Bao
Xingang Wang
VGen
33
14
0
11 Mar 2024
1234...131415
Next