ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.05131
  4. Cited By
Sora as an AGI World Model? A Complete Survey on Text-to-Video
  Generation
v1v2 (latest)

Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation

8 March 2024
Joseph Cho
Fachrina Dewi Puspitasari
Sheng Zheng
Jingyao Zheng
Lik-Hang Lee
Tae-Ho Kim
Choong Seon Hong
Chaoning Zhang
    EGVMVGen
ArXiv (abs)PDFHTML

Papers citing "Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation"

39 / 39 papers shown
Title
Generative Action Tell-Tales: Assessing Human Motion in Synthesized Videos
Generative Action Tell-Tales: Assessing Human Motion in Synthesized Videos
Xavier Thomas
Youngsun Lim
Ananya Srinivasan
Audrey Zheng
Deepti Ghadiyaram
EGVMVGen
304
0
0
01 Dec 2025
PhyDetEx: Detecting and Explaining the Physical Plausibility of T2V Models
Zeqing Wang
Keze Wang
Lei Zhang
VGen
84
0
0
01 Dec 2025
Counterfactual World Models via Digital Twin-conditioned Video Diffusion
Counterfactual World Models via Digital Twin-conditioned Video Diffusion
Yiqing Shen
Aiza Maksutova
Chenjia Li
Mathias Unberath
DiffMVGen
153
0
0
21 Nov 2025
VEIL: Jailbreaking Text-to-Video Models via Visual Exploitation from Implicit Language
VEIL: Jailbreaking Text-to-Video Models via Visual Exploitation from Implicit Language
Zonghao Ying
Moyang Chen
Nizhang Li
Zhiqiang Wang
Wenxin Zhang
Quanchen Zou
Zonglei Jing
Aishan Liu
Xianglong Liu
108
0
0
17 Nov 2025
PipeDiT: Accelerating Diffusion Transformers in Video Generation with Task Pipelining and Model Decoupling
PipeDiT: Accelerating Diffusion Transformers in Video Generation with Task Pipelining and Model Decoupling
S. Wang
Qiang Wang
Shaohuai Shi
VGen
122
0
0
15 Nov 2025
Simulating the Visual World with Artificial Intelligence: A Roadmap
Simulating the Visual World with Artificial Intelligence: A Roadmap
Jingtong Yue
Z. Huang
Z. Chen
Xintao Wang
Pengfei Wan
Ziwei Liu
VGenLM&Ro
374
0
0
11 Nov 2025
Embodied AI: From LLMs to World Models
Embodied AI: From LLMs to World Models
Tongtong Feng
Xin Wang
Yu Jiang
Wenwu Zhu
LM&Ro
325
8
0
24 Sep 2025
InPhyRe Discovers: Large Multimodal Models Struggle in Inductive Physical Reasoning
InPhyRe Discovers: Large Multimodal Models Struggle in Inductive Physical Reasoning
Gautam Sreekumar
Vishnu Boddeti
ReLMLRM
133
0
0
12 Sep 2025
Video Understanding by Design: How Datasets Shape Architectures and Insights
Video Understanding by Design: How Datasets Shape Architectures and Insights
Lei Wang
Piotr Koniusz
Yongsheng Gao
3DVVGenAI4TS
225
0
0
11 Sep 2025
CineTrans: Learning to Generate Videos with Cinematic Transitions via Masked Diffusion Models
CineTrans: Learning to Generate Videos with Cinematic Transitions via Masked Diffusion Models
Xiaoxue Wu
Bingjie Gao
Yu Qiao
Yaohui Wang
Xinyuan Chen
DiffMVGen
173
5
0
15 Aug 2025
NarrLV: Towards a Comprehensive Narrative-Centric Evaluation for Long Video Generation
NarrLV: Towards a Comprehensive Narrative-Centric Evaluation for Long Video Generation
X. Feng
H. Yu
M. Wu
Shuyan Hu
J. Chen
C. Zhu
J. Wu
X. Chu
K. Huang
DiffMEGVMVGen
481
4
0
15 Jul 2025
Vision Technologies with Applications in Traffic Surveillance Systems: A Holistic SurveyACM Computing Surveys (ACM CSUR), 2024
Wei Zhou
Lei Zhao
Lei Zhao
Runyu Zhang
Yifan Cui
Hongpu Huang
Kun Qie
Chen Wang
AI4TS
472
8
0
01 Jul 2025
G4Seg: Generation for Inexact Segmentation Refinement with Diffusion Models
G4Seg: Generation for Inexact Segmentation Refinement with Diffusion Models
Tianjiao Zhang
Fei Zhang
Jiangchao Yao
Ya Zhang
Yanfeng Wang
DiffM
317
4
0
02 Jun 2025
MOVi: Training-free Text-conditioned Multi-Object Video Generation
MOVi: Training-free Text-conditioned Multi-Object Video Generation
Aimon Rahman
Jiang Liu
Ze Wang
Ximeng Sun
Jialian Wu
Xiaodong Yu
Yusheng Su
Vishal M. Patel
Zicheng Liu
Emad Barsoum
DiffMVGen
257
0
0
29 May 2025
A Challenge to Build Neuro-Symbolic Video Agents
A Challenge to Build Neuro-Symbolic Video Agents
Sahil Shah
Harsh Goel
Sai Shankar Narasimhan
Minkyu Choi
S P Sharan
Oguzhan Akcin
Sandeep Chinchali
AI4TS
231
1
0
20 May 2025
We'll Fix it in Post: Improving Text-to-Video Generation with Neuro-Symbolic Feedback
We'll Fix it in Post: Improving Text-to-Video Generation with Neuro-Symbolic Feedback
Minkyu Choi
S P Sharan
Harsh Goel
Sahil Shah
Sandeep Chinchali
DiffMVGen
386
4
0
24 Apr 2025
Morpheus: Benchmarking Physical Reasoning of Video Generative Models with Real Physical Experiments
Morpheus: Benchmarking Physical Reasoning of Video Generative Models with Real Physical Experiments
Chenyu Zhang
Daniil Cherniavskii
Antonios Tragoudaras
Antonios Vozikis
Thijmen Nijdam
Thijmen Nijdam
Mark Bodracska
Mark Bodracska
Andrii Zadaianchuk
E. Gavves
EGVMVGen
257
11
0
03 Apr 2025
HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled Generation
HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled GenerationComputer Vision and Pattern Recognition (CVPR), 2025
Boyuan Wang
Xiaofeng Wang
Chaojun Ni
Guosheng Zhao
Zhiqin Yang
...
Yukun Zhou
Xinze Chen
Guan Huang
Lihong Liu
Xingang Wang
VGen
348
16
0
31 Mar 2025
A Self-supervised Motion Representation for Portrait Video Generation
A Self-supervised Motion Representation for Portrait Video Generation
Qiyuan Zhang
Chenyu Wu
Wenzhang Sun
Huaize Liu
Donglin Di
Wei Chen
Changqing Zou
VGen
253
0
0
13 Mar 2025
FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion
Ziyi Yang
Fanqi Wan
Longguang Zhong
Canbin Huang
Guosheng Liang
Xiaojun Quan
MoMe
260
9
0
06 Mar 2025
BounTCHA: A CAPTCHA Utilizing Boundary Identification in Guided Generative AI-extended Videos
BounTCHA: A CAPTCHA Utilizing Boundary Identification in Guided Generative AI-extended Videos
Lehao Lin
Ke Wang
Maha Abdallah
Wei Cai
AAML
325
0
0
30 Jan 2025
Generative AI for Cel-Animation: A Survey
Generative AI for Cel-Animation: A Survey
Yunlong Tang
Junjia Guo
Pinxin Liu
Zhiyuan Wang
Hang Hua
...
Jing Bi
Mingqian Feng
Xuzhao Li
Zeliang Zhang
Chenliang Xu
VGen
655
17
0
08 Jan 2025
Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric
Zhichao Zhang
Wei Sun
Xinyue Li
Yunhao Li
Qihang Ge
...
Zhongpeng Ji
Fengyu Sun
Shangling Jui
Xiongkuo Min
Guoquan Zheng
EGVM
477
10
0
25 Nov 2024
Understanding World or Predicting Future? A Comprehensive Survey of World Models
Understanding World or Predicting Future? A Comprehensive Survey of World ModelsACM Computing Surveys (ACM CSUR), 2024
Jingtao Ding
Yunke Zhang
Yu Shang
Yuheng Zhang
Zefang Zong
...
Fengli Xu
Yong Li
Chen Gao
Fengli Xu
Yong Li
VGenSyDa
445
17
0
21 Nov 2024
Jailbreak Attacks and Defenses against Multimodal Generative Models: A
  Survey
Jailbreak Attacks and Defenses against Multimodal Generative Models: A Survey
Xuannan Liu
Xing Cui
Peipei Li
Zekun Li
Huaibo Huang
Shuhan Xia
Miaoxuan Zhang
Yueying Zou
Ran He
AAML
517
23
0
14 Nov 2024
Artificial Intelligence for Biomedical Video Generation
Artificial Intelligence for Biomedical Video Generation
Linyuan Li
Jianing Qiu
Anujit Saha
Lin Li
Poyuan Li
Mengxian He
Ziyu Guo
Wu Yuan
VGen
366
3
0
12 Nov 2024
Survey of User Interface Design and Interaction Techniques in Generative
  AI Applications
Survey of User Interface Design and Interaction Techniques in Generative AI Applications
Reuben Luera
Ryan Rossi
Alexa F. Siu
Franck Dernoncourt
Tong Yu
...
Hanieh Salehy
Jian Zhao
Samyadeep Basu
Puneet Mathur
Nedim Lipka
AI4TS
270
5
0
28 Oct 2024
A Transformer Based Generative Chemical Language AI Model for Structural
  Elucidation of Organic Compounds
A Transformer Based Generative Chemical Language AI Model for Structural Elucidation of Organic CompoundsJournal of Cheminformatics (J Cheminform), 2024
Xiaofeng Tan
131
3
0
13 Oct 2024
K-Sort Arena: Efficient and Reliable Benchmarking for Generative Models via K-wise Human Preferences
K-Sort Arena: Efficient and Reliable Benchmarking for Generative Models via K-wise Human PreferencesComputer Vision and Pattern Recognition (CVPR), 2024
Zhikai Li
Xuewen Liu
Dongrong Fu
Jianquan Li
Qingyi Gu
Kurt Keutzer
Zhen Dong
EGVMVGenDiffM
326
8
0
26 Aug 2024
LessonPlanner: Assisting Novice Teachers to Prepare Pedagogy-Driven
  Lesson Plans with Large Language Models
LessonPlanner: Assisting Novice Teachers to Prepare Pedagogy-Driven Lesson Plans with Large Language ModelsACM Symposium on User Interface Software and Technology (UIST), 2024
Haoxiang Fan
Guanzheng Chen
Xingbo Wang
Zhenhui Peng
AI4Ed
191
11
0
02 Aug 2024
Benchmarking AIGC Video Quality Assessment: A Dataset and Unified Model
Benchmarking AIGC Video Quality Assessment: A Dataset and Unified Model
Zhichao Zhang
Xinyue Li
Wei Sun
Jun Jia
Xiongkuo Min
...
Puyi Wang
Zhongpeng Ji
Fengyu Sun
Shangling Jui
Guangtao Zhai
EGVM
206
0
0
31 Jul 2024
A Comprehensive Survey on Human Video Generation: Challenges, Methods,
  and Insights
A Comprehensive Survey on Human Video Generation: Challenges, Methods, and Insights
Wentao Lei
Jinting Wang
Fengji Ma
Guanjie Huang
Li Liu
VGenEGVM
261
16
0
11 Jul 2024
Latent Energy-Based Odyssey: Black-Box Optimization via Expanded
  Exploration in the Energy-Based Latent Space
Latent Energy-Based Odyssey: Black-Box Optimization via Expanded Exploration in the Energy-Based Latent Space
Peiyu Yu
Dinghuai Zhang
Hengzhi He
Xiaojian Ma
Ruiyao Miao
...
Deqian Kong
Ruiqi Gao
Jianwen Xie
Guang Cheng
Ying Nian Wu
296
9
0
27 May 2024
Sora and V-JEPA Have Not Learned The Complete Real World Model -- A
  Philosophical Analysis of Video AIs Through the Theory of Productive
  Imagination
Sora and V-JEPA Have Not Learned The Complete Real World Model -- A Philosophical Analysis of Video AIs Through the Theory of Productive Imagination
Jianqiu Zhang
VGen
91
0
0
06 May 2024
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
Zheng Zhu
Xiaofeng Wang
Wangbo Zhao
Chen Min
Nianchen Deng
...
Dawei Zhao
Liang Xiao
Jian-jun Zhao
Jiwen Lu
Guan Huang
VGenLM&Ro
310
77
0
06 May 2024
DeepFake-O-Meter v2.0: An Open Platform for DeepFake Detection
DeepFake-O-Meter v2.0: An Open Platform for DeepFake Detection
Yan Ju
Chengzhe Sun
Shan Jia
Shuwei Hou
Zhaofeng Si
Soumyya Kanti Datta
Lipeng Ke
Riky Zhou
Anita Nikolich
Siwei Lyu
269
6
0
19 Apr 2024
BEND: Bagging Deep Learning Training Based on Efficient Neural Network
  Diffusion
BEND: Bagging Deep Learning Training Based on Efficient Neural Network Diffusion
Jia Wei
Xingjun Zhang
Witold Pedrycz
DiffM
168
0
0
23 Mar 2024
A Roadmap Towards Automated and Regulated Robotic Systems
A Roadmap Towards Automated and Regulated Robotic Systems
Yihao Liu
Mehran Armand
181
3
0
21 Mar 2024
Xception: Deep Learning with Depthwise Separable Convolutions
Xception: Deep Learning with Depthwise Separable ConvolutionsComputer Vision and Pattern Recognition (CVPR), 2016
François Chollet
MDEBDLPINN
2.8K
16,553
0
07 Oct 2016
1