Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.06087
Cited By
GAIA: Rethinking Action Quality Assessment for AI-Generated Videos
10 June 2024
Zijian Chen
Wei Sun
Yuan Tian
Jun Jia
Zicheng Zhang
Jiarui Wang
Ru Huang
Xiongkuo Min
Guangtao Zhai
Wenjun Zhang
EGVM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GAIA: Rethinking Action Quality Assessment for AI-Generated Videos"
12 / 12 papers shown
Title
OBI-Bench: Can LMMs Aid in Study of Ancient Script on Oracle Bones?
Z. Chen
Tingzhu Chen
Wenjun Zhang
Guangtao Zhai
80
3
0
02 Dec 2024
LOGO: A Long-Form Video Dataset for Group Action Quality Assessment
Shiyi Zhang
Wen-Dao Dai
Sujia Wang
Xiangwei Shen
Jiwen Lu
Jie Zhou
Yansong Tang
42
25
0
07 Apr 2024
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Roberto Henschel
Levon Khachatryan
Daniil Hayrapetyan
Hayk Poghosyan
Vahram Tadevosyan
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
DiffM
VGen
88
76
0
21 Mar 2024
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model
Xiao-wen Dong
Pan Zhang
Yuhang Zang
Yuhang Cao
Bin Wang
...
Conghui He
Xingcheng Zhang
Yu Qiao
Dahua Lin
Jiaqi Wang
VLM
MLLM
73
89
0
29 Jan 2024
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Haoxin Chen
Yong Zhang
Xiaodong Cun
Menghan Xia
Xintao Wang
Chao-Liang Weng
Ying Shan
VGen
DiffM
115
269
0
17 Jan 2024
MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation
Weimin Wang
Jiawei Liu
Zhijie Lin
Jiangqiao Yan
Shuo Chen
...
Jie Wu
Jun Hao Liew
Hanshu Yan
Daquan Zhou
Jiashi Feng
VGen
DiffM
68
17
0
09 Jan 2024
BAND-2k: Banding Artifact Noticeable Database for Banding Detection and Quality Assessment
Zijian Chen
Wei Sun
Jun Jia
Fangfang Lu
Zicheng Zhang
Jing Liu
Ru Huang
Xiongkuo Min
Guangtao Zhai
31
7
0
29 Nov 2023
FAST-VQA: Efficient End-to-end Video Quality Assessment with Fragment Sampling
Haoning Wu
Chaofeng Chen
Jingwen Hou
Liang Liao
Annan Wang
Wenxiu Sun
Qiong Yan
Weisi Lin
65
76
0
06 Jul 2022
CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers
Wenyi Hong
Ming Ding
Wendi Zheng
Xinghan Liu
Jie Tang
DiffM
235
556
0
29 May 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
380
4,010
0
28 Jan 2022
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
283
5,723
0
29 Apr 2021
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg
3DV
229
74,467
0
18 May 2015
1