Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.16382
Cited By
Video Prediction Models as General Visual Encoders
25 May 2024
James Maier
Nishanth Mohankumar
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Video Prediction Models as General Visual Encoders"
3 / 3 papers shown
Title
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
382
4,010
0
28 Jan 2022
VideoGPT: Video Generation using VQ-VAE and Transformers
Wilson Yan
Yunzhi Zhang
Pieter Abbeel
A. Srinivas
ViT
VGen
237
482
0
20 Apr 2021
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg
3DV
229
74,467
0
18 May 2015
1