Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.02076
Cited By
VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPT
4 March 2024
Yifang Xu
Yunzhuo Sun
Zien Xie
Benxiang Zhai
Sidan Du
Re-assign community
ArXiv
PDF
HTML
Papers citing
"VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPT"
5 / 5 papers shown
Title
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Jun Chen
Deyao Zhu
Xiaoqian Shen
Xiang Li
Zechun Liu
Pengchuan Zhang
Raghuraman Krishnamoorthi
Vikas Chandra
Yunyang Xiong
Mohamed Elhoseiny
MLLM
152
280
0
14 Oct 2023
MH-DETR: Video Moment and Highlight Detection with Cross-modal Transformer
Yifang Xu
Yunzhuo Sun
Yang Li
Yilei Shi
Xiaoxia Zhu
S. Du
ViT
35
16
0
29 Apr 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
244
4,186
0
30 Jan 2023
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
255
7,337
0
11 Nov 2021
Zero-shot Natural Language Video Localization
Jinwoo Nam
Daechul Ahn
Dongyeop Kang
S. Ha
Jonghyun Choi
75
43
0
29 Aug 2021
1