Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.12724
Cited By
Query-aware Long Video Localization and Relation Discrimination for Deep Video Understanding
19 October 2023
Yuanxing Xu
Yuting Wei
Bin Wu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Query-aware Long Video Localization and Relation Discrimination for Deep Video Understanding"
3 / 3 papers shown
Title
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
388
4,110
0
28 Jan 2022
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
Hu Xu
Gargi Ghosh
Po-Yao (Bernie) Huang
Dmytro Okhonko
Armen Aghajanyan
Florian Metze
Luke Zettlemoyer
Florian Metze Luke Zettlemoyer Christoph Feichtenhofer
CLIP
VLM
245
554
0
28 Sep 2021
Bridge to Answer: Structure-aware Graph Interaction Network for Video Question Answering
Jungin Park
Jiyoung Lee
K. Sohn
123
99
0
29 Apr 2021
1