Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.13380
Cited By
First Place Solution to the CVPR'2023 AQTC Challenge: A Function-Interaction Centric Approach with Spatiotemporal Visual-Language Alignment
23 June 2023
Tom Tongjia Chen
Hongshan Yu
Zhengeng Yang
Ming Li
Zechuan Li
Jingwen Wang
Wei Miao
Wei Sun
Chen Chen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"First Place Solution to the CVPR'2023 AQTC Challenge: A Function-Interaction Centric Approach with Spatiotemporal Visual-Language Alignment"
2 / 2 papers shown
Title
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
388
4,110
0
28 Jan 2022
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Kristen Grauman
Andrew Westbury
Eugene Byrne
Zachary Chavis
Antonino Furnari
...
Mike Zheng Shou
Antonio Torralba
Lorenzo Torresani
Mingfei Yan
Jitendra Malik
EgoV
224
1,017
0
13 Oct 2021
1