Language as the Medium: Multimodal Video Classification through text only

19 September 2023

Papers citing "Language as the Medium: Multimodal Video Classification through text only"

1 / 1 papers shown

Title
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models Junnan Li Dongxu Li Silvio Savarese Steven C. H. Hoi VLM MLLM 244 4,186 0 30 Jan 2023