ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.14834
  4. Cited By
Can VLMs be used on videos for action recognition? LLMs are Visual
  Reasoning Coordinators

Can VLMs be used on videos for action recognition? LLMs are Visual Reasoning Coordinators

20 July 2024
Harsh Lunia
ArXivPDFHTML

Papers citing "Can VLMs be used on videos for action recognition? LLMs are Visual Reasoning Coordinators"

1 / 1 papers shown
Title
BLIP: Bootstrapping Language-Image Pre-training for Unified
  Vision-Language Understanding and Generation
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
388
4,010
0
28 Jan 2022
1