ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2412.08158
  4. Cited By
How Vision-Language Tasks Benefit from Large Pre-trained Models: A
  Survey

How Vision-Language Tasks Benefit from Large Pre-trained Models: A Survey

11 December 2024
Yayun Qi
Hongxi Li
Yiqi Song
Xinxiao Wu
Jiebo Luo
    LRMVLM
ArXiv (abs)PDFHTMLGithub

Papers citing "How Vision-Language Tasks Benefit from Large Pre-trained Models: A Survey"

3 / 3 papers shown
TIP and Polish: Text-Image-Prototype Guided Multi-Modal Generation via Commonality-Discrepancy Modeling and Refinement
TIP and Polish: Text-Image-Prototype Guided Multi-Modal Generation via Commonality-Discrepancy Modeling and Refinement
Zhiyong Ma
Jiahao Chen
Qingyuan Chuai
Zhengping Li
123
0
0
12 Nov 2025
Multi-Level LVLM Guidance for Untrimmed Video Action Recognition
Multi-Level LVLM Guidance for Untrimmed Video Action Recognition
Liyang Peng
Sihan Zhu
Yunjie Guo
185
0
0
24 Aug 2025
DeepInsert: Early Layer Bypass for Efficient and Performant Multimodal Understanding
DeepInsert: Early Layer Bypass for Efficient and Performant Multimodal Understanding
Moulik Choraria
Xinbo Wu
Akhil Bhimaraju
Nitesh Sekhar
Yue Wu
Xu Zhang
Prateek Singhal
Lav Varshney
433
0
0
27 Apr 2025
1
Page 1 of 1