ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.02740
  4. Cited By
Revisit Large-Scale Image-Caption Data in Pre-training Multimodal
  Foundation Models

Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models

3 October 2024
Zhengfeng Lai
Vasileios Saveris
C. L. P. Chen
Hong-You Chen
Haotian Zhang
Bowen Zhang
Juan Lao Tebar
Wenze Hu
Zhe Gan
Peter Grasch
Meng Cao
Yinfei Yang
    VLM
ArXivPDFHTML

Papers citing "Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models"

1 / 1 papers shown
Title
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning
Haotian Zhang
Mingfei Gao
Zhe Gan
Philipp Dufter
Nina Wenzel
...
Haoxuan You
Zirui Wang
Afshin Dehghan
Peter Grasch
Yinfei Yang
VLM
MLLM
36
32
1
30 Sep 2024
1