ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.17247
  4. Cited By
An Introduction to Vision-Language Modeling

An Introduction to Vision-Language Modeling

27 May 2024
Florian Bordes
Richard Yuanzhe Pang
Anurag Ajay
Alexander C. Li
Adrien Bardes
Suzanne Petryk
Oscar Manas
Zhiqiu Lin
Anas Mahmoud
Bargav Jayaraman
Mark Ibrahim
Melissa Hall
Yunyang Xiong
Jonathan Lebensold
Candace Ross
Srihari Jayakumar
Chuan Guo
Diane Bouchacourt
Haider Al-Tahan
Karthik Padthe
Vasu Sharma
Huijuan Xu
Xiaoqing Ellen Tan
Megan Richards
Samuel Lavoie
Pietro Astolfi
Reyhane Askari Hemmat
Jun Chen
Kushal Tirumala
Rim Assouel
Mazda Moayeri
Arjang Talattof
Kamalika Chaudhuri
Zechun Liu
Xilun Chen
Q. Garrido
Karen Ullrich
Aishwarya Agrawal
Kate Saenko
Asli Celikyilmaz
Vikas Chandra
    VLM
ArXiv (abs)PDFHTML

Papers citing "An Introduction to Vision-Language Modeling"

6 / 56 papers shown
Title
General Vision Encoder Features as Guidance in Medical Image
  Registration
General Vision Encoder Features as Guidance in Medical Image Registration
Fryderyk Kogl
Anna Reithmeir
Vasiliki Sideri-Lampretsa
Ines P. Machado
R. Braren
Daniel Rückert
Julia A. Schnabel
Veronika A. Zimmer
MedIm
73
2
0
18 Jul 2024
MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data
MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data
William Berman
A. Peysakhovich
91
4
0
26 Jun 2024
Generative AI Systems: A Systems-based Perspective on Generative AI
Generative AI Systems: A Systems-based Perspective on Generative AI
Jakub M. Tomczak
95
1
0
25 Jun 2024
Multimodal Structured Generation: CVPR's 2nd MMFM Challenge Technical Report
Multimodal Structured Generation: CVPR's 2nd MMFM Challenge Technical Report
Franz Louis Cesista
VGen
133
6
0
17 Jun 2024
What is the Visual Cognition Gap between Humans and Multimodal LLMs?
What is the Visual Cognition Gap between Humans and Multimodal LLMs?
Xu Cao
Bolin Lai
Wenqian Ye
Yunsheng Ma
Joerg Heintz
Jintai Chen
Jianguo Cao
James M. Rehg
104
11
0
14 Jun 2024
The Vector Grounding Problem
The Vector Grounding Problem
Dimitri Coelho Mollo
Raphael Milliere
146
28
0
04 Apr 2023
Previous
12