Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.13766
Cited By
Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark
18 July 2024
Tsung-Han Wu
Giscard Biamby
Jerome Quenum
Ritwik Gupta
Joseph E. Gonzalez
Trevor Darrell
David M. Chan
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"
4 / 4 papers shown
Title
LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-Identification
Yiding Lu
Mouxing Yang
Dezhong Peng
Peng Hu
Yijie Lin
Xi Peng
39
0
0
14 Apr 2025
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Bin Lin
Yang Ye
Bin Zhu
Jiaxi Cui
Munan Ning
Peng Jin
Li-ming Yuan
VLM
MLLM
182
576
0
16 Nov 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
244
4,186
0
30 Jan 2023
Gender and Racial Bias in Visual Question Answering Datasets
Yusuke Hirota
Yuta Nakashima
Noa Garcia
FaML
116
46
0
17 May 2022
1