Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.05299
Cited By
SmolVLM: Redefining small and efficient multimodal models
7 April 2025
Andres Marafioti
Orr Zohar
Miquel Farré
Merve Noyan
Elie Bakouch
Pedro Cuenca
Cyril Zakka
Loubna Ben Allal
Anton Lozhkov
Nouamane Tazi
Vaibhav Srivastav
Joshua Lochner
Hugo Larcher
Mathieu Morlon
Lewis Tunstall
Leandro von Werra
Thomas Wolf
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SmolVLM: Redefining small and efficient multimodal models"
4 / 4 papers shown
Title
VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations for Synthetic Videos
Zongxia Li
Xiyang Wu
Yubin Qin
Guangyao Shi
Hongyang Du
Dinesh Manocha
Tianyi Zhou
Jordan Boyd-Graber
MLLM
41
0
0
02 May 2025
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models
Jinguo Zhu
Weiyun Wang
Zhe Chen
Z. Liu
Shenglong Ye
...
D. Lin
Yu Qiao
Jifeng Dai
Wenhai Wang
W. Wang
MLLM
VLM
63
6
1
14 Apr 2025
One Pic is All it Takes: Poisoning Visual Document Retrieval Augmented Generation with a Single Image
Ezzeldin Shereen
Dan Ristea
Burak Hasircioglu
Shae McFadden
V. Mavroudis
Chris Hicks
42
0
0
02 Apr 2025
ComicsPAP: understanding comic strips by picking the correct panel
Emanuele Vivoli
Artemis LLabres
Mohamed Ali Soubgui
Marco Bertini
Ernest Valveny Llobet
Dimosthenis Karatzas
52
0
0
11 Mar 2025
1