Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.23308
Cited By
Spoken question answering for visual queries
29 May 2025
Nimrod Shabtay
Zvi Kons
Avihu Dekel
Hagai Aronowitz
R. Hoory
Assaf Arbelle
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Spoken question answering for visual queries"
2 / 2 papers shown
Title
Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence
Granite Vision Team
Leonid Karlinsky
Assaf Arbelle
Abraham Daniels
A. Nassar
...
Sriram Raghavan
Tanveer Syeda-Mahmood
Peter W. J. Staar
Tal Drory
Rogerio Feris
VLM
AI4TS
188
2
0
14 Feb 2025
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching
Yushen Chen
Zhikang Niu
Ziyang Ma
Keqi Deng
Chunhui Wang
Jian Zhao
Kai Yu
Xie Chen
135
92
0
09 Oct 2024
1