Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.16771
Cited By
SilVar: Speech Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization
21 December 2024
Tan-Hanh Pham
Hoang-Nam Le
Phu-Vinh Nguyen
Chris Ngo
Truong Son-Hy
AuLLM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SilVar: Speech Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization"
1 / 1 papers shown
Title
SilVar-Med: A Speech-Driven Visual Language Model for Explainable Abnormality Detection in Medical Imaging
Tan-Hanh Pham
Chris Ngo
Trong-Duong Bui
Minh Luu Quang
Tan-Huong Pham
Truong Son-Hy
27
0
0
14 Apr 2025
1