ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2412.16771
  4. Cited By
SilVar: Speech Driven Multimodal Model for Reasoning Visual Question
  Answering and Object Localization

SilVar: Speech Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization

21 December 2024
Tan-Hanh Pham
Hoang-Nam Le
Phu-Vinh Nguyen
Chris Ngo
Truong Son-Hy
    AuLLM
    LRM
ArXivPDFHTML

Papers citing "SilVar: Speech Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization"

1 / 1 papers shown
Title
SilVar-Med: A Speech-Driven Visual Language Model for Explainable Abnormality Detection in Medical Imaging
SilVar-Med: A Speech-Driven Visual Language Model for Explainable Abnormality Detection in Medical Imaging
Tan-Hanh Pham
Chris Ngo
Trong-Duong Bui
Minh Luu Quang
Tan-Huong Pham
Truong Son-Hy
27
0
0
14 Apr 2025
1