ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.00822
  4. Cited By
VHASR: A Multimodal Speech Recognition System With Vision Hotwords
v1v2 (latest)

VHASR: A Multimodal Speech Recognition System With Vision Hotwords

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
1 October 2024
Jiliang Hu
Zuchao Li
Ping Wang
Haojun Ai
Lefei Zhang
Hai Zhao
ArXiv (abs)PDFHTML

Papers citing "VHASR: A Multimodal Speech Recognition System With Vision Hotwords"

1 / 1 papers shown
Locate-and-Focus: Enhancing Terminology Translation in Speech Language Models
Locate-and-Focus: Enhancing Terminology Translation in Speech Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Suhang Wu
Jialong Tang
Chengyi Yang
Pei Zhang
Baosong Yang
Junhui Li
Junfeng Yao
Min Zhang
Jinsong Su
131
2
0
24 Jul 2025
1