Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2410.00822
Cited By
v1
v2 (latest)
VHASR: A Multimodal Speech Recognition System With Vision Hotwords
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
1 October 2024
Jiliang Hu
Zuchao Li
Ping Wang
Haojun Ai
Lefei Zhang
Hai Zhao
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"VHASR: A Multimodal Speech Recognition System With Vision Hotwords"
1 / 1 papers shown
Locate-and-Focus: Enhancing Terminology Translation in Speech Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Suhang Wu
Jialong Tang
Chengyi Yang
Pei Zhang
Baosong Yang
Junhui Li
Junfeng Yao
Min Zhang
Jinsong Su
131
2
0
24 Jul 2025
1