Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.10880
Cited By
Exploring the Potential of Multimodal LLM with Knowledge-Intensive Multimodal ASR
16 June 2024
Minghan Wang
Yuxia Wang
Thuy-Trang Vu
Ehsan Shareghi
Gholamreza Haffari
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Exploring the Potential of Multimodal LLM with Knowledge-Intensive Multimodal ASR"
1 / 1 papers shown
Title
CogAgent: A Visual Language Model for GUI Agents
Wenyi Hong
Weihan Wang
Qingsong Lv
Jiazheng Xu
Wenmeng Yu
...
Juanzi Li
Bin Xu
Yuxiao Dong
Ming Ding
Jie Tang
MLLM
137
310
0
14 Dec 2023
1