ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.10880
  4. Cited By
Exploring the Potential of Multimodal LLM with Knowledge-Intensive
  Multimodal ASR

Exploring the Potential of Multimodal LLM with Knowledge-Intensive Multimodal ASR

16 June 2024
Minghan Wang
Yuxia Wang
Thuy-Trang Vu
Ehsan Shareghi
Gholamreza Haffari
ArXivPDFHTML

Papers citing "Exploring the Potential of Multimodal LLM with Knowledge-Intensive Multimodal ASR"

1 / 1 papers shown
Title
CogAgent: A Visual Language Model for GUI Agents
CogAgent: A Visual Language Model for GUI Agents
Wenyi Hong
Weihan Wang
Qingsong Lv
Jiazheng Xu
Wenmeng Yu
...
Juanzi Li
Bin Xu
Yuxiao Dong
Ming Ding
Jie Tang
MLLM
137
310
0
14 Dec 2023
1