ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.20693
  4. Cited By
Boosting Audio Visual Question Answering via Key Semantic-Aware Cues

Boosting Audio Visual Question Answering via Key Semantic-Aware Cues

30 July 2024
Guangyao Li
Henghui Du
Di Hu
ArXiv (abs)PDFHTMLGithub (16★)

Papers citing "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues"

7 / 7 papers shown
Multi-Modal Scene Graph with Kolmogorov-Arnold Experts for Audio-Visual Question Answering
Multi-Modal Scene Graph with Kolmogorov-Arnold Experts for Audio-Visual Question Answering
Z. Fu
Changsheng Lv
Mengshi Qi
Huadong Ma
160
0
0
28 Nov 2025
AV-Master: Dual-Path Comprehensive Perception Makes Better Audio-Visual Question Answering
AV-Master: Dual-Path Comprehensive Perception Makes Better Audio-Visual Question Answering
Jiayu Zhang
Qilang Ye
Shuo Ye
Xun Lin
Zihan Song
Zitong Yu
112
0
0
21 Oct 2025
Teacher-Guided Pseudo Supervision and Cross-Modal Alignment for Audio-Visual Video Parsing
Teacher-Guided Pseudo Supervision and Cross-Modal Alignment for Audio-Visual Video Parsing
Yaru Chen
Ruohao Guo
Liting Gao
Yang Xiang
Qingyu Luo
Zhenbo Li
Wenwu Wang
152
0
0
17 Sep 2025
Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation
Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation
Kaining Ying
Henghui Ding
Guangquan Jie
Yu Jiang
VOS
321
5
0
30 Jul 2025
PreFM: Online Audio-Visual Event Parsing via Predictive Future Modeling
PreFM: Online Audio-Visual Event Parsing via Predictive Future Modeling
X. Yu
Yan Fang
Xiaojie Jin
Yao Zhao
Yunchao Wei
283
1
0
29 May 2025
Crab: A Unified Audio-Visual Scene Understanding Model with Explicit Cooperation
Crab: A Unified Audio-Visual Scene Understanding Model with Explicit CooperationComputer Vision and Pattern Recognition (CVPR), 2025
Henghui Du
Guangyao Li
Chang Zhou
Chunjie Zhang
Alan Zhao
D. Hu
260
11
0
17 Mar 2025
Question-Aware Gaussian Experts for Audio-Visual Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2025
Hongyeob Kim
Inyoung Jung
Dayoon Suh
Youjia Zhang
Sangmin Lee
Sungeun Hong
394
5
0
06 Mar 2025
1