ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2412.11248
  4. Cited By
Multimodal Class-aware Semantic Enhancement Network for Audio-Visual
  Video Parsing

Multimodal Class-aware Semantic Enhancement Network for Audio-Visual Video Parsing

15 December 2024
Pengcheng Zhao
Jinxing Zhou
Yang Zhao
D. Guo
Yanxiang Chen
ArXivPDFHTML

Papers citing "Multimodal Class-aware Semantic Enhancement Network for Audio-Visual Video Parsing"

1 / 1 papers shown
Title
Towards Open-Vocabulary Audio-Visual Event Localization
Jinxing Zhou
D. Guo
Ruohao Guo
Yuxin Mao
Jingjing Hu
Yiran Zhong
Xiaojun Chang
M. Wang
VLM
46
4
0
18 Nov 2024
1