ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.15076
  4. Cited By
Refined Semantic Enhancement towards Frequency Diffusion for Video
  Captioning
v1v2 (latest)

Refined Semantic Enhancement towards Frequency Diffusion for Video Captioning

AAAI Conference on Artificial Intelligence (AAAI), 2022
28 November 2022
Zhuo Zhou
Zipeng Li
Shuqin Chen
Kui Jiang
Chen Chen
Mang Ye
    DiffMVGen
ArXiv (abs)PDFHTMLGithub (7★)

Papers citing "Refined Semantic Enhancement towards Frequency Diffusion for Video Captioning"

6 / 6 papers shown
Title
OAD-Promoter: Enhancing Zero-shot VQA using Large Language Models with Object Attribute Description
OAD-Promoter: Enhancing Zero-shot VQA using Large Language Models with Object Attribute Description
Quanxing Xu
Ling Zhou
Feifei Zhang
Jinyu Tian
Rubing Huang
VLM
120
0
0
15 Nov 2025
SmokeBench: A Real-World Dataset for Surveillance Image Desmoking in Early-Stage Fire Scenes
SmokeBench: A Real-World Dataset for Surveillance Image Desmoking in Early-Stage Fire Scenes
Wenzhuo Jin
Q. Yang
Xianhao Wu
Hongming Chen
Pengpeng Li
Xiang-Zhong Chen
52
0
0
16 Sep 2025
CPKD: Clinical Prior Knowledge-Constrained Diffusion Models for Surgical Phase Recognition in Endoscopic Submucosal Dissection
CPKD: Clinical Prior Knowledge-Constrained Diffusion Models for Surgical Phase Recognition in Endoscopic Submucosal Dissection
Xiangning Zhang
Jinnan Chen
Qingwei Zhang
Yaqi Wang
Shilun Cai
XiaoBo Li
Dahong Qian
MedIm
140
0
0
04 Jul 2025
SPECTRUM: Semantic Processing and Emotion-informed video-Captioning
  Through Retrieval and Understanding Modalities
SPECTRUM: Semantic Processing and Emotion-informed video-Captioning Through Retrieval and Understanding Modalities
Ehsan Faghihi
Mohammedreza Zarenejad
Ali-Asghar Beheshti Shirazi
215
1
0
04 Nov 2024
Diffusion Action Segmentation
Diffusion Action SegmentationIEEE International Conference on Computer Vision (ICCV), 2023
Dao-jun Liu
Qiyue Li
A. Dinh
Ting Jiang
Mubarak Shah
Chan Xu
VGenDiffM
237
97
0
31 Mar 2023
Implicit and Explicit Commonsense for Multi-sentence Video Captioning
Implicit and Explicit Commonsense for Multi-sentence Video CaptioningComputer Vision and Image Understanding (CVIU), 2023
Shih-Han Chou
James J. Little
Leonid Sigal
138
3
0
14 Mar 2023
1