Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2211.15076
Cited By
v1
v2 (latest)
Refined Semantic Enhancement towards Frequency Diffusion for Video Captioning
AAAI Conference on Artificial Intelligence (AAAI), 2022
28 November 2022
Zhuo Zhou
Zipeng Li
Shuqin Chen
Kui Jiang
Chen Chen
Mang Ye
DiffM
VGen
Re-assign community
ArXiv (abs)
PDF
HTML
Github (7★)
Papers citing
"Refined Semantic Enhancement towards Frequency Diffusion for Video Captioning"
6 / 6 papers shown
Title
OAD-Promoter: Enhancing Zero-shot VQA using Large Language Models with Object Attribute Description
Quanxing Xu
Ling Zhou
Feifei Zhang
Jinyu Tian
Rubing Huang
VLM
120
0
0
15 Nov 2025
SmokeBench: A Real-World Dataset for Surveillance Image Desmoking in Early-Stage Fire Scenes
Wenzhuo Jin
Q. Yang
Xianhao Wu
Hongming Chen
Pengpeng Li
Xiang-Zhong Chen
52
0
0
16 Sep 2025
CPKD: Clinical Prior Knowledge-Constrained Diffusion Models for Surgical Phase Recognition in Endoscopic Submucosal Dissection
Xiangning Zhang
Jinnan Chen
Qingwei Zhang
Yaqi Wang
Shilun Cai
XiaoBo Li
Dahong Qian
MedIm
140
0
0
04 Jul 2025
SPECTRUM: Semantic Processing and Emotion-informed video-Captioning Through Retrieval and Understanding Modalities
Ehsan Faghihi
Mohammedreza Zarenejad
Ali-Asghar Beheshti Shirazi
215
1
0
04 Nov 2024
Diffusion Action Segmentation
IEEE International Conference on Computer Vision (ICCV), 2023
Dao-jun Liu
Qiyue Li
A. Dinh
Ting Jiang
Mubarak Shah
Chan Xu
VGen
DiffM
237
97
0
31 Mar 2023
Implicit and Explicit Commonsense for Multi-sentence Video Captioning
Computer Vision and Image Understanding (CVIU), 2023
Shih-Han Chou
James J. Little
Leonid Sigal
138
3
0
14 Mar 2023
1