v1v2 (latest)

Semantic Grouping Network for Video Captioning

AAAI Conference on Artificial Intelligence (AAAI), 2021

1 February 2021

Papers citing "Semantic Grouping Network for Video Captioning"

29 / 29 papers shown

SGCap: Decoding Semantic Group for Zero-shot Video Captioning

124

02 Aug 2025

Towards Efficient Partially Relevant Video Retrieval with Active Moment DiscoveringIEEE transactions on multimedia (TMM), 2025

199

15 Apr 2025

The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning

250

31 Mar 2025

Exploring Temporal Event Cues for Dense Video Captioning in Cyclic Co-learningAAAI Conference on Artificial Intelligence (AAAI), 2024

407

16 Dec 2024

Pseudo-labeling with Keyword Refining for Few-Supervised Video CaptioningPattern Recognition (Pattern Recogn.), 2024

214

06 Nov 2024

SPECTRUM: Semantic Processing and Emotion-informed video-Captioning Through Retrieval and Understanding Modalities

Ehsan Faghihi

Mohammedreza Zarenejad

Ali-Asghar Beheshti Shirazi

271

04 Nov 2024

MCF-VC: Mitigate Catastrophic Forgetting in Class-Incremental Learning for Multimodal Video Captioning

223

27 Feb 2024

SnapCap: Efficient Snapshot Compressive Video Captioning

402

10 Jan 2024

Set Prediction Guided by Semantic Concepts for Diverse Video Captioning

Bing Li

167

25 Dec 2023

Subject-Oriented Video Captioning

Guorong Li

Qi Wu

192

20 Dec 2023

Towards Surveillance Video-and-Language Understanding: New Dataset, Baselines, and ChallengesComputer Vision and Pattern Recognition (CVPR), 2023

308

25 Sep 2023

Accurate and Fast Compressed Video CaptioningIEEE International Conference on Computer Vision (ICCV), 2023

Yaojie Shen

Kai Xu

189

22 Sep 2023

Collaborative Three-Stream Transformers for Video CaptioningComputer Vision and Image Understanding (CVIU), 2023

196

18 Sep 2023

Video Captioning with Aggregated Features Based on Dual Graphs and Gated Fusion

Yutao Jin

Yinan Han

Jing Wang

161

13 Aug 2023

A Review of Deep Learning for Video CaptioningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

...

Fatih Porikli

221

22 Apr 2023

SEM-POS: Grammatically and Semantically Correct Video Captioning

200

26 Mar 2023

Text with Knowledge Graph Augmented Transformer for Video CaptioningComputer Vision and Pattern Recognition (CVPR), 2023

Yufei Wang

211

22 Mar 2023

Refined Semantic Enhancement towards Frequency Diffusion for Video CaptioningAAAI Conference on Artificial Intelligence (AAAI), 2022

213

28 Nov 2022

Aligning Source Visual and Target Language Domains for Unpaired Video CaptioningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021

242

22 Nov 2022

Visual Commonsense-aware Representation Network for Video CaptioningIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022

Pengpeng Zeng

Haonan Zhang

Lianli Gao

Xiangpeng Li

Jin Qian

Hengtao Shen

157

17 Nov 2022

Thinking Hallucination for Video CaptioningAsian Conference on Computer Vision (ACCV), 2022

Nasib Ullah

Partha Pratim Mohanta

VLM

160

28 Sep 2022

GL-RG: Global-Local Representation Granularity for Video CaptioningInternational Joint Conference on Artificial Intelligence (IJCAI), 2022

240

22 May 2022

Support-set based Multi-modal Representation Enhancement for Video CaptioningIEEE International Conference on Multimedia and Expo (ICME), 2022

Xiaoya Chen

Jingkuan Song

Pengpeng Zeng

Lianli Gao

Hengtao Shen

134

19 May 2022

Video Captioning: a comparative review of where we are and which could be the routeComputer Vision and Image Understanding (CVIU), 2022

Daniela Moctezuma

Tania A. Ramirez-delreal

Guillermo Ruiz

Othón González-Chávez

208

12 Apr 2022

Hierarchical Modular Network for Video Captioning

Hanhua Ye

Guorong Li

Yuankai Qi

Shuhui Wang

Qingming Huang

Ming-Hsuan Yang

227

24 Nov 2021

Visual-aware Attention Dual-stream Decoder for Video Captioning

170

16 Oct 2021

O2NA: An Object-Oriented Non-Autoregressive Approach for Controllable Video CaptioningFindings (Findings), 2021

Xuancheng Ren

242

05 Aug 2021

Boosting Video Captioning with Dynamic Loss Network

Nasib Ullah

Partha Pratim Mohanta

205

25 Jul 2021

Rethinking and Improving Natural Language Generation with Layer-Wise Multi-View Decoding

Xuancheng Ren

482

16 May 2020