ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.10706
  4. Cited By
GL-RG: Global-Local Representation Granularity for Video Captioning

GL-RG: Global-Local Representation Granularity for Video Captioning

22 May 2022
Liqi Yan
Qifan Wang
Yiming Cui
Fuli Feng
Xiaojun Quan
X. Zhang
Dongfang Liu
ArXivPDFHTML

Papers citing "GL-RG: Global-Local Representation Granularity for Video Captioning"

18 / 18 papers shown
Title
CodingHomo: Bootstrapping Deep Homography With Video Coding
CodingHomo: Bootstrapping Deep Homography With Video Coding
Yike Liu
Haipeng Li
Shuaicheng Liu
B. Zeng
40
2
0
16 Apr 2025
HOTVCOM: Generating Buzzworthy Comments for Videos
HOTVCOM: Generating Buzzworthy Comments for Videos
Yuyan Chen
Yiwen Qian
Songzhou Yan
Jiyuan Jia
Zhixu Li
Yanghua Xiao
Xiaobo Li
Ming Yang
Qingpei Guo
23
7
0
23 Sep 2024
Basketball-SORT: An Association Method for Complex Multi-object
  Occlusion Problems in Basketball Multi-object Tracking
Basketball-SORT: An Association Method for Complex Multi-object Occlusion Problems in Basketball Multi-object Tracking
Qingrui Hu
Atom Scott
Calvin Yeung
Keisuke Fujii
VOT
20
3
0
28 Jun 2024
SSGA-Net: Stepwise Spatial Global-local Aggregation Networks for for
  Autonomous Driving
SSGA-Net: Stepwise Spatial Global-local Aggregation Networks for for Autonomous Driving
Yiming Cui
Cheng Han
Dongfang Liu
14
0
0
29 May 2024
Deep video representation learning: a survey
Deep video representation learning: a survey
Elham Ravanbakhsh
Yongqing Liang
J. Ramanujam
Xin Li
34
3
0
10 May 2024
Subject-Oriented Video Captioning
Subject-Oriented Video Captioning
Yunchuan Ma
Chang Teng
Yuankai Qi
Guorong Li
Laiyun Qing
Qi Wu
Qingming Huang
14
0
0
20 Dec 2023
CML-MOTS: Collaborative Multi-task Learning for Multi-Object Tracking
  and Segmentation
CML-MOTS: Collaborative Multi-task Learning for Multi-Object Tracking and Segmentation
Yiming Cui
Cheng Han
Dongfang Liu
VOT
33
16
0
02 Nov 2023
Learning Dynamic Query Combinations for Transformer-based Object
  Detection and Segmentation
Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation
Yiming Cui
L. Yang
Hai-ping Yu
13
8
0
23 Jul 2023
The Staged Knowledge Distillation in Video Classification: Harmonizing
  Student Progress by a Complementary Weakly Supervised Framework
The Staged Knowledge Distillation in Video Classification: Harmonizing Student Progress by a Complementary Weakly Supervised Framework
Chao Wang
Zhenghang Tang
11
1
0
11 Jul 2023
FineEHR: Refine Clinical Note Representations to Improve Mortality
  Prediction
FineEHR: Refine Clinical Note Representations to Improve Mortality Prediction
Jun Wu
Xuesong Ye
Chengjie Mou
Weina Dai
49
18
0
24 Apr 2023
SEM-POS: Grammatically and Semantically Correct Video Captioning
SEM-POS: Grammatically and Semantically Correct Video Captioning
Asmar Nadeem
A. Hilton
R. Dawes
Graham A. Thomas
A. Mustafa
11
8
0
26 Mar 2023
FAQ: Feature Aggregated Queries for Transformer-based Video Object
  Detectors
FAQ: Feature Aggregated Queries for Transformer-based Video Object Detectors
Yiming Cui
Linjie Yang
ViT
4
15
0
15 Mar 2023
Solve the Puzzle of Instance Segmentation in Videos: A Weakly Supervised
  Framework with Spatio-Temporal Collaboration
Solve the Puzzle of Instance Segmentation in Videos: A Weakly Supervised Framework with Spatio-Temporal Collaboration
Liqi Yan
Qifan Wang
Siqi Ma
Jingang Wang
Changbin (Brad) Yu
VOS
15
38
0
15 Dec 2022
DFA: Dynamic Feature Aggregation for Efficient Video Object Detection
DFA: Dynamic Feature Aggregation for Efficient Video Object Detection
Yiming Cui
26
8
0
02 Oct 2022
Towards Unbiased Label Distribution Learning for Facial Pose Estimation
  Using Anisotropic Spherical Gaussian
Towards Unbiased Label Distribution Learning for Facial Pose Estimation Using Anisotropic Spherical Gaussian
Zhiwen Cao
Dongfang Liu
Qifan Wang
Victor Y. Chen
CVBM
16
16
0
19 Aug 2022
GeoSegNet: Point Cloud Semantic Segmentation via Geometric
  Encoder-Decoder Modeling
GeoSegNet: Point Cloud Semantic Segmentation via Geometric Encoder-Decoder Modeling
Chen Chen
Yisen Wang
Honghua Chen
Xu Yan
Da-Dui Ren
Ya Guo
H. Xie
F. Wang
Mingqiang Wei
3DPC
17
12
0
14 Jul 2022
Controllable Video Captioning with POS Sequence Guidance Based on Gated
  Fusion Network
Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
Bairui Wang
Lin Ma
Wei Zhang
Wenhao Jiang
Jingwen Wang
Wei Liu
60
158
0
27 Aug 2019
Aggregated Residual Transformations for Deep Neural Networks
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Z. Tu
Kaiming He
261
10,106
0
16 Nov 2016
1