Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.10706
Cited By
GL-RG: Global-Local Representation Granularity for Video Captioning
22 May 2022
Liqi Yan
Qifan Wang
Yiming Cui
Fuli Feng
Xiaojun Quan
X. Zhang
Dongfang Liu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GL-RG: Global-Local Representation Granularity for Video Captioning"
18 / 18 papers shown
Title
CodingHomo: Bootstrapping Deep Homography With Video Coding
Yike Liu
Haipeng Li
Shuaicheng Liu
B. Zeng
33
2
0
16 Apr 2025
HOTVCOM: Generating Buzzworthy Comments for Videos
Yuyan Chen
Yiwen Qian
Songzhou Yan
Jiyuan Jia
Zhixu Li
Yanghua Xiao
Xiaobo Li
Ming Yang
Qingpei Guo
21
7
0
23 Sep 2024
Basketball-SORT: An Association Method for Complex Multi-object Occlusion Problems in Basketball Multi-object Tracking
Qingrui Hu
Atom Scott
Calvin Yeung
Keisuke Fujii
VOT
18
3
0
28 Jun 2024
SSGA-Net: Stepwise Spatial Global-local Aggregation Networks for for Autonomous Driving
Yiming Cui
Cheng Han
Dongfang Liu
14
0
0
29 May 2024
Deep video representation learning: a survey
Elham Ravanbakhsh
Yongqing Liang
J. Ramanujam
Xin Li
34
3
0
10 May 2024
Subject-Oriented Video Captioning
Yunchuan Ma
Chang Teng
Yuankai Qi
Guorong Li
Laiyun Qing
Qi Wu
Qingming Huang
14
0
0
20 Dec 2023
CML-MOTS: Collaborative Multi-task Learning for Multi-Object Tracking and Segmentation
Yiming Cui
Cheng Han
Dongfang Liu
VOT
33
16
0
02 Nov 2023
Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation
Yiming Cui
L. Yang
Hai-ping Yu
11
8
0
23 Jul 2023
The Staged Knowledge Distillation in Video Classification: Harmonizing Student Progress by a Complementary Weakly Supervised Framework
Chao Wang
Zhenghang Tang
11
1
0
11 Jul 2023
FineEHR: Refine Clinical Note Representations to Improve Mortality Prediction
Jun Wu
Xuesong Ye
Chengjie Mou
Weina Dai
40
18
0
24 Apr 2023
SEM-POS: Grammatically and Semantically Correct Video Captioning
Asmar Nadeem
A. Hilton
R. Dawes
Graham A. Thomas
A. Mustafa
11
8
0
26 Mar 2023
FAQ: Feature Aggregated Queries for Transformer-based Video Object Detectors
Yiming Cui
Linjie Yang
ViT
4
15
0
15 Mar 2023
Solve the Puzzle of Instance Segmentation in Videos: A Weakly Supervised Framework with Spatio-Temporal Collaboration
Liqi Yan
Qifan Wang
Siqi Ma
Jingang Wang
Changbin (Brad) Yu
VOS
15
38
0
15 Dec 2022
DFA: Dynamic Feature Aggregation for Efficient Video Object Detection
Yiming Cui
23
8
0
02 Oct 2022
Towards Unbiased Label Distribution Learning for Facial Pose Estimation Using Anisotropic Spherical Gaussian
Zhiwen Cao
Dongfang Liu
Qifan Wang
Victor Y. Chen
CVBM
14
16
0
19 Aug 2022
GeoSegNet: Point Cloud Semantic Segmentation via Geometric Encoder-Decoder Modeling
Chen Chen
Yisen Wang
Honghua Chen
Xu Yan
Da-Dui Ren
Ya Guo
H. Xie
F. Wang
Mingqiang Wei
3DPC
15
12
0
14 Jul 2022
Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network
Bairui Wang
Lin Ma
Wei Zhang
Wenhao Jiang
Jingwen Wang
Wei Liu
60
158
0
27 Aug 2019
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Z. Tu
Kaiming He
261
10,106
0
16 Nov 2016
1