Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2006.14582
Cited By
SmallBigNet: Integrating Core and Contextual Views for Video Classification
25 June 2020
Xianhang Li
Yali Wang
Zhipeng Zhou
Yu Qiao
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (41★)
Papers citing
"SmallBigNet: Integrating Core and Contextual Views for Video Classification"
43 / 43 papers shown
Title
Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Yulin Wang
Haoji Zhang
Yang Yue
Shiji Song
Chao Deng
Junlan Feng
Gao Huang
201
11
0
15 Dec 2024
PosMLP-Video: Spatial and Temporal Relative Position Encoding for Efficient Video Recognition
Y. Hao
Diansong Zhou
Zhicai Wang
Chong-Wah Ngo
Meng Wang
ViT
154
9
0
03 Jul 2024
Video Recognition in Portrait Mode
Mingfei Han
Linjie Yang
Xiaojie Jin
Jiashi Feng
Xiaojun Chang
Heng Wang
137
5
0
21 Dec 2023
ConViViT -- A Deep Neural Network Combining Convolutions and Factorized Self-Attention for Human Activity Recognition
IEEE International Workshop on Multimedia Signal Processing (MMSP), 2023
Rachid Reda Dokkar
F. Chaieb
Hassen Drira
Arezki Aberkane
ViT
146
2
0
22 Oct 2023
Temporally-Adaptive Models for Efficient Video Understanding
Ziyuan Huang
Shiwei Zhang
Liang Pan
Zhiwu Qing
Yingya Zhang
Ziwei Liu
Marcelo H. Ang
133
16
0
10 Aug 2023
What Can Simple Arithmetic Operations Do for Temporal Modeling?
IEEE International Conference on Computer Vision (ICCV), 2023
Wenhao Wu
Yuxin Song
Zhun Sun
Jingdong Wang
Chang Xu
Wanli Ouyang
149
15
0
18 Jul 2023
AZTR: Aerial Video Action Recognition with Auto Zoom and Temporal Reasoning
IEEE International Conference on Robotics and Automation (ICRA), 2023
Xijun Wang
Ruiqi Xian
Tianrui Guan
Celso M. de Melo
Stephen M. Nogar
Aniket Bera
Tianyi Zhou
112
14
0
02 Mar 2023
Look More but Care Less in Video Recognition
Neural Information Processing Systems (NeurIPS), 2022
Yitian Zhang
Yue Bai
Haiquan Wang
Yi Xu
Yun Fu
130
12
0
18 Nov 2022
Dynamic Temporal Filtering in Video Models
European Conference on Computer Vision (ECCV), 2022
Fuchen Long
Zhaofan Qiu
Yingwei Pan
Ting Yao
Chong-Wah Ngo
Tao Mei
AI4TS
203
23
0
15 Nov 2022
DCVQE: A Hierarchical Transformer for Video Quality Assessment
Asian Conference on Computer Vision (ACCV), 2022
Zu-Hua Li
Lei Yang
ViT
120
3
0
10 Oct 2022
An Overview of Violence Detection Techniques: Current Challenges and Future Directions
Artificial Intelligence Review (Artif Intell Rev), 2022
N. Mumtaz
N. Ejaz
Shabana Habib
Syed Muhammad Mohsin
Prayag Tiwari
Shahab S. Band
Neeraj Kumar
138
32
0
21 Sep 2022
Long-term Leap Attention, Short-term Periodic Shift for Video Classification
ACM Multimedia (ACM MM), 2022
Huatian Zhang
Lechao Cheng
Y. Hao
Chong-Wah Ngo
ViT
150
10
0
12 Jul 2022
Bi-Calibration Networks for Weakly-Supervised Video Representation Learning
International Journal of Computer Vision (IJCV), 2022
Fuchen Long
Ting Yao
Zhaofan Qiu
Xinmei Tian
Jiebo Luo
Tao Mei
126
8
0
21 Jun 2022
Stand-Alone Inter-Frame Attention in Video Models
Computer Vision and Pattern Recognition (CVPR), 2022
Fuchen Long
Zhaofan Qiu
Yingwei Pan
Ting Yao
Jiebo Luo
Tao Mei
ViT
97
56
0
14 Jun 2022
MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing
Computer Vision and Pattern Recognition (CVPR), 2022
Zhaofan Qiu
Ting Yao
Chong-Wah Ngo
Tao Mei
ViT
131
17
0
13 Jun 2022
In Defense of Image Pre-Training for Spatiotemporal Recognition
European Conference on Computer Vision (ECCV), 2022
Xianhang Li
Huiyu Wang
Chen Wei
Jieru Mei
Alan Yuille
Yuyin Zhou
Cihang Xie
111
1
0
03 May 2022
Attention in Attention: Modeling Context Correlation for Efficient Video Classification
Y. Hao
Shuo Wang
P. Cao
Xinjian Gao
Tong Xu
Jinmeng Wu
Xiangnan He
137
48
0
20 Apr 2022
Long Movie Clip Classification with State-Space Video Models
European Conference on Computer Vision (ECCV), 2022
Md. Mohaiminul Islam
Gedas Bertasius
VLM
235
132
0
04 Apr 2022
Group Contextualization for Video Recognition
Computer Vision and Pattern Recognition (CVPR), 2022
Y. Hao
Haotong Zhang
Chong-Wah Ngo
Xiangnan He
80
31
0
18 Mar 2022
Motion-driven Visual Tempo Learning for Video-based Action Recognition
IEEE Transactions on Image Processing (IEEE TIP), 2022
Yuanzhong Liu
Junsong Yuan
Zhigang Tu
131
70
0
24 Feb 2022
UniFormer: Unifying Convolution and Self-attention for Visual Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Kunchang Li
Yali Wang
Junhao Zhang
Shiyang Feng
Guanglu Song
Yu Liu
Hongsheng Li
Yu Qiao
ViT
367
484
0
24 Jan 2022
Action Keypoint Network for Efficient Video Recognition
IEEE Transactions on Image Processing (IEEE TIP), 2022
Xu Chen
Yahong Han
Xiaohan Wang
Yifang Sun
Yi Yang
3DPC
175
8
0
17 Jan 2022
UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning
International Conference on Learning Representations (ICLR), 2022
Kunchang Li
Yali Wang
Shiyang Feng
Guanglu Song
Yu Liu
Hongsheng Li
Yu Qiao
ViT
319
305
0
12 Jan 2022
Temporal Transformer Networks with Self-Supervision for Action Recognition
Yongkang Zhang
Jun Li
Guoming Wu
Hanjie Zhang
Zhiping Shi
Zhaoxun Liu
Zizhang Wu
ViT
104
7
0
14 Dec 2021
Auto-X3D: Ultra-Efficient Video Understanding via Finer-Grained Neural Architecture Search
Lezhi Li
Xinyu Gong
Junru Wu
Humphrey Shi
Zhicheng Yan
Zhangyang Wang
VGen
125
1
0
09 Dec 2021
STSM: Spatio-Temporal Shift Module for Efficient Action Recognition
Zhaoqilin Yang
Gaoyun An
128
6
0
05 Dec 2021
MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning
David Junhao Zhang
Kunchang Li
Yali Wang
Yuxiang Chen
Shashwat Chandra
Yu Qiao
Luoqi Liu
Mike Zheng Shou
AI4TS
163
33
0
24 Nov 2021
ST-ABN: Visual Explanation Taking into Account Spatio-temporal Information for Video Recognition
Masahiro Mitsuhara
Tsubasa Hirakawa
Takayoshi Yamashita
H. Fujiyoshi
116
1
0
29 Oct 2021
Temporal-attentive Covariance Pooling Networks for Video Recognition
Zilin Gao
Qilong Wang
Bingbing Zhang
Q. Hu
P. Li
172
28
0
27 Oct 2021
The Dawn of Quantum Natural Language Processing
R. Sipio
Jia-Hong Huang
Samuel Yen-Chi Chen
Stefano Mangini
Marcel Worring
197
101
0
13 Oct 2021
TAda! Temporally-Adaptive Convolutions for Video Understanding
International Conference on Learning Representations (ICLR), 2021
Ziyuan Huang
Shiwei Zhang
Liang Pan
Zhiwu Qing
Mingqian Tang
Ziwei Liu
M. Ang
293
64
0
12 Oct 2021
Video Is Graph: Structured Graph Module for Video Action Recognition
Rongjie Li
Xiaojun Wu
Tianyang Xu
226
14
0
12 Oct 2021
ActionCLIP: A New Paradigm for Video Action Recognition
Mengmeng Wang
Jiazheng Xing
Yong Liu
VLM
324
444
0
17 Sep 2021
Searching for Two-Stream Models in Multivariate Space for Video Recognition
IEEE International Conference on Computer Vision (ICCV), 2021
Xinyu Gong
Heng Wang
Zheng Shou
Matt Feiszli
Zhangyang Wang
Zhicheng Yan
117
9
0
30 Aug 2021
CT-Net: Channel Tensorization Network for Video Classification
International Conference on Learning Representations (ICLR), 2021
Kunchang Li
Xianhang Li
Yali Wang
Jun Wang
Yu Qiao
ViT
97
64
0
03 Jun 2021
TSI: Temporal Saliency Integration for Video Action Recognition
Haisheng Su
Kunchang Li
Jinyuan Feng
Dongliang Wang
Weihao Gan
Wei Wu
Yu Qiao
103
4
0
02 Jun 2021
Busy-Quiet Video Disentangling for Video Classification
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021
Guoxi Huang
A. Bors
206
8
0
29 Mar 2021
Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition
IEEE International Conference on Computer Vision (ICCV), 2021
Heeseung Kwon
Manjin Kim
Suha Kwak
Minsu Cho
TTA
145
47
0
14 Feb 2021
TDN: Temporal Difference Networks for Efficient Action Recognition
Computer Vision and Pattern Recognition (CVPR), 2020
Limin Wang
Zhan Tong
Bin Ji
Gangshan Wu
283
448
0
18 Dec 2020
Discovering Dynamic Salient Regions for Spatio-Temporal Graph Neural Networks
Neural Information Processing Systems (NeurIPS), 2020
Iulia Duta
Andrei Liviu Nicolicioiu
Marius Leordeanu
194
7
0
17 Sep 2020
Comparison of Spatiotemporal Networks for Learning Video Related Tasks
Logan Courtney
R. Sreenivas
68
1
0
15 Sep 2020
Region-based Non-local Operation for Video Classification
International Conference on Pattern Recognition (ICPR), 2020
Guoxi Huang
A. Bors
298
12
0
17 Jul 2020
Omni-Scale CNNs: a simple and effective kernel size configuration for time series classification
International Conference on Learning Representations (ICLR), 2020
Wensi Tang
Guodong Long
Lu Liu
Wanrong Zhu
Michael Blumenstein
Jing Jiang
AI4TS
230
130
0
24 Feb 2020
1