SmallBigNet: Integrating Core and Contextual Views for Video
  Classification

SmallBigNet: Integrating Core and Contextual Views for Video Classification

    ViT

Papers citing "SmallBigNet: Integrating Core and Contextual Views for Video Classification"

43 / 43 papers shown
Title
Video Recognition in Portrait Mode
Video Recognition in Portrait Mode
Mingfei Han
Linjie Yang
Xiaojie Jin
Jiashi Feng
Xiaojun Chang
Heng Wang
137
5
0
21 Dec 2023
What Can Simple Arithmetic Operations Do for Temporal Modeling?
What Can Simple Arithmetic Operations Do for Temporal Modeling?IEEE International Conference on Computer Vision (ICCV), 2023
Wenhao Wu
Yuxin Song
Zhun Sun
Jingdong Wang
Chang Xu
Wanli Ouyang
149
14
0
18 Jul 2023
AZTR: Aerial Video Action Recognition with Auto Zoom and Temporal
  Reasoning
AZTR: Aerial Video Action Recognition with Auto Zoom and Temporal ReasoningIEEE International Conference on Robotics and Automation (ICRA), 2023
108
12
0
02 Mar 2023
Look More but Care Less in Video Recognition
Look More but Care Less in Video RecognitionNeural Information Processing Systems (NeurIPS), 2022
130
12
0
18 Nov 2022
Dynamic Temporal Filtering in Video Models
Dynamic Temporal Filtering in Video ModelsEuropean Conference on Computer Vision (ECCV), 2022
Fuchen Long
Zhaofan Qiu
Yingwei Pan
Ting Yao
Chong-Wah Ngo
Tao Mei
187
23
0
15 Nov 2022
DCVQE: A Hierarchical Transformer for Video Quality Assessment
DCVQE: A Hierarchical Transformer for Video Quality AssessmentAsian Conference on Computer Vision (ACCV), 2022
120
3
0
10 Oct 2022
Stand-Alone Inter-Frame Attention in Video Models
Stand-Alone Inter-Frame Attention in Video ModelsComputer Vision and Pattern Recognition (CVPR), 2022
97
56
0
14 Jun 2022
MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing
MLP-3D: A MLP-like 3D Architecture with Grouped Time MixingComputer Vision and Pattern Recognition (CVPR), 2022
Zhaofan Qiu
Ting Yao
Chong-Wah Ngo
Tao Mei
131
17
0
13 Jun 2022
In Defense of Image Pre-Training for Spatiotemporal Recognition
In Defense of Image Pre-Training for Spatiotemporal RecognitionEuropean Conference on Computer Vision (ECCV), 2022
111
1
0
03 May 2022
Long Movie Clip Classification with State-Space Video Models
Long Movie Clip Classification with State-Space Video ModelsEuropean Conference on Computer Vision (ECCV), 2022
Md. Mohaiminul Islam
Gedas Bertasius
235
132
0
04 Apr 2022
Group Contextualization for Video Recognition
Group Contextualization for Video RecognitionComputer Vision and Pattern Recognition (CVPR), 2022
80
31
0
18 Mar 2022
Motion-driven Visual Tempo Learning for Video-based Action Recognition
Motion-driven Visual Tempo Learning for Video-based Action RecognitionIEEE Transactions on Image Processing (IEEE TIP), 2022
131
70
0
24 Feb 2022
UniFormer: Unifying Convolution and Self-attention for Visual
  Recognition
UniFormer: Unifying Convolution and Self-attention for Visual RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
363
484
0
24 Jan 2022
Action Keypoint Network for Efficient Video Recognition
Action Keypoint Network for Efficient Video RecognitionIEEE Transactions on Image Processing (IEEE TIP), 2022
175
8
0
17 Jan 2022
CT-Net: Channel Tensorization Network for Video Classification
CT-Net: Channel Tensorization Network for Video ClassificationInternational Conference on Learning Representations (ICLR), 2021
97
64
0
03 Jun 2021
Busy-Quiet Video Disentangling for Video Classification
Busy-Quiet Video Disentangling for Video ClassificationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021
206
8
0
29 Mar 2021
TDN: Temporal Difference Networks for Efficient Action Recognition
TDN: Temporal Difference Networks for Efficient Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2020
283
447
0
18 Dec 2020
Region-based Non-local Operation for Video Classification
Region-based Non-local Operation for Video ClassificationInternational Conference on Pattern Recognition (ICPR), 2020
298
11
0
17 Jul 2020

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.