ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.14582
  4. Cited By
SmallBigNet: Integrating Core and Contextual Views for Video
  Classification

SmallBigNet: Integrating Core and Contextual Views for Video Classification

25 June 2020
Xianhang Li
Yali Wang
Zhipeng Zhou
Yu Qiao
    ViT
ArXiv (abs)PDFHTMLGithub (41★)

Papers citing "SmallBigNet: Integrating Core and Contextual Views for Video Classification"

43 / 43 papers shown
Title
Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition
Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Yulin Wang
Haoji Zhang
Yang Yue
Shiji Song
Chao Deng
Junlan Feng
Gao Huang
201
11
0
15 Dec 2024
PosMLP-Video: Spatial and Temporal Relative Position Encoding for
  Efficient Video Recognition
PosMLP-Video: Spatial and Temporal Relative Position Encoding for Efficient Video Recognition
Y. Hao
Diansong Zhou
Zhicai Wang
Chong-Wah Ngo
Meng Wang
ViT
154
9
0
03 Jul 2024
Video Recognition in Portrait Mode
Video Recognition in Portrait Mode
Mingfei Han
Linjie Yang
Xiaojie Jin
Jiashi Feng
Xiaojun Chang
Heng Wang
137
5
0
21 Dec 2023
ConViViT -- A Deep Neural Network Combining Convolutions and Factorized
  Self-Attention for Human Activity Recognition
ConViViT -- A Deep Neural Network Combining Convolutions and Factorized Self-Attention for Human Activity RecognitionIEEE International Workshop on Multimedia Signal Processing (MMSP), 2023
Rachid Reda Dokkar
F. Chaieb
Hassen Drira
Arezki Aberkane
ViT
146
2
0
22 Oct 2023
Temporally-Adaptive Models for Efficient Video Understanding
Temporally-Adaptive Models for Efficient Video Understanding
Ziyuan Huang
Shiwei Zhang
Liang Pan
Zhiwu Qing
Yingya Zhang
Ziwei Liu
Marcelo H. Ang
133
16
0
10 Aug 2023
What Can Simple Arithmetic Operations Do for Temporal Modeling?
What Can Simple Arithmetic Operations Do for Temporal Modeling?IEEE International Conference on Computer Vision (ICCV), 2023
Wenhao Wu
Yuxin Song
Zhun Sun
Jingdong Wang
Chang Xu
Wanli Ouyang
149
15
0
18 Jul 2023
AZTR: Aerial Video Action Recognition with Auto Zoom and Temporal
  Reasoning
AZTR: Aerial Video Action Recognition with Auto Zoom and Temporal ReasoningIEEE International Conference on Robotics and Automation (ICRA), 2023
Xijun Wang
Ruiqi Xian
Tianrui Guan
Celso M. de Melo
Stephen M. Nogar
Aniket Bera
Tianyi Zhou
112
14
0
02 Mar 2023
Look More but Care Less in Video Recognition
Look More but Care Less in Video RecognitionNeural Information Processing Systems (NeurIPS), 2022
Yitian Zhang
Yue Bai
Haiquan Wang
Yi Xu
Yun Fu
130
12
0
18 Nov 2022
Dynamic Temporal Filtering in Video Models
Dynamic Temporal Filtering in Video ModelsEuropean Conference on Computer Vision (ECCV), 2022
Fuchen Long
Zhaofan Qiu
Yingwei Pan
Ting Yao
Chong-Wah Ngo
Tao Mei
AI4TS
203
23
0
15 Nov 2022
DCVQE: A Hierarchical Transformer for Video Quality Assessment
DCVQE: A Hierarchical Transformer for Video Quality AssessmentAsian Conference on Computer Vision (ACCV), 2022
Zu-Hua Li
Lei Yang
ViT
120
3
0
10 Oct 2022
An Overview of Violence Detection Techniques: Current Challenges and
  Future Directions
An Overview of Violence Detection Techniques: Current Challenges and Future DirectionsArtificial Intelligence Review (Artif Intell Rev), 2022
N. Mumtaz
N. Ejaz
Shabana Habib
Syed Muhammad Mohsin
Prayag Tiwari
Shahab S. Band
Neeraj Kumar
138
32
0
21 Sep 2022
Long-term Leap Attention, Short-term Periodic Shift for Video
  Classification
Long-term Leap Attention, Short-term Periodic Shift for Video ClassificationACM Multimedia (ACM MM), 2022
Huatian Zhang
Lechao Cheng
Y. Hao
Chong-Wah Ngo
ViT
150
10
0
12 Jul 2022
Bi-Calibration Networks for Weakly-Supervised Video Representation
  Learning
Bi-Calibration Networks for Weakly-Supervised Video Representation LearningInternational Journal of Computer Vision (IJCV), 2022
Fuchen Long
Ting Yao
Zhaofan Qiu
Xinmei Tian
Jiebo Luo
Tao Mei
126
8
0
21 Jun 2022
Stand-Alone Inter-Frame Attention in Video Models
Stand-Alone Inter-Frame Attention in Video ModelsComputer Vision and Pattern Recognition (CVPR), 2022
Fuchen Long
Zhaofan Qiu
Yingwei Pan
Ting Yao
Jiebo Luo
Tao Mei
ViT
97
56
0
14 Jun 2022
MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing
MLP-3D: A MLP-like 3D Architecture with Grouped Time MixingComputer Vision and Pattern Recognition (CVPR), 2022
Zhaofan Qiu
Ting Yao
Chong-Wah Ngo
Tao Mei
ViT
131
17
0
13 Jun 2022
In Defense of Image Pre-Training for Spatiotemporal Recognition
In Defense of Image Pre-Training for Spatiotemporal RecognitionEuropean Conference on Computer Vision (ECCV), 2022
Xianhang Li
Huiyu Wang
Chen Wei
Jieru Mei
Alan Yuille
Yuyin Zhou
Cihang Xie
111
1
0
03 May 2022
Attention in Attention: Modeling Context Correlation for Efficient Video
  Classification
Attention in Attention: Modeling Context Correlation for Efficient Video Classification
Y. Hao
Shuo Wang
P. Cao
Xinjian Gao
Tong Xu
Jinmeng Wu
Xiangnan He
137
48
0
20 Apr 2022
Long Movie Clip Classification with State-Space Video Models
Long Movie Clip Classification with State-Space Video ModelsEuropean Conference on Computer Vision (ECCV), 2022
Md. Mohaiminul Islam
Gedas Bertasius
VLM
235
132
0
04 Apr 2022
Group Contextualization for Video Recognition
Group Contextualization for Video RecognitionComputer Vision and Pattern Recognition (CVPR), 2022
Y. Hao
Haotong Zhang
Chong-Wah Ngo
Xiangnan He
80
31
0
18 Mar 2022
Motion-driven Visual Tempo Learning for Video-based Action Recognition
Motion-driven Visual Tempo Learning for Video-based Action RecognitionIEEE Transactions on Image Processing (IEEE TIP), 2022
Yuanzhong Liu
Junsong Yuan
Zhigang Tu
131
70
0
24 Feb 2022
UniFormer: Unifying Convolution and Self-attention for Visual
  Recognition
UniFormer: Unifying Convolution and Self-attention for Visual RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Kunchang Li
Yali Wang
Junhao Zhang
Shiyang Feng
Guanglu Song
Yu Liu
Hongsheng Li
Yu Qiao
ViT
367
484
0
24 Jan 2022
Action Keypoint Network for Efficient Video Recognition
Action Keypoint Network for Efficient Video RecognitionIEEE Transactions on Image Processing (IEEE TIP), 2022
Xu Chen
Yahong Han
Xiaohan Wang
Yifang Sun
Yi Yang
3DPC
175
8
0
17 Jan 2022
UniFormer: Unified Transformer for Efficient Spatiotemporal
  Representation Learning
UniFormer: Unified Transformer for Efficient Spatiotemporal Representation LearningInternational Conference on Learning Representations (ICLR), 2022
Kunchang Li
Yali Wang
Shiyang Feng
Guanglu Song
Yu Liu
Hongsheng Li
Yu Qiao
ViT
319
305
0
12 Jan 2022
Temporal Transformer Networks with Self-Supervision for Action
  Recognition
Temporal Transformer Networks with Self-Supervision for Action Recognition
Yongkang Zhang
Jun Li
Guoming Wu
Hanjie Zhang
Zhiping Shi
Zhaoxun Liu
Zizhang Wu
ViT
104
7
0
14 Dec 2021
Auto-X3D: Ultra-Efficient Video Understanding via Finer-Grained Neural
  Architecture Search
Auto-X3D: Ultra-Efficient Video Understanding via Finer-Grained Neural Architecture Search
Lezhi Li
Xinyu Gong
Junru Wu
Humphrey Shi
Zhicheng Yan
Zhangyang Wang
VGen
125
1
0
09 Dec 2021
STSM: Spatio-Temporal Shift Module for Efficient Action Recognition
STSM: Spatio-Temporal Shift Module for Efficient Action Recognition
Zhaoqilin Yang
Gaoyun An
128
6
0
05 Dec 2021
MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal
  Representation Learning
MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning
David Junhao Zhang
Kunchang Li
Yali Wang
Yuxiang Chen
Shashwat Chandra
Yu Qiao
Luoqi Liu
Mike Zheng Shou
AI4TS
163
33
0
24 Nov 2021
ST-ABN: Visual Explanation Taking into Account Spatio-temporal
  Information for Video Recognition
ST-ABN: Visual Explanation Taking into Account Spatio-temporal Information for Video Recognition
Masahiro Mitsuhara
Tsubasa Hirakawa
Takayoshi Yamashita
H. Fujiyoshi
116
1
0
29 Oct 2021
Temporal-attentive Covariance Pooling Networks for Video Recognition
Temporal-attentive Covariance Pooling Networks for Video Recognition
Zilin Gao
Qilong Wang
Bingbing Zhang
Q. Hu
P. Li
172
28
0
27 Oct 2021
The Dawn of Quantum Natural Language Processing
The Dawn of Quantum Natural Language Processing
R. Sipio
Jia-Hong Huang
Samuel Yen-Chi Chen
Stefano Mangini
Marcel Worring
197
101
0
13 Oct 2021
TAda! Temporally-Adaptive Convolutions for Video Understanding
TAda! Temporally-Adaptive Convolutions for Video UnderstandingInternational Conference on Learning Representations (ICLR), 2021
Ziyuan Huang
Shiwei Zhang
Liang Pan
Zhiwu Qing
Mingqian Tang
Ziwei Liu
M. Ang
293
64
0
12 Oct 2021
Video Is Graph: Structured Graph Module for Video Action Recognition
Video Is Graph: Structured Graph Module for Video Action Recognition
Rongjie Li
Xiaojun Wu
Tianyang Xu
226
14
0
12 Oct 2021
ActionCLIP: A New Paradigm for Video Action Recognition
ActionCLIP: A New Paradigm for Video Action Recognition
Mengmeng Wang
Jiazheng Xing
Yong Liu
VLM
324
444
0
17 Sep 2021
Searching for Two-Stream Models in Multivariate Space for Video
  Recognition
Searching for Two-Stream Models in Multivariate Space for Video RecognitionIEEE International Conference on Computer Vision (ICCV), 2021
Xinyu Gong
Heng Wang
Zheng Shou
Matt Feiszli
Zhangyang Wang
Zhicheng Yan
117
9
0
30 Aug 2021
CT-Net: Channel Tensorization Network for Video Classification
CT-Net: Channel Tensorization Network for Video ClassificationInternational Conference on Learning Representations (ICLR), 2021
Kunchang Li
Xianhang Li
Yali Wang
Jun Wang
Yu Qiao
ViT
97
64
0
03 Jun 2021
TSI: Temporal Saliency Integration for Video Action Recognition
TSI: Temporal Saliency Integration for Video Action Recognition
Haisheng Su
Kunchang Li
Jinyuan Feng
Dongliang Wang
Weihao Gan
Wei Wu
Yu Qiao
103
4
0
02 Jun 2021
Busy-Quiet Video Disentangling for Video Classification
Busy-Quiet Video Disentangling for Video ClassificationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021
Guoxi Huang
A. Bors
206
8
0
29 Mar 2021
Learning Self-Similarity in Space and Time as Generalized Motion for
  Video Action Recognition
Learning Self-Similarity in Space and Time as Generalized Motion for Video Action RecognitionIEEE International Conference on Computer Vision (ICCV), 2021
Heeseung Kwon
Manjin Kim
Suha Kwak
Minsu Cho
TTA
145
47
0
14 Feb 2021
TDN: Temporal Difference Networks for Efficient Action Recognition
TDN: Temporal Difference Networks for Efficient Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2020
Limin Wang
Zhan Tong
Bin Ji
Gangshan Wu
283
448
0
18 Dec 2020
Discovering Dynamic Salient Regions for Spatio-Temporal Graph Neural
  Networks
Discovering Dynamic Salient Regions for Spatio-Temporal Graph Neural NetworksNeural Information Processing Systems (NeurIPS), 2020
Iulia Duta
Andrei Liviu Nicolicioiu
Marius Leordeanu
194
7
0
17 Sep 2020
Comparison of Spatiotemporal Networks for Learning Video Related Tasks
Comparison of Spatiotemporal Networks for Learning Video Related Tasks
Logan Courtney
R. Sreenivas
68
1
0
15 Sep 2020
Region-based Non-local Operation for Video Classification
Region-based Non-local Operation for Video ClassificationInternational Conference on Pattern Recognition (ICPR), 2020
Guoxi Huang
A. Bors
298
12
0
17 Jul 2020
Omni-Scale CNNs: a simple and effective kernel size configuration for
  time series classification
Omni-Scale CNNs: a simple and effective kernel size configuration for time series classificationInternational Conference on Learning Representations (ICLR), 2020
Wensi Tang
Guodong Long
Lu Liu
Wanrong Zhu
Michael Blumenstein
Jing Jiang
AI4TS
230
130
0
24 Feb 2020
1