Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.10379
Cited By
Temporal Saliency Query Network for Efficient Video Recognition
21 July 2022
Boyang Xia
Zhihao Wang
Wenhao Wu
Haoran Wang
Jungong Han
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Temporal Saliency Query Network for Efficient Video Recognition"
15 / 15 papers shown
Title
Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition
Yulin Wang
Haoji Zhang
Yang Yue
Shiji Song
Chao Deng
Junlan Feng
Gao Huang
74
3
0
15 Dec 2024
HaltingVT: Adaptive Token Halting Transformer for Efficient Video Recognition
Qian Wu
Ruoxuan Cui
Yuke Li
Haoqi Zhu
ViT
17
1
0
10 Jan 2024
Differentiable Resolution Compression and Alignment for Efficient Video Classification and Retrieval
Rui Deng
Qian Wu
Yuke Li
Haoran Fu
8
2
0
15 Sep 2023
View while Moving: Efficient Video Recognition in Long-untrimmed Videos
Ye Tian
Meng Yang
Lanshan Zhang
Zhizhen Zhang
Yang Liu
Xiao-Zhu Xie
Xirong Que
Wendong Wang
19
7
0
09 Aug 2023
What Can Simple Arithmetic Operations Do for Temporal Modeling?
Wenhao Wu
Yuxin Song
Zhun Sun
Jingdong Wang
Chang Xu
Wanli Ouyang
33
8
0
18 Jul 2023
A Large-scale Study of Spatiotemporal Representation Learning with a New Benchmark on Action Recognition
Andong Deng
Taojiannan Yang
C. L. P. Chen
AI4TS
16
12
0
23 Mar 2023
Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
Wenhao Wu
Xiaohan Wang
Haipeng Luo
Jingdong Wang
Yi Yang
Wanli Ouyang
94
47
0
31 Dec 2022
Vision Transformers for Action Recognition: A Survey
Anwaar Ulhaq
Naveed Akhtar
Ganna Pogrebna
Ajmal Saeed Mian
ViT
19
43
0
13 Sep 2022
NSNet: Non-saliency Suppression Sampler for Efficient Video Recognition
Boyang Xia
Wenhao Wu
Haoran Wang
Rui Su
Dongliang He
Haosen Yang
Xiaoran Fan
Wanli Ouyang
8
21
0
21 Jul 2022
Revisiting Classifier: Transferring Vision-Language Models for Video Recognition
Wenhao Wu
Zhun Sun
Wanli Ouyang
VLM
87
93
0
04 Jul 2022
Mobile-Former: Bridging MobileNet and Transformer
Yinpeng Chen
Xiyang Dai
Dongdong Chen
Mengchen Liu
Xiaoyi Dong
Lu Yuan
Zicheng Liu
ViT
169
462
0
12 Aug 2021
AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition
Yue Meng
Rameswar Panda
Chung-Ching Lin
P. Sattigeri
Leonid Karlinsky
Kate Saenko
A. Oliva
Rogerio Feris
66
62
0
10 Feb 2021
MVFNet: Multi-View Fusion Network for Efficient Video Recognition
Wenhao Wu
Dongliang He
Tianwei Lin
Fu Li
Chuang Gan
Errui Ding
85
68
0
13 Dec 2020
AdaFrame: Adaptive Frame Selection for Fast Video Recognition
Zuxuan Wu
Caiming Xiong
Chih-Yao Ma
R. Socher
L. Davis
110
194
0
29 Nov 2018
Efficient Estimation of Word Representations in Vector Space
Tomáš Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
228
29,632
0
16 Jan 2013
1