Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2004.04730
Cited By
X3D: Expanding Architectures for Efficient Video Recognition
9 April 2020
Christoph Feichtenhofer
Re-assign community
ArXiv
PDF
HTML
Papers citing
"X3D: Expanding Architectures for Efficient Video Recognition"
50 / 526 papers shown
Title
Survey: Exploiting Data Redundancy for Optimization of Deep Learning
Jou-An Chen
Wei Niu
Bin Ren
Yanzhi Wang
Xipeng Shen
17
11
0
29 Aug 2022
Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling
Rui Wang
Zuxuan Wu
Dongdong Chen
Yinpeng Chen
Xiyang Dai
Mengchen Liu
Luowei Zhou
Lu Yuan
Yu-Gang Jiang
ViT
25
4
0
25 Aug 2022
Enabling Weakly-Supervised Temporal Action Localization from On-Device Learning of the Video Stream
Yue Tang
Yawen Wu
Peipei Zhou
Jingtong Hu
6
2
0
25 Aug 2022
Lane Change Classification and Prediction with Action Recognition Networks
Kai-Bin Liang
Jun Wang
A. Bhalerao
14
2
0
24 Aug 2022
Efficient Attention-free Video Shift Transformers
Adrian Bulat
Brais Martínez
Georgios Tzimiropoulos
ViT
14
1
0
23 Aug 2022
Hierarchical Compositional Representations for Few-shot Action Recognition
Chang-bo Li
Jie M. Zhang
Shuzhe Wu
Xin Jin
Shiguang Shan
17
20
0
19 Aug 2022
Frozen CLIP Models are Efficient Video Learners
Ziyi Lin
Shijie Geng
Renrui Zhang
Peng Gao
Gerard de Melo
Xiaogang Wang
Jifeng Dai
Yu Qiao
Hongsheng Li
CLIP
VLM
10
199
0
06 Aug 2022
Combined CNN Transformer Encoder for Enhanced Fine-grained Human Action Recognition
M. C. Leong
Haosong Zhang
Huibin Tan
Liyuan Li
J. Lim
ViT
20
8
0
03 Aug 2022
Two-Stream Transformer Architecture for Long Video Understanding
Edward Fish
Jon Weinbren
Andrew Gilbert
ViT
17
6
0
02 Aug 2022
Video Question Answering with Iterative Video-Text Co-Tokenization
A. Piergiovanni
K. Morton
Weicheng Kuo
Michael S. Ryoo
A. Angelova
10
17
0
01 Aug 2022
Object-ABN: Learning to Generate Sharp Attention Maps for Action Recognition
Tomoya Nitta
Tsubasa Hirakawa
H. Fujiyoshi
Toru Tamaki
27
0
0
27 Jul 2022
Spatiotemporal Self-attention Modeling with Temporal Patch Shift for Action Recognition
Wangmeng Xiang
C. Li
Biao Wang
Xihan Wei
Xiangpei Hua
Lei Zhang
ViT
15
26
0
27 Jul 2022
MAR: Masked Autoencoders for Efficient Action Recognition
Zhiwu Qing
Shiwei Zhang
Ziyuan Huang
Xiang Wang
Yuehuang Wang
Yiliang Lv
Changxin Gao
Nong Sang
11
42
0
24 Jul 2022
TinyViT: Fast Pretraining Distillation for Small Vision Transformers
Kan Wu
Jinnian Zhang
Houwen Peng
Mengchen Liu
Bin Xiao
Jianlong Fu
Lu Yuan
ViT
11
236
0
21 Jul 2022
Temporal Saliency Query Network for Efficient Video Recognition
Boyang Xia
Zhihao Wang
Wenhao Wu
Haoran Wang
Jungong Han
26
15
0
21 Jul 2022
ViGAT: Bottom-up event recognition and explanation in video using factorized graph attention network
Nikolaos Gkalelis
Dimitrios Daskalakis
Vasileios Mezaris
8
8
0
20 Jul 2022
Is Appearance Free Action Recognition Possible?
Filip Ilic
T. Pock
Richard P. Wildes
6
15
0
13 Jul 2022
Long-term Leap Attention, Short-term Periodic Shift for Video Classification
H. M. Zhang
Lechao Cheng
Y. Hao
Chong-Wah Ngo
ViT
10
8
0
12 Jul 2022
VidConv: A modernized 2D ConvNet for Efficient Video Recognition
Chuong H. Nguyen
Su Huynh
Vinh Nguyen
Ngoc-Khanh Nguyen
ViT
14
3
0
08 Jul 2022
Large-scale Robustness Analysis of Video Action Recognition Models
Madeline Chantry Schiappa
Naman Biyani
Prudvi Kamtam
Shruti Vyas
Hamid Palangi
Vibhav Vineet
Y. S. Rawat
AAML
14
24
0
04 Jul 2022
GraphVid: It Only Takes a Few Nodes to Understand a Video
Eitan Kosman
Dotan Di Castro
GNN
27
5
0
04 Jul 2022
Revisiting Classifier: Transferring Vision-Language Models for Video Recognition
Wenhao Wu
Zhun Sun
Wanli Ouyang
VLM
87
93
0
04 Jul 2022
Enabling Harmonious Human-Machine Interaction with Visual-Context Augmented Dialogue System: A Review
Hao Wang
Bin Guo
Y. Zeng
Yasan Ding
Chen Qiu
Ying Zhang
Li Yao
Zhiwen Yu
22
2
0
02 Jul 2022
Exploring Temporally Dynamic Data Augmentation for Video Recognition
Taeoh Kim
Jinhyung Kim
Minho Shim
Sangdoo Yun
Myunggu Kang
Dongyoon Wee
Sangyoun Lee
AI4TS
15
10
0
30 Jun 2022
ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning
Junting Pan
Ziyi Lin
Xiatian Zhu
Jing Shao
Hongsheng Li
4
188
0
27 Jun 2022
Programmatic Concept Learning for Human Motion Description and Synthesis
Sumith Kulal
Jiayuan Mao
A. Aiken
Jiajun Wu
17
6
0
27 Jun 2022
SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos
S. H. Khorasgani
Yuxuan Chen
Florian Shkurti
SSL
33
22
0
25 Jun 2022
DisCoVQA: Temporal Distortion-Content Transformers for Video Quality Assessment
Haoning Wu
Chao-Yu Chen
Liang Liao
Jingwen Hou
Wenxiu Sun
Qiong Yan
Weisi Lin
ViT
17
49
0
20 Jun 2022
It's Time for Artistic Correspondence in Music and Video
Dídac Surís
Carl Vondrick
Bryan C. Russell
Justin Salamon
11
37
0
14 Jun 2022
Stand-Alone Inter-Frame Attention in Video Models
Fuchen Long
Zhaofan Qiu
Yingwei Pan
Ting Yao
Jiebo Luo
Tao Mei
ViT
18
46
0
14 Jun 2022
MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing
Zhaofan Qiu
Ting Yao
Chong-Wah Ngo
Tao Mei
ViT
22
15
0
13 Jun 2022
Words are all you need? Language as an approximation for human similarity judgments
Raja Marjieh
Pol van Rijn
Ilia Sucholutsky
T. Sumers
Harin Lee
Thomas L. Griffiths
Nori Jacoby
21
18
0
08 Jun 2022
A Deeper Dive Into What Deep Spatiotemporal Networks Encode: Quantifying Static vs. Dynamic Information
M. Kowal
Mennatullah Siam
Md. Amirul Islam
Neil D. B. Bruce
Richard P. Wildes
Konstantinos G. Derpanis
18
25
0
06 Jun 2022
Revisiting the "Video" in Video-Language Understanding
S. Buch
Cristobal Eyzaguirre
Adrien Gaidon
Jiajun Wu
L. Fei-Fei
Juan Carlos Niebles
6
154
0
03 Jun 2022
A Survey on Video Action Recognition in Sports: Datasets, Methods and Applications
Fei Wu
Qingzhong Wang
Jian Bian
Haoyi Xiong
Ning Ding
Feixiang Lu
Junqing Cheng
Dejing Dou
AI4TS
13
48
0
02 Jun 2022
PYSKL: Towards Good Practices for Skeleton Action Recognition
Haodong Duan
Jiaqi Wang
Kai-xiang Chen
Dahua Lin
VLM
19
133
0
19 May 2022
i-Code: An Integrative and Composable Multimodal Learning Framework
Ziyi Yang
Yuwei Fang
Chenguang Zhu
Reid Pryzant
Dongdong Chen
...
Bin Xiao
Yuanxun Lu
Takuya Yoshioka
Michael Zeng
Xuedong Huang
35
45
0
03 May 2022
In Defense of Image Pre-Training for Spatiotemporal Recognition
Xianhang Li
Huiyu Wang
Chen Wei
Jieru Mei
Alan Yuille
Yuyin Zhou
Cihang Xie
14
0
0
03 May 2022
Cross-modal Representation Learning for Zero-shot Action Recognition
Chung-Ching Lin
Kevin Qinghong Lin
Linjie Li
Lijuan Wang
Zicheng Liu
ViT
11
30
0
03 May 2022
Preserve Pre-trained Knowledge: Transfer Learning With Self-Distillation For Action Recognition
Yang Zhou
Zhanhao He
Ke Lu
Guanhong Wang
Gaoang Wang
CLL
SLR
6
2
0
01 May 2022
HuMMan: Multi-Modal 4D Human Dataset for Versatile Sensing and Modeling
Zhongang Cai
Daxuan Ren
Ailing Zeng
Zhengyu Lin
Tao Yu
...
Fangzhou Hong
Mingyuan Zhang
Chen Change Loy
Lei Yang
Ziwei Liu
3DH
21
99
0
28 Apr 2022
The Wisdom of Crowds: Temporal Progressive Attention for Early Action Prediction
Alexandros Stergiou
Dima Damen
AI4TS
EgoV
EDL
17
7
0
28 Apr 2022
Temporal Relevance Analysis for Video Action Models
Quanfu Fan
Donghyun Kim
Chun-Fu Chen
Chen
Stan Sclaroff
Kate Saenko
Sarah Adel Bargal
FAtt
12
0
0
25 Apr 2022
Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Han Cai
Ji Lin
Yujun Lin
Zhijian Liu
Haotian Tang
Hanrui Wang
Ligeng Zhu
Song Han
12
106
0
25 Apr 2022
The 6th AI City Challenge
M. Naphade
Shuo Wang
D. Anastasiu
Zheng Tang
Ming-Ching Chang
...
Stan Sclaroff
Pranamesh Chakraborty
Alice Li
Shangru Li
Rama Chellappa
11
70
0
21 Apr 2022
THORN: Temporal Human-Object Relation Network for Action Recognition
Mohammed Guermal
Rui Dai
F. Brémond
EgoV
6
3
0
20 Apr 2022
Attention in Attention: Modeling Context Correlation for Efficient Video Classification
Y. Hao
Shuo Wang
P. Cao
Xinjian Gao
Tong Bill Xu
Jinmeng Wu
Xiangnan He
17
40
0
20 Apr 2022
A Survey of Video-based Action Quality Assessment
Shunli Wang
Dingkang Yang
Peng Zhai
Qing Yu
Tao Suo
Zhan Sun
Ka Li
Lihua Zhang
15
16
0
20 Apr 2022
Performance Evaluation of Action Recognition Models on Low Quality Videos
Aoi Otani
Ryota Hashiguchi
Kazuki Omi
Norishige Fukushima
Toru Tamaki
8
6
0
19 Apr 2022
Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding
Xun Long Ng
Kian Eng Ong
Qichen Zheng
Yun Ni
S. Yeo
J. Liu
VGen
6
81
0
18 Apr 2022
Previous
1
2
3
...
10
11
6
7
8
9
Next