Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.03982
Cited By
SlowFast Networks for Video Recognition
10 December 2018
Christoph Feichtenhofer
Haoqi Fan
Jitendra Malik
Kaiming He
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SlowFast Networks for Video Recognition"
50 / 506 papers shown
Title
Training a Large Video Model on a Single Machine in a Day
Yue Zhao
Philipp Krahenbuhl
VLM
29
15
0
28 Sep 2023
SlowFast Network for Continuous Sign Language Recognition
Junseok Ahn
Youngjoon Jang
Joon Son Chung
SLR
33
10
0
21 Sep 2023
CPR-Coach: Recognizing Composite Error Actions based on Single-class Training
Shunli Wang
Qing Yu
Shuai Wang
Dingkang Yang
Liuzhen Su
Xiao Zhao
Haopeng Kuang
Pei Zhang
Peng Zhai
Lihua Zhang
31
3
0
21 Sep 2023
SkeleTR: Towrads Skeleton-based Action Recognition in the Wild
Haodong Duan
Mingze Xu
Bing Shuai
Davide Modolo
Zhuowen Tu
Joseph Tighe
Alessandro Bergamo
ViT
32
1
0
20 Sep 2023
Differentiable Resolution Compression and Alignment for Efficient Video Classification and Retrieval
Rui Deng
Qian Wu
Yuke Li
Haoran Fu
18
2
0
15 Sep 2023
EgoPCA: A New Framework for Egocentric Hand-Object Interaction Understanding
Yue Xu
Yong-Lu Li
Zhemin Huang
Michael Xu Liu
Cewu Lu
Yu-Wing Tai
Chi-Keung Tang
EgoV
22
9
0
05 Sep 2023
UnLoc: A Unified Framework for Video Localization Tasks
Shengjia Yan
Xuehan Xiong
Arsha Nagrani
Anurag Arnab
Zhonghao Wang
Weina Ge
David A. Ross
Cordelia Schmid
24
53
0
21 Aug 2023
MGMAE: Motion Guided Masking for Video Masked Autoencoding
Bingkun Huang
Zhiyu Zhao
Guozhen Zhang
Yu Qiao
Limin Wang
28
30
0
21 Aug 2023
Inherent Redundancy in Spiking Neural Networks
Man Yao
J. Hu
Guangshe Zhao
Yaoyuan Wang
Ziyang Zhang
Boxing Xu
Guoqi Li
22
15
0
16 Aug 2023
ARGUS: Visualization of AI-Assisted Task Guidance in AR
Sonia Castelo
Joao Rulff
Erin McGowan
Bea Steers
Guande Wu
...
Qinghong Sun
Huy Q. Vo
J. P. Bello
M. Krone
Claudio Silva
31
18
0
11 Aug 2023
Constructing Holistic Spatio-Temporal Scene Graph for Video Semantic Role Labeling
Yu Zhao
Hao Fei
Yixin Cao
Bobo Li
Meishan Zhang
Jianguo Wei
M. Zhang
Tat-Seng Chua
17
13
0
09 Aug 2023
View while Moving: Efficient Video Recognition in Long-untrimmed Videos
Ye Tian
Meng Yang
Lanshan Zhang
Zhizhen Zhang
Yang Liu
Xiao-Zhu Xie
Xirong Que
Wendong Wang
24
7
0
09 Aug 2023
Long-Distance Gesture Recognition using Dynamic Neural Networks
Shubhang Bhatnagar
S. Gopal
N. Ahuja
Liu Ren
26
3
0
09 Aug 2023
SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based Recognition
Xiao Wang
Zong-Yao Wu
Yao Rong
Lin Zhu
Bowei Jiang
Jin Tang
Yonghong Tian
ViT
71
15
0
08 Aug 2023
M
3
^3
3
Net: Multi-view Encoding, Matching, and Fusion for Few-shot Fine-grained Action Recognition
Hao Tang
Jun Liu
Shuanglin Yan
Rui Yan
Zechao Li
Jinhui Tang
19
37
0
06 Aug 2023
A Survey on Deep Learning-based Spatio-temporal Action Detection
Peng Wang
Fanwei Zeng
Yu Qian
26
5
0
03 Aug 2023
Scene Separation & Data Selection: Temporal Segmentation Algorithm for Real-Time Video Stream Analysis
Yuelin Xin
Zihan Zhou
Yuxuan Xia
13
1
0
01 Aug 2023
AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?
Qi Zhao
Shijie Wang
Ce Zhang
Changcheng Fu
Minh Quan Do
Nakul Agarwal
Kwonjoon Lee
Chen Sun
LM&Ro
48
49
0
31 Jul 2023
Robotic Vision for Human-Robot Interaction and Collaboration: A Survey and Systematic Review
Nicole L. Robinson
Brendan Tidd
Dylan Campbell
Dana Kulić
Peter Corke
35
54
0
28 Jul 2023
Sample Less, Learn More: Efficient Action Recognition via Frame Feature Restoration
Harry Cheng
Yangyang Guo
Liqiang Nie
Zhiyong Cheng
Mohan S. Kankanhalli
37
7
0
27 Jul 2023
Revisiting Event-based Video Frame Interpolation
Jiaben Chen
Yi‐Wen Zhu
Dongze Lian
Jiaqi Yang
Yifu Wang
Renrui Zhang
Xinhang Liu
Shenhan Qian
L. Kneip
Shenghua Gao
23
2
0
24 Jul 2023
GLSFormer: Gated - Long, Short Sequence Transformer for Step Recognition in Surgical Videos
Nisarg A. Shah
S. Sikder
S. Vedula
Vishal M. Patel
ViT
MedIm
14
7
0
20 Jul 2023
NTIRE 2023 Quality Assessment of Video Enhancement Challenge
Xiaohong Liu
Xiongkuo Min
Wei Sun
Yulun Zhang
K. Zhang
...
Te Shi
Azadeh Mansouri
Hossein Motamednia
Amirhossein Bakhtiari
Ahmad Mahmoudi-Aznaveh
27
18
0
19 Jul 2023
What Can Simple Arithmetic Operations Do for Temporal Modeling?
Wenhao Wu
Yuxin Song
Zhun Sun
Jingdong Wang
Chang Xu
Wanli Ouyang
40
8
0
18 Jul 2023
Atlas-Based Interpretable Age Prediction In Whole-Body MR Images
Sophie Starck
Yadunandan Vivekanand Kini
J. Ritter
R. Braren
Daniel Rueckert
Tamara T. Mueller
23
1
0
14 Jul 2023
A Study on Differentiable Logic and LLMs for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2023
Yi Cheng
Ziwei Xu
Fen Fang
Dongyun Lin
Hehe Fan
Yongkang Wong
Ying Sun
Mohan S. Kankanhalli
24
0
0
13 Jul 2023
How can objects help action recognition?
Xingyi Zhou
Anurag Arnab
Chen Sun
Cordelia Schmid
35
14
0
20 Jun 2023
Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment
Zihui Xue
Kristen Grauman
EgoV
31
30
0
08 Jun 2023
Atrial Septal Defect Detection in Children Based on Ultrasound Video Using Multiple Instances Learning
Yiman Liu
Q. Huang
Xiaoxiang Han
Tongtong Liang
Zhi-fang Zhang
...
Angelos Stefanidis
Jionglong Su
Jiangang Chen
Qingli Li
Yuqi Zhang
21
7
0
06 Jun 2023
Human-Object Interaction Prediction in Videos through Gaze Following
Zhifan Ni
Esteve Valls Mascaro
Hyemin Ahn
Dongheui Lee
24
10
0
06 Jun 2023
VR.net: A Real-world Dataset for Virtual Reality Motion Sickness Research
Elliott Wen
Chitralekha Gupta
P. Sasikumar
Mark Billinghurst
James P Wilmott
Emily Skow
Arindam Dey
Suranga Nanayakkara
27
11
0
06 Jun 2023
VIPriors 3: Visual Inductive Priors for Data-Efficient Deep Learning Challenges
Robert-Jan Bruintjes
A. Lengyel
Marcos Baptista-Rios
O. Kayhan
Davide Zambrano
Nergis Tomen
J. C. V. Gemert
18
9
0
31 May 2023
TranSFormer: Slow-Fast Transformer for Machine Translation
Bei Li
Yi Jing
Xu Tan
Zhen Xing
Tong Xiao
Jingbo Zhu
41
7
0
26 May 2023
CVB: A Video Dataset of Cattle Visual Behaviors
Ali Zia
Renuka Sharma
Reza Arablouei
G. Bishop-Hurley
Jody McNally
N. Bagnall
V. Rolland
Brano Kusy
L. Petersson
A. Ingham
23
2
0
26 May 2023
Deep Neural Networks in Video Human Action Recognition: A Review
Zihan Wang
Yang Yang
Zhi Liu
Y. Zheng
53
4
0
25 May 2023
Flexible and Inherently Comprehensible Knowledge Representation for Data-Efficient Learning and Trustworthy Human-Machine Teaming in Manufacturing Environments
Vedran Galetić
Alistair Nottle
27
1
0
19 May 2023
SB-VQA: A Stack-Based Video Quality Assessment Framework for Video Enhancement
Ding-Jiun Huang
Yu-Ting Kao
Tieh-Hung Chuang
Ya-Chun Tsai
Jing-Kai Lou
Shuen-Huei Guan
21
2
0
15 May 2023
Is end-to-end learning enough for fitness activity recognition?
Antoine Mercier
Guillaume Berger
Sunny Panchal
Florian Letsch
Cornelius Boehm
Nahua Kang
Ingo Bax
Roland Memisevic
21
2
0
14 May 2023
Transformer-Based Model for Monocular Visual Odometry: A Video Understanding Approach
André O. Françani
Marcos R. O. A. Máximo
25
8
0
10 May 2023
End-to-End Spatio-Temporal Action Localisation with Video Transformers
A. Gritsenko
Xuehan Xiong
Josip Djolonga
Mostafa Dehghani
Chen Sun
Mario Lucic
Cordelia Schmid
Anurag Arnab
ViT
32
13
0
24 Apr 2023
Efficient Video Action Detection with Token Dropout and Context Refinement
Lei Chen
Zhan Tong
Yibing Song
Gangshan Wu
Limin Wang
36
14
0
17 Apr 2023
Zoom-VQA: Patches, Frames and Clips Integration for Video Quality Assessment
Kai Zhao
Kun Yuan
Ming-Ting Sun
Xingsen Wen
10
20
0
13 Apr 2023
Isolated Sign Language Recognition based on Tree Structure Skeleton Images
David Laines
G. Bejarano
M. González-Mendoza
Gilberto Ochoa-Ruiz
SLR
24
12
0
10 Apr 2023
Machine Learning with Requirements: a Manifesto
Eleonora Giunchiglia
F. Imrie
M. Schaar
Thomas Lukasiewicz
AI4TS
OffRL
VLM
32
5
0
07 Apr 2023
Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting
Syed Talal Wasim
Muzammal Naseer
Salman Khan
F. Khan
M. Shah
VLM
VPVLM
30
73
0
06 Apr 2023
VicTR: Video-conditioned Text Representations for Activity Recognition
Kumara Kahatapitiya
Anurag Arnab
Arsha Nagrani
Michael S. Ryoo
31
19
0
05 Apr 2023
Personality-aware Human-centric Multimodal Reasoning: A New Task, Dataset and Baselines
Yaochen Zhu
Xiangqing Shen
Rui Xia
19
5
0
05 Apr 2023
Bodily expressed emotion understanding through integrating Laban movement analysis
Chenyan Wu
Dolzodmaa Davaasuren
T. Shafir
Rachelle Tsachor
James Z. Wang
30
6
0
05 Apr 2023
On the Benefits of 3D Pose and Tracking for Human Action Recognition
Jathushan Rajasegaran
Georgios Pavlakos
Angjoo Kanazawa
Christoph Feichtenhofer
Jitendra Malik
30
30
0
03 Apr 2023
DOAD: Decoupled One Stage Action Detection Network
Shuning Chang
Pichao Wang
Fan Wang
Jiashi Feng
Mike Zheng Show
13
4
0
01 Apr 2023
Previous
1
2
3
4
5
6
...
9
10
11
Next