Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1712.00636
Cited By
Compressed Video Action Recognition
2 December 2017
Chao-Yuan Wu
Manzil Zaheer
Hexiang Hu
R. Manmatha
Alex Smola
Philipp Krahenbuhl
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Compressed Video Action Recognition"
46 / 46 papers shown
Title
Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis
Amir Hosein Fadaei
M. Dehaqani
45
0
0
11 Feb 2025
Principles of Visual Tokens for Efficient Video Understanding
Xinyue Hao
Gen Li
Shreyank N. Gowda
Robert B Fisher
Jonathan Huang
Anurag Arnab
Laura Sevilla-Lara
98
0
0
20 Nov 2024
Task-Aware Encoder Control for Deep Video Compression
Xingtong Ge
Jixiang Luo
Xinjie Zhang
Tongda Xu
Guo Lu
Dailan He
Jing Geng
Yan Wang
Jun Zhang
Hongwei Qin
31
5
0
07 Apr 2024
Collaboratively Self-supervised Video Representation Learning for Action Recognition
Jie Zhang
Zhifan Wan
Lanqing Hu
Stephen Lin
Shuzhe Wu
Shiguang Shan
TTA
67
1
0
15 Jan 2024
Efficient Temporally-Aware DeepFake Detection using H.264 Motion Vectors
Peter Grönquist
Yufan Ren
Qingyi He
Alessio Verardo
Sabine Süsstrunk
29
0
0
17 Nov 2023
Training a Large Video Model on a Single Machine in a Day
Yue Zhao
Philipp Krahenbuhl
VLM
34
15
0
28 Sep 2023
How can objects help action recognition?
Xingyi Zhou
Anurag Arnab
Chen Sun
Cordelia Schmid
45
14
0
20 Jun 2023
VideoComposer: Compositional Video Synthesis with Motion Controllability
Xiang Wang
Hangjie Yuan
Shiwei Zhang
Dayou Chen
Jiuniu Wang
Yingya Zhang
Yujun Shen
Deli Zhao
Jingren Zhou
VGen
DiffM
33
318
0
03 Jun 2023
Efficient Video Action Detection with Token Dropout and Context Refinement
Lei Chen
Zhan Tong
Yibing Song
Gangshan Wu
Limin Wang
36
14
0
17 Apr 2023
Efficient Semantic Segmentation by Altering Resolutions for Compressed Videos
Yubin Hu
Yuze He
Yanghao Li
Jisheng Li
Yuxing Han
Jiangtao Wen
Yong-Jin Liu
VOS
21
11
0
13 Mar 2023
HierVL: Learning Hierarchical Video-Language Embeddings
Kumar Ashutosh
Rohit Girdhar
Lorenzo Torresani
Kristen Grauman
VLM
AI4TS
28
53
0
05 Jan 2023
MAiVAR: Multimodal Audio-Image and Video Action Recognizer
Muhammad Bilal Shaikh
Douglas Chai
S. Islam
Naveed Akhtar
32
5
0
11 Sep 2022
Multi-Attention Network for Compressed Video Referring Object Segmentation
Weidong Chen
Dexiang Hong
Yuankai Qi
Zhenjun Han
Shuhui Wang
Laiyun Qing
Qingming Huang
Guorong Li
VOS
20
35
0
26 Jul 2022
CoVA: Exploiting Compressed-Domain Analysis to Accelerate Video Analytics
Jinwoo Hwang
Minsu Kim
Daeun Kim
Seungho Nam
Yoonsung Kim
Dohee Kim
Hardik Sharma
Jongse Park
46
14
0
02 Jul 2022
Real-time Online Multi-Object Tracking in Compressed Domain
Qiankun Liu
B. Liu
Yue Wu
Weihai Li
Nenghai Yu
VOT
29
44
0
05 Apr 2022
Deformable Video Transformer
Jue Wang
Lorenzo Torresani
ViT
30
28
0
31 Mar 2022
End-to-End Compressed Video Representation Learning for Generic Event Boundary Detection
Congcong Li
Xinyao Wang
Longyin Wen
Dexiang Hong
Tiejian Luo
Libo Zhang
23
16
0
29 Mar 2022
Learnable Irrelevant Modality Dropout for Multimodal Action Recognition on Modality-Specific Annotated Videos
Saghir Alfasly
Jian Lu
C. Xu
Yuru Zou
42
18
0
06 Mar 2022
MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Chao-Yuan Wu
Yanghao Li
K. Mangalam
Haoqi Fan
Bo Xiong
Jitendra Malik
Christoph Feichtenhofer
ViT
48
198
0
20 Jan 2022
TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning
Yang Liu
Keze Wang
Lingbo Liu
Hao Lan
Liang Lin
SSL
AI4TS
53
113
0
07 Dec 2021
Efficient Video Transformers with Spatial-Temporal Token Selection
Junke Wang
Xitong Yang
Hengduo Li
Li Liu
Zuxuan Wu
Yu-Gang Jiang
ViT
21
63
0
23 Nov 2021
TEAM-Net: Multi-modal Learning for Video Action Recognition with Partial Decoding
Zhengwei Wang
Qi She
A. Smolic
21
9
0
17 Oct 2021
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action Recognition
Jiawei Chen
C. Ho
ViT
26
77
0
20 Aug 2021
Accelerating Video Object Segmentation with Compressed Video
Kai Xu
Angela Yao
VOS
25
21
0
26 Jul 2021
MutualNet: Adaptive ConvNet via Mutual Learning from Different Model Configurations
Taojiannan Yang
Sijie Zhu
Matías Mendieta
Pu Wang
Ravikumar Balakrishnan
Minwoo Lee
T. Han
M. Shah
Chong Chen
3DH
OOD
28
23
0
14 May 2021
Change Detection in Synthetic Aperture Radar Images Using a Dual-Domain Network
Xiao-Xia Qu
Feng Gao
Junyu Dong
Q. Du
Hengchao Li
16
86
0
14 Apr 2021
Real-Time and Accurate Object Detection in Compressed Video by Long Short-term Feature Aggregation
Xinggang Wang
Zhaojin Huang
Bencheng Liao
Lichao Huang
Yongchao Gong
Chang Huang
25
21
0
25 Mar 2021
Identity-aware Facial Expression Recognition in Compressed Video
Xiaofeng Liu
Linghao Jin
Xu Han
Jun Lu
J. You
Lingsheng Kong
CVBM
48
20
0
01 Jan 2021
A Comprehensive Study of Deep Video Action Recognition
Yi Zhu
Xinyu Li
Chunhui Liu
Mohammadreza Zolfaghari
Yuanjun Xiong
Chongruo Wu
Zhi-Li Zhang
Joseph Tighe
R. Manmatha
Mu Li
VLM
AI4TS
38
185
0
11 Dec 2020
AssembleNet++: Assembling Modality Representations via Attention Connections
Michael S. Ryoo
A. Piergiovanni
Juhana Kangaspunta
A. Angelova
15
44
0
18 Aug 2020
X3D: Expanding Architectures for Efficient Video Recognition
Christoph Feichtenhofer
73
1,001
0
09 Apr 2020
Learning in the Frequency Domain
Kai Xu
Minghai Qin
Fei Sun
Yuhao Wang
Yen-kuang Chen
Fengbo Ren
39
395
0
27 Feb 2020
Audiovisual SlowFast Networks for Video Recognition
Fanyi Xiao
Yong Jae Lee
Kristen Grauman
Jitendra Malik
Christoph Feichtenhofer
197
207
0
23 Jan 2020
Listen to Look: Action Recognition by Previewing Audio
Ruohan Gao
Tae-Hyun Oh
Kristen Grauman
Lorenzo Torresani
VLM
29
251
0
10 Dec 2019
Bypass Enhancement RGB Stream Model for Pedestrian Action Recognition of Autonomous Vehicles
Dong Cao
Lisha Xu
17
2
0
15 Aug 2019
Hallucinating Optical Flow Features for Video Classification
Yongyi Tang
Lin Ma
Lianqiang Zhou
19
19
0
28 May 2019
Exploring Temporal Information for Improved Video Understanding
Yi Zhu
23
0
0
25 May 2019
IF-TTN: Information Fused Temporal Transformation Network for Video Action Recognition
Ke Yang
Peng Qiao
Dongsheng Li
Y. Dou
ViT
35
8
0
26 Feb 2019
DMC-Net: Generating Discriminative Motion Cues for Fast Compressed Video Action Recognition
Zheng Shou
Xudong Lin
Yannis Kalantidis
Laura Sevilla-Lara
Marcus Rohrbach
Shih-Fu Chang
Zhicheng Yan
VGen
37
120
0
11 Jan 2019
Long-Term Feature Banks for Detailed Video Understanding
Chao-Yuan Wu
Christoph Feichtenhofer
Haoqi Fan
Kaiming He
Philipp Krahenbuhl
Ross B. Girshick
62
477
0
12 Dec 2018
Relational Long Short-Term Memory for Video Action Recognition
Zexi Chen
B. Ramachandra
Tianfu Wu
Ranga Raju Vatsavai
24
5
0
16 Nov 2018
Rate-Accuracy Trade-Off In Video Classification With Deep Convolutional Neural Networks
M. Jubran
Alhabib Abbas
Aaron Chadha
Y. Andreopoulos
10
12
0
27 Sep 2018
Multi-Fiber Networks for Video Recognition
Yunpeng Chen
Yannis Kalantidis
Jianshu Li
Shuicheng Yan
Jiashi Feng
CVBM
19
216
0
30 Jul 2018
Video Compression through Image Interpolation
Chao-Yuan Wu
Nayan Singhal
Philipp Krahenbuhl
VGen
34
317
0
18 Apr 2018
Pixel Recurrent Neural Networks
Aaron van den Oord
Nal Kalchbrenner
Koray Kavukcuoglu
SSeg
GAN
272
2,552
0
25 Jan 2016
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting
Xingjian Shi
Zhourong Chen
Hao Wang
Dit-Yan Yeung
W. Wong
W. Woo
239
7,916
0
13 Jun 2015
1