Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1406.2199
Cited By
v1
v2 (latest)
Two-Stream Convolutional Networks for Action Recognition in Videos
Neural Information Processing Systems (NeurIPS), 2014
9 June 2014
Karen Simonyan
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Two-Stream Convolutional Networks for Action Recognition in Videos"
50 / 2,340 papers shown
Multi-Modal Machine Learning for Assessing Gaming Skills in Online Streaming: A Case Study with CS:GO
Longxiang Zhang
Wenping Wang
280
1
0
23 Jul 2023
What Can Simple Arithmetic Operations Do for Temporal Modeling?
IEEE International Conference on Computer Vision (ICCV), 2023
Wenhao Wu
Yuxin Song
Zhun Sun
Jingdong Wang
Chang Xu
Wanli Ouyang
212
17
0
18 Jul 2023
Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition
IEEE International Conference on Computer Vision (ICCV), 2023
Syed Talal Wasim
Muhammad Uzair Khattak
Muzammal Naseer
Salman Khan
M. Shah
Fahad Shahbaz Khan
ViT
257
27
0
13 Jul 2023
Transformer-based end-to-end classification of variable-length volumetric data
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2023
Marzieh Oghbaie
Teresa Araújo
T. Emre
U. Schmidt-Erfurth
Hrvoje Bogunović
ViT
MedIm
267
8
0
13 Jul 2023
Reasoning over the Behaviour of Objects in Video-Clips for Adverb-Type Recognition
Amrithaa Seshadri
Alessandra Russo
282
0
0
09 Jul 2023
Task-Specific Alignment and Multiple Level Transformer for Few-Shot Action Recognition
Neurocomputing (Neurocomputing), 2023
Fei-Yu Guo
Li Zhu
Yiwang Wang
Jing Sun
ViT
235
10
0
05 Jul 2023
Synchronous Image-Label Diffusion Probability Model with Application to Stroke Lesion Segmentation on Non-contrast CT
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Jianhai Zhang
Tonghua Wan
Ethan MacDonald
B. Menon
A. Ganesh
Qiu Wu
DiffM
MedIm
213
1
0
04 Jul 2023
Look, Remember and Reason: Grounded reasoning in videos with language models
International Conference on Learning Representations (ICLR), 2023
Apratim Bhattacharyya
Sunny Panchal
Mingu Lee
Reza Pourreza
Pulkit Madan
Roland Memisevic
LRM
470
13
0
30 Jun 2023
Deep Equilibrium Multimodal Fusion
Jinhong Ni
Yalong Bai
Wei Zhang
Ting Yao
Tao Mei
273
8
0
29 Jun 2023
Differentially Private Video Activity Recognition
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Zelun Luo
Yuliang Zou
Yijin Yang
Zane Durante
De-An Huang
Zhiding Yu
Chaowei Xiao
L. Fei-Fei
Anima Anandkumar
PICV
259
7
0
27 Jun 2023
Spiking Two-Stream Methods with Unsupervised STDP-based Learning for Action Recognition
Signal processing. Image communication (SPIC), 2023
Mireille el Assal
Pierre Tirilly
Ioan Marius Bilasco
179
4
0
23 Jun 2023
Variance-Covariance Regularization Improves Representation Learning
Jiachen Zhu
Katrina Evtimova
Yubei Chen
Ravid Shwartz-Ziv
Yann LeCun
SSL
250
8
0
23 Jun 2023
Learning Scene Flow With Skeleton Guidance For 3D Action Recognition
Vasileios Magoulianitis
A. Psaltis
3DH
3DPC
217
0
0
23 Jun 2023
A Reliable and Interpretable Framework of Multi-view Learning for Liver Fibrosis Staging
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2023
Zheyao Gao
Yuanye Liu
Fuping Wu
N. Shi
Yuxin Shi
Xiahai Zhuang
EDL
112
20
0
21 Jun 2023
Vision-Language Models can Identify Distracted Driver Behavior from Naturalistic Videos
Md Zahid Hasan
Jiajing Chen
Jiyang Wang
Mohammed Shaiqur Rahman
Ameya Joshi
Senem Velipasalar
Chinmay Hegde
Anuj Sharma
Soumik Sarkar
VLM
362
41
0
16 Jun 2023
Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers
Dominick Reilly
Vasu Sharma
Srijan Das
ViT
258
4
0
15 Jun 2023
E2E-LOAD: End-to-End Long-form Online Action Detection
IEEE International Conference on Computer Vision (ICCV), 2023
Shuyuan Cao
Weihua Luo
Bairui Wang
Wei Emma Zhang
Lin Ma
216
12
0
13 Jun 2023
Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment
Neural Information Processing Systems (NeurIPS), 2023
Zihui Xue
Kristen Grauman
EgoV
285
47
0
08 Jun 2023
Optimizing ViViT Training: Time and Memory Reduction for Action Recognition
Shreyank N. Gowda
Anurag Arnab
Jonathan Huang
ViT
182
4
0
07 Jun 2023
Atrial Septal Defect Detection in Children Based on Ultrasound Video Using Multiple Instances Learning
Yiman Liu
Qingming Huang
Xiaoxiang Han
Tongtong Liang
Zhi-fang Zhang
...
Angelos Stefanidis
Jionglong Su
Jiangang Chen
Qingli Li
Yuqi Zhang
139
13
0
06 Jun 2023
Human-Object Interaction Prediction in Videos through Gaze Following
Computer Vision and Image Understanding (CVIU), 2023
Zhifan Ni
Esteve Valls Mascaro
Hyemin Ahn
Dongheui Lee
230
16
0
06 Jun 2023
A Multi-Modal Transformer Network for Action Detection
Pattern Recognition (Pattern Recogn.), 2023
Matthew Korban
Scott T. Acton
Peter Youngs
ViT
187
18
0
31 May 2023
Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning
European Conference on Computer Vision (ECCV), 2023
Sanjoy Kundu
Shubham Trehan
Sathyanarayanan N. Aakur
LRM
LM&Ro
306
5
0
26 May 2023
Action Sensitivity Learning for Temporal Action Localization
IEEE International Conference on Computer Vision (ICCV), 2023
Jiayi Shao
Xiaohan Wang
Ruijie Quan
Junjun Zheng
Jiang Yang
Yezhou Yang
334
41
0
25 May 2023
Cross-view Action Recognition Understanding From Exocentric to Egocentric Perspective
Neurocomputing (Neurocomputing), 2023
Thanh-Dat Truong
Khoa Luu
EgoV
389
15
0
25 May 2023
Continual Learning through Human-Robot Interaction: Human Perceptions of a Continual Learning Robot in Repeated Interactions
Ali Ayub
Zachary De Francesco
Patrick Holthaus
Chrystopher L. Nehaniv
Kerstin Dautenhahn
CLL
HAI
259
13
0
22 May 2023
Exploring Few-Shot Adaptation for Activity Recognition on Diverse Domains
Kunyu Peng
Di Wen
David Schneider
Kailai Li
Kailun Yang
M. Sarfraz
Rainer Stiefelhagen
Alina Roitberg
335
3
0
15 May 2023
Is end-to-end learning enough for fitness activity recognition?
Antoine Mercier
Guillaume Berger
Sunny Panchal
Florian Letsch
Cornelius Boehm
Nahua Kang
Ingo Bax
Roland Memisevic
203
2
0
14 May 2023
Lightweight Delivery Detection on Doorbell Cameras
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Pirazh Khorramshahi
Zhe Wu
Tianchen Wang
Luke Deluccia
Hongcheng Wang
190
0
0
13 May 2023
Active Semantic Localization with Graph Neural Embedding
Asian Conference on Pattern Recognition (ACPR), 2023
Mitsuki Yoshida
Kanji Tanaka
Ryo Yamamoto
Daiki Iwata
390
2
0
10 May 2023
Group Activity Recognition via Dynamic Composition and Interaction
Youliang Zhang
Zhuo Zhou
Wenxuan Liu
Danni Xu
Zheng Wang
181
3
0
09 May 2023
Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Action Localization
Xijun Wang
Aggelos K. Katsaggelos
334
0
0
07 May 2023
ItoV: Efficiently Adapting Deep Learning-based Image Watermarking to Video Watermarking
Guanhui Ye
Jiashi Gao
Yuchen Wang
Liyan Song
Xue-Ming Wei
229
12
0
04 May 2023
Weakly-supervised Micro- and Macro-expression Spotting Based on Multi-level Consistency
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Wang-Wang Yu
Kai-Fu Yang
Hong-Mei Yan
Yong-Jie Li
232
4
0
04 May 2023
Local and Global Contextual Features Fusion for Pedestrian Intention Prediction
Mohsen Azarmi
Mahdi Rezaei
Tanveer Hussain
Chenghao Qian
220
12
0
01 May 2023
Physical Adversarial Attacks for Surveillance: A Survey
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Kien Nguyen Thanh
Tharindu Fernando
Clinton Fookes
Sridha Sridharan
AAML
385
26
0
01 May 2023
Weakly-Supervised Temporal Action Localization with Bidirectional Semantic Consistency Constraint
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Guozhang Li
De Cheng
Xinpeng Ding
N. Wang
Jie Li
Xinbo Gao
306
9
0
25 Apr 2023
MRSN: Multi-Relation Support Network for Video Action Detection
IEEE International Conference on Multimedia and Expo (ICME), 2023
Yin-Dong Zheng
Guo Chen
Minglei Yuan
Tong Lu
256
9
0
24 Apr 2023
Implicit Temporal Modeling with Learnable Alignment for Video Recognition
IEEE International Conference on Computer Vision (ICCV), 2023
S. Tu
Jingdong Sun
Zuxuan Wu
Zhi-Qi Cheng
Hang-Rui Hu
Yu-Gang Jiang
309
58
0
20 Apr 2023
Search-Map-Search: A Frame Selection Paradigm for Action Recognition
Computer Vision and Pattern Recognition (CVPR), 2023
Mingjun Zhao
Yu
Xiaoli Wang
Lei Yang
Di Niu
203
11
0
20 Apr 2023
Video-based Contrastive Learning on Decision Trees: from Action Recognition to Autism Diagnosis
ACM SIGMM Conference on Multimedia Systems (MMSys), 2023
Mindi Ruan
Xiang Yu
Naifeng Zhang
Chuanbo Hu
Shuo Wang
Xin Li
303
14
0
20 Apr 2023
Self-Supervised 3D Action Representation Learning with Skeleton Cloud Colorization
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Siyuan Yang
Jun Liu
Shijian Lu
Er Meng Hwa
Yongjian Hu
Alex C. Kot
3DPC
3DH
299
30
0
18 Apr 2023
Multimodal Short Video Rumor Detection System Based on Contrastive Learning
Yuxing Yang
Junhao Zhao
Siyi Wang
Xiangyu Min
Peifeng Wang
Haizhou Wang
118
4
0
17 Apr 2023
Unsupervised Learning Optical Flow in Multi-frame Dynamic Environment Using Temporal Dynamic Modeling
Zitang Sun
Shinýa Nishida
Zhengbo Luo
174
1
0
14 Apr 2023
Explaining, Analyzing, and Probing Representations of Self-Supervised Learning Models for Sensor-based Human Activity Recognition
Bulat Khaertdinov
S. Asteriadis
191
3
0
14 Apr 2023
PMI Sampler: Patch Similarity Guided Frame Selection for Aerial Action Recognition
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Ruiqi Xian
Xijun Wang
D. Kothandaraman
Tianyi Zhou
195
13
0
14 Apr 2023
DNeRV: Modeling Inherent Dynamics via Difference Neural Representation for Videos
Computer Vision and Pattern Recognition (CVPR), 2023
Qi Zhao
M. Salman Asif
Zhan Ma
246
46
0
13 Apr 2023
VARS: Video Assistant Referee System for Automated Soccer Decision Making from Multiple Views
Jan Held
A. Cioppa
Silvio Giancola
Abdullah Hamdi
Guohao Li
Marc Van Droogenbroeck
180
48
0
10 Apr 2023
On the Benefits of 3D Pose and Tracking for Human Action Recognition
Computer Vision and Pattern Recognition (CVPR), 2023
Jathushan Rajasegaran
Georgios Pavlakos
Angjoo Kanazawa
Christoph Feichtenhofer
Jitendra Malik
408
45
0
03 Apr 2023
AutoLabel: CLIP-based framework for Open-set Video Domain Adaptation
Computer Vision and Pattern Recognition (CVPR), 2023
Giacomo Zara
Subhankar Roy
Paolo Rota
Elisa Ricci
VLM
244
24
0
03 Apr 2023
Previous
1
2
3
...
6
7
8
...
45
46
47
Next