Attentional Pooling for Action Recognition

4 November 2017

Papers citing "Attentional Pooling for Action Recognition"

33 / 33 papers shown

Title
Topological Pooling on Graphs Yuzhou Chen Yulia R. Gel 17 10 0 25 Mar 2023
3Mformer: Multi-order Multi-mode Transformer for Skeletal Action Recognition Lei Wang Piotr Koniusz ViT 23 45 0 25 Mar 2023
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with Multimodal Models Zhiqiu Lin Samuel Yu Zhiyi Kuang Deepak Pathak Deva Ramana VLM 15 100 0 16 Jan 2023
A Survey on Human Action Recognition Zhou Shuchang 29 0 0 20 Dec 2022
Inductive Attention for Video Action Anticipation Tsung-Ming Tai G. Fiameni Cheng-Kuang Lee Simon See O. Lanz 31 1 0 17 Dec 2022
DroneAttention: Sparse Weighted Temporal Attention for Drone-Camera Based Activity Recognition Santosh Kumar Yadav Achleshwar Luthra Esha Pahwa K. Tiwari Heena Rathore Hari Mohan Pandey Peter Corcoran 28 12 0 07 Dec 2022
Object-ABN: Learning to Generate Sharp Attention Maps for Action Recognition Tomoya Nitta Tsubasa Hirakawa H. Fujiyoshi Toru Tamaki 55 0 0 27 Jul 2022
RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning Xiaojian Ma Weili Nie Zhiding Yu Huaizu Jiang Chaowei Xiao Yuke Zhu Song-Chun Zhu Anima Anandkumar ViT LRM 22 19 0 24 Apr 2022
Gate-Shift-Fuse for Video Action Recognition Swathikiran Sudhakaran Sergio Escalera O. Lanz 20 22 0 16 Mar 2022
The Overlooked Classifier in Human-Object Interaction Recognition Ying Jin Yinpeng Chen Lijuan Wang Jianfeng Wang Pei Yu Lin Liang Jenq-Neng Hwang Zicheng Liu VLM 45 8 0 10 Mar 2022
Temporal-attentive Covariance Pooling Networks for Video Recognition Zilin Gao Qilong Wang Bingbing Zhang Q. Hu P. Li 18 24 0 27 Oct 2021
High-order Tensor Pooling with Attention for Action Recognition Lei Wang Ke Sun Piotr Koniusz 22 14 0 11 Oct 2021
ViViT: A Video Vision Transformer Anurag Arnab Mostafa Dehghani G. Heigold Chen Sun Mario Lucic Cordelia Schmid ViT 30 2,086 0 29 Mar 2021
Learning to Recognize Actions on Objects in Egocentric Video with Attention Dictionaries Swathikiran Sudhakaran Sergio Escalera O. Lanz EgoV 25 15 0 16 Feb 2021
Coarse Temporal Attention Network (CTA-Net) for Driver's Activity Recognition Zachary Wharton Ardhendu Behera Yonghuai Liu Nikolaos Bessis 39 35 0 17 Jan 2021
SMART Frame Selection for Action Recognition Shreyank N. Gowda Marcus Rohrbach Laura Sevilla-Lara 15 141 0 19 Dec 2020
Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds Efthymios Tzinis Scott Wisdom A. Jansen Shawn Hershey Tal Remez D. Ellis J. Hershey 26 68 0 02 Nov 2020
Detecting Hands and Recognizing Physical Contact in the Wild Supreeth Narasimhaswamy Trung Nguyen Minh Hoai 26 40 0 19 Oct 2020
Approximated Bilinear Modules for Temporal Modeling Xinqi Zhu Chang Xu Langwen Hui Cewu Lu Dacheng Tao 17 23 0 25 Jul 2020
Multi-Objective Matrix Normalization for Fine-grained Visual Recognition Shaobo Min Hantao Yao Hongtao Xie Zhengjun Zha Yongdong Zhang 20 65 0 30 Mar 2020
Actor-Transformers for Group Activity Recognition Kirill Gavrilyuk Ryan Sanford Mehrsan Javan Cees G. M. Snoek ViT 19 178 0 28 Mar 2020
PIC: Permutation Invariant Convolution for Recognizing Long-range Activities Noureldien Hussein E. Gavves A. Smeulders VLM 18 13 0 18 Mar 2020
Adversarial Cross-Domain Action Recognition with Co-Attention Boxiao Pan Zhangjie Cao Ehsan Adeli Juan Carlos Niebles ViT 16 103 0 22 Dec 2019
Frontal Low-rank Random Tensors for Fine-grained Action Segmentation Yan Zhang Krikamol Muandet Qianli Ma Heiko Neumann Siyu Tang 26 3 0 03 Jun 2019
Video Action Transformer Network Rohit Girdhar João Carreira Carl Doersch Andrew Zisserman ViT 28 702 0 06 Dec 2018
Timeception for Complex Action Recognition Noureldien Hussein E. Gavves A. Smeulders 16 212 0 04 Dec 2018
Learning to match transient sound events using attentional similarity for few-shot sound recognition Szu-Yu Chou Kai-Hsiang Cheng J. Jang Yi-Hsuan Yang 13 59 0 04 Dec 2018
Interpretable Spatio-temporal Attention for Video Action Recognition Lili Meng Bo-Lu Zhao B. Chang Gao Huang Wei Sun Fred Tung Leonid Sigal 23 82 0 01 Oct 2018
Interaction-aware Spatio-temporal Pyramid Attention Networks for Action Classification Yang Du Chunfen Yuan Bing Li Lili Zhao Yangxi Li Weiming Hu 67 79 0 03 Aug 2018
Actor-Centric Relation Network Chen Sun Abhinav Shrivastava Carl Vondrick Kevin Patrick Murphy Rahul Sukthankar Cordelia Schmid 36 220 0 28 Jul 2018
Deep Attentional Structured Representation Learning for Visual Recognition K. K. Nakka Mathieu Salzmann 20 10 0 14 May 2018
Detect-and-Track: Efficient Pose Estimation in Videos Rohit Girdhar Georgia Gkioxari Lorenzo Torresani Manohar Paluri Du Tran 3DH 18 229 0 26 Dec 2017
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification Saining Xie Chen Sun Jonathan Huang Z. Tu Kevin Patrick Murphy 3DH 11 1,307 0 13 Dec 2017