ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.01467
  4. Cited By
Attentional Pooling for Action Recognition

Attentional Pooling for Action Recognition

4 November 2017
Rohit Girdhar
Deva Ramanan
ArXivPDFHTML

Papers citing "Attentional Pooling for Action Recognition"

33 / 33 papers shown
Title
Topological Pooling on Graphs
Topological Pooling on Graphs
Yuzhou Chen
Yulia R. Gel
17
10
0
25 Mar 2023
3Mformer: Multi-order Multi-mode Transformer for Skeletal Action
  Recognition
3Mformer: Multi-order Multi-mode Transformer for Skeletal Action Recognition
Lei Wang
Piotr Koniusz
ViT
23
45
0
25 Mar 2023
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with
  Multimodal Models
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning with Multimodal Models
Zhiqiu Lin
Samuel Yu
Zhiyi Kuang
Deepak Pathak
Deva Ramana
VLM
15
100
0
16 Jan 2023
A Survey on Human Action Recognition
A Survey on Human Action Recognition
Zhou Shuchang
29
0
0
20 Dec 2022
Inductive Attention for Video Action Anticipation
Inductive Attention for Video Action Anticipation
Tsung-Ming Tai
G. Fiameni
Cheng-Kuang Lee
Simon See
O. Lanz
31
1
0
17 Dec 2022
DroneAttention: Sparse Weighted Temporal Attention for Drone-Camera
  Based Activity Recognition
DroneAttention: Sparse Weighted Temporal Attention for Drone-Camera Based Activity Recognition
Santosh Kumar Yadav
Achleshwar Luthra
Esha Pahwa
K. Tiwari
Heena Rathore
Hari Mohan Pandey
Peter Corcoran
28
12
0
07 Dec 2022
Object-ABN: Learning to Generate Sharp Attention Maps for Action
  Recognition
Object-ABN: Learning to Generate Sharp Attention Maps for Action Recognition
Tomoya Nitta
Tsubasa Hirakawa
H. Fujiyoshi
Toru Tamaki
55
0
0
27 Jul 2022
RelViT: Concept-guided Vision Transformer for Visual Relational
  Reasoning
RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning
Xiaojian Ma
Weili Nie
Zhiding Yu
Huaizu Jiang
Chaowei Xiao
Yuke Zhu
Song-Chun Zhu
Anima Anandkumar
ViT
LRM
22
19
0
24 Apr 2022
Gate-Shift-Fuse for Video Action Recognition
Gate-Shift-Fuse for Video Action Recognition
Swathikiran Sudhakaran
Sergio Escalera
O. Lanz
20
22
0
16 Mar 2022
The Overlooked Classifier in Human-Object Interaction Recognition
The Overlooked Classifier in Human-Object Interaction Recognition
Ying Jin
Yinpeng Chen
Lijuan Wang
Jianfeng Wang
Pei Yu
Lin Liang
Jenq-Neng Hwang
Zicheng Liu
VLM
45
8
0
10 Mar 2022
Temporal-attentive Covariance Pooling Networks for Video Recognition
Temporal-attentive Covariance Pooling Networks for Video Recognition
Zilin Gao
Qilong Wang
Bingbing Zhang
Q. Hu
P. Li
18
24
0
27 Oct 2021
High-order Tensor Pooling with Attention for Action Recognition
High-order Tensor Pooling with Attention for Action Recognition
Lei Wang
Ke Sun
Piotr Koniusz
22
14
0
11 Oct 2021
ViViT: A Video Vision Transformer
ViViT: A Video Vision Transformer
Anurag Arnab
Mostafa Dehghani
G. Heigold
Chen Sun
Mario Lucic
Cordelia Schmid
ViT
30
2,086
0
29 Mar 2021
Learning to Recognize Actions on Objects in Egocentric Video with
  Attention Dictionaries
Learning to Recognize Actions on Objects in Egocentric Video with Attention Dictionaries
Swathikiran Sudhakaran
Sergio Escalera
O. Lanz
EgoV
25
15
0
16 Feb 2021
Coarse Temporal Attention Network (CTA-Net) for Driver's Activity
  Recognition
Coarse Temporal Attention Network (CTA-Net) for Driver's Activity Recognition
Zachary Wharton
Ardhendu Behera
Yonghuai Liu
Nikolaos Bessis
39
35
0
17 Jan 2021
SMART Frame Selection for Action Recognition
SMART Frame Selection for Action Recognition
Shreyank N. Gowda
Marcus Rohrbach
Laura Sevilla-Lara
15
141
0
19 Dec 2020
Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of
  On-Screen Sounds
Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds
Efthymios Tzinis
Scott Wisdom
A. Jansen
Shawn Hershey
Tal Remez
D. Ellis
J. Hershey
26
68
0
02 Nov 2020
Detecting Hands and Recognizing Physical Contact in the Wild
Detecting Hands and Recognizing Physical Contact in the Wild
Supreeth Narasimhaswamy
Trung Nguyen
Minh Hoai
26
40
0
19 Oct 2020
Approximated Bilinear Modules for Temporal Modeling
Approximated Bilinear Modules for Temporal Modeling
Xinqi Zhu
Chang Xu
Langwen Hui
Cewu Lu
Dacheng Tao
17
23
0
25 Jul 2020
Multi-Objective Matrix Normalization for Fine-grained Visual Recognition
Multi-Objective Matrix Normalization for Fine-grained Visual Recognition
Shaobo Min
Hantao Yao
Hongtao Xie
Zhengjun Zha
Yongdong Zhang
20
65
0
30 Mar 2020
Actor-Transformers for Group Activity Recognition
Actor-Transformers for Group Activity Recognition
Kirill Gavrilyuk
Ryan Sanford
Mehrsan Javan
Cees G. M. Snoek
ViT
19
178
0
28 Mar 2020
PIC: Permutation Invariant Convolution for Recognizing Long-range
  Activities
PIC: Permutation Invariant Convolution for Recognizing Long-range Activities
Noureldien Hussein
E. Gavves
A. Smeulders
VLM
18
13
0
18 Mar 2020
Adversarial Cross-Domain Action Recognition with Co-Attention
Adversarial Cross-Domain Action Recognition with Co-Attention
Boxiao Pan
Zhangjie Cao
Ehsan Adeli
Juan Carlos Niebles
ViT
16
103
0
22 Dec 2019
Frontal Low-rank Random Tensors for Fine-grained Action Segmentation
Frontal Low-rank Random Tensors for Fine-grained Action Segmentation
Yan Zhang
Krikamol Muandet
Qianli Ma
Heiko Neumann
Siyu Tang
26
3
0
03 Jun 2019
Video Action Transformer Network
Video Action Transformer Network
Rohit Girdhar
João Carreira
Carl Doersch
Andrew Zisserman
ViT
28
702
0
06 Dec 2018
Timeception for Complex Action Recognition
Timeception for Complex Action Recognition
Noureldien Hussein
E. Gavves
A. Smeulders
16
212
0
04 Dec 2018
Learning to match transient sound events using attentional similarity
  for few-shot sound recognition
Learning to match transient sound events using attentional similarity for few-shot sound recognition
Szu-Yu Chou
Kai-Hsiang Cheng
J. Jang
Yi-Hsuan Yang
13
59
0
04 Dec 2018
Interpretable Spatio-temporal Attention for Video Action Recognition
Interpretable Spatio-temporal Attention for Video Action Recognition
Lili Meng
Bo-Lu Zhao
B. Chang
Gao Huang
Wei Sun
Fred Tung
Leonid Sigal
23
82
0
01 Oct 2018
Interaction-aware Spatio-temporal Pyramid Attention Networks for Action
  Classification
Interaction-aware Spatio-temporal Pyramid Attention Networks for Action Classification
Yang Du
Chunfen Yuan
Bing Li
Lili Zhao
Yangxi Li
Weiming Hu
67
79
0
03 Aug 2018
Actor-Centric Relation Network
Actor-Centric Relation Network
Chen Sun
Abhinav Shrivastava
Carl Vondrick
Kevin Patrick Murphy
Rahul Sukthankar
Cordelia Schmid
36
220
0
28 Jul 2018
Deep Attentional Structured Representation Learning for Visual
  Recognition
Deep Attentional Structured Representation Learning for Visual Recognition
K. K. Nakka
Mathieu Salzmann
20
10
0
14 May 2018
Detect-and-Track: Efficient Pose Estimation in Videos
Detect-and-Track: Efficient Pose Estimation in Videos
Rohit Girdhar
Georgia Gkioxari
Lorenzo Torresani
Manohar Paluri
Du Tran
3DH
18
229
0
26 Dec 2017
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in
  Video Classification
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
Saining Xie
Chen Sun
Jonathan Huang
Z. Tu
Kevin Patrick Murphy
3DH
11
1,307
0
13 Dec 2017
1