ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.06984
  4. Cited By
End-to-end Learning of Action Detection from Frame Glimpses in Videos
v1v2 (latest)

End-to-end Learning of Action Detection from Frame Glimpses in Videos

22 November 2015
Serena Yeung
Olga Russakovsky
Greg Mori
Li Fei-Fei
    EgoV
ArXiv (abs)PDFHTML

Papers citing "End-to-end Learning of Action Detection from Frame Glimpses in Videos"

50 / 278 papers shown
Title
Prompting Visual-Language Models for Efficient Video Understanding
Prompting Visual-Language Models for Efficient Video Understanding
Chen Ju
Tengda Han
Kunhao Zheng
Ya Zhang
Weidi Xie
VPVLMVLM
356
457
0
08 Dec 2021
DCAN: Improving Temporal Action Detection via Dual Context Aggregation
DCAN: Improving Temporal Action Detection via Dual Context Aggregation
Guo Chen
Yin-Dong Zheng
Limin Wang
Tong Lu
AI4TS
197
82
0
07 Dec 2021
SEAL: Self-supervised Embodied Active Learning using Exploration and 3D
  Consistency
SEAL: Self-supervised Embodied Active Learning using Exploration and 3D Consistency
Devendra Singh Chaplot
Murtaza Dalal
Saurabh Gupta
Jitendra Malik
Ruslan Salakhutdinov
274
86
0
02 Dec 2021
Graph Convolutional Module for Temporal Action Localization in Videos
Graph Convolutional Module for Temporal Action Localization in Videos
Runhao Zeng
Wenbing Huang
Zhuliang Yu
Yu Rong
P. Zhao
Junzhou Huang
Chuang Gan
178
83
0
01 Dec 2021
Weakly-guided Self-supervised Pretraining for Temporal Activity
  Detection
Weakly-guided Self-supervised Pretraining for Temporal Activity DetectionAAAI Conference on Artificial Intelligence (AAAI), 2021
Kumara Kahatapitiya
Zhou Ren
Haoxiang Li
Zhenyu Wu
Michael S. Ryoo
G. Hua
ViT
161
7
0
26 Nov 2021
Efficient Video Transformers with Spatial-Temporal Token Selection
Efficient Video Transformers with Spatial-Temporal Token Selection
Junke Wang
Xitong Yang
Hengduo Li
Li Liu
Zuxuan Wu
Yu-Gang Jiang
ViT
166
82
0
23 Nov 2021
Graph Convolution Neural Network For Weakly Supervised Abnormality
  Localization In Long Capsule Endoscopy Videos
Graph Convolution Neural Network For Weakly Supervised Abnormality Localization In Long Capsule Endoscopy Videos
Sodiq Adewole
Philip Fernandez
James A. Jablonski
Andrew Copland
Michael D. Porter
Sana Syed
Donald E. Brown
MedIm
116
2
0
18 Oct 2021
Object-Region Video Transformers
Object-Region Video Transformers
Roei Herzig
Elad Ben-Avraham
K. Mangalam
Amir Bar
Gal Chechik
Anna Rohrbach
Trevor Darrell
Amir Globerson
ViT
361
97
0
13 Oct 2021
Deep Learning-based Action Detection in Untrimmed Videos: A Survey
Deep Learning-based Action Detection in Untrimmed Videos: A Survey
Elahe Vahdani
Yingli Tian
349
84
0
30 Sep 2021
A Survey on Temporal Sentence Grounding in Videos
A Survey on Temporal Sentence Grounding in Videos
Xiaohan Lan
Yitian Yuan
Xin Eric Wang
Zhi Wang
Wenwu Zhu
307
57
0
16 Sep 2021
Efficient Action Recognition Using Confidence Distillation
Efficient Action Recognition Using Confidence Distillation
Shervin Manzuri Shalmani
Fei Chiang
Ronghuo Zheng
296
6
0
05 Sep 2021
Efficient Visual Recognition with Deep Neural Networks: A Survey on
  Recent Advances and New Directions
Efficient Visual Recognition with Deep Neural Networks: A Survey on Recent Advances and New DirectionsMachine Intelligence Research (MIR), 2021
Yang Wu
Dingheng Wang
Xiaotong Lu
Fan Yang
Guoqi Li
Weiming Dong
Jianbo Shi
356
18
0
30 Aug 2021
Autonomous Curiosity for Real-Time Training Onboard Robotic Agents
Autonomous Curiosity for Real-Time Training Onboard Robotic AgentsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2019
Ervin Teng
Bob Iannucci
106
6
0
29 Aug 2021
Multi-Modulation Network for Audio-Visual Event Localization
Multi-Modulation Network for Audio-Visual Event Localization
Hao Wang
Zhengjun Zha
Liang Li
Xuejin Chen
Jiebo Luo
133
2
0
26 Aug 2021
Shifted Chunk Transformer for Spatio-Temporal Representational Learning
Shifted Chunk Transformer for Spatio-Temporal Representational LearningNeural Information Processing Systems (NeurIPS), 2021
Xuefan Zha
Wentao Zhu
Tingxun Lv
Sen Yang
Ji Liu
AI4TSViT
248
30
0
26 Aug 2021
Dynamic Network Quantization for Efficient Video Inference
Dynamic Network Quantization for Efficient Video InferenceIEEE International Conference on Computer Vision (ICCV), 2021
Ximeng Sun
Yikang Shen
Chun-Fu Chen
A. Oliva
Rogerio Feris
Kate Saenko
238
52
0
23 Aug 2021
Group-aware Contrastive Regression for Action Quality Assessment
Group-aware Contrastive Regression for Action Quality Assessment
Xumin Yu
Yongming Rao
Wenliang Zhao
Jiwen Lu
Jie Zhou
AI4TS
164
130
0
17 Aug 2021
Foreground-Action Consistency Network for Weakly Supervised Temporal
  Action Localization
Foreground-Action Consistency Network for Weakly Supervised Temporal Action LocalizationIEEE International Conference on Computer Vision (ICCV), 2021
Linjiang Huang
Liang Wang
Jiaming Song
150
85
0
14 Aug 2021
Enriching Local and Global Contexts for Temporal Action Localization
Enriching Local and Global Contexts for Temporal Action LocalizationIEEE International Conference on Computer Vision (ICCV), 2021
Zixin Zhu
Wei Tang
Le Wang
N. Zheng
G. Hua
261
129
0
27 Jul 2021
SRF-Net: Selective Receptive Field Network for Anchor-Free Temporal
  Action Detection
SRF-Net: Selective Receptive Field Network for Anchor-Free Temporal Action DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021
Ranyu Ning
Can Zhang
Yuexian Zou
105
7
0
29 Jun 2021
Hear Me Out: Fusional Approaches for Audio Augmented Temporal Action
  Localization
Hear Me Out: Fusional Approaches for Audio Augmented Temporal Action LocalizationVISIGRAPP (VISIGRAPP), 2021
Anurag Bagchi
Jazib Mahmood
Dolton Fernandes
Ravi Kiran Sarvadevabhatla
355
32
0
27 Jun 2021
Winning the CVPR'2021 Kinetics-GEBD Challenge: Contrastive Learning
  Approach
Winning the CVPR'2021 Kinetics-GEBD Challenge: Contrastive Learning Approach
Hyolim Kang
Jinwoo Kim
Kyungmin Kim
Taehyun Kim
Seon Joo Kim
95
19
0
22 Jun 2021
End-to-end Temporal Action Detection with Transformer
End-to-end Temporal Action Detection with TransformerIEEE Transactions on Image Processing (TIP), 2021
Xiaolong Liu
Qimeng Wang
Yao Hu
Xu Tang
Shiwei Zhang
S. Bai
X. Bai
ViT
278
287
0
18 Jun 2021
FineAction: A Fine-Grained Video Dataset for Temporal Action
  Localization
FineAction: A Fine-Grained Video Dataset for Temporal Action LocalizationIEEE Transactions on Image Processing (TIP), 2021
Lu Dong
Limin Wang
Yali Wang
Xiao Ma
Yu Qiao
257
77
0
24 May 2021
AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition
AdaMML: Adaptive Multi-Modal Learning for Efficient Video RecognitionIEEE International Conference on Computer Vision (ICCV), 2021
Yikang Shen
Chun-Fu Chen
Quanfu Fan
Ximeng Sun
Kate Saenko
A. Oliva
Rogerio Feris
217
58
0
11 May 2021
Adaptive Focus for Efficient Video Recognition
Adaptive Focus for Efficient Video RecognitionIEEE International Conference on Computer Vision (ICCV), 2021
Yulin Wang
Zhaoxi Chen
Haojun Jiang
Shiji Song
Yizeng Han
Gao Huang
252
110
0
07 May 2021
Unsupervised Discriminative Embedding for Sub-Action Learning in Complex
  Activities
Unsupervised Discriminative Embedding for Sub-Action Learning in Complex ActivitiesInternational Conference on Information Photonics (ICIP), 2021
S. Swetha
Hilde Kuehne
Yogesh S Rawat
M. Shah
121
24
0
30 Apr 2021
Action Unit Memory Network for Weakly Supervised Temporal Action
  Localization
Action Unit Memory Network for Weakly Supervised Temporal Action LocalizationComputer Vision and Pattern Recognition (CVPR), 2021
Wang Luo
Tianzhu Zhang
Wenfei Yang
Jingen Liu
Tao Mei
Feng Wu
Yongdong Zhang
159
88
0
29 Apr 2021
FrameExit: Conditional Early Exiting for Efficient Video Recognition
FrameExit: Conditional Early Exiting for Efficient Video RecognitionComputer Vision and Pattern Recognition (CVPR), 2021
Amir Ghodrati
B. Bejnordi
A. Habibian
234
97
0
27 Apr 2021
Reinforced Attention for Few-Shot Learning and Beyond
Reinforced Attention for Few-Shot Learning and BeyondComputer Vision and Pattern Recognition (CVPR), 2021
Jie Hong
Pengfei Fang
Weihao Li
Tong Zhang
Christian Simon
Mehrtash Harandi
L. Petersson
158
54
0
09 Apr 2021
The SARAS Endoscopic Surgeon Action Detection (ESAD) dataset: Challenges
  and methods
The SARAS Endoscopic Surgeon Action Detection (ESAD) dataset: Challenges and methods
V. Bawa
Gurkirt Singh
Francis KapingA
I. Skarga-Bandurova
Elettra Oleari
...
Li Li
Armando Stabile
Francesco Setti
R. Muradore
Fabio Cuzzolin
219
44
0
07 Apr 2021
Anchor-Constrained Viterbi for Set-Supervised Action Segmentation
Anchor-Constrained Viterbi for Set-Supervised Action SegmentationComputer Vision and Pattern Recognition (CVPR), 2021
Jun Li
S. Todorovic
107
22
0
05 Apr 2021
Attention, please! A survey of Neural Attention Models in Deep Learning
Attention, please! A survey of Neural Attention Models in Deep LearningArtificial Intelligence Review (AIR), 2021
Alana de Santana Correia
Esther Luna Colombini
HAI
328
254
0
31 Mar 2021
No frame left behind: Full Video Action Recognition
No frame left behind: Full Video Action RecognitionComputer Vision and Pattern Recognition (CVPR), 2021
X. Liu
S. Pintea
Fatemeh Karimi Nejadasl
Olaf Booij
Jan van Gemert
235
45
0
29 Mar 2021
Variable-rate discrete representation learning
Variable-rate discrete representation learning
Sander Dieleman
C. Nash
Jesse Engel
Karen Simonyan
BDLDRL
194
32
0
10 Mar 2021
PcmNet: Position-Sensitive Context Modeling Network for Temporal Action
  Localization
PcmNet: Position-Sensitive Context Modeling Network for Temporal Action LocalizationNeurocomputing (Neurocomputing), 2021
Xin Qin
Hanbin Zhao
Guangchen Lin
Hao Zeng
Songcen Xu
Xi Li
179
18
0
09 Mar 2021
Modeling Multi-Label Action Dependencies for Temporal Action
  Localization
Modeling Multi-Label Action Dependencies for Temporal Action LocalizationComputer Vision and Pattern Recognition (CVPR), 2021
Praveen Tirupattur
Kevin Duarte
Yogesh S Rawat
M. Shah
250
65
0
04 Mar 2021
Coarse-Fine Networks for Temporal Activity Detection in Videos
Coarse-Fine Networks for Temporal Activity Detection in VideosComputer Vision and Pattern Recognition (CVPR), 2021
Kumara Kahatapitiya
Michael S. Ryoo
AI4TS
283
45
0
01 Mar 2021
VA-RED$^2$: Video Adaptive Redundancy Reduction
VA-RED2^22: Video Adaptive Redundancy ReductionInternational Conference on Learning Representations (ICLR), 2021
Bowen Pan
Yikang Shen
Camilo Luciano Fosco
Chung-Ching Lin
A. Andonian
Yue Meng
Kate Saenko
A. Oliva
Rogerio Feris
272
19
0
15 Feb 2021
AdaFuse: Adaptive Temporal Fusion Network for Efficient Action
  Recognition
AdaFuse: Adaptive Temporal Fusion Network for Efficient Action RecognitionInternational Conference on Learning Representations (ICLR), 2021
Yue Meng
Yikang Shen
Chung-Ching Lin
P. Sattigeri
Leonid Karlinsky
Kate Saenko
A. Oliva
Rogerio Feris
291
70
0
10 Feb 2021
Dynamic Neural Networks: A Survey
Dynamic Neural Networks: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Yizeng Han
Gao Huang
Shiji Song
Le Yang
Honghui Wang
Yulin Wang
3DHAI4TSAI4CE
383
794
0
09 Feb 2021
GCF-Net: Gated Clip Fusion Network for Video Action Recognition
GCF-Net: Gated Clip Fusion Network for Video Action Recognition
Jenhao Hsiao
Jiawei Chen
C. Ho
74
6
0
02 Feb 2021
Activity Graph Transformer for Temporal Action Localization
Activity Graph Transformer for Temporal Action Localization
Megha Nawhal
Greg Mori
247
81
0
21 Jan 2021
3D Human motion anticipation and classification
3D Human motion anticipation and classification
Emad Barsoum
J. Kender
Zicheng Liu
3DH
117
2
0
31 Dec 2020
SMART Frame Selection for Action Recognition
SMART Frame Selection for Action RecognitionAAAI Conference on Artificial Intelligence (AAAI), 2020
Shreyank N. Gowda
Marcus Rohrbach
Laura Sevilla-Lara
217
162
0
19 Dec 2020
Multi-shot Temporal Event Localization: a Benchmark
Multi-shot Temporal Event Localization: a BenchmarkComputer Vision and Pattern Recognition (CVPR), 2020
Xiaolong Liu
Yao Hu
S. Bai
Fei Ding
X. Bai
Juil Sock
191
95
0
17 Dec 2020
A Comprehensive Study of Deep Video Action Recognition
A Comprehensive Study of Deep Video Action Recognition
Yi Zhu
Xinyu Li
Chunhui Liu
Mohammadreza Zolfaghari
Yuanjun Xiong
Chongruo Wu
Zhi-Li Zhang
Joseph Tighe
R. Manmatha
Mu Li
VLMAI4TS
272
208
0
11 Dec 2020
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization
  Tasks
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks
Humam Alwassel
Silvio Giancola
Guohao Li
217
139
0
23 Nov 2020
3D CNNs with Adaptive Temporal Feature Resolutions
3D CNNs with Adaptive Temporal Feature Resolutions
Mohsen Fayyaz
Emad Bahrami Rad
Ali Diba
M. Noroozi
Ehsan Adeli
Luc Van Gool
Juergen Gall
3DPC
193
39
0
17 Nov 2020
LAP-Net: Adaptive Features Sampling via Learning Action Progression for
  Online Action Detection
LAP-Net: Adaptive Features Sampling via Learning Action Progression for Online Action Detection
Sanqing Qu
Guang Chen
Dan Xu
Jinhu Dong
Fan Lu
Alois C. Knoll
129
22
0
16 Nov 2020
Previous
123456
Next