Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1411.4389
Cited By
v1
v2
v3
v4 (latest)
Long-term Recurrent Convolutional Networks for Visual Recognition and Description
Computer Vision and Pattern Recognition (CVPR), 2014
17 November 2014
Jeff Donahue
Lisa Anne Hendricks
Marcus Rohrbach
Subhashini Venugopalan
S. Guadarrama
Kate Saenko
Trevor Darrell
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Long-term Recurrent Convolutional Networks for Visual Recognition and Description"
50 / 1,728 papers shown
Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Tengpeng Li
Hanli Wang
Bin He
Changan Chen
DiffM
244
16
0
10 Mar 2022
Live Laparoscopic Video Retrieval with Compressed Uncertainty
Tong Yu
Pietro Mascagni
J. Verde
J. Marescaux
Didier Mutter
N. Padoy
247
9
0
08 Mar 2022
Parallel Training of GRU Networks with a Multi-Grid Solver for Long Sequences
International Conference on Learning Representations (ICLR), 2022
G. Moon
E. Cyr
143
8
0
07 Mar 2022
Vision-Language Intelligence: Tasks, Representation Learning, and Large Models
Feng Li
Hao Zhang
Yi-Fan Zhang
Shixuan Liu
Jian Guo
L. Ni
Pengchuan Zhang
Lei Zhang
AI4TS
VLM
204
41
0
03 Mar 2022
Temporal Context Matters: Enhancing Single Image Prediction with Disease Progression Representations
Computer Vision and Pattern Recognition (CVPR), 2022
Aishik Konwer
Xuan Xu
Joseph Bae
Chaoyu Chen
Prateek Prasanna
MedIm
258
21
0
02 Mar 2022
ADVISE: ADaptive Feature Relevance and VISual Explanations for Convolutional Neural Networks
The Visual Computer (TVC), 2022
Mohammad Mahdi Dehshibi
Mona Ashtari-Majlan
Gereziher W. Adhane
David Masip
AAML
FAtt
175
4
0
02 Mar 2022
Rethinking Pretraining as a Bridge from ANNs to SNNs
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Yihan Lin
Yifan Hu
Shiji Ma
Guo-Qi Li
Dongjie Yu
274
17
0
02 Mar 2022
Colar: Effective and Efficient Online Action Detection by Consulting Exemplars
Computer Vision and Pattern Recognition (CVPR), 2022
Le Yang
Junwei Han
Dingwen Zhang
326
48
0
02 Mar 2022
Skeleton Sequence and RGB Frame Based Multi-Modality Feature Fusion Network for Action Recognition
Xiaoguang Zhu
Ye Zhu
Haoyu Wang
Honglin Wen
Yan Yan
Peilin Liu
242
38
0
23 Feb 2022
Exploiting long-term temporal dynamics for video captioning
World wide web (Bussum) (WWW), 2018
Yuyu Guo
Jingqiu Zhang
Lianli Gao
131
18
0
22 Feb 2022
CaMEL: Mean Teacher Learning for Image Captioning
International Conference on Pattern Recognition (ICPR), 2022
Manuele Barraco
Matteo Stefanini
Marcella Cornia
S. Cascianelli
Lorenzo Baraldi
Rita Cucchiara
ViT
VLM
194
37
0
21 Feb 2022
Shift-Memory Network for Temporal Scene Segmentation
Guo Cheng
J. Zheng
256
0
0
17 Feb 2022
When Did It Happen? Duration-informed Temporal Localization of Narrated Actions in Vlogs
Oana Ignat
Santiago Castro
Yuhang Zhou
Jiajun Bao
Dandan Shan
Amélie Reymond
217
3
0
16 Feb 2022
The influence of labeling techniques in classifying human manipulation movement of different speed
International Conference on Pattern Recognition Applications and Methods (ICPRAM), 2022
Sadique Adnan Siddiqui
L. Gutzeit
Frank Kirchner
60
0
0
04 Feb 2022
Natural Language Descriptions of Deep Visual Features
International Conference on Learning Representations (ICLR), 2022
Evan Hernandez
Sarah Schwettmann
David Bau
Teona Bagashvili
Antonio Torralba
Jacob Andreas
MILM
967
148
0
26 Jan 2022
An Integrated Approach for Video Captioning and Applications
Soheyla Amirian
T. Taha
Khaled Rasheed
H. Arabnia
149
1
0
23 Jan 2022
LTC-GIF: Attracting More Clicks on Feature-length Sports Videos
Ghulam Mujtaba
Jaehyuk Choi
Eun‐Seok Ryu
103
0
0
22 Jan 2022
MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
Computer Vision and Pattern Recognition (CVPR), 2022
Chao-Yuan Wu
Yanghao Li
K. Mangalam
Haoqi Fan
Bo Xiong
Jitendra Malik
Christoph Feichtenhofer
ViT
479
245
0
20 Jan 2022
Hand-Object Interaction Reasoning
Advanced Video and Signal Based Surveillance (AVSS), 2022
Jian Ma
Dima Damen
182
7
0
13 Jan 2022
OCSampler: Compressing Videos to One Clip with Single-step Sampling
Computer Vision and Pattern Recognition (CVPR), 2022
Jintao Lin
Haodong Duan
Kai-xiang Chen
Dahua Lin
Limin Wang
179
31
0
12 Jan 2022
Adaptive Memory Networks with Self-supervised Learning for Unsupervised Anomaly Detection
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2022
Yu-xin Zhang
Yongfeng Zhang
Yiqiang Chen
Hanchao Yu
Tao Qin
AI4TS
186
78
0
03 Jan 2022
AdaFocus V2: End-to-End Training of Spatial Dynamic Networks for Video Recognition
Computer Vision and Pattern Recognition (CVPR), 2021
Yulin Wang
Yang Yue
Yuanze Lin
Haojun Jiang
Zihang Lai
V. Kulikov
Nikita Orlov
Humphrey Shi
Gao Huang
231
63
0
28 Dec 2021
Synchronized Audio-Visual Frames with Fractional Positional Encoding for Transformers in Video-to-Text Translation
International Conference on Information Photonics (ICIP), 2021
Philipp Harzig
Moritz Einfalt
Rainer Lienhart
ViT
153
3
0
28 Dec 2021
3D Skeleton-based Few-shot Action Recognition with JEANIE is not so Naïve
Lei Wang
Jun Liu
Piotr Koniusz
164
24
0
23 Dec 2021
Wholesale Electricity Price Forecasting using Integrated Long-term Recurrent Convolutional Network Model
Energies (Energies), 2021
Vasudharini Sridharan
Mingjian Tuo
Xingpeng Li
105
32
0
23 Dec 2021
A Survey of Natural Language Generation
ACM Computing Surveys (CSUR), 2021
Chenhe Dong
Hai-Tao Zheng
Haifan Gong
Mengzhao Chen
Junxin Li
Ying Shen
Min Yang
3DV
336
63
0
22 Dec 2021
Driver Drowsiness Detection Using Ensemble Convolutional Neural Networks on YawDD
Rais Mohammad Salman
Mahbubur Rashid
Rupal Roy
M. Ahsan
Zahed Siddique
122
12
0
20 Dec 2021
Adversarial Memory Networks for Action Prediction
Zhiqiang Tao
Yue Bai
Handong Zhao
Sheng Li
Yuanyuan Kong
Y. Fu
GAN
77
3
0
18 Dec 2021
Calorie Aware Automatic Meal Kit Generation from an Image
Ahmad Babaeian Jelodar
Yu Sun
194
2
0
18 Dec 2021
Distillation of Human-Object Interaction Contexts for Action Recognition
Muna Almushyti
Frederick W. Li
282
4
0
17 Dec 2021
Dense Video Captioning Using Unsupervised Semantic Information
Valter Estevam
Rayson Laroca
Hélio Pedrini
David Menotti
230
10
0
15 Dec 2021
Temporal Shuffling for Defending Deep Action Recognition Models against Adversarial Attacks
Ian Ryu
Huan Zhang
Jun-Ho Choi
Cho-Jui Hsieh
Jong-Seok Lee
AAML
209
9
0
15 Dec 2021
SVIP: Sequence VerIfication for Procedures in Videos
Yichen Qian
Weixin Luo
Dongze Lian
Xu Tang
P. Zhao
Shenghua Gao
ViT
327
23
0
13 Dec 2021
Auto-X3D: Ultra-Efficient Video Understanding via Finer-Grained Neural Architecture Search
Lezhi Li
Xinyu Gong
Junru Wu
Humphrey Shi
Zhicheng Yan
Zinan Lin
VGen
155
1
0
09 Dec 2021
E
2
^2
2
(GO)MOTION: Motion Augmented Event Stream for Egocentric Action Recognition
Chiara Plizzari
M. Planamente
Gabriele Goletto
Marco Cannici
Emanuele Gusso
Matteo Matteucci
Barbara Caputo
EgoV
239
69
0
07 Dec 2021
Joint Learning of Localized Representations from Medical Images and Reports
European Conference on Computer Vision (ECCV), 2021
Philipp Muller
Georgios Kaissis
Cong Zou
Daniel Munich
392
113
0
06 Dec 2021
D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
Dave Zhenyu Chen
Qirui Wu
Matthias Nießner
Angel X. Chang
196
50
0
02 Dec 2021
BEVT: BERT Pretraining of Video Transformers
Rui Wang
Dongdong Chen
Zuxuan Wu
Yinpeng Chen
Xiyang Dai
Xiyang Dai
Yu-Gang Jiang
Luowei Zhou
Lu Yuan
ViT
282
248
0
02 Dec 2021
Neural Attention for Image Captioning: Review of Outstanding Methods
Zanyar Zohourianshahzadi
Jugal Kalita
VLM
191
57
0
29 Nov 2021
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
Computer Vision and Pattern Recognition (CVPR), 2021
Yoad Tewel
Yoav Shalev
Idan Schwartz
Lior Wolf
VLM
327
235
0
29 Nov 2021
SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning
Computer Vision and Pattern Recognition (CVPR), 2021
Kevin Qinghong Lin
Linjie Li
Chung-Ching Lin
Faisal Ahmed
Zhe Gan
Zicheng Liu
Yumao Lu
Lijuan Wang
ViT
336
299
0
25 Nov 2021
Ice hockey player identification via transformers and weakly supervised learning
Kanav Vats
William J. McNally
Pascale Walters
David A Clausi
John S. Zelek
ViT
152
27
0
22 Nov 2021
DVCFlow: Modeling Information Flow Towards Human-like Video Captioning
Xu Yan
Zhengcong Fei
Shuhui Wang
Qingming Huang
Qi Tian
VGen
242
4
0
19 Nov 2021
An Overview of Backdoor Attacks Against Deep Neural Networks and Possible Defences
Wei Guo
B. Tondi
Mauro Barni
AAML
263
96
0
16 Nov 2021
Co-segmentation Inspired Attention Module for Video-based Computer Vision Tasks
Computer Vision and Image Understanding (CVIU), 2021
Arulkumar Subramaniam
Jayesh Vaidya
Muhammed Ameen
Athira M. Nambiar
Anurag Mittal
362
7
0
14 Nov 2021
Sparse Adversarial Video Attacks with Spatial Transformations
British Machine Vision Conference (BMVC), 2021
Ronghui Mu
Wenjie Ruan
Leandro Soriano Marcolino
Q. Ni
AAML
298
22
0
10 Nov 2021
D-Flow: A Real Time Spatial Temporal Model for Target Area Segmentation
Wentao Lu
Claude Sammut
148
0
0
08 Nov 2021
Evaluating deep transfer learning for whole-brain cognitive decoding
Journal of the Franklin Institute (J. Franklin Inst.), 2021
A. Thomas
U. Lindenberger
Wojciech Samek
K. Müller
AI4CE
134
13
0
01 Nov 2021
Multi-Task and Multi-Modal Learning for RGB Dynamic Gesture Recognition
IEEE Sensors Journal (IEEE Sens. J.), 2021
Dinghao Fan
Hengjie Lu
Shugong Xu
Shan Cao
203
20
0
29 Oct 2021
Attacking Video Recognition Models with Bullet-Screen Comments
AAAI Conference on Artificial Intelligence (AAAI), 2021
Kai-xiang Chen
Zhipeng Wei
Yue Yu
Zuxuan Wu
Yu-Gang Jiang
AAML
227
26
0
29 Oct 2021
Previous
1
2
3
...
5
6
7
...
33
34
35
Next