ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.4389
  4. Cited By
Long-term Recurrent Convolutional Networks for Visual Recognition and
  Description
v1v2v3v4 (latest)

Long-term Recurrent Convolutional Networks for Visual Recognition and Description

Computer Vision and Pattern Recognition (CVPR), 2014
17 November 2014
Jeff Donahue
Lisa Anne Hendricks
Marcus Rohrbach
Subhashini Venugopalan
S. Guadarrama
Kate Saenko
Trevor Darrell
    VLM
ArXiv (abs)PDFHTML

Papers citing "Long-term Recurrent Convolutional Networks for Visual Recognition and Description"

50 / 1,728 papers shown
Title
Exposing Deepfake with Pixel-wise AR and PPG Correlation from Faint
  Signals
Exposing Deepfake with Pixel-wise AR and PPG Correlation from Faint Signals
Maoyu Mao
Jun Yang
CVBMPICV
168
7
0
29 Oct 2021
Audio-visual Representation Learning for Anomaly Events Detection in
  Crowds
Audio-visual Representation Learning for Anomaly Events Detection in Crowds
Junyuan Gao
Maoguo Gong
Xuelong Li
197
33
0
28 Oct 2021
A Variational Graph Autoencoder for Manipulation Action Recognition and
  Prediction
A Variational Graph Autoencoder for Manipulation Action Recognition and Prediction
Gamze Akyol
Sanem Sariel
E. Aksoy
GNNDRLBDL
154
3
0
25 Oct 2021
Recurrence along Depth: Deep Convolutional Neural Networks with
  Recurrent Layer Aggregation
Recurrence along Depth: Deep Convolutional Neural Networks with Recurrent Layer AggregationNeural Information Processing Systems (NeurIPS), 2021
Jingyu Zhao
Yanwen Fang
Guodong Li
118
27
0
22 Oct 2021
Rethinking Generalization Performance of Surgical Phase Recognition with
  Expert-Generated Annotations
Rethinking Generalization Performance of Surgical Phase Recognition with Expert-Generated Annotations
Seungbum Hong
Jiwon Lee
Bokyung Park
Ahmed A. Alwusaibie
Anwar H. Alfadhel
Sunghyun Park
W. Hyung
Min-Kook Choi
108
2
0
22 Oct 2021
Adaptive Bridge between Training and Inference for Dialogue
Adaptive Bridge between Training and Inference for DialogueConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Haoran Xu
Hainan Zhang
Yanyan Zou
Hongshen Chen
Zhuoye Ding
Yanyan Lan
CVBM
88
8
0
22 Oct 2021
ASFormer: Transformer for Action Segmentation
ASFormer: Transformer for Action Segmentation
Fangqiu Yi
Hongyu Wen
Tingting Jiang
ViT
547
235
0
16 Oct 2021
Hand Me Your PIN! Inferring ATM PINs of Users Typing with a Covered Hand
Hand Me Your PIN! Inferring ATM PINs of Users Typing with a Covered Hand
M. Cardaioli
Stefano Cecconello
Mauro Conti
Simone Milani
S. Picek
Eugen Saraci
AAML
152
8
0
15 Oct 2021
Object-Region Video Transformers
Object-Region Video Transformers
Roei Herzig
Elad Ben-Avraham
K. Mangalam
Amir Bar
Gal Chechik
Anna Rohrbach
Trevor Darrell
Amir Globerson
ViT
305
94
0
13 Oct 2021
Video Is Graph: Structured Graph Module for Video Action Recognition
Video Is Graph: Structured Graph Module for Video Action Recognition
Rongjie Li
Xiaojun Wu
Tianyang Xu
322
14
0
12 Oct 2021
Player Tracking and Identification in Ice Hockey
Player Tracking and Identification in Ice Hockey
Kanav Vats
Pascale Walters
M. Fani
David A Clausi
John S. Zelek
229
48
0
06 Oct 2021
Let there be a clock on the beach: Reducing Object Hallucination in
  Image Captioning
Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning
Ali Furkan Biten
L. G. I. Bigorda
Dimosthenis Karatzas
314
80
0
04 Oct 2021
Transfer Learning Approaches for Knowledge Discovery in Grid-based
  Geo-Spatiotemporal Data
Transfer Learning Approaches for Knowledge Discovery in Grid-based Geo-Spatiotemporal Data
Aishwarya Sarkar
Jien Zhang
Chaoqun Lu
Ali Jannesari
AI4CE
254
3
0
02 Oct 2021
Unsupervised Motion Representation Learning with Capsule Autoencoders
Unsupervised Motion Representation Learning with Capsule Autoencoders
Ziwei Xu
Xudong Shen
Yongkang Wong
Mohan S. Kankanhalli
151
26
0
01 Oct 2021
Google Neural Network Models for Edge Devices: Analyzing and Mitigating
  Machine Learning Inference Bottlenecks
Google Neural Network Models for Edge Devices: Analyzing and Mitigating Machine Learning Inference Bottlenecks
Amirali Boroumand
Saugata Ghose
Berkin Akin
Ravi Narayanaswami
Geraldo F. Oliveira
Xiaoyu Ma
Eric Shiu
O. Mutlu
152
97
0
29 Sep 2021
Geometry-Entangled Visual Semantic Transformer for Image Captioning
Geometry-Entangled Visual Semantic Transformer for Image Captioning
Ling Cheng
Wei Wei
Feida Zhu
Yong Liu
Chunyan Miao
ViT
154
3
0
29 Sep 2021
TSM: Temporal Shift Module for Efficient and Scalable Video
  Understanding on Edge Device
TSM: Temporal Shift Module for Efficient and Scalable Video Understanding on Edge DeviceIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Ji Lin
Chuang Gan
Kuan-Chieh Wang
Song Han
163
78
0
27 Sep 2021
Scene Graph Generation for Better Image Captioning?
Scene Graph Generation for Better Image Captioning?
Maximilian Mozes
Martin Schmitt
Vladimir Golkov
Hinrich Schütze
Zorah Lähner
GNN
172
5
0
23 Sep 2021
Prediction of severe thunderstorm events with ensemble deep learning and
  radar data
Prediction of severe thunderstorm events with ensemble deep learning and radar data
Sabrina Guastavino
Michele Piana
Marco Tizzi
F. Cassola
A. Iengo
D. Sacchetti
Enrico Solazzo
F. Benvenuto
156
38
0
20 Sep 2021
Towards High-Quality Temporal Action Detection with Sparse Proposals
Towards High-Quality Temporal Action Detection with Sparse Proposals
Jiannan Wu
Pei Sun
Shoufa Chen
Jiewen Yang
Zihao Qi
Lan Ma
Ping Luo
ViT
125
11
0
18 Sep 2021
Towards optimized actions in critical situations of soccer games with
  deep reinforcement learning
Towards optimized actions in critical situations of soccer games with deep reinforcement learning
Pegah Rahimian
Afshin Oroojlooy
László Toka
68
8
0
14 Sep 2021
PAT: Pseudo-Adversarial Training For Detecting Adversarial Videos
PAT: Pseudo-Adversarial Training For Detecting Adversarial Videos
Nupur Thakur
Baoxin Li
AAML
309
2
0
13 Sep 2021
Learning to Discriminate Information for Online Action Detection:
  Analysis and Application
Learning to Discriminate Information for Online Action Detection: Analysis and ApplicationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Sumin Lee
Hyunjun Eun
Jinyoung Moon
Seokeon Choi
Yoonhyung Kim
Chanho Jung
Changick Kim
191
11
0
08 Sep 2021
Sensor-Augmented Egocentric-Video Captioning with Dynamic Modal
  Attention
Sensor-Augmented Egocentric-Video Captioning with Dynamic Modal AttentionACM Multimedia (ACM MM), 2021
Katsuyuki Nakamura
Hiroki Ohashi
Mitsuhiro Okada
EgoV
192
14
0
07 Sep 2021
DeepFakes: Detecting Forged and Synthetic Media Content Using Machine
  Learning
DeepFakes: Detecting Forged and Synthetic Media Content Using Machine LearningAdvanced Sciences and Technologies for Security Applications (ASTSA), 2021
S. Zobaed
Md Fazle Rabby
Md Istiaq Hossain
Ekram Hossain
Sazib Hasan
Asif Karim
Khan Md. Hasib
AAML
152
19
0
07 Sep 2021
Vision Guided Generative Pre-trained Language Models for Multimodal
  Abstractive Summarization
Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive SummarizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Tiezheng Yu
Wenliang Dai
Zihan Liu
Pascale Fung
263
79
0
06 Sep 2021
Efficient Action Recognition Using Confidence Distillation
Efficient Action Recognition Using Confidence Distillation
Shervin Manzuri Shalmani
Fei Chiang
Ronghuo Zheng
268
6
0
05 Sep 2021
Stimuli-Aware Visual Emotion Analysis
Stimuli-Aware Visual Emotion Analysis
Jingyuan Yang
Jie Li
Xiumei Wang
Yuxuan Ding
Xinbo Gao
109
74
0
04 Sep 2021
TrouSPI-Net: Spatio-temporal attention on parallel atrous convolutions
  and U-GRUs for skeletal pedestrian crossing prediction
TrouSPI-Net: Spatio-temporal attention on parallel atrous convolutions and U-GRUs for skeletal pedestrian crossing prediction
Joseph Gesnouin
Steve Pechberti
B. Stanciulescu
Fabien Moutarde
170
33
0
02 Sep 2021
SlowFast Rolling-Unrolling LSTMs for Action Anticipation in Egocentric
  Videos
SlowFast Rolling-Unrolling LSTMs for Action Anticipation in Egocentric Videos
Nada Osman
Guglielmo Camporese
Pasquale Coscia
Lamberto Ballan
EgoV
223
26
0
02 Sep 2021
Conditional Extreme Value Theory for Open Set Video Domain Adaptation
Conditional Extreme Value Theory for Open Set Video Domain AdaptationACM Multimedia Asia (MA), 2021
Zhuoxiao Chen
Yadan Luo
Mahsa Baktash
172
11
0
01 Sep 2021
Reinforcement Learning Based Sparse Black-box Adversarial Attack on
  Video Recognition Models
Reinforcement Learning Based Sparse Black-box Adversarial Attack on Video Recognition ModelsInternational Joint Conference on Artificial Intelligence (IJCAI), 2021
Zeyuan Wang
Chaofeng Sha
Su Yang
AAML
180
19
0
29 Aug 2021
Automated Generation of Accurate \& Fluent Medical X-ray Reports
Automated Generation of Accurate \& Fluent Medical X-ray ReportsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Hoang T.N. Nguyen
Dong Nie
Taivanbat Badamdorj
Yujie Liu
Yingying Zhu
J. Truong
Li Cheng
MedImLM&MA
121
46
0
27 Aug 2021
Deep Learning-based Spacecraft Relative Navigation Methods: A Survey
Deep Learning-based Spacecraft Relative Navigation Methods: A Survey
Jianing Song
Duarte Rondao
Nabil Aouf
198
89
0
19 Aug 2021
Channel-Temporal Attention for First-Person Video Domain Adaptation
Channel-Temporal Attention for First-Person Video Domain Adaptation
Xianyuan Liu
Shuo Zhou
Tao Lei
Haiping Lu
EgoV
83
0
0
17 Aug 2021
Neural Twins Talk & Alternative Calculations
Neural Twins Talk & Alternative CalculationsInternational Journal of Semantic Computing (IJSC), 2021
Zanyar Zohourianshahzadi
Jugal Kalita
118
0
0
05 Aug 2021
Dual Graph Convolutional Networks with Transformer and Curriculum
  Learning for Image Captioning
Dual Graph Convolutional Networks with Transformer and Curriculum Learning for Image CaptioningACM Multimedia (ACM MM), 2021
Xinzhi Dong
Chengjiang Long
Wenju Xu
Chunxia Xiao
ViT
249
71
0
05 Aug 2021
Brain-Inspired Deep Imitation Learning for Autonomous Driving Systems
Brain-Inspired Deep Imitation Learning for Autonomous Driving Systems
Hasan Bayarov Ahmedov
Dewei Yi
Jien-De Sui
94
5
0
30 Jul 2021
Unsupervised Deep Anomaly Detection for Multi-Sensor Time-Series Signals
Unsupervised Deep Anomaly Detection for Multi-Sensor Time-Series SignalsIEEE Transactions on Knowledge and Data Engineering (TKDE), 2021
Yu-xin Zhang
Yiqiang Chen
Yongfeng Zhang
Zhiwen Pan
AI4TS
171
268
0
27 Jul 2021
Towards Efficient Tensor Decomposition-Based DNN Model Compression with
  Optimization Framework
Towards Efficient Tensor Decomposition-Based DNN Model Compression with Optimization FrameworkComputer Vision and Pattern Recognition (CVPR), 2021
Miao Yin
Yang Sui
Siyu Liao
Bo Yuan
129
96
0
26 Jul 2021
Spatial-Temporal Transformer for Dynamic Scene Graph Generation
Spatial-Temporal Transformer for Dynamic Scene Graph GenerationIEEE International Conference on Computer Vision (ICCV), 2021
Yuren Cong
Wentong Liao
H. Ackermann
Bodo Rosenhahn
M. Yang
ViT
238
148
0
26 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
From Show to Tell: A Survey on Deep Learning-based Image CaptioningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DVVLMMLLM
379
343
0
14 Jul 2021
Split, embed and merge: An accurate table structure recognizer
Split, embed and merge: An accurate table structure recognizer
Zhenrong Zhang
Jianshu Zhang
Jun Du
LMTD
429
69
0
12 Jul 2021
Universal 3-Dimensional Perturbations for Black-Box Attacks on Video
  Recognition Systems
Universal 3-Dimensional Perturbations for Black-Box Attacks on Video Recognition SystemsIEEE Symposium on Security and Privacy (IEEE S&P), 2021
Shangyu Xie
Zheng Chen
Yu Kong
Yuan Hong
AAML
206
29
0
09 Jul 2021
Long Short-Term Transformer for Online Action Detection
Long Short-Term Transformer for Online Action DetectionNeural Information Processing Systems (NeurIPS), 2021
Mingze Xu
Yuanjun Xiong
Hao Chen
Xinyu Li
Wei Xia
Zhuowen Tu
Stefano Soatto
ViT
226
169
0
07 Jul 2021
Egocentric Image Captioning for Privacy-Preserved Passive Dietary Intake
  Monitoring
Egocentric Image Captioning for Privacy-Preserved Passive Dietary Intake MonitoringIEEE Transactions on Cybernetics (IEEE Trans. Cybern.), 2021
Jianing Qiu
Frank P.-W. Lo
Xiao Gu
M. Jobarteh
Wenyan Jia
...
M. McCrory
Edward Sazonov
Mingui Sun
Gary Frost
Benny Lo
EgoV
126
20
0
01 Jul 2021
iMiGUE: An Identity-free Video Dataset for Micro-Gesture Understanding
  and Emotion Analysis
iMiGUE: An Identity-free Video Dataset for Micro-Gesture Understanding and Emotion AnalysisComputer Vision and Pattern Recognition (CVPR), 2021
Xin Liu
Henglin Shi
Zhaodong Sun
Zitong Yu
Amritpal Singh
Guoying Zhao
196
106
0
01 Jul 2021
MissFormer: (In-)attention-based handling of missing observations for
  trajectory filtering and prediction
MissFormer: (In-)attention-based handling of missing observations for trajectory filtering and predictionInternational Symposium on Visual Computing (ISVC), 2021
S. Becker
Ronny Hug
Wolfgang Hubner
Michael Arens
B. Morris
208
4
0
30 Jun 2021
Exploring Temporal Context and Human Movement Dynamics for Online Action
  Detection in Videos
Exploring Temporal Context and Human Movement Dynamics for Online Action Detection in VideosEuropean Signal Processing Conference (EUSIPCO), 2021
V. Vasileiou
N. Kardaris
Petros Maragos
87
2
0
26 Jun 2021
Vision-based Behavioral Recognition of Novelty Preference in Pigs
Vision-based Behavioral Recognition of Novelty Preference in Pigs
Aniket Shirke
Rebecca K. Golden
Mrinal Gautam
Angela Green-Miller
M. Caesar
R. Dilger
83
5
0
23 Jun 2021
Previous
123...678...333435
Next