Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1705.06950
Cited By
The Kinetics Human Action Video Dataset
19 May 2017
W. Kay
João Carreira
Karen Simonyan
Brian Zhang
Chloe Hillier
Sudheendra Vijayanarasimhan
Fabio Viola
Tim Green
T. Back
Apostol Natsev
Mustafa Suleyman
Andrew Zisserman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Kinetics Human Action Video Dataset"
50 / 2,151 papers shown
Title
Boosting Adversarial Transferability across Model Genus by Deformation-Constrained Warping
AAAI Conference on Artificial Intelligence (AAAI), 2024
Qinliang Lin
Cheng Luo
Zenghao Niu
Xilin He
Weicheng Xie
Yuanbo Hou
Linlin Shen
Siyang Song
AAML
241
25
0
06 Feb 2024
VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation
AAAI Conference on Artificial Intelligence (AAAI), 2024
Jialu Li
Aishwarya Padmakumar
Gaurav Sukhatme
Mohit Bansal
277
10
0
05 Feb 2024
Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization
International Conference on Machine Learning (ICML), 2024
Yang Jin
Zhicheng Sun
Kun Xu
Kun Xu
Liwei Chen
...
Yuliang Liu
Chen Zhang
Yang Song
Kun Gai
Yadong Mu
VGen
213
76
0
05 Feb 2024
Taylor Videos for Action Recognition
International Conference on Machine Learning (ICML), 2024
Lei Wang
Xiuyuan Yuan
Tom Gedeon
Liang Zheng
493
13
0
05 Feb 2024
Time-, Memory- and Parameter-Efficient Visual Adaptation
Computer Vision and Pattern Recognition (CVPR), 2024
Otniel-Bogdan Mercea
Alexey Gritsenko
Cordelia Schmid
Anurag Arnab
VLM
182
22
0
05 Feb 2024
Classification of Tennis Actions Using Deep Learning
Emil Hovad
Therese Hougaard-Jensen
L. H. Clemmensen
70
6
0
04 Feb 2024
Region-Based Representations Revisited
Michal Shlapentokh-Rothman
Ansel Blume
Yao Xiao
Yuqun Wu
TV Sethuraman
Heyi Tao
Jae Yong Lee
Wilfredo Torres
Yu-Xiong Wang
Derek Hoiem
436
14
0
04 Feb 2024
NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties
Jingyuan Sun
Mingxiao Li
Zijiao Chen
Marie-Francine Moens
VGen
255
13
0
02 Feb 2024
A Survey on Generative AI and LLM for Video Generation, Understanding, and Streaming
Pengyuan Zhou
Lin Wang
Zhi Liu
Yanbin Hao
Pan Hui
Sasu Tarkoma
J. Kangasharju
VGen
224
46
0
30 Jan 2024
Computer Vision for Primate Behavior Analysis in the Wild
Richard Vogg
Timo Lüddecke
Jonathan Henrich
Sharmita Dey
Matthias Nuske
...
Alexander Gail
Stefan Treue
H. Scherberger
Florentin Wörgötter
Alexander S. Ecker
388
14
0
29 Jan 2024
MV2MAE: Multi-View Video Masked Autoencoders
Ketul Shah
Robert Crandall
Jie Xu
Peng Zhou
Marian George
Mayank Bansal
Rama Chellappa
227
6
0
29 Jan 2024
Multi-model learning by sequential reading of untrimmed videos for action recognition
Kodai Kamiya
Toru Tamaki
223
0
0
26 Jan 2024
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities
Computer Vision and Pattern Recognition (CVPR), 2024
Yiyuan Zhang
Xiaohan Ding
Kaixiong Gong
Yixiao Ge
Ying Shan
Xiangyu Yue
ViT
265
11
0
25 Jan 2024
PanAf20K: A Large Video Dataset for Wild Ape Detection and Behaviour Recognition
International Journal of Computer Vision (IJCV), 2024
Otto Brookes
Majid Mirmehdi
Colleen Stephens
Samuel Angedakin
Katherine Corogenes
...
Klaus Zuberbühler
Christophe Boesch
M. Arandjelovic
H. Kühl
T. Burghardt
191
30
0
24 Jan 2024
Interleaving One-Class and Weakly-Supervised Models with Adaptive Thresholding for Unsupervised Video Anomaly Detection
European Conference on Computer Vision (ECCV), 2024
Yongwei Nie
Hao Huang
Chengjiang Long
Qing Zhang
Pradipta Maji
Hongmin Cai
251
6
0
24 Jan 2024
Deep Learning for Computer Vision based Activity Recognition and Fall Detection of the Elderly: a Systematic Review
F. X. Gaya-Morey
Cristina Manresa-Yee
Jose Maria Buades Rubio
143
46
0
22 Jan 2024
ActionHub: A Large-scale Action Video Description Dataset for Zero-shot Action Recognition
Jiaming Zhou
Junwei Liang
Kun-Yu Lin
Jinrui Yang
Wei-Shi Zheng
VLM
266
13
0
22 Jan 2024
M2-CLIP: A Multimodal, Multi-task Adapting Framework for Video Action Recognition
AAAI Conference on Artificial Intelligence (AAAI), 2024
Mengmeng Wang
Jiazheng Xing
Boyuan Jiang
Jun Chen
Jianbiao Mei
Xingxing Zuo
Guang Dai
Jingdong Wang
Yong-Jin Liu
VLM
179
8
0
22 Jan 2024
Detecting Multimedia Generated by Large AI Models: A Survey
Li Lin
Neeraj Gupta
Yue Zhang
Hainan Ren
Chun-Hao Liu
Feng Ding
Xin Eric Wang
Xin Li
Luisa Verdoliva
Shu Hu
821
88
0
22 Jan 2024
Exploring Missing Modality in Multimodal Egocentric Datasets
Merey Ramazanova
Alejandro Pardo
Humam Alwassel
Guohao Li
EgoV
271
7
0
21 Jan 2024
Adversarial Augmentation Training Makes Action Recognition Models More Robust to Realistic Video Distribution Shifts
International Conferences on Pattern Recognition and Artificial Intelligence (ICCPRAI), 2024
Kiyoon Kim
Shreyank N. Gowda
Panagiotis Eustratiadis
Antreas Antoniou
Robert B Fisher
330
2
0
21 Jan 2024
Deep Reinforcement Learning Empowered Activity-Aware Dynamic Health Monitoring Systems
Ziqiang Ye
Yulan Gao
Yue Xiao
Zehui Xiong
Dusit Niyato
59
2
0
19 Jan 2024
GPT4Ego: Unleashing the Potential of Pre-trained Models for Zero-Shot Egocentric Action Recognition
Guangzhao Dai
Xiangbo Shu
Wenhao Wu
Rui Yan
Jiachao Zhang
VLM
389
9
0
18 Jan 2024
Depth Over RGB: Automatic Evaluation of Open Surgery Skills Using Depth Camera
Ido Zuckerman
Nicole Werner
Jonathan Kouchly
Emma Huston
Shannon DiMarco
Paul D Dimusto
S. Laufer
151
3
0
18 Jan 2024
From Coarse to Fine: Efficient Training for Audio Spectrogram Transformers
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Jiu Feng
Mehmet Hamza Erol
Joon Son Chung
Arda Senocak
137
2
0
16 Jan 2024
Transformer-based Video Saliency Prediction with High Temporal Dimension Decoding
Morteza Moradi
S. Palazzo
C. Spampinato
187
7
0
15 Jan 2024
FiGCLIP: Fine-Grained CLIP Adaptation via Densely Annotated Videos
S. DarshanSingh
Zeeshan Khan
Makarand Tapaswi
VLM
CLIP
190
6
0
15 Jan 2024
Collaboratively Self-supervised Video Representation Learning for Action Recognition
IEEE Transactions on Information Forensics and Security (IEEE TIFS), 2024
Jie Zhang
Zhifan Wan
Lanqing Hu
Stephen Lin
Shuzhe Wu
Shiguang Shan
TTA
347
2
0
15 Jan 2024
Hierarchical Augmentation and Distillation for Class Incremental Audio-Visual Video Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Yukun Zuo
Hantao Yao
Liansheng Zhuang
Changsheng Xu
284
5
0
11 Jan 2024
HaltingVT: Adaptive Token Halting Transformer for Efficient Video Recognition
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Qian Wu
Ruoxuan Cui
Yuke Li
Haoqi Zhu
ViT
205
5
0
10 Jan 2024
Dr
2
^2
2
Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning
Computer Vision and Pattern Recognition (CVPR), 2024
Chen Zhao
Shuming Liu
K. Mangalam
Guocheng Qian
Fatimah Zohra
Abdulmohsen Alghannam
Jitendra Malik
Guohao Li
221
7
0
08 Jan 2024
Efficient Multiscale Multimodal Bottleneck Transformer for Audio-Video Classification
Wentao Zhu
247
7
0
08 Jan 2024
Efficient Selective Audio Masked Multimodal Bottleneck Transformer for Audio-Video Classification
Wentao Zhu
130
5
0
08 Jan 2024
MERBench: A Unified Evaluation Benchmark for Multimodal Emotion Recognition
Zheng Lian
Guoying Zhao
Yong Ren
Hao Gu
Haiyang Sun
Lan Chen
Yinan Han
Jianhua Tao
358
26
0
07 Jan 2024
Efficient Bitrate Ladder Construction using Transfer Learning and Spatio-Temporal Features
A. Falahati
Mohammad Karim Safavi
Ardavan Elahi
Farhad Pakdaman
Moncef Gabbouj
AI4TS
123
2
0
06 Jan 2024
Subjective and Objective Analysis of Indian Social Media Video Quality
Sandeep Mishra
Mukul Jha
A. Bovik
178
1
0
05 Jan 2024
SAR-RARP50: Segmentation of surgical instrumentation and Action Recognition on Robot-Assisted Radical Prostatectomy Challenge
Dimitrios Psychogyios
Emanuele Colleoni
Beatrice van Amsterdam
Chih-Yang Li
Shu-Yu Huang
...
Santiago Rodriguez
Juanita Puentes
Pablo Arbelaez
Omid Mohareri
Danail Stoyanov
155
38
0
31 Dec 2023
Masked Modeling for Self-supervised Representation Learning on Vision and Beyond
Siyuan Li
Luyuan Zhang
Zedong Wang
Di Wu
Lirong Wu
...
Jun Xia
Cheng Tan
Yang Liu
Baigui Sun
Stan Z. Li
SSL
263
27
0
31 Dec 2023
A Large-Scale Re-identification Analysis in Sporting Scenarios: the Betrayal of Reaching a Critical Point
David Freire-Obregón
J. Lorenzo-Navarro
Oliverio J. Santana
Daniel Hernández-Sosa
Modesto Castrillón-Santana
CVBM
161
4
0
29 Dec 2023
Multiscale Vision Transformers meet Bipartite Matching for efficient single-stage Action Localization
Computer Vision and Pattern Recognition (CVPR), 2023
Ioanna Ntinou
Enrique Sanchez
Georgios Tzimiropoulos
234
7
0
29 Dec 2023
Video Understanding with Large Language Models: A Survey
Yunlong Tang
Jing Bi
Siting Xu
Luchuan Song
Susan Liang
...
Feng Zheng
Jianguo Zhang
Chenliang Xu
Jiebo Luo
Chenliang Xu
VLM
675
160
0
29 Dec 2023
3DTINC: Time-Equivariant Non-Contrastive Learning for Predicting Disease Progression from Longitudinal OCTs
T. Emre
A. Chakravarty
Antoine Rivail
Dmitrii Lachinov
Oliver Leingang
...
S. Sivaprasad
Daniel Rueckert
A. Lotery
U. Schmidt-Erfurth
Hrvoje Bogunović
MedIm
237
8
0
28 Dec 2023
Deformable Audio Transformer for Audio Event Detection
Wentao Zhu
151
0
0
24 Dec 2023
Classifying Soccer Ball-on-Goal Position Through Kicker Shooting Action
Javier Torón-Artiles
Daniel Hernández-Sosa
Oliverio J. Santana
J. Lorenzo-Navarro
David Freire-Obregón
120
3
0
23 Dec 2023
Video Recognition in Portrait Mode
Mingfei Han
Linjie Yang
Xiaojie Jin
Jiashi Feng
Xiaojun Chang
Heng Wang
189
6
0
21 Dec 2023
Bootstrap Masked Visual Modeling via Hard Patches Mining
Haochen Wang
Junsong Fan
Yuxi Wang
Kaiyou Song
Tiancai Wang
Xiangyu Zhang
Zhaoxiang Zhang
191
6
0
21 Dec 2023
SADA: Semantic adversarial unsupervised domain adaptation for Temporal Action Localization
David Pujol-Perich
Albert Clapés
Sergio Escalera
580
1
0
20 Dec 2023
Collaborative Weakly Supervised Video Correlation Learning for Procedure-Aware Instructional Video Analysis
Tianyao He
Huabin Liu
Yuxi Li
Xiao Ma
Cheng Zhong
Yang Zhang
Weiyao Lin
290
7
0
18 Dec 2023
Traffic Incident Database with Multiple Labels Including Various Perspective Environmental Information
Shota Nishiyama
Takuma Saito
Ryo Nakamura
Go Ohtani
Hirokatsu Kataoka
Kensho Hara
153
0
0
17 Dec 2023
CMOSE: Comprehensive Multi-Modality Online Student Engagement Dataset with High-Quality Labels
Chi-hsuan Wu
Shih-yang Liu
Xijie Huang
Xingbo Wang
Rong Zhang
Luca Minciullo
Wong Kai Yiu
Kenny Kwan
Kwang-Ting Cheng
233
5
0
14 Dec 2023
Previous
1
2
3
...
10
11
12
...
42
43
44
Next