Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2004.04968
Cited By
Would Mega-scale Datasets Further Enhance Spatiotemporal 3D CNNs?
10 April 2020
Hirokatsu Kataoka
Tenga Wakamiya
Kensho Hara
Y. Satoh
3DPC
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Would Mega-scale Datasets Further Enhance Spatiotemporal 3D CNNs?"
30 / 30 papers shown
ERDES: A Benchmark Video Dataset for Retinal Detachment and Macular Status Classification in Ocular Ultrasound
Pouyan Navard
Yasemin Ozkut
S. Adhikari
Elaine Situ-LaCasse
Josie Acuña
Adrienne Yarnish
Alper Yilmaz
85
0
0
05 Aug 2025
Physics-Guided Motion Loss for Video Generation Model
Bowen Xue
G. C. Guarnera
Shuang Zhao
Zahra Montazeri
DiffM
VGen
251
0
0
02 Jun 2025
Measuring Error Alignment for Decision-Making Systems
AAAI Conference on Artificial Intelligence (AAAI), 2024
Binxia Xu
Antonis Bikakis
Daniel Onah
A. Vlachidis
Luke Dickens
476
1
0
03 Jan 2025
Traffic Incident Database with Multiple Labels Including Various Perspective Environmental Information
Shota Nishiyama
Takuma Saito
Ryo Nakamura
Go Ohtani
Hirokatsu Kataoka
Kensho Hara
198
0
0
17 Dec 2023
QAFE-Net: Quality Assessment of Facial Expressions with Landmark Heatmaps
Shuchao Duan
Amirhossein Dadashzadeh
Alan Whone
Majid Mirmehdi
CVBM
408
2
0
01 Dec 2023
FireMatch: A Semi-Supervised Video Fire Detection Network Based on Consistency and Distribution Alignment
Qinghua Lin
Zuoyong Li
Kun Zeng
Haoyi Fan
Wei Li
Xiaoguang Zhou
226
14
0
09 Nov 2023
Subtle Signals: Video-based Detection of Infant Non-nutritive Sucking as a Neurodevelopmental Cue
Shaotong Zhu
Michael Wan
Sai Kumar Reddy Manne
Emily B. Zimmerman
Sarah Ostadabbas
178
2
0
24 Oct 2023
Chop & Learn: Recognizing and Generating Object-State Compositions
IEEE International Conference on Computer Vision (ICCV), 2023
Nirat Saini
Hanyu Wang
Archana Swaminathan
Vinoj Jayasundara
Bo He
Kamal Gupta
Abhinav Shrivastava
CoGe
318
20
0
25 Sep 2023
Multimodal Distillation for Egocentric Action Recognition
IEEE International Conference on Computer Vision (ICCV), 2023
Gorjan Radevski
Dusan Grujicic
Marie-Francine Moens
Matthew Blaschko
Tinne Tuytelaars
EgoV
407
38
0
14 Jul 2023
Anatomically aware dual-hop learning for pulmonary embolism detection in CT pulmonary angiograms
Florin Condrea
S. Rapaka
Lucian Itu
Puneet Sharma
J. Sperl
Mohamed Ali
Marius Leordeanu
178
9
0
30 Mar 2023
Hand Gestures Recognition in Videos Taken with Lensless Camera
Optics Express (OE), 2022
Yinger Zhang
Zhouyi Wu
Peiying Lin
Yang Pan
Yuting Wu
Liufang Zhang
J. Huangfu
3DH
197
6
0
15 Oct 2022
Adaptive occlusion sensitivity analysis for visually explaining video recognition networks
Tomoki Uchiyama
Naoya Sogi
S. Iizuka
Koichiro Niinuma
Kazuhiro Fukui
357
3
0
26 Jul 2022
Analysis and Extensions of Adversarial Training for Video Classification
K. A. Kinfu
René Vidal
AAML
271
16
0
16 Jun 2022
Deep Neural Network approaches for Analysing Videos of Music Performances
F. Liwicki
Richa Upadhyay
Prakash Chandra Chhipa
Killian Murphy
F. Visi
S. Östersjö
Marcus Liwicki
186
1
0
05 May 2022
On Negative Sampling for Audio-Visual Contrastive Learning from Movies
Mahdi M. Kalayeh
Shervin Ardeshir
Lingyi Liu
Nagendra Kamath
Ashok Chandrashekar
SSL
210
3
0
29 Apr 2022
Learning from Temporal Gradient for Semi-supervised Action Recognition
Computer Vision and Pattern Recognition (CVPR), 2021
Junfei Xiao
Longlong Jing
Lin Zhang
Ju He
Qi She
Zongwei Zhou
Alan Yuille
Yingwei Li
293
68
0
25 Nov 2021
Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence
Xiang Bai
Hanchen Wang
Liya Ma
Yongchao Xu
Jiefeng Gan
...
C. Zheng
Jianming Wang
Zhen Li
Carola-Bibiane Schönlieb
Tian Xia
FedML
147
73
0
18 Nov 2021
Unsupervised Action Localization Crop in Video Retargeting for 3D ConvNets
IEEE Region 10 Conference (TENCON), 2021
Prithwish Jana
Swarnabja Bhaumik
Partha Pratim Mohanta
199
4
0
14 Nov 2021
Revisiting spatio-temporal layouts for compositional action recognition
British Machine Vision Conference (BMVC), 2021
Gorjan Radevski
Marie-Francine Moens
Tinne Tuytelaars
278
29
0
02 Nov 2021
AdaPool: Exponential Adaptive Pooling for Information-Retaining Downsampling
IEEE Transactions on Image Processing (TIP), 2021
Alexandros Stergiou
R. Poppe
345
141
0
01 Nov 2021
Sign Language Recognition via Skeleton-Aware Multi-Model Ensemble
Songyao Jiang
Bin Sun
Lichen Wang
Yue Bai
Kunpeng Li
Y. Fu
SLR
278
55
0
12 Oct 2021
PIP: Physical Interaction Prediction via Mental Simulation with Span Selection
European Conference on Computer Vision (ECCV), 2021
Jiafei Duan
Samson Yu
Soujanya Poria
Bihan Wen
Cheston Tan
335
9
0
10 Sep 2021
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer
Zineng Tang
Jaemin Cho
Hao Tan
Joey Tianyi Zhou
VLM
271
35
0
06 Jul 2021
Watching Too Much Television is Good: Self-Supervised Audio-Visual Representation Learning from Movies and TV Shows
Mahdi M. Kalayeh
Nagendra Kamath
Lingyi Liu
Ashok Chandrashekar
SSL
140
3
0
16 Jun 2021
Skimming and Scanning for Untrimmed Video Action Recognition
Yunyan Hong
Ailing Zeng
Min Li
Cewu Lu
Li Jiang
Qiang Xu
211
0
0
21 Apr 2021
Skeleton Aware Multi-modal Sign Language Recognition
Songyao Jiang
Bin Sun
Lichen Wang
Yue Bai
Kunpeng Li
Y. Fu
SLR
314
252
0
16 Mar 2021
TCLR: Temporal Contrastive Learning for Video Representation
Computer Vision and Image Understanding (CVIU), 2021
I. Dave
Rohit Gupta
Mamshad Nayeem Rizve
Mubarak Shah
SSL
AI4TS
475
220
0
20 Jan 2021
Refining activation downsampling with SoftPool
IEEE International Conference on Computer Vision (ICCV), 2021
Alexandros Stergiou
R. Poppe
Grigorios Kalliatakis
392
194
0
02 Jan 2021
Multi-Temporal Convolutions for Human Action Recognition in Videos
Alexandros Stergiou
R. Poppe
247
1
0
08 Nov 2020
Actor-Action Video Classification CSC 249/449 Spring 2020 Challenge Report
Jing Shi
Zhiheng Li
Haitian Zheng
Yihang Xu
Tianyou Xiao
...
R. Magnotti
A. Sexton
Jeet Thaker
Oscar Su
Chenliang Xu
179
1
0
01 Aug 2020
1
Page 1 of 1