ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.03457
  4. Cited By
VideoMix: Rethinking Data Augmentation for Video Classification

VideoMix: Rethinking Data Augmentation for Video Classification

7 December 2020
Sangdoo Yun
Seong Joon Oh
Byeongho Heo
Dongyoon Han
Jinhyung Kim
ArXiv (abs)PDFHTML

Papers citing "VideoMix: Rethinking Data Augmentation for Video Classification"

46 / 46 papers shown
Augmenting Moment Retrieval: Zero-Dependency Two-Stage Learning
Augmenting Moment Retrieval: Zero-Dependency Two-Stage Learning
Zhengxuan Wei
Jiajin Tang
Sibei Yang
VLM
160
0
0
22 Oct 2025
MoExDA: Domain Adaptation for Edge-based Action Recognition
MoExDA: Domain Adaptation for Edge-based Action Recognition
Takuya Sugimoto
Ning Ding
Toru Tamaki
180
0
0
05 Aug 2025
MUG: Pseudo Labeling Augmented Audio-Visual Mamba Network for Audio-Visual Video Parsing
MUG: Pseudo Labeling Augmented Audio-Visual Mamba Network for Audio-Visual Video Parsing
Langyu Wang
Bingke Zhu
Yingying Chen
Yiyuan Zhang
Ming Tang
Jinqiao Wang
VLM
315
1
0
02 Jul 2025
The Role of Video Generation in Enhancing Data-Limited Action Understanding
The Role of Video Generation in Enhancing Data-Limited Action UnderstandingInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Wei Li
Dezhao Luo
Dongbao Yang
Zhenhang Li
Weiping Wang
Can Ma
DiffMVGen
611
0
0
26 May 2025
MomentMix Augmentation with Length-Aware DETR for Temporally Robust Moment Retrieval
MomentMix Augmentation with Length-Aware DETR for Temporally Robust Moment Retrieval
Sangkwon Park
Jiho Choi
Kyungjune Baek
Hyunjung Shim
258
0
0
30 Dec 2024
VISTA: Enhancing Long-Duration and High-Resolution Video Understanding
  by Video Spatiotemporal Augmentation
VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal AugmentationComputer Vision and Pattern Recognition (CVPR), 2024
Weiming Ren
Huan Yang
Jie Min
Cong Wei
Lei Ma
912
9
0
01 Dec 2024
SDI-Paste: Synthetic Dynamic Instance Copy-Paste for Video Instance
  Segmentation
SDI-Paste: Synthetic Dynamic Instance Copy-Paste for Video Instance Segmentation
Sahir Shrestha
Weihao Li
Gao Zhu
Nick Barnes
DiffM
218
1
0
16 Oct 2024
Data Collection-free Masked Video Modeling
Data Collection-free Masked Video ModelingEuropean Conference on Computer Vision (ECCV), 2024
Yuchi Ishikawa
Masayoshi Kondo
Yoshimitsu Aoki
ViT
210
1
0
10 Sep 2024
A Simple Background Augmentation Method for Object Detection with
  Diffusion Model
A Simple Background Augmentation Method for Object Detection with Diffusion ModelEuropean Conference on Computer Vision (ECCV), 2024
Yuhang Li
Jun Gao
Chen Chen
Yue Zhang
Jielei Zhang
DiffM
306
15
0
01 Aug 2024
SIAVC: Semi-Supervised Framework for Industrial Accident Video
  Classification
SIAVC: Semi-Supervised Framework for Industrial Accident Video Classification
Zuoyong Li
Qinghua Lin
Haoyi Fan
Tiesong Zhao
David Zhang
271
5
0
23 May 2024
Benchmarking the Robustness of Temporal Action Detection Models Against
  Temporal Corruptions
Benchmarking the Robustness of Temporal Action Detection Models Against Temporal Corruptions
Runhao Zeng
Xiaoyong Chen
Jiaming Liang
Huisi Wu
Guangzhong Cao
Yong Guo
AAML
341
7
0
29 Mar 2024
Don't Judge by the Look: Towards Motion Coherent Video Representation
Don't Judge by the Look: Towards Motion Coherent Video RepresentationInternational Conference on Learning Representations (ICLR), 2024
Yitian Zhang
Yue Bai
Huan Wang
Yizhou Wang
Yun Fu
258
3
0
14 Mar 2024
PASCL: Supervised Contrastive Learning with Perturbative Augmentation
  for Particle Decay Reconstruction
PASCL: Supervised Contrastive Learning with Perturbative Augmentation for Particle Decay Reconstruction
Junjian Lu
Houcheng Su
Dmitrii Kobylianski
Etienne Dreyer
Eilam Gross
Houcheng Su
176
3
0
18 Feb 2024
Reimagining Reality: A Comprehensive Survey of Video Inpainting
  Techniques
Reimagining Reality: A Comprehensive Survey of Video Inpainting Techniques
Shreyank N. Gowda
Yash Thakre
Shashank Narayana Gowda
Xiaobo Jin
212
0
0
31 Jan 2024
Green Edge AI: A Contemporary Survey
Green Edge AI: A Contemporary SurveyProceedings of the IEEE (Proc. IEEE), 2023
Yuyi Mao
X. Yu
Kaibin Huang
Ying-Jun Angela Zhang
Jun Zhang
398
56
0
01 Dec 2023
Learning Human Action Recognition Representations Without Real Humans
Learning Human Action Recognition Representations Without Real HumansNeural Information Processing Systems (NeurIPS), 2023
Howard Zhong
Samarth Mishra
Donghyun Kim
SouYoung Jin
Yikang Shen
Hildegard Kuehne
Leonid Karlinsky
Venkatesh Saligrama
Aude Oliva
Rogerio Feris
273
6
0
10 Nov 2023
Training Robust Deep Physiological Measurement Models with Synthetic
  Video-based Data
Training Robust Deep Physiological Measurement Models with Synthetic Video-based Data
Yuxuan Ou
Yuzhe Zhang
Yuntang Wang
Shwetak N. Patel
Daniel J. McDuff
Yuzhe Yang
Xin Liu
341
0
0
09 Nov 2023
FireMatch: A Semi-Supervised Video Fire Detection Network Based on
  Consistency and Distribution Alignment
FireMatch: A Semi-Supervised Video Fire Detection Network Based on Consistency and Distribution Alignment
Qinghua Lin
Zuoyong Li
Kun Zeng
Haoyi Fan
Wei Li
Xiaoguang Zhou
176
12
0
09 Nov 2023
S3Aug: Segmentation, Sampling, and Shift for Action Recognition
S3Aug: Segmentation, Sampling, and Shift for Action Recognition
Taiki Sugiura
Toru Tamaki
AI4TS
211
6
0
23 Oct 2023
Hate Speech Detection in Limited Data Contexts using Synthetic Data
  Generation
Hate Speech Detection in Limited Data Contexts using Synthetic Data Generation
Aman Khullar
Daniel K. Nkemelu
Cuong V. Nguyen
Michael L. Best
229
10
0
04 Oct 2023
Selective Volume Mixup for Video Action Recognition
Selective Volume Mixup for Video Action Recognition
Yi Tan
Zhaofan Qiu
Y. Hao
Ting Yao
Xiangnan He
Tao Mei
ViT
212
4
0
18 Sep 2023
Leveraging the Power of Data Augmentation for Transformer-based Tracking
Leveraging the Power of Data Augmentation for Transformer-based TrackingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Jie Zhao
Johan Edstedt
Michael Felsberg
D. Wang
Huchuan Lu
ViT
246
15
0
15 Sep 2023
Video OWL-ViT: Temporally-consistent open-world localization in video
Video OWL-ViT: Temporally-consistent open-world localization in videoIEEE International Conference on Computer Vision (ICCV), 2023
G. Heigold
Matthias Minderer
A. Gritsenko
Alex Bewley
Daniel Keysers
Mario Luvcić
Feng Yu
Thomas Kipf
VLM
259
18
0
22 Aug 2023
The Effects of Mixed Sample Data Augmentation are Class Dependent
The Effects of Mixed Sample Data Augmentation are Class Dependent
Haeil Lee
Han S. Lee
Junmo Kim
299
1
0
18 Jul 2023
Video4MRI: An Empirical Study on Brain Magnetic Resonance Image
  Analytics with CNN-based Video Classification Frameworks
Video4MRI: An Empirical Study on Brain Magnetic Resonance Image Analytics with CNN-based Video Classification FrameworksIEEE International Symposium on Biomedical Imaging (ISBI), 2023
Yuxuan Zhang
Qingzhong Wang
Jiang Bian
Yi Liu
Yanwu Xu
Dejing Dou
Haoyi Xiong
128
6
0
24 Feb 2023
A Survey of Mix-based Data Augmentation: Taxonomy, Methods,
  Applications, and Explainability
A Survey of Mix-based Data Augmentation: Taxonomy, Methods, Applications, and ExplainabilityACM Computing Surveys (ACM CSUR), 2022
Chengtai Cao
Fan Zhou
Yurou Dai
Jianping Wang
Kunpeng Zhang
AAML
313
51
0
21 Dec 2022
Test-Time Mixup Augmentation for Data and Class-Specific Uncertainty
  Estimation in Deep Learning Image Classification
Test-Time Mixup Augmentation for Data and Class-Specific Uncertainty Estimation in Deep Learning Image Classification
Han S. Lee
Haeil Lee
H. Hong
Junmo Kim
UQCV
417
0
0
01 Dec 2022
Mitigating and Evaluating Static Bias of Action Representations in the
  Background and the Foreground
Mitigating and Evaluating Static Bias of Action Representations in the Background and the ForegroundIEEE International Conference on Computer Vision (ICCV), 2022
Haoxin Li
Yuan Liu
Hanwang Zhang
Boyang Li
281
25
0
23 Nov 2022
A Unified Multimodal De- and Re-coupling Framework for RGB-D Motion
  Recognition
A Unified Multimodal De- and Re-coupling Framework for RGB-D Motion RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Benjia Zhou
Pichao Wang
Jun Wan
Yan-Ni Liang
Fan Wang
213
29
0
16 Nov 2022
Beyond Instance Discrimination: Relation-aware Contrastive
  Self-supervised Learning
Beyond Instance Discrimination: Relation-aware Contrastive Self-supervised LearningIEEE transactions on multimedia (IEEE TMM), 2022
Yifei Zhang
Chang-rui Liu
Can Ma
Weiping Wang
QiXiang Ye
Xiangyang Ji
SSLISegBDL
196
10
0
02 Nov 2022
Baby Physical Safety Monitoring in Smart Home Using Action Recognition
  System
Baby Physical Safety Monitoring in Smart Home Using Action Recognition SystemSoutheastCon (SoutheastCon), 2022
Victor A. Adewopo
Nelly Elsayed
Kelly Anderson
207
8
0
22 Oct 2022
A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function
  Perspective
A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function PerspectiveNeural Information Processing Systems (NeurIPS), 2022
Chanwoo Park
Sangdoo Yun
Sanghyuk Chun
AAML
207
39
0
21 Aug 2022
A Feature-space Multimodal Data Augmentation Technique for Text-video
  Retrieval
A Feature-space Multimodal Data Augmentation Technique for Text-video RetrievalACM Multimedia (ACM MM), 2022
Alex Falcon
G. Serra
Oswald Lanz
VGen
203
29
0
03 Aug 2022
An Efficient Framework for Few-shot Skeleton-based Temporal Action
  Segmentation
An Efficient Framework for Few-shot Skeleton-based Temporal Action Segmentation
Leiyang Xu
Qianqian Wang
Xiaotian Lin
Lin Yuan
MedIm
115
6
0
20 Jul 2022
Exploring Temporally Dynamic Data Augmentation for Video Recognition
Exploring Temporally Dynamic Data Augmentation for Video RecognitionInternational Conference on Learning Representations (ICLR), 2022
Taeoh Kim
Jinhyung Kim
Minho Shim
Sangdoo Yun
Myunggu Kang
Dongyoon Wee
Sangyoun Lee
AI4TS
219
17
0
30 Jun 2022
Learn2Augment: Learning to Composite Videos for Data Augmentation in
  Action Recognition
Learn2Augment: Learning to Composite Videos for Data Augmentation in Action RecognitionEuropean Conference on Computer Vision (ECCV), 2022
Shreyank N. Gowda
Marcus Rohrbach
Frank Keller
Laura Sevilla-Lara
215
46
0
09 Jun 2022
ObjectMix: Data Augmentation by Copy-Pasting Objects in Videos for
  Action Recognition
ObjectMix: Data Augmentation by Copy-Pasting Objects in Videos for Action RecognitionACM Multimedia Asia (MA), 2022
Jun Kimata
Tomoya Nitta
Toru Tamaki
264
13
0
01 Apr 2022
AugLy: Data Augmentations for Robustness
AugLy: Data Augmentations for Robustness
Zoe Papakipos
Joanna Bitton
AAML
168
55
0
17 Jan 2022
Cross-modal Manifold Cutmix for Self-supervised Video Representation
  Learning
Cross-modal Manifold Cutmix for Self-supervised Video Representation Learning
Srijan Das
Michael S. Ryoo
SSL
278
1
0
07 Dec 2021
ViewCLR: Learning Self-supervised Video Representation for Unseen
  Viewpoints
ViewCLR: Learning Self-supervised Video Representation for Unseen Viewpoints
Srijan Das
Michael S. Ryoo
SSL
211
30
0
07 Dec 2021
Evaluating Transformers for Lightweight Action Recognition
Evaluating Transformers for Lightweight Action Recognition
Raivo Koot
Markus Hennerbichler
Haiping Lu
ViT
224
8
0
18 Nov 2021
Overview of Tencent Multi-modal Ads Video Understanding Challenge
Overview of Tencent Multi-modal Ads Video Understanding Challenge
Zhenzhi Wang
Liyu Wu
Zhimin Li
Jiangfeng Xiong
Qinglin Lu
144
5
0
16 Sep 2021
Scene-aware Learning Network for Radar Object Detection
Scene-aware Learning Network for Radar Object Detection
Zangwei Zheng
Xiangyu Yue
Kurt Keutzer
Alberto L. Sangiovanni-Vincentelli
259
12
0
03 Jul 2021
VideoLT: Large-scale Long-tailed Video Recognition
VideoLT: Large-scale Long-tailed Video RecognitionIEEE International Conference on Computer Vision (ICCV), 2021
Xing Zhang
Zuxuan Wu
Zejia Weng
Huazhu Fu
Yue Yu
Yu-Gang Jiang
Larry S. Davis
278
48
0
06 May 2021
Learning Representational Invariances for Data-Efficient Action
  Recognition
Learning Representational Invariances for Data-Efficient Action RecognitionComputer Vision and Image Understanding (CVIU), 2021
Yuliang Zou
Jinwoo Choi
Qitong Wang
Jia-Bin Huang
303
45
0
30 Mar 2021
An Empirical Study and Analysis on Open-Set Semi-Supervised Learning
An Empirical Study and Analysis on Open-Set Semi-Supervised Learning
Huixiang Luo
Hao Cheng
Fanxu Meng
Yuting Gao
Ke Li
Mengdan Zhang
Xing Sun
186
8
0
19 Jan 2021
1