ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.04175
  4. Cited By
Token Boosting for Robust Self-Supervised Visual Transformer
  Pre-training

Token Boosting for Robust Self-Supervised Visual Transformer Pre-training

9 April 2023
Tianjiao Li
Lin Geng Foo
Ping Hu
Xindi Shang
Hossein Rahmani
Zehuan Yuan
J. Liu
ArXivPDFHTML

Papers citing "Token Boosting for Robust Self-Supervised Visual Transformer Pre-training"

11 / 11 papers shown
Title
MultiFuser: Multimodal Fusion Transformer for Enhanced Driver Action
  Recognition
MultiFuser: Multimodal Fusion Transformer for Enhanced Driver Action Recognition
Ruoyu Wang
Wenqian Wang
Jianjun Gao
Dan Lin
Kim-Hui Yap
Bingbing Li
20
0
0
03 Aug 2024
HumMUSS: Human Motion Understanding using State Space Models
HumMUSS: Human Motion Understanding using State Space Models
Arnab Kumar Mondal
Stefano Alletto
Denis Tome
26
4
0
16 Apr 2024
Robust 6DoF Pose Estimation Against Depth Noise and a Comprehensive
  Evaluation on a Mobile Dataset
Robust 6DoF Pose Estimation Against Depth Noise and a Comprehensive Evaluation on a Mobile Dataset
Zixun Huang
Keling Yao
Seth Z. Zhao
Chuanyu Pan
Chenfeng Xu
Kathy Zhuang
Tianjian Xu
Weiyu Feng
Allen Y. Yang
6
0
0
24 Sep 2023
Recent Advances of Local Mechanisms in Computer Vision: A Survey and
  Outlook of Recent Work
Recent Advances of Local Mechanisms in Computer Vision: A Survey and Outlook of Recent Work
Qiangchang Wang
Yilong Yin
8
0
0
02 Jun 2023
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual
  Tasks
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks
Zhiyang Chen
Yousong Zhu
Zhaowen Li
Fan Yang
Wei Li
...
Chaoyang Zhao
Liwei Wu
Rui Zhao
Jinqiao Wang
Ming Tang
VLM
VOS
51
15
0
28 Sep 2022
Masked Autoencoders Are Scalable Vision Learners
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
255
7,337
0
11 Nov 2021
Are Transformers More Robust Than CNNs?
Are Transformers More Robust Than CNNs?
Yutong Bai
Jieru Mei
Alan Yuille
Cihang Xie
ViT
AAML
165
212
0
10 Nov 2021
3D Human Action Representation Learning via Cross-View Consistency
  Pursuit
3D Human Action Representation Learning via Cross-View Consistency Pursuit
Linguo Li
Minsi Wang
Bingbing Ni
Hang Wang
Jiancheng Yang
Wenjun Zhang
108
154
0
29 Apr 2021
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
283
5,723
0
29 Apr 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy
  Text Supervision
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
293
2,875
0
11 Feb 2021
Feedback Graph Convolutional Network for Skeleton-based Action
  Recognition
Feedback Graph Convolutional Network for Skeleton-based Action Recognition
Hao-Yu Yang
D. Yan
Ling Zhang
Dong Li
Yunda Sun
Shaodi You
Stephen J. Maybank
28
91
0
17 Mar 2020
1