ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.13196
  4. Cited By
ClusterFormer: Clustering As A Universal Visual Learner

ClusterFormer: Clustering As A Universal Visual Learner

22 September 2023
James Liang
Yiming Cui
Qifan Wang
Tong Geng
Wenguan Wang
Dongfang Liu
    VLM
ArXivPDFHTML

Papers citing "ClusterFormer: Clustering As A Universal Visual Learner"

10 / 10 papers shown
Title
DVHGNN: Multi-Scale Dilated Vision HGNN for Efficient Vision Recognition
DVHGNN: Multi-Scale Dilated Vision HGNN for Efficient Vision Recognition
Caoshuo Li
Tanzhe Li
Xiaobin Hu
Donghao Luo
Taisong Jin
53
0
0
19 Mar 2025
Lightweight Improved Residual Network for Efficient Inverse Tone Mapping
Lightweight Improved Residual Network for Efficient Inverse Tone Mapping
Liqi Xue
Tian-Ming Xu
Yong-ji Song
Yan Liu
Lei Zhang
Xiantong Zhen
Jun Xu
21
0
0
08 Jul 2023
CLUSTSEG: Clustering for Universal Segmentation
CLUSTSEG: Clustering for Universal Segmentation
James Liang
Tianfei Zhou
Dongfang Liu
Wenguan Wang
VLM
59
47
0
03 May 2023
Image as Set of Points
Image as Set of Points
Xu Ma
Yuqian Zhou
Huan Wang
Can Qin
Bin Sun
Chang Liu
Yun Fu
VLM
35
48
0
02 Mar 2023
Robust Domain Adaptive Object Detection with Unified Multi-Granularity
  Alignment
Robust Domain Adaptive Object Detection with Unified Multi-Granularity Alignment
Libo Zhang
Wenzhang Zhou
Heng Fan
Tiejian Luo
Haibin Ling
ObjD
19
12
0
01 Jan 2023
Visual Recognition with Deep Nearest Centroids
Visual Recognition with Deep Nearest Centroids
Wenguan Wang
Cheng Han
Tianfei Zhou
Dongfang Liu
49
89
0
15 Sep 2022
GroupViT: Semantic Segmentation Emerges from Text Supervision
GroupViT: Semantic Segmentation Emerges from Text Supervision
Jiarui Xu
Shalini De Mello
Sifei Liu
Wonmin Byeon
Thomas Breuel
Jan Kautz
X. Wang
ViT
VLM
175
494
0
22 Feb 2022
TF-Blender: Temporal Feature Blender for Video Object Detection
TF-Blender: Temporal Feature Blender for Video Object Detection
Yiming Cui
Liqi Yan
Zhiwen Cao
Dongfang Liu
ViT
48
97
0
12 Aug 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction
  without Convolutions
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
263
3,538
0
24 Feb 2021
ImageNet Large Scale Visual Recognition Challenge
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
279
39,083
0
01 Sep 2014
1