Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2011.09315
Cited By
v1
v2 (latest)
End-to-End Object Detection with Adaptive Clustering Transformer
British Machine Vision Conference (BMVC), 2020
18 November 2020
Minghang Zheng
Shiyang Feng
Renrui Zhang
Kunchang Li
Xiaogang Wang
Jiaming Song
Hao Dong
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (172★)
Papers citing
"End-to-End Object Detection with Adaptive Clustering Transformer"
50 / 104 papers shown
End-to-End On-Device Quantization-Aware Training for LLMs at Inference Cost
Qitao Tan
Xiaoying Song
Jin Lu
Guoming Li
Jun Liu
...
Jundong Li
Xiaoming Zhai
Shaoyi Huang
Wei Niu
Geng Yuan
MQ
271
0
0
21 Aug 2025
A 2D Semantic-Aware Position Encoding for Vision Transformers
Xi Chen
Shiyang Zhou
Muqi Huang
Jiaxu Feng
Yun Xiong
...
Yujiao Shi
Huishuai Bao
Sijia Peng
Chong Li
Feng Shi
ViT
393
2
0
14 May 2025
Context Aware Grounded Teacher for Source Free Object Detection
Tajamul Ashraf
Rajes Manna
Partha Sarathi Purkayastha
Tavaheed Tariq
Janibul Bashir
388
0
0
21 Apr 2025
Spectral-Adaptive Modulation Networks for Visual Perception
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Paul Hongsuck Seo
Dong Hwan Kim
522
0
1
31 Mar 2025
Efficient Token Compression for Vision Transformer with Spatial Information Preserved
Junzhu Mao
Yang Shen
Jinyang Guo
Yazhou Yao
Xiansheng Hua
ViT
475
2
0
30 Mar 2025
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
Pengcheng Zhao
Zhixian He
Fuwei Zhang
Shujin Lin
Fan Zhou
413
5
0
18 Jan 2025
Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark Dataset
Computer Vision and Pattern Recognition (CVPR), 2024
Xinyu Wang
Yu Jin
Wentao Wu
Wei Zhang
Lin Zhu
Bo Jiang
Yonghong Tian
304
22
0
09 Dec 2024
ENACT: Entropy-based Clustering of Attention Input for Reducing the Computational Needs of Object Detection Transformers
International Conference on Information Photonics (ICIP), 2024
Giorgos Savathrakis
Antonis Argyros
ViT
193
0
0
11 Sep 2024
A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships
Gracile Astlin Pereira
Muhammad Hussain
ViT
320
39
0
27 Aug 2024
Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Yifan Feng
Jiangang Huang
Shaoyi Du
Shihui Ying
Jun-Hai Yong
Yipeng Li
Guiguang Ding
Rongrong Ji
Yue Gao
ObjD
270
216
0
09 Aug 2024
Neural-based Video Compression on Solar Dynamics Observatory Images
Atefeh Khoshkhahtinat
Ali Zafari
P. Mehta
Nasser M. Nasrabadi
Barbara J. Thompson
M. Kirk
D. D. Silva
374
1
0
12 Jul 2024
Learning to Adapt Category Consistent Meta-Feature of CLIP for Few-Shot Classification
Jiaying Shi
Xuetong Xue
Shenghui Xu
VLM
362
0
0
08 Jul 2024
Diff3Dformer: Leveraging Slice Sequence Diffusion for Enhanced 3D CT Classification with Transformer Networks
Zihao Jin
D. C. Marshall
Jiahao Huang
Caiwen Xu
Simon Walsh
Guang Yang
MedIm
399
2
0
24 Jun 2024
ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision Transformers
Computer Vision and Pattern Recognition (CVPR), 2024
Narges Norouzi
Svetlana Orlova
Daan de Geus
Gijs Dubbelman
ViT
FedML
260
33
0
14 Jun 2024
Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers
Diana-Nicoleta Grigore
Mariana-Iuliana Georgescu
J. A. Justo
T. Johansen
Andreea-Iuliana Ionescu
Radu Tudor Ionescu
400
4
0
14 Apr 2024
CathFlow: Self-Supervised Segmentation of Catheters in Interventional Ultrasound Using Optical Flow and Transformers
Alex Ranne
Liming Kuang
Yordanka Velikova
Nassir Navab
F. R. Y. Baena
OOD
212
2
0
21 Mar 2024
PEEB: Part-based Image Classifiers with an Explainable and Editable Language Bottleneck
Thang M. Pham
Peijie Chen
Tin Nguyen
Seunghyun Yoon
Trung Bui
Peijie Chen
VLM
408
11
0
08 Mar 2024
DEYO: DETR with YOLO for End-to-End Object Detection
Haodong Ouyang
222
19
0
26 Feb 2024
Semi-supervised Counting via Pixel-by-pixel Density Distribution Modelling
Hui Lin
Zhiheng Ma
Rongrong Ji
Yao Wang
Zhou Su
Xiaopeng Hong
Deyu Meng
202
9
0
23 Feb 2024
Weakly Supervised Open-Vocabulary Object Detection
Jianghang Lin
Chunjiang Ge
Bingquan Wang
Shaohui Lin
Ke Li
Liujuan Cao
WSOD
428
18
0
19 Dec 2023
PixelLM: Pixel Reasoning with Large Multimodal Model
Computer Vision and Pattern Recognition (CVPR), 2023
Zhongwei Ren
Zhicheng Huang
Yunchao Wei
Yao-Min Zhao
Dongmei Fu
Jiashi Feng
Xiaojie Jin
VLM
MLLM
LRM
518
239
0
04 Dec 2023
AiluRus: A Scalable ViT Framework for Dense Prediction
Neural Information Processing Systems (NeurIPS), 2023
Jin Li
Yaoming Wang
Xiaopeng Zhang
Bowen Shi
Dongsheng Jiang
Chenglin Li
Wenrui Dai
Hongkai Xiong
Qi Tian
376
16
0
02 Nov 2023
Improving Robustness for Vision Transformer with a Simple Dynamic Scanning Augmentation
Shashank Kotyan
Danilo Vasconcellos Vargas
ViT
290
7
0
01 Nov 2023
UniTime: A Language-Empowered Unified Model for Cross-Domain Time Series Forecasting
Xu Liu
Junfeng Hu
Yuan N. Li
Shizhe Diao
Yuxuan Liang
Bryan Hooi
Roger Zimmermann
AI4TS
346
189
0
15 Oct 2023
Anchor-Intermediate Detector: Decoupling and Coupling Bounding Boxes for Accurate Object Detection
IEEE International Conference on Computer Vision (ICCV), 2023
Yilong Lv
Min Li
Yujie He
Shaopeng Li
Zhuzhen He
Aitao Yang
154
4
0
09 Oct 2023
Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy
International Conference on Learning Representations (ICLR), 2023
Pingzhi Li
Zhenyu Zhang
Prateek Yadav
Yi-Lin Sung
Yu Cheng
Mohit Bansal
Tianlong Chen
MoMe
342
93
0
02 Oct 2023
ClusterFormer: Clustering As A Universal Visual Learner
James Liang
Yiming Cui
Qifan Wang
Tong Geng
Wenguan Wang
Dongfang Liu
VLM
488
20
0
22 Sep 2023
CL-MAE: Curriculum-Learned Masked Autoencoders
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Neelu Madan
Nicolae-Cătălin Ristea
Kamal Nasrollahi
T. Moeslund
Radu Tudor Ionescu
533
27
0
31 Aug 2023
SPANet: Frequency-balancing Token Mixer using Spectral Pooling Aggregation Modulation
IEEE International Conference on Computer Vision (ICCV), 2023
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Dong Hwan Kim
MoE
369
34
0
22 Aug 2023
ASAG: Building Strong One-Decoder-Layer Sparse Detectors via Adaptive Sparse Anchor Generation
IEEE International Conference on Computer Vision (ICCV), 2023
Sheng-Hsiang Fu
Junkai Yan
Yipeng Gao
Xiaohua Xie
Wei-Shi Zheng
229
8
0
18 Aug 2023
Revisiting Vision Transformer from the View of Path Ensemble
IEEE International Conference on Computer Vision (ICCV), 2023
Shuning Chang
Pichao Wang
Haowen Luo
Fan Wang
Mike Zheng Shou
ViT
264
8
0
12 Aug 2023
Graph Ladling: Shockingly Simple Parallel GNN Training without Intermediate Communication
International Conference on Machine Learning (ICML), 2023
A. Jaiswal
Shiwei Liu
Tianlong Chen
Ying Ding
Zinan Lin
GNN
310
8
0
18 Jun 2023
Instant Soup: Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models
International Conference on Machine Learning (ICML), 2023
A. Jaiswal
Shiwei Liu
Tianlong Chen
Ying Ding
Zinan Lin
VLM
281
24
0
18 Jun 2023
The Emergence of Essential Sparsity in Large Pre-trained Models: The Weights that Matter
Neural Information Processing Systems (NeurIPS), 2023
Ajay Jaiswal
Shiwei Liu
Tianlong Chen
Zinan Lin
VLM
353
44
0
06 Jun 2023
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
AAAI Conference on Artificial Intelligence (AAAI), 2023
Shilin Yan
Renrui Zhang
Ziyu Guo
Wenchao Chen
Wei Zhang
Guoying Gu
Yu Qiao
Hao Dong
Zhongjiang He
Shiyang Feng
VOS
376
66
0
25 May 2023
SSD-MonoDETR: Supervised Scale-aware Deformable Transformer for Monocular 3D Object Detection
IEEE Transactions on Intelligent Vehicles (TIV), 2023
Xuan He
Fan Yang
Kailun Yang
Jiacheng Lin
Haolong Fu
Ming Wang
Jin Yuan
Zhiyong Li
ViT
412
25
0
12 May 2023
AutoFocusFormer: Image Segmentation off the Grid
Computer Vision and Pattern Recognition (CVPR), 2023
Chen Ziwen
K. Patnaik
Shuangfei Zhai
Alvin Wan
Zhile Ren
Alex Schwing
Alex Colburn
Li Fuxin
373
18
0
24 Apr 2023
Transformer-Based Visual Segmentation: A Survey
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Xiangtai Li
Henghui Ding
Haobo Yuan
Wenwei Zhang
Jiangmiao Pang
Guangliang Cheng
Kai-xiang Chen
Ziwei Liu
Chen Change Loy
ViT
MedIm
576
280
0
19 Apr 2023
Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers
Computer Vision and Pattern Recognition (CVPR), 2023
Cong Wei
Brendan Duke
R. Jiang
P. Aarabi
Graham W. Taylor
Florian Shkurti
ViT
222
27
0
24 Mar 2023
OcTr: Octree-based Transformer for 3D Object Detection
Computer Vision and Pattern Recognition (CVPR), 2023
Chao Zhou
Yanan Zhang
Jiaxin Chen
Di Huang
3DPC
ViT
348
71
0
22 Mar 2023
Making Vision Transformers Efficient from A Token Sparsification View
Computer Vision and Pattern Recognition (CVPR), 2023
Shuning Chang
Pichao Wang
Ming Lin
Fan Wang
David Junhao Zhang
Rong Jin
Mike Zheng Shou
ViT
334
45
0
15 Mar 2023
PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection
Computer Vision and Pattern Recognition (CVPR), 2023
Anthony Chen
Kevin Zhang
Renrui Zhang
Zihan Wang
Yuheng Lu
Yandong Guo
Shanghang Zhang
3DPC
350
103
0
14 Mar 2023
HiCLIP: Contrastive Language-Image Pretraining with Hierarchy-aware Attention
International Conference on Learning Representations (ICLR), 2023
Shijie Geng
Jianbo Yuan
Yu Tian
Yuxiao Chen
Zelong Li
CLIP
VLM
305
59
0
06 Mar 2023
Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers
International Conference on Learning Representations (ICLR), 2023
Tianlong Chen
Zhenyu Zhang
Ajay Jaiswal
Shiwei Liu
Zinan Lin
MoE
387
76
0
02 Mar 2023
iQuery: Instruments as Queries for Audio-Visual Sound Separation
Computer Vision and Pattern Recognition (CVPR), 2022
Jiaben Chen
Renrui Zhang
Dongze Lian
Jiaqi Yang
Ziyao Zeng
Jianbo Shi
332
42
0
07 Dec 2022
Vision Transformer Computation and Resilience for Dynamic Inference
IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2022
Kavya Sreedhar
Jason Clemons
Rangharajan Venkatesan
S. Keckler
M. Horowitz
387
2
0
06 Dec 2022
Vision Transformer with Super Token Sampling
Huaibo Huang
Xiaoqiang Zhou
Jie Cao
Xiao-Yu Zhang
Tieniu Tan
ViT
330
107
0
21 Nov 2022
Vision Transformers in Medical Imaging: A Review
Emerald U. Henry
Onyeka Emebob
C. Omonhinmin
ViT
MedIm
285
67
0
18 Nov 2022
Pair DETR: Contrastive Learning Speeds Up DETR Training
M. Iranmanesh
Xiaotong Chen
Kuo-Chin Lien
ViT
325
0
0
29 Oct 2022
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Weicong Liang
Yuhui Yuan
Henghui Ding
Xiao Luo
Weihong Lin
Ding Jia
Zheng Zhang
Chao Zhang
Hanhua Hu
350
44
0
03 Oct 2022
1
2
3
Next
Page 1 of 3