ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.09315
  4. Cited By
End-to-End Object Detection with Adaptive Clustering Transformer
v1v2 (latest)

End-to-End Object Detection with Adaptive Clustering Transformer

British Machine Vision Conference (BMVC), 2020
18 November 2020
Minghang Zheng
Shiyang Feng
Renrui Zhang
Kunchang Li
Xiaogang Wang
Jiaming Song
Hao Dong
    ViT
ArXiv (abs)PDFHTMLGithub (172★)

Papers citing "End-to-End Object Detection with Adaptive Clustering Transformer"

50 / 104 papers shown
End-to-End On-Device Quantization-Aware Training for LLMs at Inference Cost
End-to-End On-Device Quantization-Aware Training for LLMs at Inference Cost
Qitao Tan
Xiaoying Song
Jin Lu
Guoming Li
Jun Liu
...
Jundong Li
Xiaoming Zhai
Shaoyi Huang
Wei Niu
Geng Yuan
MQ
271
0
0
21 Aug 2025
A 2D Semantic-Aware Position Encoding for Vision Transformers
A 2D Semantic-Aware Position Encoding for Vision Transformers
Xi Chen
Shiyang Zhou
Muqi Huang
Jiaxu Feng
Yun Xiong
...
Yujiao Shi
Huishuai Bao
Sijia Peng
Chong Li
Feng Shi
ViT
393
2
0
14 May 2025
Context Aware Grounded Teacher for Source Free Object Detection
Context Aware Grounded Teacher for Source Free Object Detection
Tajamul Ashraf
Rajes Manna
Partha Sarathi Purkayastha
Tavaheed Tariq
Janibul Bashir
388
0
0
21 Apr 2025
Spectral-Adaptive Modulation Networks for Visual Perception
Spectral-Adaptive Modulation Networks for Visual Perception
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Paul Hongsuck Seo
Dong Hwan Kim
522
0
1
31 Mar 2025
Efficient Token Compression for Vision Transformer with Spatial Information Preserved
Efficient Token Compression for Vision Transformer with Spatial Information Preserved
Junzhu Mao
Yang Shen
Jinyang Guo
Yazhou Yao
Xiansheng Hua
ViT
475
2
0
30 Mar 2025
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
Pengcheng Zhao
Zhixian He
Fuwei Zhang
Shujin Lin
Fan Zhou
413
5
0
18 Jan 2025
Object Detection using Event Camera: A MoE Heat Conduction based
  Detector and A New Benchmark Dataset
Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark DatasetComputer Vision and Pattern Recognition (CVPR), 2024
Xinyu Wang
Yu Jin
Wentao Wu
Wei Zhang
Lin Zhu
Bo Jiang
Yonghong Tian
304
22
0
09 Dec 2024
ENACT: Entropy-based Clustering of Attention Input for Reducing the Computational Needs of Object Detection Transformers
ENACT: Entropy-based Clustering of Attention Input for Reducing the Computational Needs of Object Detection TransformersInternational Conference on Information Photonics (ICIP), 2024
Giorgos Savathrakis
Antonis Argyros
ViT
193
0
0
11 Sep 2024
A Review of Transformer-Based Models for Computer Vision Tasks:
  Capturing Global Context and Spatial Relationships
A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships
Gracile Astlin Pereira
Muhammad Hussain
ViT
320
39
0
27 Aug 2024
Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation
Hyper-YOLO: When Visual Object Detection Meets Hypergraph ComputationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Yifan Feng
Jiangang Huang
Shaoyi Du
Shihui Ying
Jun-Hai Yong
Yipeng Li
Guiguang Ding
Rongrong Ji
Yue Gao
ObjD
270
216
0
09 Aug 2024
Neural-based Video Compression on Solar Dynamics Observatory Images
Neural-based Video Compression on Solar Dynamics Observatory Images
Atefeh Khoshkhahtinat
Ali Zafari
P. Mehta
Nasser M. Nasrabadi
Barbara J. Thompson
M. Kirk
D. D. Silva
374
1
0
12 Jul 2024
Learning to Adapt Category Consistent Meta-Feature of CLIP for Few-Shot
  Classification
Learning to Adapt Category Consistent Meta-Feature of CLIP for Few-Shot Classification
Jiaying Shi
Xuetong Xue
Shenghui Xu
VLM
362
0
0
08 Jul 2024
Diff3Dformer: Leveraging Slice Sequence Diffusion for Enhanced 3D CT
  Classification with Transformer Networks
Diff3Dformer: Leveraging Slice Sequence Diffusion for Enhanced 3D CT Classification with Transformer Networks
Zihao Jin
D. C. Marshall
Jiahao Huang
Caiwen Xu
Simon Walsh
Guang Yang
MedIm
399
2
0
24 Jun 2024
ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic
  Segmentation with Plain Vision Transformers
ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision TransformersComputer Vision and Pattern Recognition (CVPR), 2024
Narges Norouzi
Svetlana Orlova
Daan de Geus
Gijs Dubbelman
ViTFedML
260
33
0
14 Jun 2024
Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision
  Transformers
Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers
Diana-Nicoleta Grigore
Mariana-Iuliana Georgescu
J. A. Justo
T. Johansen
Andreea-Iuliana Ionescu
Radu Tudor Ionescu
400
4
0
14 Apr 2024
CathFlow: Self-Supervised Segmentation of Catheters in Interventional
  Ultrasound Using Optical Flow and Transformers
CathFlow: Self-Supervised Segmentation of Catheters in Interventional Ultrasound Using Optical Flow and Transformers
Alex Ranne
Liming Kuang
Yordanka Velikova
Nassir Navab
F. R. Y. Baena
OOD
212
2
0
21 Mar 2024
PEEB: Part-based Image Classifiers with an Explainable and Editable
  Language Bottleneck
PEEB: Part-based Image Classifiers with an Explainable and Editable Language Bottleneck
Thang M. Pham
Peijie Chen
Tin Nguyen
Seunghyun Yoon
Trung Bui
Peijie Chen
VLM
408
11
0
08 Mar 2024
DEYO: DETR with YOLO for End-to-End Object Detection
DEYO: DETR with YOLO for End-to-End Object Detection
Haodong Ouyang
222
19
0
26 Feb 2024
Semi-supervised Counting via Pixel-by-pixel Density Distribution
  Modelling
Semi-supervised Counting via Pixel-by-pixel Density Distribution Modelling
Hui Lin
Zhiheng Ma
Rongrong Ji
Yao Wang
Zhou Su
Xiaopeng Hong
Deyu Meng
202
9
0
23 Feb 2024
Weakly Supervised Open-Vocabulary Object Detection
Weakly Supervised Open-Vocabulary Object Detection
Jianghang Lin
Chunjiang Ge
Bingquan Wang
Shaohui Lin
Ke Li
Liujuan Cao
WSOD
428
18
0
19 Dec 2023
PixelLM: Pixel Reasoning with Large Multimodal Model
PixelLM: Pixel Reasoning with Large Multimodal ModelComputer Vision and Pattern Recognition (CVPR), 2023
Zhongwei Ren
Zhicheng Huang
Yunchao Wei
Yao-Min Zhao
Dongmei Fu
Jiashi Feng
Xiaojie Jin
VLMMLLMLRM
518
239
0
04 Dec 2023
AiluRus: A Scalable ViT Framework for Dense Prediction
AiluRus: A Scalable ViT Framework for Dense PredictionNeural Information Processing Systems (NeurIPS), 2023
Jin Li
Yaoming Wang
Xiaopeng Zhang
Bowen Shi
Dongsheng Jiang
Chenglin Li
Wenrui Dai
Hongkai Xiong
Qi Tian
376
16
0
02 Nov 2023
Improving Robustness for Vision Transformer with a Simple Dynamic
  Scanning Augmentation
Improving Robustness for Vision Transformer with a Simple Dynamic Scanning Augmentation
Shashank Kotyan
Danilo Vasconcellos Vargas
ViT
290
7
0
01 Nov 2023
UniTime: A Language-Empowered Unified Model for Cross-Domain Time Series
  Forecasting
UniTime: A Language-Empowered Unified Model for Cross-Domain Time Series Forecasting
Xu Liu
Junfeng Hu
Yuan N. Li
Shizhe Diao
Yuxuan Liang
Bryan Hooi
Roger Zimmermann
AI4TS
346
189
0
15 Oct 2023
Anchor-Intermediate Detector: Decoupling and Coupling Bounding Boxes for
  Accurate Object Detection
Anchor-Intermediate Detector: Decoupling and Coupling Bounding Boxes for Accurate Object DetectionIEEE International Conference on Computer Vision (ICCV), 2023
Yilong Lv
Min Li
Yujie He
Shaopeng Li
Zhuzhen He
Aitao Yang
154
4
0
09 Oct 2023
Merge, Then Compress: Demystify Efficient SMoE with Hints from Its
  Routing Policy
Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing PolicyInternational Conference on Learning Representations (ICLR), 2023
Pingzhi Li
Zhenyu Zhang
Prateek Yadav
Yi-Lin Sung
Yu Cheng
Mohit Bansal
Tianlong Chen
MoMe
342
93
0
02 Oct 2023
ClusterFormer: Clustering As A Universal Visual Learner
ClusterFormer: Clustering As A Universal Visual Learner
James Liang
Yiming Cui
Qifan Wang
Tong Geng
Wenguan Wang
Dongfang Liu
VLM
488
20
0
22 Sep 2023
CL-MAE: Curriculum-Learned Masked Autoencoders
CL-MAE: Curriculum-Learned Masked AutoencodersIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Neelu Madan
Nicolae-Cătălin Ristea
Kamal Nasrollahi
T. Moeslund
Radu Tudor Ionescu
533
27
0
31 Aug 2023
SPANet: Frequency-balancing Token Mixer using Spectral Pooling
  Aggregation Modulation
SPANet: Frequency-balancing Token Mixer using Spectral Pooling Aggregation ModulationIEEE International Conference on Computer Vision (ICCV), 2023
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Dong Hwan Kim
MoE
369
34
0
22 Aug 2023
ASAG: Building Strong One-Decoder-Layer Sparse Detectors via Adaptive
  Sparse Anchor Generation
ASAG: Building Strong One-Decoder-Layer Sparse Detectors via Adaptive Sparse Anchor GenerationIEEE International Conference on Computer Vision (ICCV), 2023
Sheng-Hsiang Fu
Junkai Yan
Yipeng Gao
Xiaohua Xie
Wei-Shi Zheng
229
8
0
18 Aug 2023
Revisiting Vision Transformer from the View of Path Ensemble
Revisiting Vision Transformer from the View of Path EnsembleIEEE International Conference on Computer Vision (ICCV), 2023
Shuning Chang
Pichao Wang
Haowen Luo
Fan Wang
Mike Zheng Shou
ViT
264
8
0
12 Aug 2023
Graph Ladling: Shockingly Simple Parallel GNN Training without
  Intermediate Communication
Graph Ladling: Shockingly Simple Parallel GNN Training without Intermediate CommunicationInternational Conference on Machine Learning (ICML), 2023
A. Jaiswal
Shiwei Liu
Tianlong Chen
Ying Ding
Zinan Lin
GNN
310
8
0
18 Jun 2023
Instant Soup: Cheap Pruning Ensembles in A Single Pass Can Draw Lottery
  Tickets from Large Models
Instant Soup: Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large ModelsInternational Conference on Machine Learning (ICML), 2023
A. Jaiswal
Shiwei Liu
Tianlong Chen
Ying Ding
Zinan Lin
VLM
281
24
0
18 Jun 2023
The Emergence of Essential Sparsity in Large Pre-trained Models: The
  Weights that Matter
The Emergence of Essential Sparsity in Large Pre-trained Models: The Weights that MatterNeural Information Processing Systems (NeurIPS), 2023
Ajay Jaiswal
Shiwei Liu
Tianlong Chen
Zinan Lin
VLM
353
44
0
06 Jun 2023
Referred by Multi-Modality: A Unified Temporal Transformer for Video
  Object Segmentation
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object SegmentationAAAI Conference on Artificial Intelligence (AAAI), 2023
Shilin Yan
Renrui Zhang
Ziyu Guo
Wenchao Chen
Wei Zhang
Guoying Gu
Yu Qiao
Hao Dong
Zhongjiang He
Shiyang Feng
VOS
376
66
0
25 May 2023
SSD-MonoDETR: Supervised Scale-aware Deformable Transformer for
  Monocular 3D Object Detection
SSD-MonoDETR: Supervised Scale-aware Deformable Transformer for Monocular 3D Object DetectionIEEE Transactions on Intelligent Vehicles (TIV), 2023
Xuan He
Fan Yang
Kailun Yang
Jiacheng Lin
Haolong Fu
Ming Wang
Jin Yuan
Zhiyong Li
ViT
412
25
0
12 May 2023
AutoFocusFormer: Image Segmentation off the Grid
AutoFocusFormer: Image Segmentation off the GridComputer Vision and Pattern Recognition (CVPR), 2023
Chen Ziwen
K. Patnaik
Shuangfei Zhai
Alvin Wan
Zhile Ren
Alex Schwing
Alex Colburn
Li Fuxin
373
18
0
24 Apr 2023
Transformer-Based Visual Segmentation: A Survey
Transformer-Based Visual Segmentation: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Xiangtai Li
Henghui Ding
Haobo Yuan
Wenwei Zhang
Jiangmiao Pang
Guangliang Cheng
Kai-xiang Chen
Ziwei Liu
Chen Change Loy
ViTMedIm
576
280
0
19 Apr 2023
Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient
  Vision Transformers
Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision TransformersComputer Vision and Pattern Recognition (CVPR), 2023
Cong Wei
Brendan Duke
R. Jiang
P. Aarabi
Graham W. Taylor
Florian Shkurti
ViT
222
27
0
24 Mar 2023
OcTr: Octree-based Transformer for 3D Object Detection
OcTr: Octree-based Transformer for 3D Object DetectionComputer Vision and Pattern Recognition (CVPR), 2023
Chao Zhou
Yanan Zhang
Jiaxin Chen
Di Huang
3DPCViT
348
71
0
22 Mar 2023
Making Vision Transformers Efficient from A Token Sparsification View
Making Vision Transformers Efficient from A Token Sparsification ViewComputer Vision and Pattern Recognition (CVPR), 2023
Shuning Chang
Pichao Wang
Ming Lin
Fan Wang
David Junhao Zhang
Rong Jin
Mike Zheng Shou
ViT
334
45
0
15 Mar 2023
PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D
  Object Detection
PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object DetectionComputer Vision and Pattern Recognition (CVPR), 2023
Anthony Chen
Kevin Zhang
Renrui Zhang
Zihan Wang
Yuheng Lu
Yandong Guo
Shanghang Zhang
3DPC
350
103
0
14 Mar 2023
HiCLIP: Contrastive Language-Image Pretraining with Hierarchy-aware
  Attention
HiCLIP: Contrastive Language-Image Pretraining with Hierarchy-aware AttentionInternational Conference on Learning Representations (ICLR), 2023
Shijie Geng
Jianbo Yuan
Yu Tian
Yuxiao Chen
Zelong Li
CLIPVLM
305
59
0
06 Mar 2023
Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable
  Transformers
Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable TransformersInternational Conference on Learning Representations (ICLR), 2023
Tianlong Chen
Zhenyu Zhang
Ajay Jaiswal
Shiwei Liu
Zinan Lin
MoE
387
76
0
02 Mar 2023
iQuery: Instruments as Queries for Audio-Visual Sound Separation
iQuery: Instruments as Queries for Audio-Visual Sound SeparationComputer Vision and Pattern Recognition (CVPR), 2022
Jiaben Chen
Renrui Zhang
Dongze Lian
Jiaqi Yang
Ziyao Zeng
Jianbo Shi
332
42
0
07 Dec 2022
Vision Transformer Computation and Resilience for Dynamic Inference
Vision Transformer Computation and Resilience for Dynamic InferenceIEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2022
Kavya Sreedhar
Jason Clemons
Rangharajan Venkatesan
S. Keckler
M. Horowitz
387
2
0
06 Dec 2022
Vision Transformer with Super Token Sampling
Vision Transformer with Super Token Sampling
Huaibo Huang
Xiaoqiang Zhou
Jie Cao
Xiao-Yu Zhang
Tieniu Tan
ViT
330
107
0
21 Nov 2022
Vision Transformers in Medical Imaging: A Review
Vision Transformers in Medical Imaging: A Review
Emerald U. Henry
Onyeka Emebob
C. Omonhinmin
ViTMedIm
285
67
0
18 Nov 2022
Pair DETR: Contrastive Learning Speeds Up DETR Training
Pair DETR: Contrastive Learning Speeds Up DETR Training
M. Iranmanesh
Xiaotong Chen
Kuo-Chin Lien
ViT
325
0
0
29 Oct 2022
Expediting Large-Scale Vision Transformer for Dense Prediction without
  Fine-tuning
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Weicong Liang
Yuhui Yuan
Henghui Ding
Xiao Luo
Weihong Lin
Ding Jia
Zheng Zhang
Chao Zhang
Hanhua Hu
350
44
0
03 Oct 2022
123
Next
Page 1 of 3