v1v2 (latest)

End-to-End Object Detection with Adaptive Clustering Transformer

British Machine Vision Conference (BMVC), 2020

18 November 2020

Minghang Zheng

ArXiv (abs)PDF HTML Github (172★)

Papers citing "End-to-End Object Detection with Adaptive Clustering Transformer"

50 / 104 papers shown

End-to-End On-Device Quantization-Aware Training for LLMs at Inference Cost

...

271

21 Aug 2025

A 2D Semantic-Aware Position Encoding for Vision Transformers

...

393

14 May 2025

Context Aware Grounded Teacher for Source Free Object Detection

Tajamul Ashraf

Rajes Manna

Partha Sarathi Purkayastha

Tavaheed Tariq

Janibul Bashir

388

21 Apr 2025

Spectral-Adaptive Modulation Networks for Visual Perception

522

31 Mar 2025

Efficient Token Compression for Vision Transformer with Spatial Information Preserved

475

30 Mar 2025

LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection

413

18 Jan 2025

Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark DatasetComputer Vision and Pattern Recognition (CVPR), 2024

304

09 Dec 2024

ENACT: Entropy-based Clustering of Attention Input for Reducing the Computational Needs of Object Detection TransformersInternational Conference on Information Photonics (ICIP), 2024

Giorgos Savathrakis

Antonis Argyros

ViT

193

11 Sep 2024

A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships

Gracile Astlin Pereira

Muhammad Hussain

ViT

320

27 Aug 2024

Hyper-YOLO: When Visual Object Detection Meets Hypergraph ComputationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

Shaoyi Du

Rongrong Ji

270

216

09 Aug 2024

Neural-based Video Compression on Solar Dynamics Observatory Images

Atefeh Khoshkhahtinat

Barbara J. Thompson

374

12 Jul 2024

Learning to Adapt Category Consistent Meta-Feature of CLIP for Few-Shot Classification

362

08 Jul 2024

Diff3Dformer: Leveraging Slice Sequence Diffusion for Enhanced 3D CT Classification with Transformer Networks

Simon Walsh

Guang Yang

MedIm

399

24 Jun 2024

ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision TransformersComputer Vision and Pattern Recognition (CVPR), 2024

260

14 Jun 2024

Weight Copy and Low-Rank Adaptation for Few-Shot Distillation of Vision Transformers

Diana-Nicoleta Grigore

Mariana-Iuliana Georgescu

J. A. Justo

T. Johansen

Andreea-Iuliana Ionescu

Radu Tudor Ionescu

400

14 Apr 2024

CathFlow: Self-Supervised Segmentation of Catheters in Interventional Ultrasound Using Optical Flow and Transformers

Nassir Navab

212

21 Mar 2024

PEEB: Part-based Image Classifiers with an Explainable and Editable Language Bottleneck

408

08 Mar 2024

DEYO: DETR with YOLO for End-to-End Object Detection

Haodong Ouyang

222

26 Feb 2024

Semi-supervised Counting via Pixel-by-pixel Density Distribution Modelling

202

23 Feb 2024

Weakly Supervised Open-Vocabulary Object Detection

Liujuan Cao

428

19 Dec 2023

PixelLM: Pixel Reasoning with Large Multimodal ModelComputer Vision and Pattern Recognition (CVPR), 2023

518

239

04 Dec 2023

AiluRus: A Scalable ViT Framework for Dense PredictionNeural Information Processing Systems (NeurIPS), 2023

376

02 Nov 2023

Improving Robustness for Vision Transformer with a Simple Dynamic Scanning Augmentation

Shashank Kotyan

Danilo Vasconcellos Vargas

ViT

290

01 Nov 2023

UniTime: A Language-Empowered Unified Model for Cross-Domain Time Series Forecasting

Xu Liu

Bryan Hooi

Roger Zimmermann

AI4TS

346

189

15 Oct 2023

Anchor-Intermediate Detector: Decoupling and Coupling Bounding Boxes for Accurate Object DetectionIEEE International Conference on Computer Vision (ICCV), 2023

154

09 Oct 2023

Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing PolicyInternational Conference on Learning Representations (ICLR), 2023

Mohit Bansal

342

02 Oct 2023

ClusterFormer: Clustering As A Universal Visual Learner

488

22 Sep 2023

CL-MAE: Curriculum-Learned Masked AutoencodersIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

Neelu Madan

Nicolae-Cătălin Ristea

Kamal Nasrollahi

T. Moeslund

Radu Tudor Ionescu

533

31 Aug 2023

SPANet: Frequency-balancing Token Mixer using Spectral Pooling Aggregation ModulationIEEE International Conference on Computer Vision (ICCV), 2023

369

22 Aug 2023

ASAG: Building Strong One-Decoder-Layer Sparse Detectors via Adaptive Sparse Anchor GenerationIEEE International Conference on Computer Vision (ICCV), 2023

Xiaohua Xie

229

18 Aug 2023

Revisiting Vision Transformer from the View of Path EnsembleIEEE International Conference on Computer Vision (ICCV), 2023

Fan Wang

264

12 Aug 2023

Graph Ladling: Shockingly Simple Parallel GNN Training without Intermediate CommunicationInternational Conference on Machine Learning (ICML), 2023

310

18 Jun 2023

Instant Soup: Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large ModelsInternational Conference on Machine Learning (ICML), 2023

281

18 Jun 2023

The Emergence of Essential Sparsity in Large Pre-trained Models: The Weights that MatterNeural Information Processing Systems (NeurIPS), 2023

353

06 Jun 2023

Referred by Multi-Modality: A Unified Temporal Transformer for Video Object SegmentationAAAI Conference on Artificial Intelligence (AAAI), 2023

Ziyu Guo

Wei Zhang

Yu Qiao

Zhongjiang He

376

25 May 2023

SSD-MonoDETR: Supervised Scale-aware Deformable Transformer for Monocular 3D Object DetectionIEEE Transactions on Intelligent Vehicles (TIV), 2023

Xuan He

Fan Yang

Kailun Yang

412

12 May 2023

AutoFocusFormer: Image Segmentation off the GridComputer Vision and Pattern Recognition (CVPR), 2023

Zhile Ren

373

24 Apr 2023

Transformer-Based Visual Segmentation: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

Xiangtai Li

576

280

19 Apr 2023

Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision TransformersComputer Vision and Pattern Recognition (CVPR), 2023

222

24 Mar 2023

OcTr: Octree-based Transformer for 3D Object DetectionComputer Vision and Pattern Recognition (CVPR), 2023

Yanan Zhang

348

22 Mar 2023

Making Vision Transformers Efficient from A Token Sparsification ViewComputer Vision and Pattern Recognition (CVPR), 2023

Fan Wang

334

15 Mar 2023

PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object DetectionComputer Vision and Pattern Recognition (CVPR), 2023

Shanghang Zhang

350

103

14 Mar 2023

HiCLIP: Contrastive Language-Image Pretraining with Hierarchy-aware AttentionInternational Conference on Learning Representations (ICLR), 2023

Jianbo Yuan

305

06 Mar 2023

Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable TransformersInternational Conference on Learning Representations (ICLR), 2023

387

02 Mar 2023

iQuery: Instruments as Queries for Audio-Visual Sound SeparationComputer Vision and Pattern Recognition (CVPR), 2022

332

07 Dec 2022

Vision Transformer Computation and Resilience for Dynamic InferenceIEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2022

Kavya Sreedhar

Jason Clemons

Rangharajan Venkatesan

S. Keckler

M. Horowitz

387

06 Dec 2022

Vision Transformer with Super Token Sampling

330

107

21 Nov 2022

Vision Transformers in Medical Imaging: A Review

285

18 Nov 2022

Pair DETR: Contrastive Learning Speeds Up DETR Training

325

29 Oct 2022

Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022

Xiao Luo

350

03 Oct 2022