v1v2v3v4 (latest)

Deformable DETR: Deformable Transformers for End-to-End Object Detection

International Conference on Learning Representations (ICLR), 2020

8 October 2020

Weijie Su

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github (3553★)

Papers citing "Deformable DETR: Deformable Transformers for End-to-End Object Detection"

50 / 2,782 papers shown

Cross-modulated Attention Transformer for RGBT TrackingAAAI Conference on Artificial Intelligence (AAAI), 2024

Yin Lin

229

05 Aug 2024

CAF-YOLO: A Robust Framework for Multi-Scale Lesion Detection in Biomedical ImageryIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

Zilin Chen

Shengnan Lu

MedIm

205

04 Aug 2024

A Survey and Evaluation of Adversarial Attacks for Object DetectionIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024

Khoi Nguyen Tiet Nguyen

371

04 Aug 2024

AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation

233

03 Aug 2024

Underwater Object Detection Enhancement via Channel StabilizationInternational Conference on Digital Image Computing: Techniques and Applications (DICTA), 2022

Muhammad Ali

Rita Sevastjanova

165

02 Aug 2024

Enhancing Online Road Network Perception and Reasoning with Standard Definition MapsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024

Henrik I. Christensen

Liu Ren

232

01 Aug 2024

A Systematic Review on Long-Tailed LearningIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024

410

01 Aug 2024

DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising TrainingACM Multimedia (MM), 2024

348

01 Aug 2024

WAS: Dataset and Methods for Artistic Text Segmentation

Zhifei Zhang

Wei Xiong

269

31 Jul 2024

Open-Vocabulary Audio-Visual Semantic Segmentation

259

31 Jul 2024

Dynamic Object Queries for Transformer-based Incremental Object Detection

236

31 Jul 2024

Leveraging Adaptive Implicit Representation Mapping for Ultra High-Resolution Image Segmentation

265

31 Jul 2024

HandDAGT: A Denoising Adaptive Graph Transformer for 3D Hand Pose Estimation

235

30 Jul 2024

Classification Matters: Improving Video Action Detection with Class-Specific AttentionEuropean Conference on Computer Vision (ECCV), 2024

391

29 Jul 2024

Practical Video Object Detection via Feature Selection and Aggregation

325

29 Jul 2024

Background Semantics Matter: Cross-Task Feature Exchange Network for Clustered Infrared Small Target Detection

503

29 Jul 2024

WeCromCL: Weakly Supervised Cross-Modality Contrastive Learning for Transcription-only Supervised Text SpottingEuropean Conference on Computer Vision (ECCV), 2024

492

28 Jul 2024

XS-VID: An Extremely Small Video Object Detection Dataset

248

25 Jul 2024

CSWin-UNet: Transformer UNet with Cross-Shaped Windows for Medical Image Segmentation

221

100

25 Jul 2024

StreamMOS: Streaming Moving Object Segmentation with Multi-View Perception and Dual-Span Memory

213

25 Jul 2024

SDLNet: Statistical Deep Learning Network for Co-Occurring Object Detection and Identification

Binay Kumar Singh

Niels Da Vitoria Lobo

ObjD

24 Jul 2024

MuST: Multi-Scale Transformers for Surgical Phase Recognition

Alejandra Pérez

Santiago Rodríguez

235

24 Jul 2024

Progressive Query Refinement Framework for Bird's-Eye-View Semantic Segmentation from Surrounding Images

214

24 Jul 2024

DVPE: Divided View Position Embedding for Multi-View 3D Object Detection

238

24 Jul 2024

PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects

232

23 Jul 2024

Dynamic Retraining-Updating Mean Teacher for Source-Free Object Detection

Trinh Le Ba Khanh

Huy-Hung Nguyen

L. Pham

Duong Nguyen-Ngoc Tran

Jae Wook Jeon

304

23 Jul 2024

ESOD: Efficient Small Object Detection on High-Resolution Images

290

23 Jul 2024

SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation

489

23 Jul 2024

Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection

286

22 Jul 2024

Explore the LiDAR-Camera Dynamic Adjustment Fusion for 3D Object Detection

Xiao Tan

Xiaoqing Ye

Jingdong Wang

3DPC

157

22 Jul 2024

Navigation Instruction Generation with BEV Perception and Large Language Models

266

21 Jul 2024

Efficient Visual Transformer by Learnable Token Merging

Yancheng Wang

Yingzhen Yang

ViT

341

21 Jul 2024

RayFormer: Improving Query-Based Multi-Camera 3D Object Detection via Ray-Centric Strategies

413

20 Jul 2024

PolyR-CNN: R-CNN for end-to-end polygonal building outline extraction

205

20 Jul 2024

Bucketed Ranking-based Losses for Efficient Training of Object Detectors

302

19 Jul 2024

OCTrack: Benchmarking the Open-Corpus Multi-Object Tracking

278

19 Jul 2024

Mask2Map: Vectorized HD Map Construction Using Bird's Eye View Segmentation Masks

270

18 Jul 2024

Real-Time 3D Occupancy Prediction via Geometric-Semantic Disentanglement

289

18 Jul 2024

DFMSD: Dual Feature Masking Stage-wise Knowledge Distillation for Object Detection

181

18 Jul 2024

OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird's-eye-view Vehicle Semantic Segmentation

Jian Sun

Yuqi Dai

Chi-Man Vong

Qing Xu

Shengbo Eben Li

Jianqiang Wang

Lei He

Keqiang Li

339

18 Jul 2024

ViLLa: Video Reasoning Segmentation with Large Language Model

Yu Qiao

519

18 Jul 2024

GroupMamba: Efficient Group-Based Visual State Space Model

Abdelrahman M. Shaker

Syed Talal Wasim

Salman Khan

Juergen Gall

Fahad Shahbaz Khan

Mamba

216

18 Jul 2024

AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer

353

17 Jul 2024

MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models

203

17 Jul 2024

Hierarchical and Decoupled BEV Perception Learning Framework for Autonomous Driving

Yuqi Dai

Jian Sun

Shengbo Eben Li

Qing Xu

Jianqiang Wang

Lei He

Keqiang Li

295

17 Jul 2024

VisionTrap: Vision-Augmented Trajectory Prediction Guided by Textual Descriptions

289

17 Jul 2024

Hierarchical Separable Video Transformer for Snapshot Compressive Imaging

425

16 Jul 2024

Relation DETR: Exploring Explicit Position Relation Prior for Object Detection

265

16 Jul 2024

Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection

277

16 Jul 2024

Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes

402

16 Jul 2024