v1v2v3v4 (latest)

Deformable DETR: Deformable Transformers for End-to-End Object Detection

International Conference on Learning Representations (ICLR), 2020

8 October 2020

Weijie Su

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github (3553★)

Papers citing "Deformable DETR: Deformable Transformers for End-to-End Object Detection"

50 / 2,784 papers shown

Does Self-Attention Need Separate Weights in Transformers?North American Chapter of the Association for Computational Linguistics (NAACL), 2024

Md. Kowsher

Nusrat Jahan Prottasha

Chun-Nam Yu

O. Garibay

Niloofar Yousefi

1.2K

30 Nov 2024

BGM: Background Mixup for X-ray Prohibited Items Detection

564

30 Nov 2024

On Moving Object Segmentation from Monocular Video with Transformers

Christian Homeyer

Christoph Schnörr

289

28 Nov 2024

TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video

565

27 Nov 2024

Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models

393

26 Nov 2024

Large-Scale Data-Free Knowledge Distillation for ImageNet via Multi-Resolution Data Generation

367

26 Nov 2024

GeoFormer: A Multi-Polygon Segmentation TransformerBritish Machine Vision Conference (BMVC), 2024

Maxim Khomiakov

Michael Riis Andersen

J. Frellsen

254

25 Nov 2024

Monocular Lane Detection Based on Deep Learning: A Survey

728

25 Nov 2024

TreeFormer: Single-view Plant Skeleton Estimation via Tree-constrained Graph GenerationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

381

25 Nov 2024

Scaling Spike-driven Transformer with Efficient Spike Firing Approximation TrainingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

379

25 Nov 2024

Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene CompletionComputer Vision and Pattern Recognition (CVPR), 2024

Jongseong Bae

Junwoo Ha

Ha Young Kim

431

25 Nov 2024

Edge Weight Prediction For Category-Agnostic Pose Estimation

Or Hirschorn

S. Avidan

319

25 Nov 2024

Towards RAW Object Detection in Diverse ConditionsComputer Vision and Pattern Recognition (CVPR), 2024

207

24 Nov 2024

DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous DrivingComputer Vision and Pattern Recognition (CVPR), 2024

...

683

184

22 Nov 2024

MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving

847

22 Nov 2024

DT-LSD: Deformable Transformer-based Line Segment DetectionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

Sebastian Janampa

Marios Pattichis

ViT

365

20 Nov 2024

RAWMamba: Unified sRGB-to-RAW De-rendering With State Space Model

344

18 Nov 2024

Unveiling the Hidden: Online Vectorized HD Map Construction with Clip-Level Token Interaction and PropagationNeural Information Processing Systems (NeurIPS), 2024

196

17 Nov 2024

CCi-YOLOv8n: Enhanced Fire Detection with CARAFE and Context-Guided Modules

583

17 Nov 2024

EVT: Efficient View Transformation for Multi-Modal 3D Object Detection

577

16 Nov 2024

RETR: Multi-View Radar Detection Transformer for Indoor PerceptionNeural Information Processing Systems (NeurIPS), 2024

392

15 Nov 2024

Prompt-Guided Environmentally Consistent Adversarial Patch

212

15 Nov 2024

Toward Robust and Accurate Adversarial Camouflage Generation against Vehicle DetectorsIEEE Transactions on Dependable and Secure Computing (IEEE TDSC), 2024

258

15 Nov 2024

SynCL: A Synergistic Training Strategy with Instance-Aware Contrastive Learning for End-to-End Multi-Camera 3D Tracking

317

11 Nov 2024

LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance RepresentationInternational Conference on 3D Vision (3DV), 2024

327

09 Nov 2024

Moving Off-the-Grid: Scene-Grounded Video RepresentationsNeural Information Processing Systems (NeurIPS), 2024

Sjoerd van Steenkiste

...

335

08 Nov 2024

OccLoff: Learning Optimized Feature Fusion for 3D Occupancy PredictionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024

361

06 Nov 2024

Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection

350

05 Nov 2024

VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector QuantizationNeural Information Processing Systems (NeurIPS), 2024

178

03 Nov 2024

FactorizePhys: Matrix Factorization for Multidimensional Attention in Remote Physiological SensingNeural Information Processing Systems (NeurIPS), 2024

340

03 Nov 2024

IO Transformer: Evaluating SwinV2-Based Reward Models for Computer Vision

Maxwell Meyer

Jack Spruyt

ViT

142

31 Oct 2024

Uncertainty Estimation for 3D Object Detection via Evidential Learning

370

31 Oct 2024

GigaCheck: Detecting LLM-generated Content

330

31 Oct 2024

IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking

Yunshui Li

219

30 Oct 2024

Unbiased Regression Loss for DETRs

150

30 Oct 2024

BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV AlignmentIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024

M. Hosseinzadeh

Ian Reid

260

28 Oct 2024

Referring Human Pose and Mask Estimation in the WildNeural Information Processing Systems (NeurIPS), 2024

281

27 Oct 2024

Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation ModelsNeural Information Processing Systems (NeurIPS), 2024

314

25 Oct 2024

Prompting Continual Person SearchACM Multimedia (MM), 2024

273

25 Oct 2024

DCT-HistoTransformer: Efficient Lightweight Vision Transformer with DCT Integration for histopathological image analysisIranian Conference on Biomedical Engineering (ICBME), 2024

Mahtab Ranjbar

Mehdi Mohebbi

Mahdi Cherakhloo

Bijan Vosoughi. Vahdat

MedIm

260

24 Oct 2024

PixelGaussian: Generalizable 3D Gaussian Reconstruction from Arbitrary Views

289

24 Oct 2024

PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers in a resource-limited Context

284

23 Oct 2024

AlphaChimp: Tracking and Behavior Recognition of Chimpanzees

474

22 Oct 2024

DI-MaskDINO: A Joint Object Detection and Instance Segmentation ModelNeural Information Processing Systems (NeurIPS), 2024

290

22 Oct 2024

Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability

334

20 Oct 2024

YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-DictionaryInternational Conference on Learning Representations (ICLR), 2024

460

20 Oct 2024

D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution Refinement

Yueyi Zhang

259

17 Oct 2024

Improving Multi-modal Large Language Model through Boosting Vision Capabilities

Jingdong Wang

225

17 Oct 2024

Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation

Qiong Cao

238

17 Oct 2024

UniDrive: Towards Universal Driving Perception Across Camera Configurations

467

17 Oct 2024