v1v2v3v4 (latest)

Deformable DETR: Deformable Transformers for End-to-End Object Detection

International Conference on Learning Representations (ICLR), 2020

8 October 2020

Weijie Su

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github (3553★)

Papers citing "Deformable DETR: Deformable Transformers for End-to-End Object Detection"

50 / 2,782 papers shown

CycleHOI: Improving Human-Object Interaction Detection with Cycle Consistency of Detection and Generation

333

16 Jul 2024

Continuity Preserving Online CenterLine Graph Learning

319

16 Jul 2024

TCFormer: Visual Recognition via Token Clustering Transformer

Wentao Liu

Wanli Ouyang

Ping Luo

Xiaogang Wang

198

16 Jul 2024

Can Textual Semantics Mitigate Sounding Object Segmentation Preference?

324

15 Jul 2024

RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception

227

15 Jul 2024

GTPT: Group-based Token Pruning Transformer for Efficient Human Pose Estimation

Gangshan Wu

294

15 Jul 2024

SEED: A Simple and Effective 3D DETR in Point Clouds

Xiaoqing Ye

Jingdong Wang

215

15 Jul 2024

Joint-Embedding Predictive Architecture for Self-Supervised Learning of Mask Classification Architecture

281

15 Jul 2024

FRI-Net: Floorplan Reconstruction via Room-wise Implicit Representation

201

15 Jul 2024

PolyRoom: Room-aware Transformer for Floorplan Reconstruction

Shuhan Shen

203

15 Jul 2024

When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset

Yi Zhang

Wang Zeng

Sheng Jin

Chao Qian

Ping Luo

Wentao Liu

265

14 Jul 2024

Plain-Det: A Plain Multi-Dataset Object Detector

250

14 Jul 2024

MutDet: Mutually Optimizing Pre-training for Remote Sensing Object Detection

Ziyue Huang

Yongchao Feng

Qingjie Liu

Yunhong Wang

ViT

333

13 Jul 2024

IFTR: An Instance-Level Fusion Transformer for Visual Collaborative Perception

326

13 Jul 2024

Image Compression for Machine and Human Vision with Spatial-Frequency Adaptation

338

13 Jul 2024

Neural-based Video Compression on Solar Dynamics Observatory Images

Atefeh Khoshkhahtinat

Barbara J. Thompson

298

12 Jul 2024

FD-SOS: Vision-Language Open-Set Detectors for Bone Fenestration and Dehiscence Detection from Intraoral Images

157

12 Jul 2024

Domain-adaptive Video Deblurring via Test-time Blurring

Jia-Hao Wu

Yen-Yu Lin

244

12 Jul 2024

DroneMOT: Drone-based Multi-Object Tracking Considering Detection Difficulties and Simultaneous Moving of Drones and Objects

Peng Wang

Yongcai Wang

Deying Li

VOT

266

12 Jul 2024

Textual Query-Driven Mask Transformer for Domain Generalized Segmentation

436

12 Jul 2024

Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness

300

12 Jul 2024

Semi-Supervised Object Detection: A Survey on Progress from CNN to Transformer

Tahira Shehzadi

Ifza

Didier Stricker

Muhammad Zeshan Afzal

ViT

369

11 Jul 2024

Cross Domain Object Detection via Multi-Granularity Confidence Alignment based Mean Teacher

227

10 Jul 2024

DIOR-ViT: Differential Ordinal Learning Vision Transformer for Cancer Classification in Pathology Images

Jin Tae Kwak

307

10 Jul 2024

Deformable-Heatmap-Segmentation for Automobile Visual Perception

Hongyu Jin

109

10 Jul 2024

ActionVOS: Actions as Prompts for Video Object Segmentation

222

10 Jul 2024

Exploring Camera Encoder Designs for Autonomous Driving Perception

Jose M. Alvarez

263

09 Jul 2024

D-MASTER: Mask Annealed Transformer for Unsupervised Domain Adaptation in Breast Cancer Detection from Mammograms

263

09 Jul 2024

Anatomy-guided Pathology Segmentation

Moon Kim

186

08 Jul 2024

Learning Lane Graphs from Aerial Imagery Using Transformers

Martin Büchner

Simon Dorer

Abhinav Valada

196

08 Jul 2024

Described Spatial-Temporal Video Detection

You Qin

285

08 Jul 2024

Smart Camera Parking System With Auto Parking Spot Detection

Tuan T. Nguyen

Mina Sartipi

215

07 Jul 2024

JDT3D: Addressing the Gaps in LiDAR-Based Tracking-by-Attention

Brian Cheong

Jiachen Zhou

Steven Waslander

250

06 Jul 2024

Multi-Branch Auxiliary Fusion YOLO with Re-parameterization Heterogeneous Convolutional for accurate object detection

Zhiqiang Yang

296

101

05 Jul 2024

AMD: Automatic Multi-step Distillation of Large-scale Vision Models

Lifu Huang

209

05 Jul 2024

QueryMamba: A Mamba-Based Encoder-Decoder Architecture with a Statistical Verb-Noun Interaction Module for Video Action Forecasting @ Ego4D Long-Term Action Anticipation Challenge 2024

195

04 Jul 2024

Occupancy as Set of Points

Yiang Shi

Tianheng Cheng

Qian Zhang

Wenyu Liu

Xinggang Wang

3DPC

280

04 Jul 2024

Cyclic Refiner: Object-Aware Temporal Representation Learning for Multi-View 3D Detection and Tracking

Mingzhe Guo

Zhipeng Zhang

Liping Jing

Yuan He

Ke Wang

Heng Fan

246

03 Jul 2024

Graph and Skipped Transformer: Exploiting Spatial and Temporal Modeling Capacities for Efficient 3D Human Pose Estimation

228

03 Jul 2024

CAVIS: Context-Aware Video Instance Segmentation

380

03 Jul 2024

Research on Reliable and Safe Occupancy Grid Prediction in Underground Parking Lots

JiaQi Luo

190

02 Jul 2024

CountFormer: Multi-View Crowd Counting Transformer

Cheng Yang

292

02 Jul 2024

SymPoint Revolutionized: Boosting Panoptic Symbol Spotting with Layer Feature Enhancement

222

02 Jul 2024

LPViT: Low-Power Semi-structured Pruning for Vision Transformers

Zhe Wang

Min Wu

Xiaoli Li

Weisi Lin

ViT VLM

666

02 Jul 2024

Robot Instance Segmentation with Few Annotations for Grasping

428

01 Jul 2024

SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection

365

01 Jul 2024

Parametric Primitive Analysis of CAD Sketches with Vision Transformer

Xiaogang Wang

171

29 Jun 2024

GM-DF: Generalized Multi-Scenario Deepfake Detection

308

28 Jun 2024

Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding

Fangxing Chen

Xueping Liu

Yongjin Liu

Long Zeng

262

28 Jun 2024

Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads

Ali Khaleghi Rahimian

240

27 Jun 2024