ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.04159
  4. Cited By
Deformable DETR: Deformable Transformers for End-to-End Object Detection
v1v2v3v4 (latest)

Deformable DETR: Deformable Transformers for End-to-End Object Detection

International Conference on Learning Representations (ICLR), 2020
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
    ViT
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)Github (3553★)

Papers citing "Deformable DETR: Deformable Transformers for End-to-End Object Detection"

50 / 2,782 papers shown
BGM: Background Mixup for X-ray Prohibited Items Detection
BGM: Background Mixup for X-ray Prohibited Items Detection
Wen Liu
Haoyu Wang
Hongguang Zhu
Yunda Sun
Yao Zhao
Y. X. Wei
559
2
0
30 Nov 2024
On Moving Object Segmentation from Monocular Video with Transformers
On Moving Object Segmentation from Monocular Video with Transformers
Christian Homeyer
Christoph Schnörr
268
4
0
28 Nov 2024
TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video
TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video
Jinyuan Qu
Hongyang Li
Shilong Liu
Tianhe Ren
Zhaoyang Zeng
Lei Zhang
3DPC
534
6
0
27 Nov 2024
Exploring Aleatoric Uncertainty in Object Detection via Vision
  Foundation Models
Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models
Peng Cui
Guande He
Dan Zhang
Zhijie Deng
Yinpeng Dong
Jun Zhu
366
3
0
26 Nov 2024
Large-Scale Data-Free Knowledge Distillation for ImageNet via
  Multi-Resolution Data Generation
Large-Scale Data-Free Knowledge Distillation for ImageNet via Multi-Resolution Data Generation
Minh-Tuan Tran
Trung Le
Xuan-May Le
Jianfei Cai
Mehrtash Harandi
Dinh Q. Phung
344
3
0
26 Nov 2024
GeoFormer: A Multi-Polygon Segmentation Transformer
GeoFormer: A Multi-Polygon Segmentation TransformerBritish Machine Vision Conference (BMVC), 2024
Maxim Khomiakov
Michael Riis Andersen
J. Frellsen
223
1
0
25 Nov 2024
Monocular Lane Detection Based on Deep Learning: A Survey
Monocular Lane Detection Based on Deep Learning: A Survey
Xin He
Haiyun Guo
Kuan Zhu
Bingke Zhu
Xu Zhao
Jianwu Fang
Jinqiao Wang
720
13
0
25 Nov 2024
TreeFormer: Single-view Plant Skeleton Estimation via Tree-constrained
  Graph Generation
TreeFormer: Single-view Plant Skeleton Estimation via Tree-constrained Graph GenerationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Xinpeng Liu
Hiroaki Santo
Yosuke Toda
Fumio Okura
344
2
0
25 Nov 2024
Scaling Spike-driven Transformer with Efficient Spike Firing
  Approximation Training
Scaling Spike-driven Transformer with Efficient Spike Firing Approximation TrainingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Man Yao
Xuerui Qiu
Tianxiang Hu
J. Hu
Yuhong Chou
Keyu Tian
Jianxing Liao
Luziwei Leng
Bo Xu
Guoqi Li
373
47
0
25 Nov 2024
Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion
Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene CompletionComputer Vision and Pattern Recognition (CVPR), 2024
Jongseong Bae
Junwoo Ha
Ha Young Kim
409
2
0
25 Nov 2024
Edge Weight Prediction For Category-Agnostic Pose Estimation
Edge Weight Prediction For Category-Agnostic Pose Estimation
Or Hirschorn
S. Avidan
275
1
0
25 Nov 2024
Towards RAW Object Detection in Diverse Conditions
Towards RAW Object Detection in Diverse ConditionsComputer Vision and Pattern Recognition (CVPR), 2024
Zhong-Yu Li
Xin Jin
Boyuan Sun
Chun-Le Guo
Ming-Ming Cheng
203
5
0
24 Nov 2024
DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving
DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous DrivingComputer Vision and Pattern Recognition (CVPR), 2024
Bencheng Liao
Shaoyu Chen
Haoran Yin
Bo Jiang
Cheng Wang
...
Xinbang Zhang
Xiangyu Li
Y. Zhang
Qian Zhang
Xinggang Wang
670
155
0
22 Nov 2024
MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving
MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving
Hongsi Liu
Jun Liu
Guangfeng Jiang
Jianfeng Dong
823
13
0
22 Nov 2024
DT-LSD: Deformable Transformer-based Line Segment Detection
DT-LSD: Deformable Transformer-based Line Segment DetectionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Sebastian Janampa
Marios Pattichis
ViT
357
1
0
20 Nov 2024
RAWMamba: Unified sRGB-to-RAW De-rendering With State Space Model
Hongjun Chen
Wencheng Han
Huan Zheng
Jianbing Shen
Mamba
318
0
0
18 Nov 2024
Unveiling the Hidden: Online Vectorized HD Map Construction with Clip-Level Token Interaction and PropagationNeural Information Processing Systems (NeurIPS), 2024
Nayeon Kim
Hongje Seong
Daehyun Ji
Sujin Jang
188
8
0
17 Nov 2024
CCi-YOLOv8n: Enhanced Fire Detection with CARAFE and Context-Guided Modules
Kunwei Lv
Ruobing Wu
Suyang Chen
Ping Lan
559
13
0
17 Nov 2024
EVT: Efficient View Transformation for Multi-Modal 3D Object Detection
Yongjin Lee
Hyeon-Mun Jeong
Yurim Jeon
Sanghyun Kim
541
2
0
16 Nov 2024
RETR: Multi-View Radar Detection Transformer for Indoor Perception
RETR: Multi-View Radar Detection Transformer for Indoor PerceptionNeural Information Processing Systems (NeurIPS), 2024
Ryoma Yataka
Adriano Cardace
Peng Wang
P. Boufounos
R. Takahashi
373
11
0
15 Nov 2024
Prompt-Guided Environmentally Consistent Adversarial Patch
Prompt-Guided Environmentally Consistent Adversarial Patch
Chaoqun Li
Huanqian Yan
Lifeng Zhou
Tairan Chen
Zhuodong Liu
Hang Su
DiffMAAML
208
1
0
15 Nov 2024
Toward Robust and Accurate Adversarial Camouflage Generation against Vehicle Detectors
Toward Robust and Accurate Adversarial Camouflage Generation against Vehicle DetectorsIEEE Transactions on Dependable and Secure Computing (IEEE TDSC), 2024
Jiawei Zhou
Linye Lyu
Daojing He
Yu Li
AAML
251
1
0
15 Nov 2024
SynCL: A Synergistic Training Strategy with Instance-Aware Contrastive Learning for End-to-End Multi-Camera 3D Tracking
SynCL: A Synergistic Training Strategy with Instance-Aware Contrastive Learning for End-to-End Multi-Camera 3D Tracking
Shubo Lin
Yutong Kou
Zirui Wu
Shaoru Wang
Bing Li
Weiming Hu
Jin Gao
VOT
271
0
0
11 Nov 2024
LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with
  Instance Representation
LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance RepresentationInternational Conference on 3D Vision (3DV), 2024
Weijie Ma
Jingwei Jiang
Yue Yang
Zhaoyu Chen
Hao Chen
325
3
0
09 Nov 2024
Moving Off-the-Grid: Scene-Grounded Video Representations
Moving Off-the-Grid: Scene-Grounded Video RepresentationsNeural Information Processing Systems (NeurIPS), 2024
Sjoerd van Steenkiste
Daniel Zoran
Yi Yang
Yulia Rubanova
Rishabh Kabra
...
Thomas Keck
João Carreira
Alexey Dosovitskiy
Mehdi S. M. Sajjadi
Thomas Kipf
308
10
0
08 Nov 2024
OccLoff: Learning Optimized Feature Fusion for 3D Occupancy Prediction
OccLoff: Learning Optimized Feature Fusion for 3D Occupancy PredictionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Ji Zhang
Yiran Ding
Zixin Liu
3DPC
351
5
0
06 Nov 2024
Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection
Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection
Yifan Wang
Xiaohu Yang
Fanqi Pu
Q. Liao
Wenming Yang
344
1
0
05 Nov 2024
VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete
  Space via Vector Quantization
VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector QuantizationNeural Information Processing Systems (NeurIPS), 2024
Yiwei Zhang
Jin Gao
Fudong Ge
Guan Luo
Bing Li
Zheng Zhang
Haibin Ling
Weiming Hu
166
1
0
03 Nov 2024
FactorizePhys: Matrix Factorization for Multidimensional Attention in
  Remote Physiological Sensing
FactorizePhys: Matrix Factorization for Multidimensional Attention in Remote Physiological SensingNeural Information Processing Systems (NeurIPS), 2024
Jitesh Joshi
Sos S. Agaian
Youngjun Cho
AI4TS
323
9
0
03 Nov 2024
IO Transformer: Evaluating SwinV2-Based Reward Models for Computer
  Vision
IO Transformer: Evaluating SwinV2-Based Reward Models for Computer Vision
Maxwell Meyer
Jack Spruyt
ViT
131
0
0
31 Oct 2024
Uncertainty Estimation for 3D Object Detection via Evidential Learning
Uncertainty Estimation for 3D Object Detection via Evidential Learning
Nikita Durasov
Rafid Mahmood
Jiwoong Choi
Marc T. Law
James Lucas
Pascal Fua
Jose M. Alvarez
UQCVEDL3DPC
349
5
0
31 Oct 2024
GigaCheck: Detecting LLM-generated Content
GigaCheck: Detecting LLM-generated Content
Irina Tolstykh
Aleksandra Tsybina
Sergey Yakubson
Aleksandr Gordeev
Vladimir Dokholyan
Maksim Kuprashevich
DeLMO
313
4
0
31 Oct 2024
IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking
IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking
Run Luo
Zikai Song
Longze Chen
Yunshui Li
Min Yang
Wei-Guo Yang
209
1
0
30 Oct 2024
Unbiased Regression Loss for DETRs
Unbiased Regression Loss for DETRs
Edric
Ueta Daisuke
Kurokawa Yukimasa
Karlekar Jayashree
Sugiri Pranata
150
0
0
30 Oct 2024
BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV
  Alignment
BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV AlignmentIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
M. Hosseinzadeh
Ian Reid
240
2
0
28 Oct 2024
Referring Human Pose and Mask Estimation in the Wild
Referring Human Pose and Mask Estimation in the WildNeural Information Processing Systems (NeurIPS), 2024
Bo Miao
Mingtao Feng
Zijie Wu
Mohammed Bennamoun
Yongsheng Gao
Lin Wang
237
7
0
27 Oct 2024
Frozen-DETR: Enhancing DETR with Image Understanding from Frozen
  Foundation Models
Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation ModelsNeural Information Processing Systems (NeurIPS), 2024
Shenghao Fu
Junkai Yan
Q. Yang
Xihan Wei
Xiaohua Xie
Wei-Shi Zheng
VLM
301
15
0
25 Oct 2024
Prompting Continual Person Search
Prompting Continual Person SearchACM Multimedia (MM), 2024
Pengcheng Zhang
Xiaohan Yu
Xiao Bai
Jin Zheng
Xin Ning
CLLVLM
244
3
0
25 Oct 2024
DCT-HistoTransformer: Efficient Lightweight Vision Transformer with DCT
  Integration for histopathological image analysis
DCT-HistoTransformer: Efficient Lightweight Vision Transformer with DCT Integration for histopathological image analysisIranian Conference on Biomedical Engineering (ICBME), 2024
Mahtab Ranjbar
Mehdi Mohebbi
Mahdi Cherakhloo
Bijan Vosoughi. Vahdat
MedIm
254
3
0
24 Oct 2024
PixelGaussian: Generalizable 3D Gaussian Reconstruction from Arbitrary
  Views
PixelGaussian: Generalizable 3D Gaussian Reconstruction from Arbitrary Views
Xin Fei
Wenzhao Zheng
Yueqi Duan
Weidong Zhan
Masayoshi Tomizuka
Kurt Keutzer
Jiwen Lu
3DGS
276
14
0
24 Oct 2024
PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers in a resource-limited Context
PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers in a resource-limited Context
Maximilian Augustin
Syed Shakib Sarwar
Mostafa Elhoushi
Sai Qian Zhang
Yuecheng Li
B. D. Salvo
260
1
0
23 Oct 2024
AlphaChimp: Tracking and Behavior Recognition of Chimpanzees
AlphaChimp: Tracking and Behavior Recognition of Chimpanzees
Xiaoxuan Ma
Yutang Lin
Yuan Xu
Stephan P. Kaufhold
Jack Terwilliger
Andres Meza
Yixin Zhu
Federico Rossano
Yizhou Wang
465
4
0
22 Oct 2024
DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model
DI-MaskDINO: A Joint Object Detection and Instance Segmentation ModelNeural Information Processing Systems (NeurIPS), 2024
Jingjing Jiang
Xianghong Li
Tao Xiang
Jifeng Dai
ISeg
269
9
0
22 Oct 2024
Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object
  Detection Considering Text Describability
Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability
Yusuke Hosoya
Masanori Suganuma
Takayuki Okatani
ObjD
288
0
0
20 Oct 2024
YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary
YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-DictionaryInternational Conference on Learning Representations (ICLR), 2024
Hao-Tang Tsui
Chien-Yao Wang
H. Liao
ObjDVLM
401
0
0
20 Oct 2024
D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution
  Refinement
D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution Refinement
Yansong Peng
Hebei Li
Peixi Wu
Yueyi Zhang
Xingwu Sun
Feng Wu
244
78
0
17 Oct 2024
Improving Multi-modal Large Language Model through Boosting Vision
  Capabilities
Improving Multi-modal Large Language Model through Boosting Vision Capabilities
Yanpeng Sun
Han Zhang
Qiang Chen
Xinyu Zhang
Nong Sang
Gang Zhang
Jingdong Wang
Zechao Li
213
10
0
17 Oct 2024
Temporal-Enhanced Multimodal Transformer for Referring Multi-Object
  Tracking and Segmentation
Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation
Changcheng Xiao
Qiong Cao
Yujie Zhong
Xiang Zhang
Tao Wang
Canqun Yang
L. Lan
231
3
0
17 Oct 2024
UniDrive: Towards Universal Driving Perception Across Camera Configurations
UniDrive: Towards Universal Driving Perception Across Camera Configurations
Ye Li
Wenzhao Zheng
Xiaonan Huang
Kurt Keutzer
428
1
0
17 Oct 2024
Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor
  Fusion
Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion
Minkyoung Cho
Yulong Cao
Jiachen Sun
Qingzhao Zhang
Marco Pavone
Jeong Joon Park
Heng Yang
Z. Morley Mao
210
5
0
16 Oct 2024
Previous
123...101112...545556
Next
Page 11 of 56
Pageof 56