ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.04159
  4. Cited By
Deformable DETR: Deformable Transformers for End-to-End Object Detection
v1v2v3v4 (latest)

Deformable DETR: Deformable Transformers for End-to-End Object Detection

International Conference on Learning Representations (ICLR), 2020
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
    ViT
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)Github (3553★)

Papers citing "Deformable DETR: Deformable Transformers for End-to-End Object Detection"

50 / 2,788 papers shown
IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object
  Detection
IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object DetectionComputer Vision and Pattern Recognition (CVPR), 2024
Junbo Yin
Jianbing Shen
Runnan Chen
Wei Li
Ruigang Yang
Pascal Frossard
Wenguan Wang
3DPC
425
95
0
22 Mar 2024
MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection
MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection
Taeheon Kim
Sangyun Chung
Damin Yeom
Youngjoon Yu
Hak Gu Kim
Y. Ro
431
9
0
22 Mar 2024
Infrastructure-Assisted Collaborative Perception in Automated Valet
  Parking: A Safety Perspective
Infrastructure-Assisted Collaborative Perception in Automated Valet Parking: A Safety PerspectiveIEEE Vehicular Technology Conference (VTC), 2024
Yukuan Jia
Jiawen Zhang
Shimeng Lu
Baokang Fan
Ruiqing Mao
Sheng Zhou
Z. Niu
234
4
0
22 Mar 2024
Vehicle Detection Performance in Nordic Region
Vehicle Detection Performance in Nordic RegionInternational Conference on Pattern Recognition (ICPR), 2024
Hamam Mokayed
Rajkumar Saini
Oluwatosin Adewumi
Lama Alkhaled
Björn Backe
P. Shivakumara
Olle Hagner
Yan Chai Hum
213
1
0
22 Mar 2024
Preventing Catastrophic Forgetting through Memory Networks in Continuous
  Detection
Preventing Catastrophic Forgetting through Memory Networks in Continuous Detection
Gaurav Bhatt
James Ross
Leonid Sigal
CLLVLM
290
10
0
21 Mar 2024
LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT
  Descriptors
LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors
Saksham Suri
Matthew Walmer
Kamal Gupta
Abhinav Shrivastava
315
22
0
21 Mar 2024
ODTFormer: Efficient Obstacle Detection and Tracking with Stereo Cameras
  Based on Transformer
ODTFormer: Efficient Obstacle Detection and Tracking with Stereo Cameras Based on Transformer
Tianye Ding
Hongyu Li
Huaizu Jiang
229
1
0
21 Mar 2024
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
Qing Jiang
Feng Li
Zhaoyang Zeng
Tianhe Ren
Shilong Liu
Lei Zhang
VLM
433
89
0
21 Mar 2024
LDTR: Transformer-based Lane Detection with Anchor-chain Representation
LDTR: Transformer-based Lane Detection with Anchor-chain Representation
Zhongyu Yang
Chen Shen
Wei Shao
Tengfei Xing
Runbo Hu
Pengfei Xu
Hua Chai
Ruini Xue
ViT
222
12
0
21 Mar 2024
Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship
  Detection
Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection
Tim Salzmann
Markus Ryll
Alex Bewley
Matthias Minderer
346
9
0
21 Mar 2024
Volumetric Environment Representation for Vision-Language Navigation
Volumetric Environment Representation for Vision-Language Navigation
Rui Liu
Wenguan Wang
Yi Yang
310
70
0
21 Mar 2024
Meta-Point Learning and Refining for Category-Agnostic Pose Estimation
Meta-Point Learning and Refining for Category-Agnostic Pose Estimation
Junjie Chen
Jiebin Yan
Yuming Fang
Li Niu
3DPC
362
9
0
20 Mar 2024
vid-TLDR: Training Free Token merging for Light-weight Video Transformer
vid-TLDR: Training Free Token merging for Light-weight Video Transformer
Joonmyung Choi
Sanghyeok Lee
Jaewon Chu
Minhyuk Choi
Hyunwoo J. Kim
MoMeViT
350
46
0
20 Mar 2024
Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language
  Models
Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models
Zuyan Liu
Yuhao Dong
Yongming Rao
Jie Zhou
Jiwen Lu
LRM
285
51
0
19 Mar 2024
TAPTR: Tracking Any Point with Transformers as Detection
TAPTR: Tracking Any Point with Transformers as Detection
Hongyang Li
Hao Zhang
Shilong Liu
Zhaoyang Zeng
Tianhe Ren
Feng Li
Lei Zhang
261
48
0
19 Mar 2024
FaceXFormer: A Unified Transformer for Facial Analysis
FaceXFormer: A Unified Transformer for Facial Analysis
Kartik Narayan
VS Vibashan
Rama Chellappa
Vishal M. Patel
ViT
553
40
0
19 Mar 2024
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video
  Object Segmentation
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object SegmentationEuropean Conference on Computer Vision (ECCV), 2024
Zixin Zhu
Xuelu Feng
Dongdong Chen
Junsong Yuan
Chunming Qiao
Gang Hua
DiffM
352
23
0
18 Mar 2024
BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation
BEVCar: Camera-Radar Fusion for BEV Map and Object SegmentationIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Jonas Schramm
Niclas Vodisch
Kürsat Petek
Ravi Kiran
S. Yogamani
Wolfram Burgard
Abhinav Valada
293
41
0
18 Mar 2024
Continual Forgetting for Pre-trained Vision Models
Continual Forgetting for Pre-trained Vision ModelsComputer Vision and Pattern Recognition (CVPR), 2024
Hongbo Zhao
Bolin Ni
Haochen Wang
Junsong Fan
Fei Zhu
Yuxi Wang
Yuntao Chen
Gaofeng Meng
Zhaoxiang Zhang
MUVLM
379
24
0
18 Mar 2024
GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection
GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection
Ziying Song
Lei Yang
Shaoqing Xu
Lin Liu
Dongyang Xu
Caiyan Jia
Feiyang Jia
Li-e Wang
3DPC
625
57
0
18 Mar 2024
Align and Distill: Unifying and Improving Domain Adaptive Object Detection
Align and Distill: Unifying and Improving Domain Adaptive Object Detection
Justin Kay
T. Haucke
Suzanne Stathatos
Siqi Deng
Erik Young
Pietro Perona
Sara Beery
Grant Van Horn
568
16
0
18 Mar 2024
Domain-Guided Masked Autoencoders for Unique Player Identification
Domain-Guided Masked Autoencoders for Unique Player Identification
Bavesh Balaji
Jerrin Bright
Sirisha Rambhatla
Yuhao Chen
Alexander Wong
John S. Zelek
David A Clausi
212
2
0
17 Mar 2024
NetTrack: Tracking Highly Dynamic Objects with a Net
NetTrack: Tracking Highly Dynamic Objects with a Net
Guang-Zheng Zheng
Shijie Lin
Haobo Zuo
Changhong Fu
Jia Pan
317
27
0
17 Mar 2024
Diffusion Models are Efficient Data Generators for Human Mesh Recovery
Diffusion Models are Efficient Data Generators for Human Mesh Recovery
Yongtao Ge
Wenjia Wang
Yongfan Chen
Fanzhou Wang
Lei Yang
Hao Chen
Chunhua Shen
3DH
518
8
0
17 Mar 2024
SimPB: A Single Model for 2D and 3D Object Detection from Multiple
  Cameras
SimPB: A Single Model for 2D and 3D Object Detection from Multiple CamerasEuropean Conference on Computer Vision (ECCV), 2024
Yingqi Tang
Zhaotie Meng
Guoliang Chen
Erkang Cheng
3DPC
278
5
0
15 Mar 2024
Generative Region-Language Pretraining for Open-Ended Object Detection
Generative Region-Language Pretraining for Open-Ended Object DetectionComputer Vision and Pattern Recognition (CVPR), 2024
Chuang Lin
Yi Jiang
Zhuang Li
Zehuan Yuan
Jianfei Cai
ObjDVLM
249
28
0
15 Mar 2024
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with
  Pre-trained Language Model
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Jiahao Lyu
Jin Wei
Gangyan Zeng
Zeng Li
Enze Xie
Wei Wang
Can Ma
VLM
332
8
0
15 Mar 2024
SparseFusion: Efficient Sparse Multi-Modal Fusion Framework for
  Long-Range 3D Perception
SparseFusion: Efficient Sparse Multi-Modal Fusion Framework for Long-Range 3D Perception
Yiheng Li
Hongyang Li
Zehao Huang
Hong Chang
Naiyan Wang
309
11
0
15 Mar 2024
Multi-criteria Token Fusion with One-step-ahead Attention for Efficient
  Vision Transformers
Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision TransformersComputer Vision and Pattern Recognition (CVPR), 2024
Sanghyeok Lee
Joonmyung Choi
Hyunwoo J. Kim
ViT
448
27
0
15 Mar 2024
ST-LDM: A Universal Framework for Text-Grounded Object Generation in
  Real Images
ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real ImagesEuropean Conference on Computer Vision (ECCV), 2024
Xiangtian Xue
Jiasong Wu
Youyong Kong
L. Senhadji
Huazhong Shu
DiffM
204
1
0
15 Mar 2024
EfficientVMamba: Atrous Selective Scan for Light Weight Visual Mamba
EfficientVMamba: Atrous Selective Scan for Light Weight Visual MambaAAAI Conference on Artificial Intelligence (AAAI), 2024
Xiaohuan Pei
Tao Huang
Chang Xu
Mamba
347
221
0
15 Mar 2024
HyCTAS: Multi-Objective Hybrid Convolution-Transformer Architecture Search for Real-Time Image Segmentation
HyCTAS: Multi-Objective Hybrid Convolution-Transformer Architecture Search for Real-Time Image Segmentation
Hongyuan Yu
Cheng Wan
Xiyang Dai
Dongdong Chen
Bin Xiao
Xiyang Dai
Yan Huang
Yuan Lu
Liang Wang
345
8
0
15 Mar 2024
Video Mamba Suite: State Space Model as a Versatile Alternative for
  Video Understanding
Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding
Guo Chen
Yifei Huang
Jilan Xu
Baoqi Pei
Zhe Chen
Zhiqi Li
Jiahao Wang
Kunchang Li
Tong Lu
Limin Wang
Mamba
342
138
0
14 Mar 2024
Open-Vocabulary Object Detection with Meta Prompt Representation and
  Instance Contrastive Optimization
Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive OptimizationBritish Machine Vision Conference (BMVC), 2024
Zhao Wang
Aoxue Li
Fengwei Zhou
Zhenguo Li
Qi Dou
ObjDVLM
273
5
0
14 Mar 2024
Efficient Transferability Assessment for Selection of Pre-trained
  Detectors
Efficient Transferability Assessment for Selection of Pre-trained DetectorsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Zhao Wang
Aoxue Li
Zhenguo Li
Qi Dou
215
0
0
14 Mar 2024
GiT: Towards Generalist Vision Transformer through Universal Language
  Interface
GiT: Towards Generalist Vision Transformer through Universal Language InterfaceEuropean Conference on Computer Vision (ECCV), 2024
Haiyang Wang
Hao Tang
Li Jiang
Shaoshuai Shi
Muhammad Ferjad Naeem
Jiaming Song
Bernt Schiele
Liwei Wang
VLM
307
24
0
14 Mar 2024
PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of
  Interest
PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest
Jiajun Deng
Sha Zhang
Feras Dayoub
Wanli Ouyang
Yanyong Zhang
Ian Reid
3DPC
341
8
0
14 Mar 2024
PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient
  Task Adaptation
PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task AdaptationEuropean Conference on Computer Vision (ECCV), 2024
Yizhe Xiong
Hui Chen
Tianxiang Hao
Zijia Lin
Jungong Han
Yuesong Zhang
Guoxin Wang
Yongjun Bao
Guiguang Ding
416
26
0
14 Mar 2024
ThermoHands: A Benchmark for 3D Hand Pose Estimation from Egocentric Thermal Images
ThermoHands: A Benchmark for 3D Hand Pose Estimation from Egocentric Thermal ImagesACM International Conference on Embedded Networked Sensor Systems (SenSys), 2024
Fangqiang Ding
Yunzhou Zhu
Xiangyu Wen
Gaowen Liu
Chris Xiaoxuan Lu
605
10
0
14 Mar 2024
MonoOcc: Digging into Monocular Semantic Occupancy Prediction
MonoOcc: Digging into Monocular Semantic Occupancy PredictionIEEE International Conference on Robotics and Automation (ICRA), 2024
Yupeng Zheng
Xiang Li
Pengfei Li
Yuhang Zheng
Bu Jin
Chengliang Zhong
Xiaoxiao Long
Hao Zhao
Qichao Zhang
257
49
0
13 Mar 2024
MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving
  Representation Learning
MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning
Jialv Zou
Bencheng Liao
Qian Zhang
Wenyu Liu
Xinggang Wang
294
6
0
13 Mar 2024
Historical Astronomical Diagrams Decomposition in Geometric Primitives
Historical Astronomical Diagrams Decomposition in Geometric PrimitivesIEEE International Conference on Document Analysis and Recognition (ICDAR), 2024
Syrine Kalleli
Scott Trigg
Ségolene Albouy
Mathieu Husson
Mathieu Aubry
192
2
0
13 Mar 2024
HIMap: HybrId Representation Learning for End-to-end Vectorized HD Map
  Construction
HIMap: HybrId Representation Learning for End-to-end Vectorized HD Map ConstructionComputer Vision and Pattern Recognition (CVPR), 2024
Yi Zhou
Hui Zhang
Jiaqian Yu
Yifan Yang
Sangil Jung
Seungsang Park
ByungIn Yoo
3DPC
316
51
0
13 Mar 2024
TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object
  Detection
TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object Detection
Hanning Chen
Wenjun Huang
Yang Ni
Sanggeon Yun
Fei Wen
Hugo Latapie
Mohsen Imani
ObjDMLLMVLM
252
31
0
12 Mar 2024
A Survey of Vision Transformers in Autonomous Driving: Current Trends
  and Future Directions
A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions
Quoc-Vinh Lai-Dang
ViT
311
14
0
12 Mar 2024
ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature
  Interaction for Dense Predictions
ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense PredictionsComputer Vision and Pattern Recognition (CVPR), 2024
Chunlong Xia
Xinliang Wang
Feng Lv
Xin Hao
Yifeng Shi
ViT
459
137
0
12 Mar 2024
SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object
  Detection
SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object DetectionEuropean Conference on Computer Vision (ECCV), 2024
Hongcheng Zhang
Liu Liang
Pengxin Zeng
Xiao Song
Zhe Wang
409
29
0
12 Mar 2024
Unleashing HyDRa: Hybrid Fusion, Depth Consistency and Radar for Unified 3D Perception
Unleashing HyDRa: Hybrid Fusion, Depth Consistency and Radar for Unified 3D PerceptionIEEE International Conference on Robotics and Automation (ICRA), 2024
Philipp Wolters
Johannes Gilg
Torben Teepe
Fabian Herzog
Anouar Laouichi
Martin Hofmann
Gerhard Rigoll
MDE
621
41
0
12 Mar 2024
Real-time Transformer-based Open-Vocabulary Detection with Efficient
  Fusion Head
Real-time Transformer-based Open-Vocabulary Detection with Efficient Fusion Head
Tiancheng Zhao
Peng Liu
Xuan He
Lu Zhang
Kyusong Lee
ObjD
213
22
0
11 Mar 2024
Genetic Learning for Designing Sim-to-Real Data Augmentations
Genetic Learning for Designing Sim-to-Real Data Augmentations
Bram Vanherle
Nick Michiels
F. Reeth
164
0
0
11 Mar 2024
Previous
123...192021...545556
Next
Page 20 of 56
Pageof 56