ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.04159
  4. Cited By
Deformable DETR: Deformable Transformers for End-to-End Object Detection
v1v2v3v4 (latest)

Deformable DETR: Deformable Transformers for End-to-End Object Detection

International Conference on Learning Representations (ICLR), 2020
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
    ViT
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)Github (3553★)

Papers citing "Deformable DETR: Deformable Transformers for End-to-End Object Detection"

50 / 2,782 papers shown
CycleHOI: Improving Human-Object Interaction Detection with Cycle
  Consistency of Detection and Generation
CycleHOI: Improving Human-Object Interaction Detection with Cycle Consistency of Detection and Generation
Yisen Wang
Yao Teng
Limin Wang
DiffM
333
6
0
16 Jul 2024
Continuity Preserving Online CenterLine Graph Learning
Continuity Preserving Online CenterLine Graph Learning
Yunhui Han
Kun Yu
Zhiwei Li
GNN3DPC
319
3
0
16 Jul 2024
TCFormer: Visual Recognition via Token Clustering Transformer
TCFormer: Visual Recognition via Token Clustering Transformer
Wang Zeng
Sheng Jin
Lumin Xu
Wentao Liu
Chao Qian
Wanli Ouyang
Ping Luo
Xiaogang Wang
198
15
0
16 Jul 2024
Can Textual Semantics Mitigate Sounding Object Segmentation Preference?
Can Textual Semantics Mitigate Sounding Object Segmentation Preference?
Yaoting Wang
Peiwen Sun
Yuanchao Li
Honggang Zhang
Di Hu
324
13
0
15 Jul 2024
RepVF: A Unified Vector Fields Representation for Multi-task 3D
  Perception
RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception
Chunliang Li
Wencheng Han
Junbo Yin
Sanyuan Zhao
Jianbing Shen
227
10
0
15 Jul 2024
GTPT: Group-based Token Pruning Transformer for Efficient Human Pose
  Estimation
GTPT: Group-based Token Pruning Transformer for Efficient Human Pose Estimation
Haonan Wang
Jie Liu
Jie Tang
Gangshan Wu
Bo Xu
Y. Kevin Chou
Yong Wang
ViT
294
8
0
15 Jul 2024
SEED: A Simple and Effective 3D DETR in Point Clouds
SEED: A Simple and Effective 3D DETR in Point Clouds
Yanfeng Guo
Jinghua Hou
Xiaoqing Ye
Tong Wang
Jingdong Wang
Xiang Bai
3DPC
215
21
0
15 Jul 2024
Joint-Embedding Predictive Architecture for Self-Supervised Learning of
  Mask Classification Architecture
Joint-Embedding Predictive Architecture for Self-Supervised Learning of Mask Classification Architecture
Donghee Kim
Sungduk Cho
Hyeonwoo Cho
Chanmin Park
Jinyoung Kim
Won Hwa Kim
281
0
0
15 Jul 2024
FRI-Net: Floorplan Reconstruction via Room-wise Implicit Representation
FRI-Net: Floorplan Reconstruction via Room-wise Implicit Representation
Honghao Xu
Juzhan Xu
Zeyu Huang
Pengfei Xu
Hui Huang
Ruizhen Hu
3DV
201
4
0
15 Jul 2024
PolyRoom: Room-aware Transformer for Floorplan Reconstruction
PolyRoom: Room-aware Transformer for Floorplan Reconstruction
Yuzhou Liu
Lingjie Zhu
Xiaodong Ma
Hanqiao Ye
Xiang Gao
Xianwei Zheng
Shuhan Shen
203
6
0
15 Jul 2024
When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model
  and Benchmark Dataset
When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset
Yi Zhang
Wang Zeng
Sheng Jin
Chao Qian
Ping Luo
Wentao Liu
265
19
0
14 Jul 2024
Plain-Det: A Plain Multi-Dataset Object Detector
Plain-Det: A Plain Multi-Dataset Object Detector
Cheng Shi
Yuchen Zhu
Sibei Yang
ObjDVLM
250
8
0
14 Jul 2024
MutDet: Mutually Optimizing Pre-training for Remote Sensing Object
  Detection
MutDet: Mutually Optimizing Pre-training for Remote Sensing Object Detection
Ziyue Huang
Yongchao Feng
Qingjie Liu
Yunhong Wang
ViT
333
3
0
13 Jul 2024
IFTR: An Instance-Level Fusion Transformer for Visual Collaborative
  Perception
IFTR: An Instance-Level Fusion Transformer for Visual Collaborative Perception
Shaohong Wang
Lu Bin
Xinyu Xiao
Zhiyu Xiang
Hangguan Shan
Eryun Liu
ViT
326
8
0
13 Jul 2024
Image Compression for Machine and Human Vision with Spatial-Frequency
  Adaptation
Image Compression for Machine and Human Vision with Spatial-Frequency Adaptation
Han Li
Shaohui Li
Shuangrui Ding
Wenrui Dai
Maida Cao
Chenglin Li
Junni Zou
Hongkai Xiong
VLM
338
28
0
13 Jul 2024
Neural-based Video Compression on Solar Dynamics Observatory Images
Neural-based Video Compression on Solar Dynamics Observatory Images
Atefeh Khoshkhahtinat
Ali Zafari
P. Mehta
Nasser M. Nasrabadi
Barbara J. Thompson
M. Kirk
D. D. Silva
298
1
0
12 Jul 2024
FD-SOS: Vision-Language Open-Set Detectors for Bone Fenestration and
  Dehiscence Detection from Intraoral Images
FD-SOS: Vision-Language Open-Set Detectors for Bone Fenestration and Dehiscence Detection from Intraoral Images
Marawan Elbatel
Keyuan Liu
Yanqi Yang
Xuelong Li
157
2
0
12 Jul 2024
Domain-adaptive Video Deblurring via Test-time Blurring
Domain-adaptive Video Deblurring via Test-time Blurring
Jin-Ting He
Fu-Jen Tsai
Jia-Hao Wu
Yan-Tsung Peng
Chung-Chi Tsai
Chia-Wen Lin
Yen-Yu Lin
244
8
0
12 Jul 2024
DroneMOT: Drone-based Multi-Object Tracking Considering Detection
  Difficulties and Simultaneous Moving of Drones and Objects
DroneMOT: Drone-based Multi-Object Tracking Considering Detection Difficulties and Simultaneous Moving of Drones and Objects
Peng Wang
Yongcai Wang
Deying Li
VOT
266
14
0
12 Jul 2024
Textual Query-Driven Mask Transformer for Domain Generalized
  Segmentation
Textual Query-Driven Mask Transformer for Domain Generalized Segmentation
Byeonghyun Pak
Byeongju Woo
Sunghwan Kim
Dae-Hwan Kim
Hoseong Kim
436
20
0
12 Jul 2024
Revealing the Dark Secrets of Extremely Large Kernel ConvNets on
  Robustness
Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness
Honghao Chen
Yurong Zhang
Xiaokun Feng
Xiangxiang Chu
Kaiqi Huang
AAML
300
10
0
12 Jul 2024
Semi-Supervised Object Detection: A Survey on Progress from CNN to
  Transformer
Semi-Supervised Object Detection: A Survey on Progress from CNN to Transformer
Tahira Shehzadi
Ifza
Didier Stricker
Muhammad Zeshan Afzal
ViT
369
11
0
11 Jul 2024
Cross Domain Object Detection via Multi-Granularity Confidence Alignment
  based Mean Teacher
Cross Domain Object Detection via Multi-Granularity Confidence Alignment based Mean Teacher
Jiangming Chen
Li Liu
Wanxia Deng
Zhen Liu
Yu Liu
Yingmei Wei
Yongxiang Liu
227
2
0
10 Jul 2024
DIOR-ViT: Differential Ordinal Learning Vision Transformer for Cancer
  Classification in Pathology Images
DIOR-ViT: Differential Ordinal Learning Vision Transformer for Cancer Classification in Pathology Images
Ju Cheon Lee
Keunho Byeon
Boram Song
Kyungeun Kim
Jin Tae Kwak
MedIm
307
4
0
10 Jul 2024
Deformable-Heatmap-Segmentation for Automobile Visual Perception
Deformable-Heatmap-Segmentation for Automobile Visual Perception
Hongyu Jin
109
1
0
10 Jul 2024
ActionVOS: Actions as Prompts for Video Object Segmentation
ActionVOS: Actions as Prompts for Video Object Segmentation
Liangyang Ouyang
Ruicong Liu
Yifei Huang
Ryosuke Furuta
Yoichi Sato
VOS
222
9
0
10 Jul 2024
Exploring Camera Encoder Designs for Autonomous Driving Perception
Exploring Camera Encoder Designs for Autonomous Driving Perception
Barath Lakshmanan
Joshua Chen
Shiyi Lan
Maying Shen
Zhiding Yu
Jose M. Alvarez
263
0
0
09 Jul 2024
D-MASTER: Mask Annealed Transformer for Unsupervised Domain Adaptation
  in Breast Cancer Detection from Mammograms
D-MASTER: Mask Annealed Transformer for Unsupervised Domain Adaptation in Breast Cancer Detection from Mammograms
Tajamul Ashraf
K. Rangarajan
Mohit Gambhir
Richa Gabha
Chetan Arora
MedIm
263
4
0
09 Jul 2024
Anatomy-guided Pathology Segmentation
Anatomy-guided Pathology Segmentation
A. Jaus
C. Seibold
Simon Reiß
Lukas Heine
Anton Schily
Moon Kim
F. Bahnsen
Ken Herrmann
Rainer Stiefelhagen
Jens Kleesiek
MedIm
186
9
0
08 Jul 2024
Learning Lane Graphs from Aerial Imagery Using Transformers
Learning Lane Graphs from Aerial Imagery Using Transformers
Martin Büchner
Simon Dorer
Abhinav Valada
196
0
0
08 Jul 2024
Described Spatial-Temporal Video Detection
Described Spatial-Temporal Video Detection
Wei Ji
Xiangyan Liu
Yingfei Sun
Jiajun Deng
You Qin
Ammar Nuwanna
Mengyao Qiu
Lina Wei
Roger Zimmermann
285
3
0
08 Jul 2024
Smart Camera Parking System With Auto Parking Spot Detection
Smart Camera Parking System With Auto Parking Spot Detection
Tuan T. Nguyen
Mina Sartipi
215
5
0
07 Jul 2024
JDT3D: Addressing the Gaps in LiDAR-Based Tracking-by-Attention
JDT3D: Addressing the Gaps in LiDAR-Based Tracking-by-Attention
Brian Cheong
Jiachen Zhou
Steven Waslander
250
4
0
06 Jul 2024
Multi-Branch Auxiliary Fusion YOLO with Re-parameterization
  Heterogeneous Convolutional for accurate object detection
Multi-Branch Auxiliary Fusion YOLO with Re-parameterization Heterogeneous Convolutional for accurate object detection
Zhiqiang Yang
Q. Guan
Keer Zhao
Jianmin Yang
Xinli Xu
Haixia Long
Ying Tang
296
101
0
05 Jul 2024
AMD: Automatic Multi-step Distillation of Large-scale Vision Models
AMD: Automatic Multi-step Distillation of Large-scale Vision Models
Cheng Han
Qifan Wang
S. Dianat
Majid Rabbani
Raghuveer M. Rao
Yi Fang
Qiang Guan
Lifu Huang
Dongfang Liu
VLM
209
14
0
05 Jul 2024
QueryMamba: A Mamba-Based Encoder-Decoder Architecture with a
  Statistical Verb-Noun Interaction Module for Video Action Forecasting @ Ego4D
  Long-Term Action Anticipation Challenge 2024
QueryMamba: A Mamba-Based Encoder-Decoder Architecture with a Statistical Verb-Noun Interaction Module for Video Action Forecasting @ Ego4D Long-Term Action Anticipation Challenge 2024
Zeyun Zhong
Manuel Martin
Frederik Diederichs
Juergen Beyerer
195
5
0
04 Jul 2024
Occupancy as Set of Points
Occupancy as Set of Points
Yiang Shi
Tianheng Cheng
Qian Zhang
Wenyu Liu
Xinggang Wang
3DPC
280
28
0
04 Jul 2024
Cyclic Refiner: Object-Aware Temporal Representation Learning for
  Multi-View 3D Detection and Tracking
Cyclic Refiner: Object-Aware Temporal Representation Learning for Multi-View 3D Detection and Tracking
Mingzhe Guo
Zhipeng Zhang
Liping Jing
Yuan He
Ke Wang
Heng Fan
246
3
0
03 Jul 2024
Graph and Skipped Transformer: Exploiting Spatial and Temporal Modeling
  Capacities for Efficient 3D Human Pose Estimation
Graph and Skipped Transformer: Exploiting Spatial and Temporal Modeling Capacities for Efficient 3D Human Pose Estimation
Mengmeng Cui
Kunbo Zhang
Zhenan Sun
ViT
228
0
0
03 Jul 2024
CAVIS: Context-Aware Video Instance Segmentation
CAVIS: Context-Aware Video Instance Segmentation
Seunghun Lee
Jiwan Seo
Kiljoon Han
Minwoo Choi
S. Im
VOS
380
4
0
03 Jul 2024
Research on Reliable and Safe Occupancy Grid Prediction in Underground
  Parking Lots
Research on Reliable and Safe Occupancy Grid Prediction in Underground Parking Lots
JiaQi Luo
190
0
0
02 Jul 2024
CountFormer: Multi-View Crowd Counting Transformer
CountFormer: Multi-View Crowd Counting Transformer
Hong Mo
Xiong Zhang
Jianchao Tan
Cheng Yang
Qiong Gu
Bo Hang
Wenqi Ren
292
9
0
02 Jul 2024
SymPoint Revolutionized: Boosting Panoptic Symbol Spotting with Layer
  Feature Enhancement
SymPoint Revolutionized: Boosting Panoptic Symbol Spotting with Layer Feature Enhancement
Wenlong Liu
Tianyu Yang
Qizhi Yu
Lei Zhang
222
4
0
02 Jul 2024
LPViT: Low-Power Semi-structured Pruning for Vision Transformers
LPViT: Low-Power Semi-structured Pruning for Vision Transformers
Kaixin Xu
Zhe Wang
Chunyun Chen
Xue Geng
Jie Lin
Xulei Yang
Ruibing Jin
Min Wu
Xiaoli Li
Weisi Lin
ViTVLM
666
18
0
02 Jul 2024
Robot Instance Segmentation with Few Annotations for Grasping
Robot Instance Segmentation with Few Annotations for Grasping
Moshe Kimhi
David Vainshtein
Chaim Baskin
Dotan Di Castro
428
6
0
01 Jul 2024
SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection
SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection
Dingkang Liang
Wei Hua
Chunsheng Shi
Zhikang Zou
Xiaoqing Ye
X. Bai
365
10
0
01 Jul 2024
Parametric Primitive Analysis of CAD Sketches with Vision Transformer
Parametric Primitive Analysis of CAD Sketches with Vision Transformer
Xiaogang Wang
Liang Wang
Hongyu Wu
Guoqiang Xiao
Kai Xu
171
4
0
29 Jun 2024
GM-DF: Generalized Multi-Scenario Deepfake Detection
GM-DF: Generalized Multi-Scenario Deepfake Detection
Yingxin Lai
Zitong Yu
Jing Yang
Bin Li
Xiangui Kang
Linlin Shen
308
19
0
28 Jun 2024
Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene
  Understanding
Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding
Yifan Tang
Cong Tai
Fangxing Chen
Wanting Zhang
Tao Zhang
Xueping Liu
Yongjin Liu
Long Zeng
262
10
0
28 Jun 2024
Fibottention: Inceptive Visual Representation Learning with Diverse
  Attention Across Heads
Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads
Ali Khaleghi Rahimian
Manish Kumar Govind
Subhajit Maity
Dominick Reilly
Christian Kummerle
Srijan Das
A. Dutta
240
1
0
27 Jun 2024
Previous
123...141516...545556
Next
Page 15 of 56
Pageof 56