Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2010.04159
Cited By
v1
v2
v3
v4 (latest)
Deformable DETR: Deformable Transformers for End-to-End Object Detection
International Conference on Learning Representations (ICLR), 2020
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Github (3553★)
Papers citing
"Deformable DETR: Deformable Transformers for End-to-End Object Detection"
50 / 2,782 papers shown
Sparse Global Matching for Video Frame Interpolation with Large Motion
Computer Vision and Pattern Recognition (CVPR), 2024
Chunxu Liu
Guozhen Zhang
Rui Zhao
Limin Wang
300
27
0
10 Apr 2024
SparseAD: Sparse Query-Centric Paradigm for Efficient End-to-End Autonomous Driving
Diankun Zhang
Guoan Wang
Runwen Zhu
Jianbo Zhao
Xiwu Chen
...
Haotian Yao
Chi Zhang
Xiaojun Liu
Xiaoguang Di
Bin Li
253
34
0
10 Apr 2024
MOSE: Boosting Vision-based Roadside 3D Object Detection with Scene Cues
Xiahan Chen
Mingjian Chen
Sanli Tang
Yi Niu
Jiang Zhu
192
4
0
08 Apr 2024
Hyperbolic Learning with Synthetic Captions for Open-World Detection
Fanjie Kong
Yanbei Chen
Jiarui Cai
Davide Modolo
VLM
ObjD
221
14
0
07 Apr 2024
Dual-Scale Transformer for Large-Scale Single-Pixel Imaging
Gang Qu
Ping Wang
Xin Yuan
MedIm
196
10
0
07 Apr 2024
Bridging the Gap Between End-to-End and Two-Step Text Spotting
Mingxin Huang
Hongliang Li
Yuliang Liu
Xiang Bai
Lianwen Jin
213
11
0
06 Apr 2024
Rethinking Self-training for Semi-supervised Landmark Detection: A Selection-free Approach
Haibo Jin
Haoxuan Che
Hao Chen
261
6
0
06 Apr 2024
Mixed-Query Transformer: A Unified Image Segmentation Architecture
Pei Wang
Zhaowei Cai
Hao Yang
Ashwin Swaminathan
R. Manmatha
Stefano Soatto
267
4
0
06 Apr 2024
DQ-DETR: DETR with Dynamic Query for Tiny Object Detection
European Conference on Computer Vision (ECCV), 2024
Yi-Xin Huang
Hou-I Liu
Hong-Han Shuai
Wen-Huang Cheng
356
58
0
04 Apr 2024
Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer
European Conference on Computer Vision (ECCV), 2024
Qinji Yu
Yirui Wang
K. Yan
Haoshen Li
Dazhou Guo
...
Na Shen
Qifeng Wang
Xiaowei Ding
X. Ye
Dakai Jin
MedIm
381
2
0
04 Apr 2024
HAPNet: Toward Superior RGB-Thermal Scene Parsing via Hybrid, Asymmetric, and Progressive Heterogeneous Feature Fusion
Jiahang Li
Peng Yun
Qijun Chen
Rui Fan
Mingjian Sun
Qijun Chen
Ilin Alexander
Rui Fan
269
12
0
04 Apr 2024
DPFT: Dual Perspective Fusion Transformer for Camera-Radar-based Object Detection
IEEE Transactions on Intelligent Vehicles (TIV), 2024
F. Fent
Andras Palffy
Holger Caesar
290
24
0
03 Apr 2024
EGTR: Extracting Graph from Transformer for Scene Graph Generation
Computer Vision and Pattern Recognition (CVPR), 2024
Jinbae Im
Jeongyeon Nam
Nokyung Park
Hyungmin Lee
Seunghyun Park
ViT
603
53
0
02 Apr 2024
SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation
Computer Vision and Pattern Recognition (CVPR), 2024
V. Srivastav
Keqi Chen
N. Padoy
364
20
0
02 Apr 2024
Towards Enhanced Analysis of Lung Cancer Lesions in EBUS-TBNA -- A Semi-Supervised Video Object Detection Method
Jyun-An Lin
Yun-Chien Cheng
Ching-Kai Lin
264
1
0
02 Apr 2024
Scene Adaptive Sparse Transformer for Event-based Object Detection
Computer Vision and Pattern Recognition (CVPR), 2024
Yansong Peng
Hebei Li
Yueyi Zhang
Xiaoyan Sun
Feng Wu
ViT
210
41
0
02 Apr 2024
Sparse Semi-DETR: Sparse Learnable Queries for Semi-Supervised Object Detection
Computer Vision and Pattern Recognition (CVPR), 2024
Tahira Shehzadi
K. Hashmi
Didier Stricker
Muhammad Zeshan Afzal
251
28
0
02 Apr 2024
Atom-Level Optical Chemical Structure Recognition with Limited Supervision
Computer Vision and Pattern Recognition (CVPR), 2024
M. Oldenhof
E. Brouwer
Adam Arany
Yves Moreau
124
4
0
02 Apr 2024
Learning Temporal Cues by Predicting Objects Move for Multi-camera 3D Object Detection
IEEE International Conference on Robotics and Automation (ICRA), 2024
Seokha Moon
Hongbeen Park
Jungphil Kwon
Jaekoo Lee
Jinkyu Kim
3DPC
195
1
0
02 Apr 2024
QuAD: Query-based Interpretable Neural Motion Planning for Autonomous Driving
IEEE International Conference on Robotics and Automation (ICRA), 2024
Sourav Biswas
Sergio Casas
Quinlan Sykora
Ben Agro
Abbas Sadat
R. Urtasun
264
13
0
01 Apr 2024
BEM: Balanced and Entropy-based Mix for Long-Tailed Semi-Supervised Learning
Hongwei Zheng
Linyuan Zhou
Han Li
Jinming Su
Xiaoming Wei
Xiaoming Xu
218
9
0
01 Apr 2024
MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction
Xiaolu Liu
Song Wang
Wentong Li
Ruizi Yang
Junbo Chen
Jianke Zhu
212
45
0
01 Apr 2024
Roadside Monocular 3D Detection Prompted by 2D Detection
Yechi Ma
Shuoquan Wei
Churun Zhang
Wei Hua
557
0
0
01 Apr 2024
Dual DETRs for Multi-Label Temporal Action Detection
Yuhan Zhu
Guozhen Zhang
Jing Tan
Gangshan Wu
Limin Wang
254
25
0
31 Mar 2024
AgileFormer: Spatially Agile Transformer UNet for Medical Image Segmentation
Peijie Qiu
Jin Yang
Sayantan Kumar
S. Ghosh
Aristeidis Sotiras
MedIm
226
16
0
29 Mar 2024
Enhancing Efficiency in Vision Transformer Networks: Design Techniques and Insights
Moein Heidari
Reza Azad
Sina Ghorbani Kolahi
René Arimond
Leon Niggemeier
...
Afshin Bozorgpour
Ehsan Khodapanah Aghdam
Amirhossein Kazerouni
Ilker Hacihaliloglu
Dorit Merhof
307
14
0
28 Mar 2024
LocCa: Visual Pretraining with Location-aware Captioners
Bo Wan
Michael Tschannen
Yongqin Xian
Filip Pavetić
Ibrahim Alabdulmohsin
Xiao Wang
André Susano Pinto
Andreas Steiner
Lucas Beyer
Xiao-Qi Zhai
VLM
380
22
0
28 Mar 2024
CAT: Exploiting Inter-Class Dynamics for Domain Adaptive Object Detection
Mikhail Kennerley
Jian-Gang Wang
B. Veeravalli
R. Tan
194
31
0
28 Mar 2024
RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents
Zeren Chen
Zhelun Shi
Xiaoya Lu
Lehan He
Sucheng Qian
...
Zhen-fei Yin
Jing Shao
Jing Shao
Cewu Lu
Cewu Lu
297
13
0
28 Mar 2024
Dense Vision Transformer Compression with Few Samples
Hanxiao Zhang
Yifan Zhou
Guo-Hua Wang
Jianxin Wu
ViT
VLM
242
10
0
27 Mar 2024
Transformers-based architectures for stroke segmentation: A review
Yalda Zafari-Ghadim
Essam A. Rashed
M. Mabrok
MedIm
277
11
0
27 Mar 2024
DODA: Adapting Object Detectors to Dynamic Agricultural Environments in Real-Time with Diffusion
Shuai Xiang
Pieter M. Blok
James Burridge
Haozhou Wang
Wei Guo
418
0
0
27 Mar 2024
Cross-domain Fiber Cluster Shape Analysis for Language Performance Cognitive Score Prediction
Yui Lo
Yuqian Chen
Dongnan Liu
Wan Liu
L. Zekelman
...
Yogesh Rathi
N. Makris
A. Golby
Weidong Cai
L. O’Donnell
302
7
0
27 Mar 2024
EgoPoseFormer: A Simple Baseline for Egocentric 3D Human Pose Estimation
Chenhongyi Yang
Anastasia Tkach
Shreyas Hampali
Linguang Zhang
Elliot J. Crowley
Cem Keskin
213
0
0
26 Mar 2024
AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation
Qingping Sun
Yanjun Wang
Ailing Zeng
Wanqi Yin
Chen Wei
...
Haiyi Mei
Chi Sing Leung
Ziwei Liu
Lei Yang
Zhongang Cai
3DH
266
40
0
26 Mar 2024
A Survey on Deep Learning and State-of-the-art Applications
Mohd Halim Mohd Noor
A. O. Ige
AILaw
MLAU
214
0
0
26 Mar 2024
QKFormer: Hierarchical Spiking Transformer using Q-K Attention
Chenlin Zhou
Han Zhang
Zhaokun Zhou
Liutao Yu
Liwei Huang
Xiaopeng Fan
Liuliang Yuan
Zhengyu Ma
Huihui Zhou
Yonghong Tian
310
50
0
25 Mar 2024
RCBEVDet: Radar-camera Fusion in Bird's Eye View for 3D Object Detection
Zhiwei Lin
Zhe Liu
Zhongyu Xia
Xinhao Wang
Yongtao Wang
Shengxiang Qi
Yang Dong
Nan Dong
Le Zhang
Ce Zhu
312
111
0
25 Mar 2024
DOCTR: Disentangled Object-Centric Transformer for Point Scene Understanding
Xiaoxuan Yu
Hao Wang
Weiming Li
Qiang Wang
Soonyong Cho
Younghun Sung
3DPC
ViT
153
0
0
25 Mar 2024
Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects
Zicong Fan
Takehiko Ohkawa
Linlin Yang
Nie Lin
Zhishan Zhou
...
Kun He
Yoichi Sato
Otmar Hilliges
Hyung Jin Chang
Angela Yao
252
31
0
25 Mar 2024
Multiple Object Tracking as ID Prediction
Ruopeng Gao
Yijun Zhang
Limin Wang
580
53
0
25 Mar 2024
Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement
Xiuquan Hou
Meiqin Liu
Senlin Zhang
Ping Wei
Badong Chen
195
80
0
24 Mar 2024
Cross-domain Multi-modal Few-shot Object Detection via Rich Text
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Zeyu Shangguan
Daniel Seita
Mohammad Rostami
ObjD
435
2
0
24 Mar 2024
MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD Mapping
European Conference on Computer Vision (ECCV), 2024
Jiacheng Chen
Yuefan Wu
Jiaqi Tan
Hang Ma
Yasutaka Furukawa
272
54
0
23 Mar 2024
IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection
Computer Vision and Pattern Recognition (CVPR), 2024
Junbo Yin
Jianbing Shen
Runnan Chen
Wei Li
Ruigang Yang
Pascal Frossard
Wenguan Wang
3DPC
372
82
0
22 Mar 2024
MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection
Taeheon Kim
Sangyun Chung
Damin Yeom
Youngjoon Yu
Hak Gu Kim
Y. Ro
405
8
0
22 Mar 2024
Infrastructure-Assisted Collaborative Perception in Automated Valet Parking: A Safety Perspective
IEEE Vehicular Technology Conference (VTC), 2024
Yukuan Jia
Jiawen Zhang
Shimeng Lu
Baokang Fan
Ruiqing Mao
Sheng Zhou
Z. Niu
200
3
0
22 Mar 2024
Vehicle Detection Performance in Nordic Region
International Conference on Pattern Recognition (ICPR), 2024
Hamam Mokayed
Rajkumar Saini
Oluwatosin Adewumi
Lama Alkhaled
Björn Backe
P. Shivakumara
Olle Hagner
Yan Chai Hum
180
1
0
22 Mar 2024
Preventing Catastrophic Forgetting through Memory Networks in Continuous Detection
Gaurav Bhatt
James Ross
Leonid Sigal
CLL
VLM
265
8
0
21 Mar 2024
LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors
Saksham Suri
Matthew Walmer
Kamal Gupta
Abhinav Shrivastava
276
17
0
21 Mar 2024
Previous
1
2
3
...
18
19
20
...
54
55
56
Next
Page 19 of 56
Page
of 56
Go