Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2010.04159
Cited By
v1
v2
v3
v4 (latest)
Deformable DETR: Deformable Transformers for End-to-End Object Detection
International Conference on Learning Representations (ICLR), 2020
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Github (3553★)
Papers citing
"Deformable DETR: Deformable Transformers for End-to-End Object Detection"
50 / 2,782 papers shown
A lightweight model FDM-YOLO for small target improvement based on YOLOv8
Xuerui Zhang
ObjD
258
2
0
06 Mar 2025
Manboformer: Learning Gaussian Representations via Spatial-temporal Attention Mechanism
Ziyue Zhao
Qining Qi
Jianfa Ma
217
0
0
06 Mar 2025
DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
Zhao Yang
Zezhong Qian
Xiaofan Li
Weixiang Xu
Gongpeng Zhao
Ruohong Yu
Lingsi Zhu
Longjun Liu
DiffM
VGen
368
4
0
05 Mar 2025
Boltzmann Attention Sampling for Image Analysis with Small Objects
Computer Vision and Pattern Recognition (CVPR), 2025
Theodore Zhao
Sid Kiblawi
Naoto Usuyama
Ho Hin Lee
Sam Preston
Hoifung Poon
Mu-Hsin Wei
MedIm
447
2
0
04 Mar 2025
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface
Hao Tang
Chenwei Xie
Haiyang Wang
Xiaoyi Bao
Tingyu Weng
Nianzu Yang
Yun Zheng
Liwei Wang
ObjD
VLM
455
13
0
03 Mar 2025
MI-DETR: An Object Detection Model with Multi-time Inquiries Mechanism
Computer Vision and Pattern Recognition (CVPR), 2025
Jingjing Jiang
Xianghong Li
Jifeng Dai
Tao Xiang
361
7
0
03 Mar 2025
Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized Detection
Computer Vision and Pattern Recognition (CVPR), 2025
Boyong He
Yuxiang Ji
Qianwen Ye
Zhuoyue Tan
Liaoni Wu
DiffM
541
5
0
03 Mar 2025
Evaluating Stenosis Detection with Grounding DINO, YOLO, and DINO-DETR
Muhammad Musab Ansari
258
0
0
03 Mar 2025
Composed Multi-modal Retrieval: A Survey of Approaches and Applications
Kun Zhang
Jingyu Li
Zhiyu Li
Jingjing Zhang
F. Li
...
Nan Chen
Lei Zhang
Yongdong Zhang
Zhendong Mao
S.Kevin Zhou
427
2
0
03 Mar 2025
RTGen: Real-Time Generative Detection Transformer
Chi Ruan
Jiying Zhao
Wenhu Chen
ObjD
VLM
420
0
0
28 Feb 2025
New Dataset and Methods for Fine-Grained Compositional Referring Expression Comprehension via Specialist-MLLM Collaboration
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
X. J. Yang
Jing Liu
Peng Wang
Guoqing Wang
Yue Yang
Mengqi Li
ObjD
492
5
0
27 Feb 2025
BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance
Computer Vision and Pattern Recognition (CVPR), 2025
Xin Ye
Burhaneddin Yaman
Sheng Cheng
Feng Tao
Abhirup Mallik
Liu Ren
DiffM
438
10
0
27 Feb 2025
WalnutData: A UAV Remote Sensing Dataset of Green Walnuts and Model Evaluation
Mingjie Wu
Chenggui Yang
Huihua Wang
Chen Xue
Yibo Wang
...
Yuqi Han
R. Li
Lijun Yun
Zaiqing Chen
Siyang Song
559
0
0
27 Feb 2025
SAC-ViT: Semantic-Aware Clustering Vision Transformer with Early Exit
Youbing Hu
Yun Cheng
Anqi Lu
Dawei Wei
Zhijun Li
323
1
0
27 Feb 2025
QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects
AAAI Conference on Artificial Intelligence (AAAI), 2025
Elkhan Ismayilzada
MD Khalequzzaman Chowdhury Sayem
Yihalem Yimolal Tiruneh
Mubarrat Chowdhury
Muhammadjon Boboev
Seungryul Baek
ViT
342
2
0
27 Feb 2025
ProxyTransformation: Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding
Computer Vision and Pattern Recognition (CVPR), 2025
Qihang Peng
Henry Zheng
Gao Huang
3DPC
388
3
0
26 Feb 2025
CoopDETR: A Unified Cooperative Perception Framework for 3D Detection via Object Query
IEEE International Conference on Robotics and Automation (ICRA), 2025
Liang Luo
Shaocong Xu
Xucai Zhuang
Tongda Xu
Yan Wang
Qingbin Liu
Yilun Chen
Yuanhang Zhang
387
6
0
26 Feb 2025
Automatic Vehicle Detection using DETR: A Transformer-Based Approach for Navigating Treacherous Roads
Istiaq Ahmed Fahad
Abdullah Ibne Hanif Arean
Nazmus Sakib Ahmed
Mahmudul Hasan
ViT
159
4
0
25 Feb 2025
Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking
AAAI Conference on Artificial Intelligence (AAAI), 2025
Xin Tong
Shi Peng
Baojie Tian
Yufei Guo
Xuhui Huang
Zhe Ma
ViT
250
2
0
25 Feb 2025
Soybean pod and seed counting in both outdoor fields and indoor laboratories using unions of deep neural networks
Tianyou Jiang
Mingshun Shao
Tianyi Zhang
Xiaoyu Liu
Qun Yu
254
2
0
24 Feb 2025
DeepInteraction++: Multi-Modality Interaction for Autonomous Driving
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Zeyu Yang
Nan Song
Wei Li
Xiatian Zhu
Guang Dai
Philip H. S. Torr
483
8
0
24 Feb 2025
MVIP -- A Dataset and Methods for Application Oriented Multi-View and Multi-Modal Industrial Part Recognition
Paul Koch
Marian Schluter
Jörg Krüger
306
0
0
24 Feb 2025
A Transformer-in-Transformer Network Utilizing Knowledge Distillation for Image Recognition
Dewan Tauhid Rahman
Yeahia Sarker
Antar Mazumder
Md. Shamim Anower
ViT
197
0
0
24 Feb 2025
Cross-domain Few-shot Object Detection with Multi-modal Textual Enrichment
Zeyu Shangguan
Daniel Seita
Mohammad Rostami
ObjD
336
1
0
23 Feb 2025
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models
Wenwen Yu
Zhibo Yang
Jianqiang Wan
Sibo Song
J. Tang
Wenqing Cheng
Yunxing Liu
Xiang Bai
344
15
0
22 Feb 2025
Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines
Xinyi Ying
Chao Xiao
Ruojing Li
Xu He
Boyang Li
...
Miao Li
Shilin Zhou
Wei An
Weidong Sheng
Tianpeng Liu
405
32
0
21 Feb 2025
YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object Detection
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Yuming Chen
Xinbin Yuan
Ruiqi Wu
Jiabao Wang
Qibin Hou
Mingg-Ming Cheng
Ming-Ming Cheng
ObjD
470
145
0
21 Feb 2025
MindLLM: A Subject-Agnostic and Versatile Model for fMRI-to-Text Decoding
Weikang Qiu
Zheng Huang
Haoyu Hu
Aosong Feng
Yujun Yan
Rex Ying
406
10
0
18 Feb 2025
GraphMorph: Tubular Structure Extraction by Morphing Predicted Graphs
Neural Information Processing Systems (NeurIPS), 2025
Zhao Zhang
Ziwei Zhao
Dong Wang
Liwei Wang
MedIm
290
2
0
17 Feb 2025
RT-DEMT: A hybrid real-time acupoint detection model combining mamba and transformer
Shilong Yang
Qi Zang
Chulong Zhang
Lingfeng Huang
Yaoqin Xie
Mamba
513
4
0
16 Feb 2025
CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs
Qizhen Lan
Qing Tian
280
2
0
15 Feb 2025
Improving action segmentation via explicit similarity measurement
Kamel Aouaidjia
Wenhao Zhang
Aofan Li
Chongsheng Zhang
268
0
0
15 Feb 2025
Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis
Amir Hosein Fadaei
M. Dehaqani
343
0
0
11 Feb 2025
Dense Object Detection Based on De-homogenized Queries
Yueming Huang
Chenrui Ma
Hao Zhou
Hao Wu
Guowu Yuan
419
0
0
11 Feb 2025
Cell Nuclei Detection and Classification in Whole Slide Images with Transformers
Oscar Pina
Eduard Dorca
Verónica Vilaplana
158
0
0
10 Feb 2025
Amnesia as a Catalyst for Enhancing Black Box Pixel Attacks in Image Classification and Object Detection
Neural Information Processing Systems (NeurIPS), 2025
Dongsu Song
Daehwa Ko
Jay Hoon Jung
AAML
491
0
0
10 Feb 2025
SMART: Advancing Scalable Map Priors for Driving Topology Reasoning
IEEE International Conference on Robotics and Automation (ICRA), 2025
Junjie Ye
David Paz
Hengyuan Zhang
Yuliang Guo
Xinyu Huang
Henrik I. Christensen
Yue Wang
Liu Ren
LRM
363
6
0
06 Feb 2025
Foundation Model-Based Apple Ripeness and Size Estimation for Selective Harvesting
Computers and Electronics in Agriculture (CEA), 2025
Keyi Zhu
Jiajia Li
Kaixiang Zhang
Chaaran Arunachalam
Siddhartha Bhattacharya
R. Lu
Zhaojian Li
355
4
0
03 Feb 2025
IROAM: Improving Roadside Monocular 3D Object Detection Learning from Autonomous Vehicle Data Domain
IEEE International Conference on Robotics and Automation (ICRA), 2025
Liang Luo
Xiaoliang Huo
Siqi Fan
Jingjing Liu
Ya-Qin Zhang
Yan Wang
183
0
0
30 Jan 2025
Not Every Patch is Needed: Towards a More Efficient and Effective Backbone for Video-based Person Re-identification
IEEE Transactions on Image Processing (IEEE TIP), 2025
Lanyun Zhu
Tianrun Chen
Deyi Ji
Jieping Ye
Jing Liu
423
7
0
28 Jan 2025
B-RIGHT: Benchmark Re-evaluation for Integrity in Generalized Human-Object Interaction Testing
Yoojin Jang
Junsu Kim
H. Kim
Eun-ki Lee
Eun-sol Kim
Seungryul Baek
Jaejun Yoo
186
0
0
28 Jan 2025
SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation
IEEE International Conference on Robotics and Automation (ICRA), 2025
Jianing Li
Ming Lu
Hao Wang
Chenyang Gu
Wenzhao Zheng
Li Du
Shanghang Zhang
389
1
0
28 Jan 2025
V2X-DGPE: Addressing Domain Gaps and Pose Errors for Robust Collaborative 3D Object Detection
Sichao Wang
Chuang Zhang
Ming Yuan
Qing Xu
Lei He
Jianqiang Wang
357
3
0
28 Jan 2025
CSPCL: Category Semantic Prior Contrastive Learning for Deformable DETR-Based Prohibited Item Detectors
Mingyuan Li
Tong Jia
Hui Lu
Bowen Ma
Hao Wang
Shiyi Guo
Da Cai
Dongyue Chen
310
1
0
28 Jan 2025
Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection
Xiangyu Gao
Yu Dai
Benliu Qiu
Hongliang Li
Heqian Qiu
Hongliang Li
ObjD
VLM
1.0K
0
0
28 Jan 2025
Object Detection for Medical Image Analysis: Insights from the RT-DETR Model
Weijie He
Yuwei Zhang
T. Xu
Tai An
Yingbin Liang
Bo Zhang
PINN
MU
MedIm
229
23
0
27 Jan 2025
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
Fu Rong
Meng Lan
Qian Zhang
Guang Dai
VOS
VGen
616
3
0
23 Jan 2025
A generalizable 3D framework and model for self-supervised learning in medical imaging
Tony Xu
Sepehr Hosseini
Chris Anderson
Anthony Rinaldi
Rahul G. Krishnan
Anne L. Martel
Maged Goubran
MedIm
343
11
0
20 Jan 2025
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
Pengcheng Zhao
Zhixian He
Fuwei Zhang
Shujin Lin
Fan Zhou
378
3
0
18 Jan 2025
Geometric Distortion Guided Transformer for Omnidirectional Image Super-Resolution
Cuixin Yang
Rongkang Dong
Jun Xiao
Cong Zhang
Kin-Man Lam
Fei Zhou
Guoping Qiu
457
8
0
17 Jan 2025
Previous
1
2
3
...
8
9
10
...
54
55
56
Next
Page 9 of 56
Page
of 56
Go