ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.04159
  4. Cited By
Deformable DETR: Deformable Transformers for End-to-End Object Detection
v1v2v3v4 (latest)

Deformable DETR: Deformable Transformers for End-to-End Object Detection

International Conference on Learning Representations (ICLR), 2020
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
    ViT
ArXiv (abs)PDFHTMLHuggingFace (1 upvotes)Github (3553★)

Papers citing "Deformable DETR: Deformable Transformers for End-to-End Object Detection"

50 / 2,782 papers shown
AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation
AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual SegmentationIEEE transactions on multimedia (TMM), 2025
Sitong Gong
Yunzhi Zhuge
Lu Zhang
Yifan Wang
Pingping Zhang
Lijun Wang
Huchuan Lu
MambaVOS
142
14
0
14 Jan 2025
BioPose: Biomechanically-accurate 3D Pose Estimation from Monocular Videos
BioPose: Biomechanically-accurate 3D Pose Estimation from Monocular VideosIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Farnoosh Koleini
Muhammad Usama Saleem
Pu Wang
Hongfei Xue
Ahmed Helmy
Abbey Fenwick
3DH
324
6
0
14 Jan 2025
SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing
SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing
Varun Biyyala
Bharat Chanderprakash Kathuria
Jialu Li
Youshan Zhang
327
1
0
13 Jan 2025
TimberVision: A Multi-Task Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry Operations
TimberVision: A Multi-Task Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry OperationsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Daniel Steininger
Julia Simon
Andreas Trondl
Markus Murschitz
307
5
0
13 Jan 2025
Toward Realistic Camouflaged Object Detection: Benchmarks and Method
Toward Realistic Camouflaged Object Detection: Benchmarks and Method
Zhimeng Xin
Tianxu Wu
Shiming Chen
Shuo Ye
Zijing Xie
Yixiong Zou
Xinge You
Yufei Guo
202
0
0
13 Jan 2025
VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning
VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video CaptioningAAAI Conference on Artificial Intelligence (AAAI), 2025
Ji Soo Lee
Jongha Kim
Jeehye Na
Jinyoung Park
H. Kim
VGen
152
8
0
12 Jan 2025
MapGS: Generalizable Pretraining and Data Augmentation for Online Mapping via Novel View Synthesis
MapGS: Generalizable Pretraining and Data Augmentation for Online Mapping via Novel View Synthesis
Hengyuan Zhang
David Paz
Yuliang Guo
Xinyu Huang
Henrik I. Christensen
Liu Ren
3DGSViT
227
4
0
11 Jan 2025
YO-CSA-T: A Real-time Badminton Tracking System Utilizing YOLO Based on Contextual and Spatial Attention
YO-CSA-T: A Real-time Badminton Tracking System Utilizing YOLO Based on Contextual and Spatial Attention
Yuan Lai
Zhiwei Shi
Chengxi Zhu
80
3
0
11 Jan 2025
Semi-supervised 3D Semantic Scene Completion with 2D Vision Foundation Model Guidance
Semi-supervised 3D Semantic Scene Completion with 2D Vision Foundation Model GuidanceAAAI Conference on Artificial Intelligence (AAAI), 2024
Duc-Hai Pham
Duc Dung Nguyen
Anh Pham
Ho Lai Tuan
P. Nguyen
Khoi Duc Minh Nguyen
Rang Nguyen
3DPC
573
3
0
10 Jan 2025
UV-Attack: Physical-World Adversarial Attacks for Person Detection via Dynamic-NeRF-based UV Mapping
UV-Attack: Physical-World Adversarial Attacks for Person Detection via Dynamic-NeRF-based UV Mapping
Yanjie Li
Wenxuan Zhang
K. Liang
AAML
295
6
0
10 Jan 2025
RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark
RSAR: Restricted State Angle Resolver and Rotated SAR BenchmarkComputer Vision and Pattern Recognition (CVPR), 2025
Xinsong Zhang
Xue Yang
Yuchen Ren
Zhiqiang Wang
Ming-Ming Cheng
Xianrui Li
260
16
0
08 Jan 2025
AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features
AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive FeaturesApplied Sciences (AS), 2025
Ruochen Zhang
Hyeung-Sik Choi
Dongwook Jung
Phan Huy Nam Anh
Sang-Ki Jeong
Zihao Zhu
3DPCMDE
206
0
0
08 Jan 2025
Siamese-DETR for Generic Multi-Object Tracking
Siamese-DETR for Generic Multi-Object TrackingIEEE Transactions on Image Processing (IEEE TIP), 2023
Qiankun Liu
Yichen Li
Yuqi Jiang
Ying Fu
VOT
310
14
0
08 Jan 2025
Exploiting Boundary Loss for the Hierarchical Panoptic Segmentation of Plants and Leaves
Madeleine Darbyshire
Elizabeth I. Sklar
Simon Parsons
287
0
0
03 Jan 2025
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language TasksNeural Information Processing Systems (NeurIPS), 2024
Jiannan Wu
Muyan Zhong
Sen Xing
Zeqiang Lai
Zhaoyang Liu
...
Lewei Lu
Tong Lu
Ping Luo
Yu Qiao
Jifeng Dai
MLLMVLMLRM
868
121
0
03 Jan 2025
Open-Set Object Detection By Aligning Known Class Representations
Open-Set Object Detection By Aligning Known Class RepresentationsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Hiran Sarkar
Vishal M. Chudasama
N. Onoe
Pankaj Wasnik
Vineeth N. Balasubramanian
ObjD
208
8
0
31 Dec 2024
Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of
  Vision-Language Multiway Transformer Model
Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer ModelInternational Conference on Information Photonics (ICIP), 2024
Yi-Chia Chen
Wei-Hua Li
Chu-Song Chen
VLM
233
2
0
25 Dec 2024
Evaluating the Adversarial Robustness of Detection Transformers
Evaluating the Adversarial Robustness of Detection Transformers
A. Nazeri
Chunheng Zhao
P. Pisu
AAML
297
4
0
25 Dec 2024
Feature Based Methods in Domain Adaptation for Object Detection: A Review Paper
Feature Based Methods in Domain Adaptation for Object Detection: A Review Paper
Helia Mohamadi
Mohammad Ali Keyvanrad
Mohammad Reza Mohammadi
339
0
0
23 Dec 2024
Towards Unsupervised Model Selection for Domain Adaptive Object
  Detection
Towards Unsupervised Model Selection for Domain Adaptive Object DetectionNeural Information Processing Systems (NeurIPS), 2024
Hengfu Yu
Jinhong Deng
Wen Li
Lixin Duan
275
4
0
23 Dec 2024
NumbOD: A Spatial-Frequency Fusion Attack Against Object Detectors
NumbOD: A Spatial-Frequency Fusion Attack Against Object DetectorsAAAI Conference on Artificial Intelligence (AAAI), 2024
Ziqi Zhou
Bowen Li
Yufei Song
Zhifei Yu
Shengshan Hu
Wei Wan
L. Zhang
Dezhong Yao
Hai Jin
AAML
369
14
0
22 Dec 2024
ImagineMap: Enhanced HD Map Construction with SD Maps
ImagineMap: Enhanced HD Map Construction with SD Maps
Yishen Ji
Zhiqi Li
Tong Lu
322
1
0
22 Dec 2024
Anchor3DLane++: 3D Lane Detection via Sample-Adaptive Sparse 3D Anchor
  Regression
Anchor3DLane++: 3D Lane Detection via Sample-Adaptive Sparse 3D Anchor RegressionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Shaofei Huang
Zhenwei Shen
Zehao Huang
Yue Liao
Jizhong Han
Naiyan Wang
Si Liu
401
9
0
22 Dec 2024
Object Detection Approaches to Identifying Hand Images with High
  Forensic Values
Object Detection Approaches to Identifying Hand Images with High Forensic ValuesIEEE International Conference on Systems, Man and Cybernetics (SMC), 2024
Thanh Thi Nguyen
Campbell Wilson
Imad Khan
Janis Dalins
3DH
291
0
0
21 Dec 2024
LLaVA-UHD v2: an MLLM Integrating High-Resolution Semantic Pyramid via Hierarchical Window Transformer
LLaVA-UHD v2: an MLLM Integrating High-Resolution Semantic Pyramid via Hierarchical Window Transformer
Yipeng Zhang
Yi Liu
Zonghao Guo
Yidan Zhang
Xuesong Yang
...
Xingtai Lv
Zhiyuan Liu
Tat-Seng Chua
Maosong Sun
Maosong Sun
MLLMVLM
366
3
0
18 Dec 2024
Differential Alignment for Domain Adaptive Object Detection
Differential Alignment for Domain Adaptive Object Detection
Xinyu He
Xinhui Li
Xiaojie Guo
348
2
0
17 Dec 2024
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding
GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial UnderstandingComputer Vision and Pattern Recognition (CVPR), 2024
Haoyi Jiang
Liu Liu
Tianheng Cheng
Xinjie Wang
Tianwei Lin
Zhizhong Su
Wen Liu
Xinyu Wang
3DGSViT
503
30
0
17 Dec 2024
SAMIC: Segment Anything with In-Context Spatial Prompt Engineering
SAMIC: Segment Anything with In-Context Spatial Prompt Engineering
S. Nagendra
Kashif Rashid
Chaopeng Shen
Daniel Kifer
VLM
331
4
0
16 Dec 2024
CLDA-YOLO: Visual Contrastive Learning Based Domain Adaptive YOLO
  Detector
CLDA-YOLO: Visual Contrastive Learning Based Domain Adaptive YOLO Detector
Tianheng Qiu
Ka Lung Law
Guanghua Pan
Jufei Wang
Xin Gao
Xuan Huang
Hu Wei
269
1
0
16 Dec 2024
Oriented Tiny Object Detection: A Dataset, Benchmark, and Dynamic
  Unbiased Learning
Oriented Tiny Object Detection: A Dataset, Benchmark, and Dynamic Unbiased Learning
Chang Xu
Ruixiang Zhang
Wen Yang
Haoran Zhu
Fang Xu
Jian Ding
Gui-Song Xia
ObjD
343
2
0
16 Dec 2024
V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D
  Annotations
V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D AnnotationsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Jin-Cheng Jhang
Tao Tu
Fu-En Wang
Ke Zhang
Min Sun
Cheng-Hao Kuo
303
5
0
16 Dec 2024
Heterogeneous Graph Transformer for Multiple Tiny Object Tracking in
  RGB-T Videos
Heterogeneous Graph Transformer for Multiple Tiny Object Tracking in RGB-T VideosIEEE transactions on multimedia (IEEE TMM), 2024
Qingyu Xu
Longguang Wang
Weidong Sheng
Yingqian Wang
Chao Xiao
Chao Ma
Wei An
VOT
341
25
0
14 Dec 2024
Just a Few Glances: Open-Set Visual Perception with Image Prompt
  Paradigm
Just a Few Glances: Open-Set Visual Perception with Image Prompt ParadigmAAAI Conference on Artificial Intelligence (AAAI), 2024
Jinrong Zhang
Penghui Wang
Chunxiao Liu
Wei Liu
D. Jin
Qiong Zhang
Erli Meng
Zhengnan Hu
VLM
246
1
0
14 Dec 2024
SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
SoftVQ-VAE: Efficient 1-Dimensional Continuous TokenizerComputer Vision and Pattern Recognition (CVPR), 2024
Zeyang Zhang
Zihan Wang
Xianrui Li
Xingwu Sun
Fangyi Chen
Jiang Liu
Jiadong Wang
Bhiksha Raj
Zicheng Liu
Emad Barsoum
VLM
723
32
0
14 Dec 2024
PanSR: An Object-Centric Mask Transformer for Panoptic Segmentation
PanSR: An Object-Centric Mask Transformer for Panoptic Segmentation
Lojze Žust
Matej Kristan
ViT
316
1
0
13 Dec 2024
UADet: A Remarkably Simple Yet Effective Uncertainty-Aware Open-Set
  Object Detection Framework
UADet: A Remarkably Simple Yet Effective Uncertainty-Aware Open-Set Object Detection Framework
Silin Cheng
Yuanpei Liu
Kai Han
EDL
374
0
0
12 Dec 2024
TimeRefine: Temporal Grounding with Time Refining Video LLM
TimeRefine: Temporal Grounding with Time Refining Video LLM
Xizi Wang
Feng Cheng
Ziyang Wang
Huiyu Wang
Md. Mohaiminul Islam
Lorenzo Torresani
Joey Tianyi Zhou
Gedas Bertasius
David J. Crandall
492
6
0
12 Dec 2024
Object Detection using Event Camera: A MoE Heat Conduction based
  Detector and A New Benchmark Dataset
Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark DatasetComputer Vision and Pattern Recognition (CVPR), 2024
Xinyu Wang
Yu Jin
Wentao Wu
Wei Zhang
Lin Zhu
Bo Jiang
Yonghong Tian
277
18
0
09 Dec 2024
GAQAT: gradient-adaptive quantization-aware training for domain
  generalization
GAQAT: gradient-adaptive quantization-aware training for domain generalization
Jiacheng Jiang
Yuan Meng
Chen Tang
Han Yu
Qun Li
Zhi Wang
Wenwu Zhu
MQ
294
1
0
07 Dec 2024
Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video
  Object Detection
Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object DetectionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
K. Hashmi
Talha Uddin Sheikh
Didier Stricker
Muhammad Zeshan Afzal
289
0
0
06 Dec 2024
Cubify Anything: Scaling Indoor 3D Object Detection
Cubify Anything: Scaling Indoor 3D Object DetectionComputer Vision and Pattern Recognition (CVPR), 2024
Justin Lazarow
David Griffiths
Gefen Kohavi
Francisco Crespo
Afshin Dehghan
3DPC
254
18
0
05 Dec 2024
Towards Real-Time Open-Vocabulary Video Instance Segmentation
Towards Real-Time Open-Vocabulary Video Instance SegmentationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Bin Yan
Martin Sundermeyer
D. Tan
Huchuan Lu
F. Tombari
VLMVOS
294
3
0
05 Dec 2024
Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in
  Bird's-Eye-View via Uncertainty Measure
Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in Bird's-Eye-View via Uncertainty MeasureIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Saheli Hazra
Sudip Das
Rohit Choudhary
Arindam Das
Ganesh Sistu
Ciarán Eising
Ujjwal Bhattacharya
303
0
0
05 Dec 2024
Redundant Queries in DETR-Based 3D Detection Methods: Unnecessary and Prunable
Redundant Queries in DETR-Based 3D Detection Methods: Unnecessary and Prunable
Lizhen Xu
Zehao Wu
Wenzhao Qiu
Zehao Wu
Xiuxiu Bai
K. Mei
Jianru Xue
491
3
0
03 Dec 2024
XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive
  Generation
XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation
Xianrui Li
Kai Qiu
Zeyang Zhang
Jason Kuen
Jiuxiang Gu
Jiadong Wang
Zhe Lin
Bhiksha Raj
VLM
421
12
0
02 Dec 2024
HandOS: 3D Hand Reconstruction in One Stage
HandOS: 3D Hand Reconstruction in One StageComputer Vision and Pattern Recognition (CVPR), 2024
Xingyu Chen
Zhuheng Song
Xiaoke Jiang
Yaoqing Hu
Junzhi Yu
Lei Zhang
3DHHAI
504
5
0
02 Dec 2024
SyncVIS: Synchronized Video Instance Segmentation
SyncVIS: Synchronized Video Instance SegmentationNeural Information Processing Systems (NeurIPS), 2024
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
Kun Wang
Yu Qiao
Hengshuang Zhao
317
1
0
01 Dec 2024
Explaining Object Detectors via Collective Contribution of Pixels
Explaining Object Detectors via Collective Contribution of Pixels
Toshinori Yamauchi
Hiroshi Kera
K. Kawamoto
ObjDFAtt
575
3
0
01 Dec 2024
LQ-Adapter: ViT-Adapter with Learnable Queries for Gallbladder Cancer Detection from Ultrasound ImageIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Chetan Madan
Mayuna Gupta
Soumen Basu
Pankaj Gupta
Chetan Arora
385
3
0
30 Nov 2024
Does Self-Attention Need Separate Weights in Transformers?North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Md. Kowsher
Nusrat Jahan Prottasha
Chun-Nam Yu
O. Garibay
Niloofar Yousefi
1.1K
3
0
30 Nov 2024
Previous
123...91011...545556
Next
Page 10 of 56
Pageof 56