Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2010.04159
Cited By
v1
v2
v3
v4 (latest)
Deformable DETR: Deformable Transformers for End-to-End Object Detection
International Conference on Learning Representations (ICLR), 2020
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Github (3553★)
Papers citing
"Deformable DETR: Deformable Transformers for End-to-End Object Detection"
50 / 2,782 papers shown
InteractionMap: Improving Online Vectorized HDMap Construction with Interaction
Computer Vision and Pattern Recognition (CVPR), 2025
Kuang Wu
Chuan Yang
Zhanbin Li
287
5
0
27 Mar 2025
Leveraging 3D Geometric Priors in 2D Rotation Symmetry Detection
Computer Vision and Pattern Recognition (CVPR), 2025
Ahyun Seo
Minsu Cho
360
2
0
26 Mar 2025
UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines
Computer Vision and Pattern Recognition (CVPR), 2025
Chen Tang
Cheng Wang
Encheng Su
Xiufeng Song
Xiaohong Liu
Wei-Hong Li
Lei Bai
Wanli Ouyang
Xiangyu Yue
3DGS
AI4TS
287
0
0
26 Mar 2025
FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model
Computer Vision and Pattern Recognition (CVPR), 2025
Zhiqiang Zhang
Jia-Nan Li
Zunnan Xu
Hanhui Li
Yiji Cheng
Fa-Ting Hong
Qin Lin
Qinglin Lu
Xiaodan Liang
DiffM
375
15
0
25 Mar 2025
BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction
IEEE International Conference on Document Analysis and Recognition (ICDAR), 2025
Jan Kohút
Martin Dočekal
Michal Hradiš
Marek Vaško
199
1
0
25 Mar 2025
Your ViT is Secretly an Image Segmentation Model
Computer Vision and Pattern Recognition (CVPR), 2025
Tommie Kerssies
Niccolò Cavagnero
Alexander Hermans
Narges Norouzi
Giuseppe Averta
Bastian Leibe
Gijs Dubbelman
Daan de Geus
ViT
VLM
324
24
0
24 Mar 2025
FG
2
^2
2
: Fine-Grained Cross-View Localization by Fine-Grained Feature Matching
Computer Vision and Pattern Recognition (CVPR), 2025
Zimin Xia
Alexandre Alahi
380
5
0
24 Mar 2025
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection
Zhichao Sun
Huazhang Hu
Yidong Ma
Gang Liu
Nemo Chen
Xu Tang
Feng-Long Xie
Yongchao Xu
ObjD
467
0
0
24 Mar 2025
Benchmarking Object Detectors under Real-World Distribution Shifts in Satellite Imagery
Computer Vision and Pattern Recognition (CVPR), 2025
Sara Al-Emadi
Yin Yang
Ferda Ofli
236
1
0
24 Mar 2025
An Image-like Diffusion Method for Human-Object Interaction Detection
Computer Vision and Pattern Recognition (CVPR), 2025
Xiaofei Hui
Haoxuan Qu
Hossein Rahmani
Jun Liu
DiffM
356
1
0
23 Mar 2025
SGFormer: Satellite-Ground Fusion for 3D Semantic Scene Completion
Computer Vision and Pattern Recognition (CVPR), 2025
Xiyue Guo
Jiarui Hu
Junjie Hu
Hujun Bao
Guofeng Zhang
335
3
0
21 Mar 2025
UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis
Pattern Recognition (Pattern Recogn.), 2025
Jiawei Wang
Kai Hu
Qiang Huo
311
2
0
20 Mar 2025
DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding
Keyan Chen
Chenyang Liu
Bowen Chen
Wenyuan Li
Zhengxia Zou
Zhenwei Shi
300
17
0
20 Mar 2025
SpiLiFormer: Enhancing Spiking Transformers with Lateral Inhibition
Zeqi Zheng
Yanchen Huang
Yingchao Yu
Zizheng Zhu
Junfeng Tang
Zhaofei Yu
Yaochu Jin
277
1
0
20 Mar 2025
3D Occupancy Prediction with Low-Resolution Queries via Prototype-aware View Transformation
Computer Vision and Pattern Recognition (CVPR), 2025
Gyeongrok Oh
Sungjune Kim
Heeju Ko
Hyung-Gun Chi
J. Kim
Dongwook Lee
Daehyun Ji
Sungjoon Choi
Sujin Jang
Sangpil Kim
252
6
0
19 Mar 2025
State Space Model Meets Transformer: A New Paradigm for 3D Object Detection
International Conference on Learning Representations (ICLR), 2025
Chuxin Wang
Wenfei Yang
Xiang Liu
Tianzhu Zhang
504
4
0
18 Mar 2025
LipShiFT: A Certifiably Robust Shift-based Vision Transformer
Rohan Menon
Nicola Franco
Stephan Günnemann
298
1
0
18 Mar 2025
SimWorld: A Unified Benchmark for Simulator-Conditioned Scene Generation via World Model
Xinqing Li
Ruiqi Song
Qingyu Xie
Ye Wu
Nanxin Zeng
Yunfeng Ai
VGen
SyDa
392
3
0
18 Mar 2025
Panoramic Distortion-Aware Tokenization for Person Detection and Localization in Overhead Fisheye Images
Nobuhiko Wakai
Satoshi Sato
Yasunori Ishii
Takayoshi Yamashita
469
0
0
18 Mar 2025
MamBEV: Enabling State Space Models to Learn Birds-Eye-View Representations
International Conference on Learning Representations (ICLR), 2025
Hongyu Ke
Jack Morris
K. Oguchi
Xiaofei Cao
Yongkang Liu
Haoxin Wang
Yi Ding
Mamba
479
1
0
18 Mar 2025
Is Discretization Fusion All You Need for Collaborative Perception?
IEEE International Conference on Robotics and Automation (ICRA), 2025
Kang Yang
Tianci Bu
L. Li
Chunxu Li
Yanjie Wang
Deying Li
441
1
0
18 Mar 2025
TGBFormer: Transformer-GraphFormer Blender Network for Video Object Detection
AAAI Conference on Artificial Intelligence (AAAI), 2025
Qiang Qi
Xiao Wang
ViT
1.1K
5
0
18 Mar 2025
8-Calves Image dataset
Xuyang Fang
S. Hannuna
Neill D. F. Campbell
Edwin Simpson
942
0
0
17 Mar 2025
AugMapNet: Improving Spatial Latent Structure via BEV Grid Augmentation for Enhanced Vectorized Online HD Map Construction
T. Monninger
Md Zafar Anwar
Stanislaw Antol
Steffen Staab
Sihao Ding
289
4
0
17 Mar 2025
Action tube generation by person query matching for spatio-temporal action detection
Kazuki Omi
Jion Oshima
Toru Tamaki
379
0
0
17 Mar 2025
L2COcc: Lightweight Camera-Centric Semantic Scene Completion via Distillation of LiDAR Model
Ruoyu Wang
Yukai Ma
Yi Yao
Sheng Tao
Haoyang Li
Zongzhi Zhu
Wenshu Fan
Xingxing Zuo
393
2
0
16 Mar 2025
Exploring Contextual Attribute Density in Referring Expression Counting
Computer Vision and Pattern Recognition (CVPR), 2025
Zhicheng Wang
Zhiyu Pan
Zhan Peng
Jian Cheng
Liwen Xiao
Wei Jiang
Zhiguo Cao
262
3
0
16 Mar 2025
History-Aware Transformation of ReID Features for Multiple Object Tracking
Ruopeng Gao
Yidan Wang
Chunxu Liu
Limin Wang
VOT
424
2
0
16 Mar 2025
Minuscule Cell Detection in AS-OCT Images with Progressive Field-of-View Focusing
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Boyu Chen
A. L. Solebo
Daqian Shi
Jinge Wu
Paul Taylor
281
0
0
15 Mar 2025
Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection
International Conference on Learning Representations (ICLR), 2025
Chuhan Zhang
Chaoyang Zhu
Pingcheng Dong
Long Chen
Dong Zhang
ObjD
VLM
1.1K
4
0
14 Mar 2025
Bring Your Rear Cameras for Egocentric 3D Human Pose Estimation
Hiroyasu Akada
Jian Wang
Vladislav Golyanik
Christian Theobalt
EgoV
402
2
0
14 Mar 2025
Active Learning from Scene Embeddings for End-to-End Autonomous Driving
Wenhao Jiang
Duo Li
Menghan Hu
Chao Ma
Ke Wang
Zhipeng Zhang
406
0
0
14 Mar 2025
A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection
Shenghao Fu
Junkai Yan
Q. Yang
Xihan Wei
Xiaohua Xie
Wei-Shi Zheng
ObjD
VLM
257
3
0
13 Mar 2025
MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors
Computer Vision and Pattern Recognition (CVPR), 2024
Fanqi Pu
Yifan Wang
Jiru Deng
Wenming Yang
MDE
ViT
383
16
0
13 Mar 2025
Foundation X: Integrating Classification, Localization, and Segmentation through Lock-Release Pretraining Strategy for Chest X-ray Analysis
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
N. Islam
Dongao Ma
Jiaxuan Pang
Shivasakthi Senthil Velan
Michael B. Gotway
Jianming Liang
248
0
0
12 Mar 2025
DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection
Chiara Cappellino
Gianluca Mancusi
Matteo Mosconi
Angelo Porrello
Simone Calderara
Rita Cucchiara
ObjD
VLM
587
1
0
12 Mar 2025
Towards Large-scale Chemical Reaction Image Parsing via a Multimodal Large Language Model
Chemical Science (Chem. Sci.), 2025
Yufan Chen
Ching Ting Leung
Jianwei Sun
Yong Huang
Linyan Li
Hao Chen
Hanyu Gao
242
4
0
11 Mar 2025
SparseVoxFormer: Sparse Voxel-based Transformer for Multi-modal 3D Object Detection
Hyeongseok Son
Jia He
Seung-In Park
Ying Min
Yunhao Zhang
ByungIn Yoo
265
1
0
11 Mar 2025
Robust Latent Matters: Boosting Image Generation with Sampling Error Synthesis
Kai Qiu
Xianrui Li
Jason Kuen
Zeyang Zhang
Xiaohao Xu
Jiuxiang Gu
Yinyi Luo
Bhiksha Raj
Zhe Lin
Marios Savvides
542
6
0
11 Mar 2025
Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction
IEEE International Conference on Robotics and Automation (ICRA), 2025
Zongzheng Zhang
Xinrun Li
Sizhe Zou
Guoxuan Chi
Siqi Li
...
Guoliang Wang
Guantian Zheng
Leichen Wang
Hang Zhao
Hao Zhao
367
9
0
10 Mar 2025
Rethinking Two-Stage Referring-by-Tracking in Referring Multi-Object Tracking: Make it Strong Again
Weize Li
Yunhao Du
Qixiang Yin
Zhicheng Zhao
Fei Su
414
0
0
10 Mar 2025
YOLOE: Real-Time Seeing Anything
Ao Wang
Lihao Liu
Hui Chen
Zijia Lin
Jiawei Han
Guiguang Ding
VLM
ObjD
549
35
0
10 Mar 2025
SimROD: A Simple Baseline for Raw Object Detection with Global and Local Enhancements
Haiyang Xie
Xi Shen
Shihua Huang
Qirui Wang
Zheng Wang
370
0
0
10 Mar 2025
LEGO-Motion: Learning-Enhanced Grids with Occupancy Instance Modeling for Class-Agnostic Motion Prediction
Kangan Qian
Jinyu Miao
Ziang Luo
Zheng Fu
and Jinchen Li
Xinyu Jiao
Yunlong Wang
Yunlong Wang
Mengmeng Yang
Ke Wang
263
6
0
10 Mar 2025
Removing Multiple Hybrid Adverse Weather in Video via a Unified Model
Yecong Wan
Mingwen Shao
Yuanshuo Cheng
Jun Shu
Shuigen Wang
256
0
0
08 Mar 2025
Rethinking Lanes and Points in Complex Scenarios for Monocular 3D Lane Detection
Computer Vision and Pattern Recognition (CVPR), 2025
Yifan Chang
Junjie Huang
Xiaofeng Wang
Yun Ye
Zhujin Liang
Yi Shan
Dalong Du
Xingang Wang
3DPC
360
2
0
08 Mar 2025
FastMap: Fast Queries Initialization Based Vectorized HD Map Reconstruction Framework
Haotian Hu
Jingwei Xu
Fanyi Wang
Toyota Li
Yaonong Wang
Laifeng Hu
Zhiwang Zhang
196
2
0
07 Mar 2025
A lightweight model FDM-YOLO for small target improvement based on YOLOv8
Xuerui Zhang
ObjD
258
2
0
06 Mar 2025
Prediction of Frozen Region Growth in Kidney Cryoablation Intervention Using a 3D Flow-Matching Model
Siyeop Yoon
Y. Oh
Matthew Tivnan
Qing Xiao
Pengfei Jin
Sekeun KimHyun Jin Cho
Hyun Jin Cho
Dufan Wu
Raul Uppot
Quanzheng Li
322
2
0
06 Mar 2025
Omnidirectional Multi-Object Tracking
Computer Vision and Pattern Recognition (CVPR), 2025
Kai Luo
Hao-miao Shi
Sheng Wu
Fei Teng
Mengfei Duan
Chang Huang
Longji Xu
Kaiwei Wang
Kailun Yang
473
5
0
06 Mar 2025
Previous
1
2
3
...
7
8
9
...
54
55
56
Next
Page 8 of 56
Page
of 56
Go