Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2010.04159
Cited By
v1
v2
v3
v4 (latest)
Deformable DETR: Deformable Transformers for End-to-End Object Detection
International Conference on Learning Representations (ICLR), 2020
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Github (3553★)
Papers citing
"Deformable DETR: Deformable Transformers for End-to-End Object Detection"
50 / 2,782 papers shown
Cross-modulated Attention Transformer for RGBT Tracking
AAAI Conference on Artificial Intelligence (AAAI), 2024
Yun Xiao
Jiacong Zhao
Andong Lu
Chenglong Li
Yin Lin
Bing Yin
Cong Liu
229
16
0
05 Aug 2024
CAF-YOLO: A Robust Framework for Multi-Scale Lesion Detection in Biomedical Imagery
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Zilin Chen
Shengnan Lu
MedIm
205
17
0
04 Aug 2024
A Survey and Evaluation of Adversarial Attacks for Object Detection
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Khoi Nguyen Tiet Nguyen
Wenyu Zhang
Kangkang Lu
Yuhuan Wu
Xingjian Zheng
Hui Li Tan
Liangli Zhen
AAML
371
0
0
04 Aug 2024
AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation
Zili Wang
Qi Yang
Linsu Shi
Jiazhong Yu
M. Tanveer
Fei Li
Shiming Xiang
VOS
233
4
0
03 Aug 2024
Underwater Object Detection Enhancement via Channel Stabilization
International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2022
Muhammad Ali
Rita Sevastjanova
165
5
0
02 Aug 2024
Enhancing Online Road Network Perception and Reasoning with Standard Definition Maps
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Hengyuan Zhang
David Paz
Yuliang Guo
Arun Das
Xinyu Huang
Karsten Haug
Henrik I. Christensen
Liu Ren
232
9
0
01 Aug 2024
A Systematic Review on Long-Tailed Learning
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Chongsheng Zhang
G. Almpanidis
Gaojuan Fan
Binquan Deng
Yanbo Zhang
Ji Liu
Aouaidjia Kamel
Paolo Soda
Joao Gama
410
27
0
01 Aug 2024
DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training
ACM Multimedia (MM), 2024
Xi Chen
Qian Qiao
Jun Gao
Tianxiang Wu
Rahul Bhadani
Jiaqing Fan
Ziqiang Cao
Larry Head
DiffM
348
13
0
01 Aug 2024
WAS: Dataset and Methods for Artistic Text Segmentation
Xudong Xie
Yuzhe Li
Yang Liu
Zhifei Zhang
Zhaowen Wang
Wei Xiong
Xiang Bai
DiffM
269
3
0
31 Jul 2024
Open-Vocabulary Audio-Visual Semantic Segmentation
Zhenghao Zhang
Junchao Liao
Dantong Niu
Yanyu Qi
Menghao Li
Ji Shi
Bowei Xing
Xianghua Ying
VOS
VLM
259
18
0
31 Jul 2024
Dynamic Object Queries for Transformer-based Incremental Object Detection
Jichuan Zhang
Wei Li
Shuang Cheng
Yali Li
Shengjin Wang
236
7
0
31 Jul 2024
Leveraging Adaptive Implicit Representation Mapping for Ultra High-Resolution Image Segmentation
Ziyu Zhao
Xiaoguang Li
Pingping Cai
Canyu Zhang
Song Wang
265
0
0
31 Jul 2024
HandDAGT: A Denoising Adaptive Graph Transformer for 3D Hand Pose Estimation
Wencan Cheng
Eunji Kim
Jong Hwan Ko
3DH
ViT
235
3
0
30 Jul 2024
Classification Matters: Improving Video Action Detection with Class-Specific Attention
European Conference on Computer Vision (ECCV), 2024
Jinsung Lee
Taeoh Kim
Inwoong Lee
Minho Shim
Dongyoon Wee
Minsu Cho
Suha Kwak
391
1
0
29 Jul 2024
Practical Video Object Detection via Feature Selection and Aggregation
Yuheng Shi
Tong Zhang
Xiaojie Guo
ObjD
325
5
0
29 Jul 2024
Background Semantics Matter: Cross-Task Feature Exchange Network for Clustered Infrared Small Target Detection
Mengxuan Xiao
Yinfei Zhu
Yiming Zhu
Boyang Li
Kehua Guo
Huan Wang
Meng Cai
Yimian Dai
ObjD
503
16
0
29 Jul 2024
WeCromCL: Weakly Supervised Cross-Modality Contrastive Learning for Transcription-only Supervised Text Spotting
European Conference on Computer Vision (ECCV), 2024
Jingjing Wu
Zhengyao Fang
Pengyuan Lyu
Chengquan Zhang
Fanglin Chen
Guangming Lu
Wenjie Pei
492
4
0
28 Jul 2024
XS-VID: An Extremely Small Video Object Detection Dataset
Jiahao Guo
Ziyang Xu
Lianjun Wu
Fei Gao
Wenyu Liu
Xinggang Wang
ObjD
248
5
0
25 Jul 2024
CSWin-UNet: Transformer UNet with Cross-Shaped Windows for Medical Image Segmentation
Xiao Liu
Peng Gao
Tao Yu
Haiwei Yang
Ruyue Yuan
MedIm
ViT
221
100
0
25 Jul 2024
StreamMOS: Streaming Moving Object Segmentation with Multi-View Perception and Dual-Span Memory
Zhiheng Li
Yubo Cui
Jiexi Zhong
Zheng Fang
VOS
213
3
0
25 Jul 2024
SDLNet: Statistical Deep Learning Network for Co-Occurring Object Detection and Identification
Binay Kumar Singh
Niels Da Vitoria Lobo
ObjD
64
0
0
24 Jul 2024
MuST: Multi-Scale Transformers for Surgical Phase Recognition
Alejandra Pérez
Santiago Rodríguez
Nicolás Ayobi
Nicolás Aparicio
Eugénie Dessevres
Pablo Arbelaez
MedIm
235
7
0
24 Jul 2024
Progressive Query Refinement Framework for Bird's-Eye-View Semantic Segmentation from Surrounding Images
Dooseop Choi
Jungyu Kang
Taeghyun An
Kyounghwan Ahn
Kyoung‐Wook Min
214
0
0
24 Jul 2024
DVPE: Divided View Position Embedding for Multi-View 3D Object Detection
Jiasen Wang
Zhenglin Li
Ke Sun
Xianyuan Liu
Yang Zhou
238
2
0
24 Jul 2024
PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects
Junyi Li
Junfeng Wu
Weizhi Zhao
Song Bai
Xiang Bai
232
13
0
23 Jul 2024
Dynamic Retraining-Updating Mean Teacher for Source-Free Object Detection
Trinh Le Ba Khanh
Huy-Hung Nguyen
L. Pham
Duong Nguyen-Ngoc Tran
Jae Wook Jeon
304
15
0
23 Jul 2024
ESOD: Efficient Small Object Detection on High-Resolution Images
Kai-Chun Liu
Zhihang Fu
Sheng Jin
Ze Chen
Fan Zhou
Rongxin Jiang
Yao-Shen Chen
Jieping Ye
ObjD
290
26
0
23 Jul 2024
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation
Pengfei Chen
Lingxi Xie
Xinyue Huo
Xuehui Yu
Xiaopeng Zhang
Yingfei Sun
Zhenjun Han
Qi Tian
VLM
489
5
0
23 Jul 2024
Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection
Zhili Chen
Shuangjie Xu
Maosheng Ye
Zian Qian
Xiaoyi Zou
Dit-Yan Yeung
Qifeng Chen
286
5
0
22 Jul 2024
Explore the LiDAR-Camera Dynamic Adjustment Fusion for 3D Object Detection
Yiran Yang
Xu Gao
Tong Wang
Xin Hao
Yifeng Shi
Xiao Tan
Xiaoqing Ye
Jingdong Wang
3DPC
157
0
0
22 Jul 2024
Navigation Instruction Generation with BEV Perception and Large Language Models
Sheng Fan
Rui Liu
Wenguan Wang
Yi Yang
266
20
0
21 Jul 2024
Efficient Visual Transformer by Learnable Token Merging
Yancheng Wang
Yingzhen Yang
ViT
341
11
0
21 Jul 2024
RayFormer: Improving Query-Based Multi-Camera 3D Object Detection via Ray-Centric Strategies
Xiaomeng Chu
Jiajun Deng
Guoliang You
YiFan Duan
Yao Li
Yanyong Zhang
413
5
0
20 Jul 2024
PolyR-CNN: R-CNN for end-to-end polygonal building outline extraction
Weiqin Jiao
Claudio Persello
G. Vosselman
3DV
205
17
0
20 Jul 2024
Bucketed Ranking-based Losses for Efficient Training of Object Detectors
Feyza Yavuz
Baris Can Cam
Adnan Harun Dogan
Kemal Oksuz
Emre Akbas
Sinan Kalkan
302
5
0
19 Jul 2024
OCTrack: Benchmarking the Open-Corpus Multi-Object Tracking
Zekun Qian
Ruize Han
Wei Feng
Junhui Hou
Linqi Song
Song Wang
278
1
0
19 Jul 2024
Mask2Map: Vectorized HD Map Construction Using Bird's Eye View Segmentation Masks
Sehwan Choi
Jungho Kim
Hongjae Shin
Jungwook Choi
3DPC
270
25
0
18 Jul 2024
Real-Time 3D Occupancy Prediction via Geometric-Semantic Disentanglement
Yulin He
Wei Chen
Tianci Xun
Yusong Tan
3DPC
289
1
0
18 Jul 2024
DFMSD: Dual Feature Masking Stage-wise Knowledge Distillation for Object Detection
Zhourui Zhang
Jun Li
Zhijian Wu
Jifeng Shen
Jianhua Xu
181
0
0
18 Jul 2024
OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird's-eye-view Vehicle Semantic Segmentation
Jian Sun
Yuqi Dai
Chi-Man Vong
Qing Xu
Shengbo Eben Li
Jianqiang Wang
Lei He
Keqiang Li
339
3
0
18 Jul 2024
ViLLa: Video Reasoning Segmentation with Large Language Model
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
Kun Wang
Yu Qiao
Hengshuang Zhao
VOS
LRM
519
16
0
18 Jul 2024
GroupMamba: Efficient Group-Based Visual State Space Model
Abdelrahman M. Shaker
Syed Talal Wasim
Salman Khan
Juergen Gall
Fahad Shahbaz Khan
Mamba
216
4
0
18 Jul 2024
AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer
Zhuguanyu Wu
Jiaxin Chen
Hanwen Zhong
Di Huang
Yun Wang
MQ
353
23
0
17 Jul 2024
MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models
Leyang Shen
Gongwei Chen
Rui Shao
Weili Guan
Liqiang Nie
MoE
203
34
0
17 Jul 2024
Hierarchical and Decoupled BEV Perception Learning Framework for Autonomous Driving
Yuqi Dai
Jian Sun
Shengbo Eben Li
Qing Xu
Jianqiang Wang
Lei He
Keqiang Li
295
3
0
17 Jul 2024
VisionTrap: Vision-Augmented Trajectory Prediction Guided by Textual Descriptions
Seokha Moon
Hyun Woo
Hongbeen Park
Haeji Jung
R. Mahjourian
Hyung-Gun Chi
Hyerin Lim
Sangpil Kim
Jinkyu Kim
289
21
0
17 Jul 2024
Hierarchical Separable Video Transformer for Snapshot Compressive Imaging
Ping Wang
Yulun Zhang
Lishun Wang
Xin Yuan
ViT
425
4
0
16 Jul 2024
Relation DETR: Exploring Explicit Position Relation Prior for Object Detection
Xiuquan Hou
Mei-qin Liu
Senlin Zhang
Ping Wei
Badong Chen
Xuguang Lan
ViT
265
61
0
16 Jul 2024
Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection
Qijie Mo
Yipeng Gao
Shenghao Fu
Junkai Yan
Ancong Wu
Wei-Shi Zheng
CLL
277
14
0
16 Jul 2024
Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes
Zhi Cai
Yingjie Gao
Yaoyan Zheng
Nan Zhou
Di Huang
VLM
402
19
0
16 Jul 2024
Previous
1
2
3
...
13
14
15
...
54
55
56
Next
Page 14 of 56
Page
of 56
Go