Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2010.04159
Cited By
v1
v2
v3
v4 (latest)
Deformable DETR: Deformable Transformers for End-to-End Object Detection
International Conference on Learning Representations (ICLR), 2020
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Github (3553★)
Papers citing
"Deformable DETR: Deformable Transformers for End-to-End Object Detection"
50 / 2,782 papers shown
CycleHOI: Improving Human-Object Interaction Detection with Cycle Consistency of Detection and Generation
Yisen Wang
Yao Teng
Limin Wang
DiffM
333
6
0
16 Jul 2024
Continuity Preserving Online CenterLine Graph Learning
Yunhui Han
Kun Yu
Zhiwei Li
GNN
3DPC
319
3
0
16 Jul 2024
TCFormer: Visual Recognition via Token Clustering Transformer
Wang Zeng
Sheng Jin
Lumin Xu
Wentao Liu
Chao Qian
Wanli Ouyang
Ping Luo
Xiaogang Wang
198
15
0
16 Jul 2024
Can Textual Semantics Mitigate Sounding Object Segmentation Preference?
Yaoting Wang
Peiwen Sun
Yuanchao Li
Honggang Zhang
Di Hu
324
13
0
15 Jul 2024
RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception
Chunliang Li
Wencheng Han
Junbo Yin
Sanyuan Zhao
Jianbing Shen
227
10
0
15 Jul 2024
GTPT: Group-based Token Pruning Transformer for Efficient Human Pose Estimation
Haonan Wang
Jie Liu
Jie Tang
Gangshan Wu
Bo Xu
Y. Kevin Chou
Yong Wang
ViT
294
8
0
15 Jul 2024
SEED: A Simple and Effective 3D DETR in Point Clouds
Yanfeng Guo
Jinghua Hou
Xiaoqing Ye
Tong Wang
Jingdong Wang
Xiang Bai
3DPC
215
21
0
15 Jul 2024
Joint-Embedding Predictive Architecture for Self-Supervised Learning of Mask Classification Architecture
Donghee Kim
Sungduk Cho
Hyeonwoo Cho
Chanmin Park
Jinyoung Kim
Won Hwa Kim
281
0
0
15 Jul 2024
FRI-Net: Floorplan Reconstruction via Room-wise Implicit Representation
Honghao Xu
Juzhan Xu
Zeyu Huang
Pengfei Xu
Hui Huang
Ruizhen Hu
3DV
201
4
0
15 Jul 2024
PolyRoom: Room-aware Transformer for Floorplan Reconstruction
Yuzhou Liu
Lingjie Zhu
Xiaodong Ma
Hanqiao Ye
Xiang Gao
Xianwei Zheng
Shuhan Shen
203
6
0
15 Jul 2024
When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset
Yi Zhang
Wang Zeng
Sheng Jin
Chao Qian
Ping Luo
Wentao Liu
265
19
0
14 Jul 2024
Plain-Det: A Plain Multi-Dataset Object Detector
Cheng Shi
Yuchen Zhu
Sibei Yang
ObjD
VLM
250
8
0
14 Jul 2024
MutDet: Mutually Optimizing Pre-training for Remote Sensing Object Detection
Ziyue Huang
Yongchao Feng
Qingjie Liu
Yunhong Wang
ViT
333
3
0
13 Jul 2024
IFTR: An Instance-Level Fusion Transformer for Visual Collaborative Perception
Shaohong Wang
Lu Bin
Xinyu Xiao
Zhiyu Xiang
Hangguan Shan
Eryun Liu
ViT
326
8
0
13 Jul 2024
Image Compression for Machine and Human Vision with Spatial-Frequency Adaptation
Han Li
Shaohui Li
Shuangrui Ding
Wenrui Dai
Maida Cao
Chenglin Li
Junni Zou
Hongkai Xiong
VLM
338
28
0
13 Jul 2024
Neural-based Video Compression on Solar Dynamics Observatory Images
Atefeh Khoshkhahtinat
Ali Zafari
P. Mehta
Nasser M. Nasrabadi
Barbara J. Thompson
M. Kirk
D. D. Silva
298
1
0
12 Jul 2024
FD-SOS: Vision-Language Open-Set Detectors for Bone Fenestration and Dehiscence Detection from Intraoral Images
Marawan Elbatel
Keyuan Liu
Yanqi Yang
Xuelong Li
157
2
0
12 Jul 2024
Domain-adaptive Video Deblurring via Test-time Blurring
Jin-Ting He
Fu-Jen Tsai
Jia-Hao Wu
Yan-Tsung Peng
Chung-Chi Tsai
Chia-Wen Lin
Yen-Yu Lin
244
8
0
12 Jul 2024
DroneMOT: Drone-based Multi-Object Tracking Considering Detection Difficulties and Simultaneous Moving of Drones and Objects
Peng Wang
Yongcai Wang
Deying Li
VOT
266
14
0
12 Jul 2024
Textual Query-Driven Mask Transformer for Domain Generalized Segmentation
Byeonghyun Pak
Byeongju Woo
Sunghwan Kim
Dae-Hwan Kim
Hoseong Kim
436
20
0
12 Jul 2024
Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness
Honghao Chen
Yurong Zhang
Xiaokun Feng
Xiangxiang Chu
Kaiqi Huang
AAML
300
10
0
12 Jul 2024
Semi-Supervised Object Detection: A Survey on Progress from CNN to Transformer
Tahira Shehzadi
Ifza
Didier Stricker
Muhammad Zeshan Afzal
ViT
369
11
0
11 Jul 2024
Cross Domain Object Detection via Multi-Granularity Confidence Alignment based Mean Teacher
Jiangming Chen
Li Liu
Wanxia Deng
Zhen Liu
Yu Liu
Yingmei Wei
Yongxiang Liu
227
2
0
10 Jul 2024
DIOR-ViT: Differential Ordinal Learning Vision Transformer for Cancer Classification in Pathology Images
Ju Cheon Lee
Keunho Byeon
Boram Song
Kyungeun Kim
Jin Tae Kwak
MedIm
307
4
0
10 Jul 2024
Deformable-Heatmap-Segmentation for Automobile Visual Perception
Hongyu Jin
109
1
0
10 Jul 2024
ActionVOS: Actions as Prompts for Video Object Segmentation
Liangyang Ouyang
Ruicong Liu
Yifei Huang
Ryosuke Furuta
Yoichi Sato
VOS
222
9
0
10 Jul 2024
Exploring Camera Encoder Designs for Autonomous Driving Perception
Barath Lakshmanan
Joshua Chen
Shiyi Lan
Maying Shen
Zhiding Yu
Jose M. Alvarez
263
0
0
09 Jul 2024
D-MASTER: Mask Annealed Transformer for Unsupervised Domain Adaptation in Breast Cancer Detection from Mammograms
Tajamul Ashraf
K. Rangarajan
Mohit Gambhir
Richa Gabha
Chetan Arora
MedIm
263
4
0
09 Jul 2024
Anatomy-guided Pathology Segmentation
A. Jaus
C. Seibold
Simon Reiß
Lukas Heine
Anton Schily
Moon Kim
F. Bahnsen
Ken Herrmann
Rainer Stiefelhagen
Jens Kleesiek
MedIm
186
9
0
08 Jul 2024
Learning Lane Graphs from Aerial Imagery Using Transformers
Martin Büchner
Simon Dorer
Abhinav Valada
196
0
0
08 Jul 2024
Described Spatial-Temporal Video Detection
Wei Ji
Xiangyan Liu
Yingfei Sun
Jiajun Deng
You Qin
Ammar Nuwanna
Mengyao Qiu
Lina Wei
Roger Zimmermann
285
3
0
08 Jul 2024
Smart Camera Parking System With Auto Parking Spot Detection
Tuan T. Nguyen
Mina Sartipi
215
5
0
07 Jul 2024
JDT3D: Addressing the Gaps in LiDAR-Based Tracking-by-Attention
Brian Cheong
Jiachen Zhou
Steven Waslander
250
4
0
06 Jul 2024
Multi-Branch Auxiliary Fusion YOLO with Re-parameterization Heterogeneous Convolutional for accurate object detection
Zhiqiang Yang
Q. Guan
Keer Zhao
Jianmin Yang
Xinli Xu
Haixia Long
Ying Tang
296
101
0
05 Jul 2024
AMD: Automatic Multi-step Distillation of Large-scale Vision Models
Cheng Han
Qifan Wang
S. Dianat
Majid Rabbani
Raghuveer M. Rao
Yi Fang
Qiang Guan
Lifu Huang
Dongfang Liu
VLM
209
14
0
05 Jul 2024
QueryMamba: A Mamba-Based Encoder-Decoder Architecture with a Statistical Verb-Noun Interaction Module for Video Action Forecasting @ Ego4D Long-Term Action Anticipation Challenge 2024
Zeyun Zhong
Manuel Martin
Frederik Diederichs
Juergen Beyerer
195
5
0
04 Jul 2024
Occupancy as Set of Points
Yiang Shi
Tianheng Cheng
Qian Zhang
Wenyu Liu
Xinggang Wang
3DPC
280
28
0
04 Jul 2024
Cyclic Refiner: Object-Aware Temporal Representation Learning for Multi-View 3D Detection and Tracking
Mingzhe Guo
Zhipeng Zhang
Liping Jing
Yuan He
Ke Wang
Heng Fan
246
3
0
03 Jul 2024
Graph and Skipped Transformer: Exploiting Spatial and Temporal Modeling Capacities for Efficient 3D Human Pose Estimation
Mengmeng Cui
Kunbo Zhang
Zhenan Sun
ViT
228
0
0
03 Jul 2024
CAVIS: Context-Aware Video Instance Segmentation
Seunghun Lee
Jiwan Seo
Kiljoon Han
Minwoo Choi
S. Im
VOS
380
4
0
03 Jul 2024
Research on Reliable and Safe Occupancy Grid Prediction in Underground Parking Lots
JiaQi Luo
190
0
0
02 Jul 2024
CountFormer: Multi-View Crowd Counting Transformer
Hong Mo
Xiong Zhang
Jianchao Tan
Cheng Yang
Qiong Gu
Bo Hang
Wenqi Ren
292
9
0
02 Jul 2024
SymPoint Revolutionized: Boosting Panoptic Symbol Spotting with Layer Feature Enhancement
Wenlong Liu
Tianyu Yang
Qizhi Yu
Lei Zhang
222
4
0
02 Jul 2024
LPViT: Low-Power Semi-structured Pruning for Vision Transformers
Kaixin Xu
Zhe Wang
Chunyun Chen
Xue Geng
Jie Lin
Xulei Yang
Ruibing Jin
Min Wu
Xiaoli Li
Weisi Lin
ViT
VLM
666
18
0
02 Jul 2024
Robot Instance Segmentation with Few Annotations for Grasping
Moshe Kimhi
David Vainshtein
Chaim Baskin
Dotan Di Castro
428
6
0
01 Jul 2024
SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection
Dingkang Liang
Wei Hua
Chunsheng Shi
Zhikang Zou
Xiaoqing Ye
X. Bai
365
10
0
01 Jul 2024
Parametric Primitive Analysis of CAD Sketches with Vision Transformer
Xiaogang Wang
Liang Wang
Hongyu Wu
Guoqiang Xiao
Kai Xu
171
4
0
29 Jun 2024
GM-DF: Generalized Multi-Scenario Deepfake Detection
Yingxin Lai
Zitong Yu
Jing Yang
Bin Li
Xiangui Kang
Linlin Shen
308
19
0
28 Jun 2024
Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding
Yifan Tang
Cong Tai
Fangxing Chen
Wanting Zhang
Tao Zhang
Xueping Liu
Yongjin Liu
Long Zeng
262
10
0
28 Jun 2024
Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads
Ali Khaleghi Rahimian
Manish Kumar Govind
Subhajit Maity
Dominick Reilly
Christian Kummerle
Srijan Das
A. Dutta
240
1
0
27 Jun 2024
Previous
1
2
3
...
14
15
16
...
54
55
56
Next
Page 15 of 56
Page
of 56
Go