Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2010.04159
Cited By
v1
v2
v3
v4 (latest)
Deformable DETR: Deformable Transformers for End-to-End Object Detection
International Conference on Learning Representations (ICLR), 2020
8 October 2020
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (1 upvotes)
Github (3553★)
Papers citing
"Deformable DETR: Deformable Transformers for End-to-End Object Detection"
50 / 2,782 papers shown
ODTFormer: Efficient Obstacle Detection and Tracking with Stereo Cameras Based on Transformer
Tianye Ding
Hongyu Li
Huaizu Jiang
188
1
0
21 Mar 2024
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
Qing Jiang
Feng Li
Zhaoyang Zeng
Tianhe Ren
Shilong Liu
Lei Zhang
VLM
363
82
0
21 Mar 2024
LDTR: Transformer-based Lane Detection with Anchor-chain Representation
Zhongyu Yang
Chen Shen
Wei Shao
Tengfei Xing
Runbo Hu
Pengfei Xu
Hua Chai
Ruini Xue
ViT
211
12
0
21 Mar 2024
Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection
Tim Salzmann
Markus Ryll
Alex Bewley
Matthias Minderer
288
8
0
21 Mar 2024
Volumetric Environment Representation for Vision-Language Navigation
Rui Liu
Wenguan Wang
Yi Yang
252
61
0
21 Mar 2024
Meta-Point Learning and Refining for Category-Agnostic Pose Estimation
Junjie Chen
Jiebin Yan
Yuming Fang
Li Niu
3DPC
250
8
0
20 Mar 2024
vid-TLDR: Training Free Token merging for Light-weight Video Transformer
Joonmyung Choi
Sanghyeok Lee
Jaewon Chu
Minhyuk Choi
Hyunwoo J. Kim
MoMe
ViT
289
42
0
20 Mar 2024
Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models
Zuyan Liu
Yuhao Dong
Yongming Rao
Jie Zhou
Jiwen Lu
LRM
251
44
0
19 Mar 2024
TAPTR: Tracking Any Point with Transformers as Detection
Hongyang Li
Hao Zhang
Shilong Liu
Zhaoyang Zeng
Tianhe Ren
Feng Li
Lei Zhang
202
41
0
19 Mar 2024
FaceXFormer: A Unified Transformer for Facial Analysis
Kartik Narayan
VS Vibashan
Rama Chellappa
Vishal M. Patel
ViT
499
36
0
19 Mar 2024
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
European Conference on Computer Vision (ECCV), 2024
Zixin Zhu
Xuelu Feng
Dongdong Chen
Junsong Yuan
Chunming Qiao
Gang Hua
DiffM
323
17
0
18 Mar 2024
BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Jonas Schramm
Niclas Vodisch
Kürsat Petek
Ravi Kiran
S. Yogamani
Wolfram Burgard
Abhinav Valada
235
34
0
18 Mar 2024
Continual Forgetting for Pre-trained Vision Models
Computer Vision and Pattern Recognition (CVPR), 2024
Hongbo Zhao
Bolin Ni
Haochen Wang
Junsong Fan
Fei Zhu
Yuxi Wang
Yuntao Chen
Gaofeng Meng
Zhaoxiang Zhang
MU
VLM
350
23
0
18 Mar 2024
GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection
Ziying Song
Lei Yang
Shaoqing Xu
Lin Liu
Dongyang Xu
Caiyan Jia
Feiyang Jia
Li-e Wang
3DPC
529
50
0
18 Mar 2024
Align and Distill: Unifying and Improving Domain Adaptive Object Detection
Justin Kay
T. Haucke
Suzanne Stathatos
Siqi Deng
Erik Young
Pietro Perona
Sara Beery
Grant Van Horn
490
14
0
18 Mar 2024
Domain-Guided Masked Autoencoders for Unique Player Identification
Bavesh Balaji
Jerrin Bright
Sirisha Rambhatla
Yuhao Chen
Alexander Wong
John S. Zelek
David A Clausi
173
2
0
17 Mar 2024
NetTrack: Tracking Highly Dynamic Objects with a Net
Guang-Zheng Zheng
Shijie Lin
Haobo Zuo
Changhong Fu
Jia Pan
257
25
0
17 Mar 2024
Diffusion Models are Efficient Data Generators for Human Mesh Recovery
Yongtao Ge
Wenjia Wang
Yongfan Chen
Fanzhou Wang
Lei Yang
Hao Chen
Chunhua Shen
3DH
476
8
0
17 Mar 2024
SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras
European Conference on Computer Vision (ECCV), 2024
Yingqi Tang
Zhaotie Meng
Guoliang Chen
Erkang Cheng
3DPC
248
5
0
15 Mar 2024
Generative Region-Language Pretraining for Open-Ended Object Detection
Computer Vision and Pattern Recognition (CVPR), 2024
Chuang Lin
Yi Jiang
Zhuang Li
Zehuan Yuan
Jianfei Cai
ObjD
VLM
224
27
0
15 Mar 2024
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model
Jiahao Lyu
Jin Wei
Gangyan Zeng
Zeng Li
Enze Xie
Wei Wang
Can Ma
VLM
301
7
0
15 Mar 2024
SparseFusion: Efficient Sparse Multi-Modal Fusion Framework for Long-Range 3D Perception
Yiheng Li
Hongyang Li
Zehao Huang
Hong Chang
Naiyan Wang
273
6
0
15 Mar 2024
Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers
Computer Vision and Pattern Recognition (CVPR), 2024
Sanghyeok Lee
Joonmyung Choi
Hyunwoo J. Kim
ViT
413
25
0
15 Mar 2024
ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real Images
European Conference on Computer Vision (ECCV), 2024
Xiangtian Xue
Jiasong Wu
Youyong Kong
L. Senhadji
Huazhong Shu
DiffM
158
1
0
15 Mar 2024
EfficientVMamba: Atrous Selective Scan for Light Weight Visual Mamba
AAAI Conference on Artificial Intelligence (AAAI), 2024
Xiaohuan Pei
Tao Huang
Chang Xu
Mamba
303
189
0
15 Mar 2024
HyCTAS: Multi-Objective Hybrid Convolution-Transformer Architecture Search for Real-Time Image Segmentation
Hongyuan Yu
Cheng Wan
Xiyang Dai
Dongdong Chen
Bin Xiao
Xiyang Dai
Yan Huang
Yuan Lu
Liang Wang
315
8
0
15 Mar 2024
Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding
Guo Chen
Yifei Huang
Jilan Xu
Baoqi Pei
Zhe Chen
Zhiqi Li
Jiahao Wang
Kunchang Li
Tong Lu
Limin Wang
Mamba
281
128
0
14 Mar 2024
Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization
British Machine Vision Conference (BMVC), 2024
Zhao Wang
Aoxue Li
Fengwei Zhou
Zhenguo Li
Qi Dou
ObjD
VLM
248
4
0
14 Mar 2024
Efficient Transferability Assessment for Selection of Pre-trained Detectors
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Zhao Wang
Aoxue Li
Zhenguo Li
Qi Dou
174
0
0
14 Mar 2024
GiT: Towards Generalist Vision Transformer through Universal Language Interface
European Conference on Computer Vision (ECCV), 2024
Haiyang Wang
Hao Tang
Li Jiang
Shaoshuai Shi
Muhammad Ferjad Naeem
Jiaming Song
Bernt Schiele
Liwei Wang
VLM
286
23
0
14 Mar 2024
PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest
Jiajun Deng
Sha Zhang
Feras Dayoub
Wanli Ouyang
Yanyong Zhang
Ian Reid
3DPC
315
8
0
14 Mar 2024
PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation
European Conference on Computer Vision (ECCV), 2024
Yizhe Xiong
Hui Chen
Tianxiang Hao
Zijia Lin
Jungong Han
Yuesong Zhang
Guoxin Wang
Yongjun Bao
Guiguang Ding
343
26
0
14 Mar 2024
ThermoHands: A Benchmark for 3D Hand Pose Estimation from Egocentric Thermal Images
ACM International Conference on Embedded Networked Sensor Systems (SenSys), 2024
Fangqiang Ding
Yunzhou Zhu
Xiangyu Wen
Gaowen Liu
Chris Xiaoxuan Lu
556
9
0
14 Mar 2024
MonoOcc: Digging into Monocular Semantic Occupancy Prediction
IEEE International Conference on Robotics and Automation (ICRA), 2024
Yupeng Zheng
Xiang Li
Pengfei Li
Yuhang Zheng
Bu Jin
Chengliang Zhong
Xiaoxiao Long
Hao Zhao
Qichao Zhang
220
44
0
13 Mar 2024
MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning
Jialv Zou
Bencheng Liao
Qian Zhang
Wenyu Liu
Xinggang Wang
247
6
0
13 Mar 2024
Historical Astronomical Diagrams Decomposition in Geometric Primitives
IEEE International Conference on Document Analysis and Recognition (ICDAR), 2024
Syrine Kalleli
Scott Trigg
Ségolene Albouy
Mathieu Husson
Mathieu Aubry
162
2
0
13 Mar 2024
HIMap: HybrId Representation Learning for End-to-end Vectorized HD Map Construction
Computer Vision and Pattern Recognition (CVPR), 2024
Yi Zhou
Hui Zhang
Jiaqian Yu
Yifan Yang
Sangil Jung
Seungsang Park
ByungIn Yoo
3DPC
283
46
0
13 Mar 2024
TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object Detection
Hanning Chen
Wenjun Huang
Yang Ni
Sanggeon Yun
Fei Wen
Hugo Latapie
Mohsen Imani
ObjD
MLLM
VLM
218
27
0
12 Mar 2024
A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions
Quoc-Vinh Lai-Dang
ViT
264
11
0
12 Mar 2024
ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions
Computer Vision and Pattern Recognition (CVPR), 2024
Chunlong Xia
Xinliang Wang
Feng Lv
Xin Hao
Yifeng Shi
ViT
430
132
0
12 Mar 2024
SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection
European Conference on Computer Vision (ECCV), 2024
Hongcheng Zhang
Liu Liang
Pengxin Zeng
Xiao Song
Zhe Wang
380
23
0
12 Mar 2024
Unleashing HyDRa: Hybrid Fusion, Depth Consistency and Radar for Unified 3D Perception
IEEE International Conference on Robotics and Automation (ICRA), 2024
Philipp Wolters
Johannes Gilg
Torben Teepe
Fabian Herzog
Anouar Laouichi
Martin Hofmann
Gerhard Rigoll
MDE
527
34
0
12 Mar 2024
Real-time Transformer-based Open-Vocabulary Detection with Efficient Fusion Head
Tiancheng Zhao
Peng Liu
Xuan He
Lu Zhang
Kyusong Lee
ObjD
185
20
0
11 Mar 2024
Genetic Learning for Designing Sim-to-Real Data Augmentations
Bram Vanherle
Nick Michiels
F. Reeth
144
0
0
11 Mar 2024
SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection
Neural Information Processing Systems (NeurIPS), 2024
Yuxuan Li
Kaijie Zhu
Wei-Jang Li
Qibin Hou
Tianpeng Liu
Ming-Ming Cheng
Jian Yang
472
84
0
11 Mar 2024
Enhancing 3D Object Detection with 2D Detection-Guided Query Anchors
Computer Vision and Pattern Recognition (CVPR), 2024
Haoxuanye Ji
Pengpeng Liang
Erkang Cheng
3DPC
206
17
0
10 Mar 2024
VLM-PL: Advanced Pseudo Labeling Approach for Class Incremental Object Detection via Vision-Language Model
Junsu Kim
Yunhoe Ku
Jihyeon Kim
Junuk Cha
Seungryul Baek
ObjD
VLM
335
24
0
08 Mar 2024
OccFusion: Depth Estimation Free Multi-sensor Fusion for 3D Occupancy Prediction
Asian Conference on Computer Vision (ACCV), 2024
Ji Zhang
Yiran Ding
Zixin Liu
3DPC
376
16
0
08 Mar 2024
Model Comparison for Fast Domain Adaptation in Table Service Scenario
Woo-han Yun
Minsu Jang
Jaehong Kim
110
0
0
08 Mar 2024
ActFormer: Scalable Collaborative Perception via Active Queries
IEEE International Conference on Robotics and Automation (ICRA), 2024
Suozhi Huang
Juexiao Zhang
Yiming Li
Chen Feng
234
8
0
08 Mar 2024
Previous
1
2
3
...
19
20
21
...
54
55
56
Next
Page 20 of 56
Page
of 56
Go