Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.03605
Cited By
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection
7 March 2022
Hao Zhang
Feng Li
Shilong Liu
Lei Zhang
Hang Su
Jun Zhu
L. Ni
H. Shum
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
50 / 718 papers shown
Title
U-DECN: End-to-End Underwater Object Detection ConvNet with Improved DeNoising Training
Zhuoyan Liu
Bo Wang
Ye Li
ViT
30
0
0
11 Aug 2024
Embodied Uncertainty-Aware Object Segmentation
Xiaolin Fang
Leslie Pack Kaelbling
Tomás Lozano-Pérez
21
5
0
08 Aug 2024
Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models
Qirui Jiao
Daoyuan Chen
Yilun Huang
Yaliang Li
Ying Shen
VLM
27
5
0
08 Aug 2024
Data Generation Scheme for Thermal Modality with Edge-Guided Adversarial Conditional Diffusion Model
Guoqing Zhu
Honghu Pan
Qiang Wang
Chao Tian
Chao Yang
Zhenyu He
25
0
0
07 Aug 2024
Contrastive Learning for Image Complexity Representation
Shipeng Liu
Liang Zhao
Dengfeng Chen
Zhanping Song
34
2
0
06 Aug 2024
DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training
Xi Chen
Qian Qiao
Jun Gao
Tianxiang Wu
Rahul Bhadani
...
Ziqiang Cao
Larry Head
Yue Zhang
Jielei Zhang
Huyang Sun
DiffM
21
5
0
01 Aug 2024
Practical Video Object Detection via Feature Selection and Aggregation
Yuheng Shi
Tong Zhang
Xiaojie Guo
ObjD
29
2
0
29 Jul 2024
MOoSE: Multi-Orientation Sharing Experts for Open-set Scene Text Recognition
Chang Liu
Simon Corbillé
Elisa H Barney Smith
19
0
0
26 Jul 2024
VSSD: Vision Mamba with Non-Causal State Space Duality
Yuheng Shi
Minjing Dong
Mingjia Li
Chang Xu
Mamba
28
3
0
26 Jul 2024
DVPE: Divided View Position Embedding for Multi-View 3D Object Detection
Jiasen Wang
Zhenglin Li
Ke Sun
Xianyuan Liu
Yang Zhou
36
0
0
24 Jul 2024
Dynamic Retraining-Updating Mean Teacher for Source-Free Object Detection
Trinh Le Ba Khanh
Huy-Hung Nguyen
L. Pham
Duong Nguyen-Ngoc Tran
Jae Wook Jeon
33
3
0
23 Jul 2024
ESOD: Efficient Small Object Detection on High-Resolution Images
Kai-Chun Liu
Zhihang Fu
Sheng Jin
Ze Chen
Fan Zhou
Rongxin Jiang
Yao-Shen Chen
Jieping Ye
ObjD
33
2
0
23 Jul 2024
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation
Pengfei Chen
Lingxi Xie
Xinyue Huo
Xuehui Yu
Xiaopeng Zhang
Yingfei Sun
Zhenjun Han
Qi Tian
VLM
58
1
0
23 Jul 2024
DFMSD: Dual Feature Masking Stage-wise Knowledge Distillation for Object Detection
Zhourui Zhang
Jun Li
Zhijian Wu
Jifeng Shen
Jianhua Xu
36
0
0
18 Jul 2024
GroupMamba: Efficient Group-Based Visual State Space Model
Abdelrahman M. Shaker
Syed Talal Wasim
Salman Khan
Juergen Gall
Fahad Shahbaz Khan
Mamba
51
0
0
18 Jul 2024
Relation DETR: Exploring Explicit Position Relation Prior for Object Detection
Xiuquan Hou
Mei-qin Liu
Senlin Zhang
Ping Wei
Badong Chen
Xuguang Lan
ViT
45
15
0
16 Jul 2024
Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes
Zhi Cai
Yingjie Gao
Yaoyan Zheng
Nan Zhou
Di Huang
VLM
37
4
0
16 Jul 2024
LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction
Penghui Du
Yu Wang
Yifan Sun
Luting Wang
Yue Liao
Gang Zhang
Errui Ding
Yan Wang
Jingdong Wang
Si Liu
VLM
ObjD
33
1
0
16 Jul 2024
TCFormer: Visual Recognition via Token Clustering Transformer
Wang Zeng
Sheng Jin
Lumin Xu
Wentao Liu
Chao Qian
Wanli Ouyang
Ping Luo
Xiaogang Wang
26
3
0
16 Jul 2024
SEED: A Simple and Effective 3D DETR in Point Clouds
Zhe Liu
Jinghua Hou
Xiaoqing Ye
Tong Wang
Jingdong Wang
Xiang Bai
3DPC
33
6
0
15 Jul 2024
When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset
Yi Zhang
Wang Zeng
Sheng Jin
Chao Qian
Ping Luo
Wentao Liu
32
4
0
14 Jul 2024
Plain-Det: A Plain Multi-Dataset Object Detector
Cheng Shi
Yuchen Zhu
Sibei Yang
ObjD
VLM
24
0
0
14 Jul 2024
MutDet: Mutually Optimizing Pre-training for Remote Sensing Object Detection
Ziyue Huang
Yongchao Feng
Qingjie Liu
Yunhong Wang
ViT
56
1
0
13 Jul 2024
FD-SOS: Vision-Language Open-Set Detectors for Bone Fenestration and Dehiscence Detection from Intraoral Images
Marawan Elbatel
Keyuan Liu
Yanqi Yang
X. Li
19
0
0
12 Jul 2024
Textual Query-Driven Mask Transformer for Domain Generalized Segmentation
Byeonghyun Pak
Byeongju Woo
Sunghwan Kim
Dae-Hwan Kim
Hoseong Kim
40
3
0
12 Jul 2024
Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection
Xingyu Peng
Yan Bai
Chen Gao
Lirong Yang
Fei Xia
Beipeng Mu
Xiaofei Wang
Si Liu
ObjD
35
3
0
12 Jul 2024
D-MASTER: Mask Annealed Transformer for Unsupervised Domain Adaptation in Breast Cancer Detection from Mammograms
Tajamul Ashraf
K. Rangarajan
Mohit Gambhir
Richa Gabha
Chetan Arora
MedIm
31
1
0
09 Jul 2024
Multi-Branch Auxiliary Fusion YOLO with Re-parameterization Heterogeneous Convolutional for accurate object detection
Zhiqiang Yang
Q. Guan
Keer Zhao
Jianmin Yang
Xinli Xu
Haixia Long
Ying Tang
24
9
0
05 Jul 2024
Lift, Splat, Map: Lifting Foundation Masks for Label-Free Semantic Scene Completion
Arthur Zhang
Rainier Heijne
Joydeep Biswas
26
1
0
03 Jul 2024
Explainable vertebral fracture analysis with uncertainty estimation using differentiable rule-based classification
Victor Wåhlstrand Skärström
L. Johansson
Jennifer Alvén
M. Lorentzon
Ida Häggström
15
1
0
03 Jul 2024
SymPoint Revolutionized: Boosting Panoptic Symbol Spotting with Layer Feature Enhancement
Wenlong Liu
Tianyu Yang
Qizhi Yu
Lei Zhang
41
2
0
02 Jul 2024
Parametric Primitive Analysis of CAD Sketches with Vision Transformer
Xiaogang Wang
Liang Wang
Hongyu Wu
Guoqiang Xiao
Kai Xu
29
2
0
29 Jun 2024
From Local Concepts to Universals: Evaluating the Multicultural Understanding of Vision-Language Models
Mehar Bhatia
Sahithya Ravi
Aditya Chinchure
EunJeong Hwang
Vered Shwartz
VLM
18
2
0
28 Jun 2024
Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding
Yue Fan
Lei Ding
Ching-Chen Kuo
Shan Jiang
Yang Zhao
Xinze Guan
Jie Yang
Yi Zhang
Xin Eric Wang
39
10
0
27 Jun 2024
Segment Anything Model for automated image data annotation: empirical studies using text prompts from Grounding DINO
F. Mumuni
A. Mumuni
VLM
39
0
0
27 Jun 2024
MATE: Meet At The Embedding -- Connecting Images with Long Texts
Young Kyun Jang
Junmo Kang
Yong Jae Lee
Donghyun Kim
VLM
31
5
0
26 Jun 2024
MDHA: Multi-Scale Deformable Transformer with Hybrid Anchors for Multi-View 3D Object Detection
Michelle Adeline
Junn Yong Loo
Vishnu Monn Baskaran
44
0
0
25 Jun 2024
High-resolution open-vocabulary object 6D pose estimation
Jaime Corsetti
Davide Boscaini
Francesco Giuliari
Changjae Oh
Andrea Cavallaro
Fabio Poiesi
28
1
0
24 Jun 2024
Rethinking Remote Sensing Change Detection With A Mask View
Xiaowen Ma
Zhenkai Wu
Rongrong Lian
Wei Zhang
Siyang Song
27
3
0
21 Jun 2024
Enhanced Bank Check Security: Introducing a Novel Dataset and Transformer-Based Approach for Detection and Verification
Muhammad Gul Zain Ali Khan
Tahira Shehzadi
Rabeya Noor
Didier Stricker
Muhammad Zeshan Afzal
30
1
0
20 Jun 2024
SSAD: Self-supervised Auxiliary Detection Framework for Panoramic X-ray based Dental Disease Diagnosis
Zijian Cai
Xinquan Yang
Xuguang Li
Xiaoling Luo
Xuechen Li
Linlin Shen
He Meng
Yongqiang Deng
MedIm
26
0
0
20 Jun 2024
ViLCo-Bench: VIdeo Language COntinual learning Benchmark
Tianqi Tang
Shohreh Deldari
Hao Xue
Celso De Melo
Flora D. Salim
CLL
27
2
0
19 Jun 2024
V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results
Jiaqi Wang
Yuhang Zang
Pan Zhang
Tao Chu
Yuhang Cao
...
Kehong Yuan
Yanyan Zu
Jiayao Ha
Qiong Gao
Licheng Jiao
ObjD
39
1
0
17 Jun 2024
Technique Report of CVPR 2024 PBDL Challenges
Ying Fu
Yu Li
Shaodi You
Boxin Shi
Linwei Chen
...
Songyin Dai
Sen Jia
Junpei Zhang
Puhua Chen
Qihang Li
33
0
0
15 Jun 2024
An Image is Worth 32 Tokens for Reconstruction and Generation
Qihang Yu
Mark Weber
XueQing Deng
Xiaohui Shen
Daniel Cremers
Liang-Chieh Chen
VLM
ViT
44
79
0
11 Jun 2024
Open-World Human-Object Interaction Detection via Multi-modal Prompts
Jie-jin Yang
Bingliang Li
Ailing Zeng
L. Zhang
Ruimao Zhang
VLM
32
8
0
11 Jun 2024
Mamba YOLO: SSMs-Based YOLO For Object Detection
Zeyu Wang
Chen Li
Huiying Xu
Xinzhong Zhu
Mamba
47
13
0
09 Jun 2024
OD-DETR: Online Distillation for Stabilizing Training of Detection Transformer
Shengjian Wu
Li Sun
Qingli Li
30
0
0
09 Jun 2024
Utilizing Grounded SAM for self-supervised frugal camouflaged human detection
Matthias Pijarowski
Alexander Wolpert
Martin Heckmann
Michael Teutsch
40
0
0
09 Jun 2024
A DeNoising FPN With Transformer R-CNN for Tiny Object Detection
Hou-I Liu
Yu-Wen Tseng
Kai-Cheng Chang
Pin-Jyun Wang
Hong-Han Shuai
Wen-Huang Cheng
ViT
ObjD
40
22
0
09 Jun 2024
Previous
1
2
3
4
5
6
...
13
14
15
Next