DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection

7 March 2022

Hao Zhang

Feng Li

Shilong Liu

Lei Zhang

Hang Su

Jun Zhu

Papers citing "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

50 / 716 papers shown

Title
ViMoE: An Empirical Study of Designing Vision Mixture-of-Experts Xumeng Han Longhui Wei Zhiyang Dou Zipeng Wang Chenhui Qiang Xin He Yingfei Sun Zhenjun Han Qi Tian MoE 33 3 0 21 Oct 2024
A Survey of Hallucination in Large Visual Language Models Wei Lan Wenyi Chen Qingfeng Chen Shirui Pan Huiyu Zhou Yi-Lun Pan LRM 28 4 0 20 Oct 2024
Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability Yusuke Hosoya Masanori Suganuma Takayuki Okatani ObjD 16 0 0 20 Oct 2024
D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution Refinement Yansong Peng Hebei Li Peixi Wu Yueyi Zhang X. Sun Feng Wu 34 13 0 17 Oct 2024
OAH-Net: A Deep Neural Network for Hologram Reconstruction of Off-axis Digital Holographic Microscope Wei Liu Kerem Delikoyun Qianyu Chen Alperen Yildiz Si Ko Myo Win Sen Kuan John Tshon Yit Soong Matthew Edward Cove Oliver Hayden Hweekuan Lee 21 0 0 17 Oct 2024
VividMed: Vision Language Model with Versatile Visual Grounding for Medicine Lingxiao Luo Bingda Tang Xuanzhong Chen Rong Han Ting Chen VLM 21 2 0 16 Oct 2024
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception Zhiyuan Zhao Hengrui Kang Bin Wang Conghui He 25 9 0 16 Oct 2024
big.LITTLE Vision Transformer for Efficient Visual Recognition He Guo Yulong Wang Zixuan Ye Jifeng Dai Yuwen Xiong ViT 50 0 0 14 Oct 2024
ET-Former: Efficient Triplane Deformable Attention for 3D Semantic Scene Completion From Monocular Camera Jing Liang He Yin Xuewei Qi Jong Jin Park Min Sun R. Madhivanan Dinesh Manocha 3DPC 27 0 0 14 Oct 2024
UnSeg: One Universal Unlearnable Example Generator is Enough against All Image Segmentation Ye Sun Hao Zhang Tiehua Zhang Xingjun Ma Yu-Gang Jiang VLM 32 3 0 13 Oct 2024
Ego3DT: Tracking Every 3D Object in Ego-centric Videos Shengyu Hao Wenhao Chai Zhonghan Zhao Meiqi Sun Wendi Hu ... Yixian Zhao Qi Li Yizhou Wang Xi Li Gaoang Wang 29 1 0 11 Oct 2024
Multi-Scale Deformable Transformers for Student Learning Behavior Detection in Smart Classroom Zhifeng Wang Minghui Wang Chunyan Zeng Longlong Li 24 1 0 10 Oct 2024
Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis Ahmed Abdullah Nikolas Ebert Oliver Wasenmüller ObjD 25 1 0 09 Oct 2024
Improving Object Detection via Local-global Contrastive Learning Danai Triantafyllidou Sarah Parisot A. Leonardis Steven G. McDonagh 24 0 0 07 Oct 2024
Cross Resolution Encoding-Decoding For Detection Transformers Ashish Kumar Jaesik Park ViT 21 0 0 05 Oct 2024
3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection Yang Cao Yuanliang Jv Dan Xu 3DGS 29 3 0 02 Oct 2024
Saliency-Guided DETR for Moment Retrieval and Highlight Detection Aleksandr Gordeev Vladimir Dokholyan Irina Tolstykh Maksim Kuprashevich 21 4 0 02 Oct 2024
Pose Estimation of Buried Deep-Sea Objects using 3D Vision Deep Learning Models Jerry Yan Chinmay Talegaonkar Nicholas Antipa Eric Terrill Sophia Merrifield 3DPC 26 0 0 01 Oct 2024
Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation Aleyna Kütük Tevfik Metin Sezgin 18 1 0 30 Sep 2024
Intelligent Fish Detection System with Similarity-Aware Transformer Shengchen Li Haobo Zuo Changhong Fu Zhiyong Wang Zhiqiang Xu ViT 23 0 0 28 Sep 2024
Embed and Emulate: Contrastive representations for simulation-based inference Ruoxi Jiang Peter Y. Lu Rebecca Willett 24 0 0 27 Sep 2024
You Only Speak Once to See Wenhao Yang Jianguo Wei Wenhuan Lu Lei Li VOS 23 1 0 27 Sep 2024
ZALM3: Zero-Shot Enhancement of Vision-Language Alignment via In-Context Information in Multi-Turn Multimodal Medical Dialogue Zhangpu Li Changhong Zou Suxue Ma Zhicheng Yang Chen Du ... Xingzhi Sun Jing Xiao Kai Zhang Mei Han Mei Han LM&MA 46 1 0 26 Sep 2024
Source-Free Domain Adaptation for YOLO Object Detection Simon Varailhon Masih Aminbeidokhti M. Pedersoli Eric Granger TTA ObjD 28 4 0 25 Sep 2024
General Detection-based Text Line Recognition Raphael Baena Syrine Kalleli Mathieu Aubry 83 0 0 25 Sep 2024
COCO-OLAC: A Benchmark for Occluded Panoptic Segmentation and Image Understanding Wenbo Wei Jun Wang Abhir Bhalerao 66 0 0 19 Sep 2024
TopoMaskV2: Enhanced Instance-Mask-Based Formulation for the Road Topology Problem M. E. Kalfaoglu H. Öztürk Ozsel Kilinc A. Temi̇zel 3DPC 27 2 0 17 Sep 2024
Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation Minghan Chen Guikun Chen Wenguan Wang Yi Yang 56 3 0 16 Sep 2024
OPUS: Occupancy Prediction Using a Sparse Set Jiabao Wang Zhaojiang Liu Qiang Meng Liujiang Yan Ke Wang Jie Yang Wei Liu Qibin Hou Ming-Ming Cheng 28 9 0 14 Sep 2024
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection Haoxuan Wang Q. He Jinlong Peng Hao Yang Mingmin Chi Yabiao Wang Mamba 34 1 0 13 Sep 2024
RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positive Supervision Shuo Wang Chunlong Xia Feng Lv Yifeng Shi PINN ViT MU 31 2 0 13 Sep 2024
From COCO to COCO-FP: A Deep Dive into Background False Positives for COCO Detectors Longfei Liu Wen Guo S. Huang Cheng Li Xi Shen ObjD 25 0 0 12 Sep 2024
ENACT: Entropy-based Clustering of Attention Input for Improving the Computational Performance of Object Detection Transformers Giorgos Savathrakis Antonis Argyros ViT 13 0 0 11 Sep 2024
Advancing SEM Based Nano-Scale Defect Analysis in Semiconductor Manufacturing for Advanced IC Nodes Bappaditya Dey Matthias Monden Victor Blanco Sandip Halder S. de Gendt 23 0 0 06 Sep 2024
PitVis-2023 Challenge: Workflow Recognition in videos of Endoscopic Pituitary Surgery Adrito Das Danyal Z. Khan Dimitrios Psychogyios Yitong Zhang John G. Hanrahan ... Santiago Rodriguez Pablo Arbelaez Danail Stoyanov Hani J. Marcus Sophia Bano 29 5 0 02 Sep 2024
IAFI-FCOS: Intra- and across-layer feature interaction FCOS model for lesion detection of CT images Q. Guan Mengjie Pan Feng Chen Zhiqiang Yang Zhongwen Yu Qianwei Zhou Haigen Hu 25 0 0 01 Sep 2024
COMOGen: A Controllable Text-to-3D Multi-object Generation Framework Shaorong Sun Shuchao Pang Yazhou Yao Xiaoshui Huang 16 1 0 01 Sep 2024
LLaVA-SG: Leveraging Scene Graphs as Visual Semantic Expression in Vision-Language Models Jingyi Wang Jianzhong Ju Jian Luan Zhidong Deng VLM 25 1 0 29 Aug 2024
Center Direction Network for Grasping Point Localization on Cloths Domen Tabernik Jon Muhovič Matej Urbas Danijel Skočaj 3DPC 16 1 0 26 Aug 2024
LSM-YOLO: A Compact and Effective ROI Detector for Medical Detection Zhongwen Yu Q. Guan Jianmin Yang Zhiqiang Yang Qianwei Zhou Yang Chen Feng Chen 30 0 0 26 Aug 2024
CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities Tao Wu Yong Zhang Xintao Wang Xianpan Zhou Guangcong Zheng Zhongang Qi Ying Shan Xi Li VGen DiffM 24 26 0 23 Aug 2024
VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models Wentao Wu Fanghua Hong Xiao Wang Chenglong Li Jin Tang VLM 41 1 0 23 Aug 2024
OVA-Det: Open Vocabulary Aerial Object Detection with Image-Text Collaboration Guoting Wei Xia Yuan Yu Liu Zhenhao Shang Kelu Yao Peng Wang Qingsen Yan Chunxia Zhao Haokui Zhang Rong Xiao VLM ObjD 41 1 0 22 Aug 2024
On the Potential of Open-Vocabulary Models for Object Detection in Unusual Street Scenes Sadia Ilyas Ido Freeman Matthias Rottmann ObjD 43 3 0 20 Aug 2024
PADetBench: Towards Benchmarking Physical Attacks against Object Detection Jiawei Lian Jianhong Pan Lefan Wang Yi Wang Lap-Pui Chau Shaohui Mei AAML 31 0 0 17 Aug 2024
Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community Jiancheng Pan Yanxing Liu Yuqian Fu Muyuan Ma Jiaohao Li D. Paudel Luc Van Gool Xiaomeng Huang ObjD 61 7 0 17 Aug 2024
DPDETR: Decoupled Position Detection Transformer for Infrared-Visible Object Detection Junjie Guo Chenqiang Gao Fangcen Liu Deyu Meng ViT 32 1 0 12 Aug 2024
MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection Zitian Wang Zehao Huang Yulu Gao Naiyan Wang Si Liu 3DPC 35 3 0 12 Aug 2024
U-DECN: End-to-End Underwater Object Detection ConvNet with Improved DeNoising Training Zhuoyan Liu Bo Wang Ye Li ViT 27 0 0 11 Aug 2024
Embodied Uncertainty-Aware Object Segmentation Xiaolin Fang Leslie Pack Kaelbling Tomás Lozano-Pérez 19 5 0 08 Aug 2024