ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.03605
  4. Cited By
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object
  Detection

DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection

7 March 2022
Hao Zhang
Feng Li
Shilong Liu
Lei Zhang
Hang Su
Jun Zhu
L. Ni
H. Shum
    ViT
ArXivPDFHTML

Papers citing "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

50 / 716 papers shown
Title
ViMoE: An Empirical Study of Designing Vision Mixture-of-Experts
ViMoE: An Empirical Study of Designing Vision Mixture-of-Experts
Xumeng Han
Longhui Wei
Zhiyang Dou
Zipeng Wang
Chenhui Qiang
Xin He
Yingfei Sun
Zhenjun Han
Qi Tian
MoE
33
3
0
21 Oct 2024
A Survey of Hallucination in Large Visual Language Models
A Survey of Hallucination in Large Visual Language Models
Wei Lan
Wenyi Chen
Qingfeng Chen
Shirui Pan
Huiyu Zhou
Yi-Lun Pan
LRM
28
4
0
20 Oct 2024
Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object
  Detection Considering Text Describability
Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability
Yusuke Hosoya
Masanori Suganuma
Takayuki Okatani
ObjD
16
0
0
20 Oct 2024
D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution
  Refinement
D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution Refinement
Yansong Peng
Hebei Li
Peixi Wu
Yueyi Zhang
X. Sun
Feng Wu
34
13
0
17 Oct 2024
OAH-Net: A Deep Neural Network for Hologram Reconstruction of Off-axis
  Digital Holographic Microscope
OAH-Net: A Deep Neural Network for Hologram Reconstruction of Off-axis Digital Holographic Microscope
Wei Liu
Kerem Delikoyun
Qianyu Chen
Alperen Yildiz
Si Ko Myo
Win Sen Kuan
John Tshon Yit Soong
Matthew Edward Cove
Oliver Hayden
Hweekuan Lee
21
0
0
17 Oct 2024
VividMed: Vision Language Model with Versatile Visual Grounding for
  Medicine
VividMed: Vision Language Model with Versatile Visual Grounding for Medicine
Lingxiao Luo
Bingda Tang
Xuanzhong Chen
Rong Han
Ting Chen
VLM
21
2
0
16 Oct 2024
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse
  Synthetic Data and Global-to-Local Adaptive Perception
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
Zhiyuan Zhao
Hengrui Kang
Bin Wang
Conghui He
25
9
0
16 Oct 2024
big.LITTLE Vision Transformer for Efficient Visual Recognition
big.LITTLE Vision Transformer for Efficient Visual Recognition
He Guo
Yulong Wang
Zixuan Ye
Jifeng Dai
Yuwen Xiong
ViT
50
0
0
14 Oct 2024
ET-Former: Efficient Triplane Deformable Attention for 3D Semantic Scene Completion From Monocular Camera
ET-Former: Efficient Triplane Deformable Attention for 3D Semantic Scene Completion From Monocular Camera
Jing Liang
He Yin
Xuewei Qi
Jong Jin Park
Min Sun
R. Madhivanan
Dinesh Manocha
3DPC
27
0
0
14 Oct 2024
UnSeg: One Universal Unlearnable Example Generator is Enough against All
  Image Segmentation
UnSeg: One Universal Unlearnable Example Generator is Enough against All Image Segmentation
Ye Sun
Hao Zhang
Tiehua Zhang
Xingjun Ma
Yu-Gang Jiang
VLM
32
3
0
13 Oct 2024
Ego3DT: Tracking Every 3D Object in Ego-centric Videos
Ego3DT: Tracking Every 3D Object in Ego-centric Videos
Shengyu Hao
Wenhao Chai
Zhonghan Zhao
Meiqi Sun
Wendi Hu
...
Yixian Zhao
Qi Li
Yizhou Wang
Xi Li
Gaoang Wang
29
1
0
11 Oct 2024
Multi-Scale Deformable Transformers for Student Learning Behavior
  Detection in Smart Classroom
Multi-Scale Deformable Transformers for Student Learning Behavior Detection in Smart Classroom
Zhifeng Wang
Minghui Wang
Chunyan Zeng
Longlong Li
24
1
0
10 Oct 2024
Boosting Few-Shot Detection with Large Language Models and
  Layout-to-Image Synthesis
Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis
Ahmed Abdullah
Nikolas Ebert
Oliver Wasenmüller
ObjD
25
1
0
09 Oct 2024
Improving Object Detection via Local-global Contrastive Learning
Improving Object Detection via Local-global Contrastive Learning
Danai Triantafyllidou
Sarah Parisot
A. Leonardis
Steven G. McDonagh
24
0
0
07 Oct 2024
Cross Resolution Encoding-Decoding For Detection Transformers
Cross Resolution Encoding-Decoding For Detection Transformers
Ashish Kumar
Jaesik Park
ViT
21
0
0
05 Oct 2024
3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and
  Box-Focused Sampling for 3D Object Detection
3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection
Yang Cao
Yuanliang Jv
Dan Xu
3DGS
29
3
0
02 Oct 2024
Saliency-Guided DETR for Moment Retrieval and Highlight Detection
Saliency-Guided DETR for Moment Retrieval and Highlight Detection
Aleksandr Gordeev
Vladimir Dokholyan
Irina Tolstykh
Maksim Kuprashevich
21
4
0
02 Oct 2024
Pose Estimation of Buried Deep-Sea Objects using 3D Vision Deep Learning
  Models
Pose Estimation of Buried Deep-Sea Objects using 3D Vision Deep Learning Models
Jerry Yan
Chinmay Talegaonkar
Nicholas Antipa
Eric Terrill
Sophia Merrifield
3DPC
26
0
0
01 Oct 2024
Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation
Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation
Aleyna Kütük
Tevfik Metin Sezgin
18
1
0
30 Sep 2024
Intelligent Fish Detection System with Similarity-Aware Transformer
Intelligent Fish Detection System with Similarity-Aware Transformer
Shengchen Li
Haobo Zuo
Changhong Fu
Zhiyong Wang
Zhiqiang Xu
ViT
23
0
0
28 Sep 2024
Embed and Emulate: Contrastive representations for simulation-based
  inference
Embed and Emulate: Contrastive representations for simulation-based inference
Ruoxi Jiang
Peter Y. Lu
Rebecca Willett
24
0
0
27 Sep 2024
You Only Speak Once to See
You Only Speak Once to See
Wenhao Yang
Jianguo Wei
Wenhuan Lu
Lei Li
VOS
23
1
0
27 Sep 2024
ZALM3: Zero-Shot Enhancement of Vision-Language Alignment via In-Context
  Information in Multi-Turn Multimodal Medical Dialogue
ZALM3: Zero-Shot Enhancement of Vision-Language Alignment via In-Context Information in Multi-Turn Multimodal Medical Dialogue
Zhangpu Li
Changhong Zou
Suxue Ma
Zhicheng Yang
Chen Du
...
Xingzhi Sun
Jing Xiao
Kai Zhang
Mei Han
Mei Han
LM&MA
46
1
0
26 Sep 2024
Source-Free Domain Adaptation for YOLO Object Detection
Source-Free Domain Adaptation for YOLO Object Detection
Simon Varailhon
Masih Aminbeidokhti
M. Pedersoli
Eric Granger
TTA
ObjD
28
4
0
25 Sep 2024
General Detection-based Text Line Recognition
General Detection-based Text Line Recognition
Raphael Baena
Syrine Kalleli
Mathieu Aubry
83
0
0
25 Sep 2024
COCO-OLAC: A Benchmark for Occluded Panoptic Segmentation and Image Understanding
COCO-OLAC: A Benchmark for Occluded Panoptic Segmentation and Image Understanding
Wenbo Wei
Jun Wang
Abhir Bhalerao
66
0
0
19 Sep 2024
TopoMaskV2: Enhanced Instance-Mask-Based Formulation for the Road
  Topology Problem
TopoMaskV2: Enhanced Instance-Mask-Based Formulation for the Road Topology Problem
M. E. Kalfaoglu
H. Öztürk
Ozsel Kilinc
A. Temi̇zel
3DPC
27
2
0
17 Sep 2024
Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation
Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation
Minghan Chen
Guikun Chen
Wenguan Wang
Yi Yang
56
3
0
16 Sep 2024
OPUS: Occupancy Prediction Using a Sparse Set
OPUS: Occupancy Prediction Using a Sparse Set
Jiabao Wang
Zhaojiang Liu
Qiang Meng
Liujiang Yan
Ke Wang
Jie Yang
Wei Liu
Qibin Hou
Ming-Ming Cheng
28
9
0
14 Sep 2024
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary
  Detection
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection
Haoxuan Wang
Q. He
Jinlong Peng
Hao Yang
Mingmin Chi
Yabiao Wang
Mamba
34
1
0
13 Sep 2024
RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense
  Positive Supervision
RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positive Supervision
Shuo Wang
Chunlong Xia
Feng Lv
Yifeng Shi
PINN
ViT
MU
31
2
0
13 Sep 2024
From COCO to COCO-FP: A Deep Dive into Background False Positives for
  COCO Detectors
From COCO to COCO-FP: A Deep Dive into Background False Positives for COCO Detectors
Longfei Liu
Wen Guo
S. Huang
Cheng Li
Xi Shen
ObjD
25
0
0
12 Sep 2024
ENACT: Entropy-based Clustering of Attention Input for Improving the
  Computational Performance of Object Detection Transformers
ENACT: Entropy-based Clustering of Attention Input for Improving the Computational Performance of Object Detection Transformers
Giorgos Savathrakis
Antonis Argyros
ViT
13
0
0
11 Sep 2024
Advancing SEM Based Nano-Scale Defect Analysis in Semiconductor
  Manufacturing for Advanced IC Nodes
Advancing SEM Based Nano-Scale Defect Analysis in Semiconductor Manufacturing for Advanced IC Nodes
Bappaditya Dey
Matthias Monden
Victor Blanco
Sandip Halder
S. de Gendt
23
0
0
06 Sep 2024
PitVis-2023 Challenge: Workflow Recognition in videos of Endoscopic
  Pituitary Surgery
PitVis-2023 Challenge: Workflow Recognition in videos of Endoscopic Pituitary Surgery
Adrito Das
Danyal Z. Khan
Dimitrios Psychogyios
Yitong Zhang
John G. Hanrahan
...
Santiago Rodriguez
Pablo Arbelaez
Danail Stoyanov
Hani J. Marcus
Sophia Bano
29
5
0
02 Sep 2024
IAFI-FCOS: Intra- and across-layer feature interaction FCOS model for
  lesion detection of CT images
IAFI-FCOS: Intra- and across-layer feature interaction FCOS model for lesion detection of CT images
Q. Guan
Mengjie Pan
Feng Chen
Zhiqiang Yang
Zhongwen Yu
Qianwei Zhou
Haigen Hu
25
0
0
01 Sep 2024
COMOGen: A Controllable Text-to-3D Multi-object Generation Framework
COMOGen: A Controllable Text-to-3D Multi-object Generation Framework
Shaorong Sun
Shuchao Pang
Yazhou Yao
Xiaoshui Huang
16
1
0
01 Sep 2024
LLaVA-SG: Leveraging Scene Graphs as Visual Semantic Expression in
  Vision-Language Models
LLaVA-SG: Leveraging Scene Graphs as Visual Semantic Expression in Vision-Language Models
Jingyi Wang
Jianzhong Ju
Jian Luan
Zhidong Deng
VLM
25
1
0
29 Aug 2024
Center Direction Network for Grasping Point Localization on Cloths
Center Direction Network for Grasping Point Localization on Cloths
Domen Tabernik
Jon Muhovič
Matej Urbas
Danijel Skočaj
3DPC
16
1
0
26 Aug 2024
LSM-YOLO: A Compact and Effective ROI Detector for Medical Detection
LSM-YOLO: A Compact and Effective ROI Detector for Medical Detection
Zhongwen Yu
Q. Guan
Jianmin Yang
Zhiqiang Yang
Qianwei Zhou
Yang Chen
Feng Chen
30
0
0
26 Aug 2024
CustomCrafter: Customized Video Generation with Preserving Motion and
  Concept Composition Abilities
CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
Tao Wu
Yong Zhang
Xintao Wang
Xianpan Zhou
Guangcong Zheng
Zhongang Qi
Ying Shan
Xi Li
VGen
DiffM
24
26
0
23 Aug 2024
VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation
  Models
VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models
Wentao Wu
Fanghua Hong
Xiao Wang
Chenglong Li
Jin Tang
VLM
41
1
0
23 Aug 2024
OVA-Det: Open Vocabulary Aerial Object Detection with Image-Text Collaboration
OVA-Det: Open Vocabulary Aerial Object Detection with Image-Text Collaboration
Guoting Wei
Xia Yuan
Yu Liu
Zhenhao Shang
Kelu Yao
Peng Wang
Qingsen Yan
Chunxia Zhao
Haokui Zhang
Rong Xiao
VLM
ObjD
41
1
0
22 Aug 2024
On the Potential of Open-Vocabulary Models for Object Detection in
  Unusual Street Scenes
On the Potential of Open-Vocabulary Models for Object Detection in Unusual Street Scenes
Sadia Ilyas
Ido Freeman
Matthias Rottmann
ObjD
43
3
0
20 Aug 2024
PADetBench: Towards Benchmarking Physical Attacks against Object
  Detection
PADetBench: Towards Benchmarking Physical Attacks against Object Detection
Jiawei Lian
Jianhong Pan
Lefan Wang
Yi Wang
Lap-Pui Chau
Shaohui Mei
AAML
31
0
0
17 Aug 2024
Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community
Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community
Jiancheng Pan
Yanxing Liu
Yuqian Fu
Muyuan Ma
Jiaohao Li
D. Paudel
Luc Van Gool
Xiaomeng Huang
ObjD
61
7
0
17 Aug 2024
DPDETR: Decoupled Position Detection Transformer for Infrared-Visible
  Object Detection
DPDETR: Decoupled Position Detection Transformer for Infrared-Visible Object Detection
Junjie Guo
Chenqiang Gao
Fangcen Liu
Deyu Meng
ViT
32
1
0
12 Aug 2024
MV2DFusion: Leveraging Modality-Specific Object Semantics for
  Multi-Modal 3D Detection
MV2DFusion: Leveraging Modality-Specific Object Semantics for Multi-Modal 3D Detection
Zitian Wang
Zehao Huang
Yulu Gao
Naiyan Wang
Si Liu
3DPC
35
3
0
12 Aug 2024
U-DECN: End-to-End Underwater Object Detection ConvNet with Improved
  DeNoising Training
U-DECN: End-to-End Underwater Object Detection ConvNet with Improved DeNoising Training
Zhuoyan Liu
Bo Wang
Ye Li
ViT
27
0
0
11 Aug 2024
Embodied Uncertainty-Aware Object Segmentation
Embodied Uncertainty-Aware Object Segmentation
Xiaolin Fang
Leslie Pack Kaelbling
Tomás Lozano-Pérez
19
5
0
08 Aug 2024
Previous
12345...131415
Next