Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.12872
Cited By
End-to-End Object Detection with Transformers
26 May 2020
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT
3DV
PINN
Re-assign community
ArXiv
PDF
HTML
Papers citing
"End-to-End Object Detection with Transformers"
50 / 1,216 papers shown
Title
Semantic Line Combination Detector
Jinwon Ko
Dongkwon Jin
Chang-Su Kim
24
0
0
29 Apr 2024
Reliable Student: Addressing Noise in Semi-Supervised 3D Object Detection
Farzad Nozarian
Shashank Agarwal
Farzaneh Rezaeianaran
Danish Shahzad
Atanas Poibrenski
Christian Müller
P. Slusallek
NoLa
29
3
0
27 Apr 2024
Two in One Go: Single-stage Emotion Recognition with Decoupled Subject-context Transformer
Xinpeng Li
Teng Wang
Jian Zhao
Shuyi Mao
Jinbao Wang
Feng Zheng
Xiaojiang Peng
Xuelong Li
21
1
0
26 Apr 2024
Sparse Reconstruction of Optical Doppler Tomography with Alternative State Space Model and Attention
Zhenghong Li
Jiaxiang Ren
Wensheng Cheng
C. Du
Yingtian Pan
Haibin Ling
48
0
0
26 Apr 2024
Detection of Peri-Pancreatic Edema using Deep Learning and Radiomics Techniques
Ziliang Hong
Debesh Jha
Koushik Biswas
Zheyu Zhang
Yury Velichko
...
Amir Borhani
B. Turkbey
A. Medetalibeyoğlu
Gorkem Durak
Ulas Bagci
MedIm
20
0
0
25 Apr 2024
Constellation Dataset: Benchmarking High-Altitude Object Detection for an Urban Intersection
Mehmet Kerem Turkcan
Sanjeev Narasimhan
Chengbo Zang
Gyung Hyun Je
Bo Yu
Mahshid Ghasemi
Javad Ghaderi
Gil Zussman
Z. Kostić
31
2
0
25 Apr 2024
Progressive Token Length Scaling in Transformer Encoders for Efficient Universal Segmentation
Abhishek Aich
Yumin Suh
S. Schulter
Manmohan Chandraker
51
0
0
23 Apr 2024
Surgical-DeSAM: Decoupling SAM for Instrument Segmentation in Robotic Surgery
Yuyang Sheng
Sophia Bano
Matthew J. Clarkson
Mobarakol Islam
30
6
0
22 Apr 2024
Towards Robust Ferrous Scrap Material Classification with Deep Learning and Conformal Prediction
Paulo Henrique dos Santos
Valéria de Carvalho Santos
Eduardo José da Silva Luz
14
0
0
19 Apr 2024
TC-OCR: TableCraft OCR for Efficient Detection & Recognition of Table Structure & Content
Avinash Anand
Raj Jaiswal
Pijush Bhuyan
Mohit Gupta
Siddhesh Bangar
Md. Modassir Imam
R. Shah
Shiníchi Satoh
LMTD
27
3
0
16 Apr 2024
GazeHTA: End-to-end Gaze Target Detection with Head-Target Association
Zhi-Yi Lin
Jouh Yeong Chew
J. C. V. Gemert
Xucong Zhang
36
1
0
16 Apr 2024
Task-Driven Exploration: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection
Jin Yang
Ping Wei
Huan Li
Ziyang Ren
35
8
0
14 Apr 2024
Arena: A Patch-of-Interest ViT Inference Acceleration System for Edge-Assisted Video Analytics
Haosong Peng
Wei Feng
Hao Li
Yufeng Zhan
Qihua Zhou
Yuanqing Xia
21
2
0
14 Apr 2024
Enhancing Visual Question Answering through Question-Driven Image Captions as Prompts
Övgü Özdemir
Erdem Akagündüz
31
9
0
12 Apr 2024
AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation
Jiannan Ge
Lingxi Xie
Hongtao Xie
Pandeng Li
Xiaopeng Zhang
Yongdong Zhang
Qi Tian
VLM
16
3
0
08 Apr 2024
DQ-DETR: DETR with Dynamic Query for Tiny Object Detection
Yi-Xin Huang
Hou-I Liu
Hong-Han Shuai
Wen-Huang Cheng
26
14
0
04 Apr 2024
Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer
Qinji Yu
Yirui Wang
K. Yan
Haoshen Li
Dazhou Guo
...
Na Shen
Qifeng Wang
Xiaowei Ding
X. Ye
Dakai Jin
MedIm
66
2
0
04 Apr 2024
Roadside Monocular 3D Detection via 2D Detection Prompting
Yechi Ma
Shuoquan Wei
Churun Zhang
Wei Hua
Yanan Li
Shu Kong
31
0
0
01 Apr 2024
Enhancing Efficiency in Vision Transformer Networks: Design Techniques and Insights
Moein Heidari
Reza Azad
Sina Ghorbani Kolahi
René Arimond
Leon Niggemeier
...
Afshin Bozorgpour
Ehsan Khodapanah Aghdam
A. Kazerouni
I. Hacihaliloglu
Dorit Merhof
41
7
0
28 Mar 2024
Tiny Models are the Computational Saver for Large Models
Qingyuan Wang
B. Cardiff
Antoine Frappé
Benoît Larras
Deepu John
24
2
0
26 Mar 2024
Multiple Object Tracking as ID Prediction
Ruopeng Gao
Yijun Zhang
Limin Wang
43
12
0
25 Mar 2024
Edit3K: Universal Representation Learning for Video Editing Components
Xin Gu
Libo Zhang
Fan Chen
Longyin Wen
Yufei Wang
Tiejian Luo
Sijie Zhu
35
4
0
24 Mar 2024
Cross-domain Multi-modal Few-shot Object Detection via Rich Text
Zeyu Shangguan
Daniel Seita
Mohammad Rostami
ObjD
45
1
0
24 Mar 2024
FaceXFormer: A Unified Transformer for Facial Analysis
Kartik Narayan
VS Vibashan
Rama Chellappa
Vishal M. Patel
ViT
57
12
0
19 Mar 2024
GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection
Ziying Song
Lei Yang
Shaoqing Xu
Lin Liu
Dongyang Xu
Caiyan Jia
Feiyang Jia
Li-e Wang
3DPC
60
13
0
18 Mar 2024
Align and Distill: Unifying and Improving Domain Adaptive Object Detection
Justin Kay
T. Haucke
Suzanne Stathatos
Siqi Deng
Erik Young
Pietro Perona
Sara Beery
Grant Van Horn
51
4
0
18 Mar 2024
Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Hongyuan Yu
Cheng Wan
Mengchen Liu
Dongdong Chen
Bin Xiao
Xiyang Dai
Yan Huang
Yuan Lu
Liang Wang
71
5
0
15 Mar 2024
Unleashing HyDRa: Hybrid Fusion, Depth Consistency and Radar for Unified 3D Perception
Philipp Wolters
Johannes Gilg
Torben Teepe
Fabian Herzog
Anouar Laouichi
Martin Hofmann
Gerhard Rigoll
MDE
60
12
0
12 Mar 2024
Segmentation Guided Sparse Transformer for Under-Display Camera Image Restoration
Jingyun Xue
Tao Wang
Jun Wang
Kaihao Zhang
ViT
43
2
0
09 Mar 2024
Med3DInsight: Enhancing 3D Medical Image Understanding with 2D Multi-Modal Large Language Models
Qiuhui Chen
Huping Ye
Yi Hong
MedIm
30
1
0
08 Mar 2024
LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking
Jialin Li
Qiang Nie
Weifu Fu
Yuhuan Lin
Guangpin Tao
Yong-Jin Liu
Chengjie Wang
25
4
0
07 Mar 2024
Adversarial Infrared Geometry: Using Geometry to Perform Adversarial Attack against Infrared Pedestrian Detectors
Kalibinuer Tiliwalidi
AAML
43
0
0
06 Mar 2024
Continual Segmentation with Disentangled Objectness Learning and Class Recognition
Yizheng Gong
Siyue Yu
Xiaoyang Wang
Jimin Xiao
CLL
25
5
0
06 Mar 2024
Detecting Concrete Visual Tokens for Multimodal Machine Translation
Braeden Bowen
Vipin Vijayan
Scott Grigsby
Timothy Anderson
Jeremy Gwinnup
21
2
0
05 Mar 2024
Non-autoregressive Sequence-to-Sequence Vision-Language Models
Kunyu Shi
Qi Dong
Luis Goncalves
Zhuowen Tu
Stefano Soatto
VLM
35
3
0
04 Mar 2024
End-to-End Human Instance Matting
Qinglin Liu
Shengping Zhang
Quanling Meng
Bineng Zhong
Peiqiang Liu
H. Yao
3DH
33
5
0
03 Mar 2024
Genie: Smart ROS-based Caching for Connected Autonomous Robots
Zexin Li
Soroush Bateni
Cong Liu
27
1
0
29 Feb 2024
Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction
Hao Li
Ying Chen
Yifei Chen
Wenxian Yang
Bowen Ding
Yuchen Han
Liansheng Wang
Rongshan Yu
31
15
0
29 Feb 2024
EAN-MapNet: Efficient Vectorized HD Map Construction with Anchor Neighborhoods
Huiyuan Xiong
Jun Shen
Taohong Zhu
Yuelong Pan
22
3
0
28 Feb 2024
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
Ming-hui Li
Shuai Li
Xindong Zhang
Lei Zhang
VOS
33
16
0
28 Feb 2024
AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks
Zekang Yang
Wang Zeng
Sheng Jin
Chao Qian
Ping Luo
Wentao Liu
MLLM
VLM
46
8
0
23 Feb 2024
High-Speed Detector For Low-Powered Devices In Aerial Grasping
Ashish Kumar
Laxmidhar Behera
25
2
0
22 Feb 2024
NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D Detection
Chenxi Huang
Yuenan Hou
Weicai Ye
Di Huang
Xiaoshui Huang
Binbin Lin
Deng Cai
Wanli Ouyang
3DV
3DPC
MDE
29
12
0
22 Feb 2024
YOLO-TLA: An Efficient and Lightweight Small Object Detection Model based on YOLOv5
Peng Gao
Chun-Lin Ji
Tao Yu
Ruyue Yuan
ObjD
29
34
0
22 Feb 2024
Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection
Pengfei Zhou
Weiqing Min
Jiajun Song
Yang Zhang
Shuqiang Jiang
22
10
0
14 Feb 2024
ClusterTabNet: Supervised clustering method for table detection and table structure recognition
Marek Polewczyk
Marco Spinaci
LMTD
19
0
0
12 Feb 2024
Domain Adaptable Fine-Tune Distillation Framework For Advancing Farm Surveillance
Raza Imam
Muhammad Huzaifa
Nabil Mansour
Shaher Bano Mirza
Fouad Lamghari
20
0
0
10 Feb 2024
CurveFormer++: 3D Lane Detection by Curve Propagation with Temporal Curve Queries and Attention
Yifeng Bai
Zhirong Chen
Pengpeng Liang
Erkang Cheng
Erkang Cheng
ViT
20
8
0
09 Feb 2024
Text-to-Code Generation with Modality-relative Pre-training
Fenia Christopoulou
Guchun Zhang
Gerasimos Lampouras
AI4TS
13
1
0
08 Feb 2024
RepQuant: Towards Accurate Post-Training Quantization of Large Transformer Models via Scale Reparameterization
Zhikai Li
Xuewen Liu
Jing Zhang
Qingyi Gu
MQ
32
7
0
08 Feb 2024
Previous
1
2
3
...
5
6
7
...
23
24
25
Next