ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.12872
  4. Cited By
End-to-End Object Detection with Transformers

End-to-End Object Detection with Transformers

26 May 2020
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
    ViT
    3DV
    PINN
ArXivPDFHTML

Papers citing "End-to-End Object Detection with Transformers"

50 / 1,194 papers shown
Title
PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images
PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images
Nanqing Liu
Xun Xu
Yongyi Su
Haojie Zhang
Heng-Chao Li
VLM
33
14
0
20 Sep 2024
Generating Visual Stories with Grounded and Coreferent Characters
Generating Visual Stories with Grounded and Coreferent Characters
Danyang Liu
Mirella Lapata
Frank Keller
15
2
0
20 Sep 2024
COCO-OLAC: A Benchmark for Occluded Panoptic Segmentation and Image Understanding
COCO-OLAC: A Benchmark for Occluded Panoptic Segmentation and Image Understanding
Wenbo Wei
Jun Wang
Abhir Bhalerao
48
0
0
19 Sep 2024
End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting
End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting
Yongqi Wang
Xinxiao Wu
Shuo Yang
Jiebo Luo
53
1
0
19 Sep 2024
Multi-OCT-SelfNet: Integrating Self-Supervised Learning with
  Multi-Source Data Fusion for Enhanced Multi-Class Retinal Disease
  Classification
Multi-OCT-SelfNet: Integrating Self-Supervised Learning with Multi-Source Data Fusion for Enhanced Multi-Class Retinal Disease Classification
Fatema Jannat
Sina Gholami
Jennifer I. Lim
Theodore Leng
Minhaj Nur Alam
Hamed Tabkhi
28
0
0
17 Sep 2024
Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation
Hydra-SGG: Hybrid Relation Assignment for One-stage Scene Graph Generation
Minghan Chen
Guikun Chen
Wenguan Wang
Yi Yang
56
3
0
16 Sep 2024
Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization
Ling Xing
Hongyu Qu
Rui Yan
Xiangbo Shu
Jinhui Tang
45
0
0
12 Sep 2024
Relevance for Human Robot Collaboration
Relevance for Human Robot Collaboration
Xiaotong Zhang
Dingcheng Huang
Kamal Youcef-Toumi
33
2
0
12 Sep 2024
When to Extract ReID Features: A Selective Approach for Improved
  Multiple Object Tracking
When to Extract ReID Features: A Selective Approach for Improved Multiple Object Tracking
Emirhan Bayar
Cemal Aker
VOT
40
0
0
10 Sep 2024
Driving with Prior Maps: Unified Vector Prior Encoding for Autonomous
  Vehicle Mapping
Driving with Prior Maps: Unified Vector Prior Encoding for Autonomous Vehicle Mapping
Shuang Zeng
Xinyuan Chang
Xinran Liu
Zheng Pan
Xing Wei
35
1
0
09 Sep 2024
PdfTable: A Unified Toolkit for Deep Learning-Based Table Extraction
PdfTable: A Unified Toolkit for Deep Learning-Based Table Extraction
Lei Sheng
Shuai-Shuai Xu
LMTD
27
0
0
08 Sep 2024
Unleashing the Power of Generic Segmentation Models: A Simple Baseline
  for Infrared Small Target Detection
Unleashing the Power of Generic Segmentation Models: A Simple Baseline for Infrared Small Target Detection
Mingjin Zhang
Chi Zhang
Qiming Zhang
Yunsong Li
Xinbo Gao
Jing Zhang
VLM
30
3
0
07 Sep 2024
Introducing Gating and Context into Temporal Action Detection
Introducing Gating and Context into Temporal Action Detection
Aglind Reka
Diana Laura Borza
Dominick Reilly
Michal Balazia
Francois Bremond
15
0
0
06 Sep 2024
Context is the Key: Backdoor Attacks for In-Context Learning with Vision
  Transformers
Context is the Key: Backdoor Attacks for In-Context Learning with Vision Transformers
Gorka Abad
S. Picek
Lorenzo Cavallaro
A. Urbieta
SILM
37
0
0
06 Sep 2024
TG-LMM: Enhancing Medical Image Segmentation Accuracy through
  Text-Guided Large Multi-Modal Model
TG-LMM: Enhancing Medical Image Segmentation Accuracy through Text-Guided Large Multi-Modal Model
Yihao Zhao
Enhao Zhong
Cuiyun Yuan
Yang Li
Man Zhao
Chunxia Li
Jun Hu
Chenbin Liu
VLM
MedIm
28
0
0
05 Sep 2024
iSeg: An Iterative Refinement-based Framework for Training-free
  Segmentation
iSeg: An Iterative Refinement-based Framework for Training-free Segmentation
Lin Sun
Jiale Cao
J. Xie
F. Khan
Yanwei Pang
DiffM
30
1
0
05 Sep 2024
Boundless: Generating Photorealistic Synthetic Data for Object Detection
  in Urban Streetscapes
Boundless: Generating Photorealistic Synthetic Data for Object Detection in Urban Streetscapes
Mehmet Kerem Turkcan
Yuyang Li
Chengbo Zang
Javad Ghaderi
Gil Zussman
Z. Kostić
27
1
0
04 Sep 2024
iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation
iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation
Hayeon Jo
Hyesong Choi
Minhee Cho
Dongbo Min
34
1
0
04 Sep 2024
A Simple and Generalist Approach for Panoptic Segmentation
A Simple and Generalist Approach for Panoptic Segmentation
Nedyalko Prisadnikov
Wouter Van Gansbeke
Danda Pani Paudel
Luc Van Gool
VLM
38
0
0
29 Aug 2024
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting
Alloy Das
Sanket Biswas
Umapada Pal
Josep Lladós
Saumik Bhattacharya
52
2
0
27 Aug 2024
D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matching
D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matching
Jingyu Liu
Minquan Wang
Ye Ma
Bo Wang
Aozhu Chen
Quan Chen
Peng Jiang
Xirong Li
38
1
0
23 Aug 2024
Costal Cartilage Segmentation with Topology Guided Deformable Mamba:
  Method and Benchmark
Costal Cartilage Segmentation with Topology Guided Deformable Mamba: Method and Benchmark
Senmao Wang
Haifan Gong
Runmeng Cui
Boyao Wan
Yicheng Liu
...
Haiqing Yang
Jingyang Zhou
Bo Pan
Lin Lin
Haiyue Jiang
18
0
0
14 Aug 2024
Sampling Foundational Transformer: A Theoretical Perspective
Sampling Foundational Transformer: A Theoretical Perspective
Viet Anh Nguyen
Minh Lenhat
Khoa Nguyen
Duong Duc Hieu
Dao Huu Hung
Truong Son-Hy
42
0
0
11 Aug 2024
MacFormer: Semantic Segmentation with Fine Object Boundaries
MacFormer: Semantic Segmentation with Fine Object Boundaries
Guoan Xu
Wenfeng Huang
Tao Wu
Ligeng Chen
Wenjing Jia
Guangwei Gao
Xiatian Zhu
Stuart W. Perry
29
0
0
11 Aug 2024
Modeling Electromagnetic Signal Injection Attacks on Camera-based Smart
  Systems: Applications and Mitigation
Modeling Electromagnetic Signal Injection Attacks on Camera-based Smart Systems: Applications and Mitigation
Youqian Zhang
Michael Cheung
Chunxi Yang
Xinwei Zhai
Zitong Shen
Xinyu Ji
Eugene Y. Fu
Sze-Yiu Chau
Xiapu Luo
AAML
33
1
0
09 Aug 2024
SwinShadow: Shifted Window for Ambiguous Adjacent Shadow Detection
SwinShadow: Shifted Window for Ambiguous Adjacent Shadow Detection
Yonghui Wang
Shaokai Liu
Li Li
Wengang Zhou
Houqiang Li
ViT
32
1
0
07 Aug 2024
Attacks and Defenses for Generative Diffusion Models: A Comprehensive
  Survey
Attacks and Defenses for Generative Diffusion Models: A Comprehensive Survey
V. T. Truong
Luan Ba Dang
Long Bao Le
DiffM
MedIm
38
14
0
06 Aug 2024
Lighthouse: A User-Friendly Library for Reproducible Video Moment
  Retrieval and Highlight Detection
Lighthouse: A User-Friendly Library for Reproducible Video Moment Retrieval and Highlight Detection
Taichi Nishimura
Shota Nakada
Hokuto Munakata
Tatsuya Komatsu
VLM
14
1
0
06 Aug 2024
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining
Dongyang Liu
Shitian Zhao
Le Zhuo
Weifeng Lin
Yu Qiao
Xinyue Li
Qi Qin
Yu Qiao
Hongsheng Li
Peng Gao
MLLM
62
48
0
05 Aug 2024
Lifelong Person Search
Lifelong Person Search
Jae-Won Yang
Seungbin Hong
Jae-Young Sim
CLL
28
0
0
31 Jul 2024
HandDAGT: A Denoising Adaptive Graph Transformer for 3D Hand Pose
  Estimation
HandDAGT: A Denoising Adaptive Graph Transformer for 3D Hand Pose Estimation
Wencan Cheng
Eunji Kim
Jong Hwan Ko
3DH
ViT
27
0
0
30 Jul 2024
WeCromCL: Weakly Supervised Cross-Modality Contrastive Learning for Transcription-only Supervised Text Spotting
WeCromCL: Weakly Supervised Cross-Modality Contrastive Learning for Transcription-only Supervised Text Spotting
Jingjing Wu
Zhengyao Fang
Pengyuan Lyu
Chengquan Zhang
Fanglin Chen
Guangming Lu
Wenjie Pei
47
2
0
28 Jul 2024
BCTR: Bidirectional Conditioning Transformer for Scene Graph Generation
BCTR: Bidirectional Conditioning Transformer for Scene Graph Generation
Peng Hao
Xiaobing Wang
Yingying Jiang
Hanchao Jia
Xiaoshuai Hao
Shaowei Cui
Junhang Wei
Xiaoshuai Hao
47
3
0
26 Jul 2024
Strike a Balance in Continual Panoptic Segmentation
Strike a Balance in Continual Panoptic Segmentation
Jinpeng Chen
Runmin Cong
Yuxuan Luo
H. Ip
Sam Kwong
38
4
0
23 Jul 2024
HSVLT: Hierarchical Scale-Aware Vision-Language Transformer for
  Multi-Label Image Classification
HSVLT: Hierarchical Scale-Aware Vision-Language Transformer for Multi-Label Image Classification
Shuyi Ouyang
Hongyi Wang
Ziwei Niu
Zhenjia Bai
Shiao Xie
Yingying Xu
Ruofeng Tong
Yen-Wei Chen
Lanfen Lin
VLM
27
1
0
23 Jul 2024
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation
SAM-CP: Marrying SAM with Composable Prompts for Versatile Segmentation
Pengfei Chen
Lingxi Xie
Xinyue Huo
Xuehui Yu
Xiaopeng Zhang
Yingfei Sun
Zhenjun Han
Qi Tian
VLM
58
1
0
23 Jul 2024
Towards Open-World Object-based Anomaly Detection via Self-Supervised
  Outlier Synthesis
Towards Open-World Object-based Anomaly Detection via Self-Supervised Outlier Synthesis
Brian K. S. Isaac-Medina
Yona Falinie A. Gaus
Neelanjan Bhowmik
T. Breckon
23
2
0
22 Jul 2024
SwinSF: Image Reconstruction from Spatial-Temporal Spike Streams
SwinSF: Image Reconstruction from Spatial-Temporal Spike Streams
Liangyan Jiang
Chuang Zhu
Yanxu Chen
38
2
0
22 Jul 2024
Predicting the Best of N Visual Trackers
Predicting the Best of N Visual Trackers
B. Alawode
S. Javed
Arif Mahmood
Jirí Matas
37
1
0
22 Jul 2024
SS-SFR: Synthetic Scenes Spatial Frequency Response on Virtual KITTI and
  Degraded Automotive Simulations for Object Detection
SS-SFR: Synthetic Scenes Spatial Frequency Response on Virtual KITTI and Degraded Automotive Simulations for Object Detection
Daniel Jakab
Alexander Braun
Cathaoir Agnew
Reenu Mohandas
B. Deegan
Dara Molloy
Enda Ward
Anthony G. Scanlan
Ciarán Eising
15
0
0
22 Jul 2024
PolyR-CNN: R-CNN for end-to-end polygonal building outline extraction
PolyR-CNN: R-CNN for end-to-end polygonal building outline extraction
Weiqin Jiao
Claudio Persello
G. Vosselman
3DV
19
2
0
20 Jul 2024
Hierarchical Separable Video Transformer for Snapshot Compressive
  Imaging
Hierarchical Separable Video Transformer for Snapshot Compressive Imaging
Ping Wang
Yulun Zhang
Lishun Wang
Xin Yuan
ViT
26
1
0
16 Jul 2024
Continuity Preserving Online CenterLine Graph Learning
Continuity Preserving Online CenterLine Graph Learning
Yunhui Han
Kun Yu
Zhiwei Li
GNN
3DPC
35
2
0
16 Jul 2024
Joint-Embedding Predictive Architecture for Self-Supervised Learning of
  Mask Classification Architecture
Joint-Embedding Predictive Architecture for Self-Supervised Learning of Mask Classification Architecture
Donghee Kim
Sungduk Cho
Hyeonwoo Cho
Chanmin Park
Jinyoung Kim
Won Hwa Kim
33
0
0
15 Jul 2024
WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation
  Models
WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models
Xin-Jian Wu
Rui-Song Zhang
Jie Qin
Shijie Ma
Cheng-Lin Liu
VLM
22
1
0
14 Jul 2024
Part2Object: Hierarchical Unsupervised 3D Instance Segmentation
Part2Object: Hierarchical Unsupervised 3D Instance Segmentation
Cheng Shi
Yulin Zhang
Bin Yang
Jiajin Tang
Yuexin Ma
Sibei Yang
3DPC
34
1
0
14 Jul 2024
Neural-based Video Compression on Solar Dynamics Observatory Images
Neural-based Video Compression on Solar Dynamics Observatory Images
Atefeh Khoshkhahtinat
Ali Zafari
P. Mehta
Nasser M. Nasrabadi
Barbara J. Thompson
M. Kirk
D. D. Silva
39
0
0
12 Jul 2024
Mixed-View Panorama Synthesis using Geospatially Guided Diffusion
Mixed-View Panorama Synthesis using Geospatially Guided Diffusion
Zhexiao Xiong
Xin Xing
Scott Workman
Subash Khanal
Nathan Jacobs
DiffM
MDE
52
1
0
12 Jul 2024
Learning Spatial-Semantic Features for Robust Video Object Segmentation
Learning Spatial-Semantic Features for Robust Video Object Segmentation
Xin Li
Deshui Miao
Zhenyu He
Y. Wang
Huchuan Lu
Ming Yang
VOS
49
4
0
10 Jul 2024
iiANET: Inception Inspired Attention Hybrid Network for efficient Long-Range Dependency
iiANET: Inception Inspired Attention Hybrid Network for efficient Long-Range Dependency
Haruna Yunusa
Qin Shiyin
Abdulrahman Hamman Adama Chukkol
Isah Bello
A. Lawan
Isah Bello
39
4
0
10 Jul 2024
Previous
12345...222324
Next