ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.12872
  4. Cited By
End-to-End Object Detection with Transformers

End-to-End Object Detection with Transformers

26 May 2020
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
    ViT
    3DV
    PINN
ArXivPDFHTML

Papers citing "End-to-End Object Detection with Transformers"

50 / 1,194 papers shown
Title
Semantic-Based Self-Critical Training For Question Generation
Semantic-Based Self-Critical Training For Question Generation
Loïc Kwate Dassi
Kwate Dassi
10
0
0
26 Aug 2021
A Comparison of Deep Saliency Map Generators on Multispectral Data in
  Object Detection
A Comparison of Deep Saliency Map Generators on Multispectral Data in Object Detection
Jens Bayer
David Munch
Michael Arens
3DPC
17
3
0
26 Aug 2021
TransFER: Learning Relation-aware Facial Expression Representations with
  Transformers
TransFER: Learning Relation-aware Facial Expression Representations with Transformers
Fanglei Xue
Qiangchang Wang
G. Guo
ViT
14
182
0
25 Aug 2021
Deep neural networks approach to microbial colony detection -- a
  comparative analysis
Deep neural networks approach to microbial colony detection -- a comparative analysis
Sylwia Majchrowska
J. Pawlowski
Natalia Czerep
Aleksander Górecki
Jakub Kuciñski
Tomasz Golan
8
5
0
23 Aug 2021
From Two to One: A New Scene Text Recognizer with Visual Language
  Modeling Network
From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network
Yuxin Wang
Hongtao Xie
Shancheng Fang
Jing Wang
Shenggao Zhu
Yongdong Zhang
VLM
44
152
0
22 Aug 2021
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action
  Recognition
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action Recognition
Jiawei Chen
C. Ho
ViT
24
76
0
20 Aug 2021
Trans4Trans: Efficient Transformer for Transparent Object and Semantic
  Scene Segmentation in Real-World Navigation Assistance
Trans4Trans: Efficient Transformer for Transparent Object and Semantic Scene Segmentation in Real-World Navigation Assistance
Jiaming Zhang
Kailun Yang
Angela Constantinescu
Kunyu Peng
Karin Muller
Rainer Stiefelhagen
ViT
28
68
0
20 Aug 2021
End-to-End Dense Video Captioning with Parallel Decoding
End-to-End Dense Video Captioning with Parallel Decoding
Teng Wang
Ruimao Zhang
Zhichao Lu
Feng Zheng
Ran Cheng
Ping Luo
3DV
25
179
0
17 Aug 2021
WikiChurches: A Fine-Grained Dataset of Architectural Styles with
  Real-World Challenges
WikiChurches: A Fine-Grained Dataset of Architectural Styles with Real-World Challenges
Björn Barz
Joachim Denzler
17
7
0
16 Aug 2021
Track without Appearance: Learn Box and Tracklet Embedding with Local
  and Global Motion Patterns for Vehicle Tracking
Track without Appearance: Learn Box and Tracklet Embedding with Local and Global Motion Patterns for Vehicle Tracking
Gaoang Wang
Renshu Gu
Zuozhu Liu
Weijie Hu
Mingli Song
Jenq-Neng Hwang
VOT
13
50
0
13 Aug 2021
MUSIQ: Multi-scale Image Quality Transformer
MUSIQ: Multi-scale Image Quality Transformer
Junjie Ke
Qifei Wang
Yilin Wang
P. Milanfar
Feng Yang
151
608
0
12 Aug 2021
Vision-Language Transformer and Query Generation for Referring
  Segmentation
Vision-Language Transformer and Query Generation for Referring Segmentation
Henghui Ding
Chang-rui Liu
Suchen Wang
Xudong Jiang
40
250
0
12 Aug 2021
Video Transformer for Deepfake Detection with Incremental Learning
Video Transformer for Deepfake Detection with Incremental Learning
Sohail Ahmed Khan
Hang Dai
ViT
10
62
0
11 Aug 2021
FoodLogoDet-1500: A Dataset for Large-Scale Food Logo Detection via
  Multi-Scale Feature Decoupling Network
FoodLogoDet-1500: A Dataset for Large-Scale Food Logo Detection via Multi-Scale Feature Decoupling Network
Qiang Hou
Weiqing Min
Jing Wang
Sujuan Hou
Yuanjie Zheng
Shuqiang Jiang
22
23
0
10 Aug 2021
Monte Carlo DropBlock for Modelling Uncertainty in Object Detection
Monte Carlo DropBlock for Modelling Uncertainty in Object Detection
K. Deepshikha
Sai Harsha Yelleni
P. K. Srijith
C.Krishna Mohan
BDL
UQCV
19
87
0
08 Aug 2021
Understanding the computational demands underlying visual reasoning
Understanding the computational demands underlying visual reasoning
Mohit Vaishnav
Rémi Cadène
A. Alamia
Drew Linsley
Rufin VanRullen
Thomas Serre
GNN
CoGe
27
16
0
08 Aug 2021
Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight
  Transformer
Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer
Zhihe Lu
Sen He
Xiatian Zhu
Li Zhang
Yi-Zhe Song
Tao Xiang
ViT
164
171
0
06 Aug 2021
Unifying Global-Local Representations in Salient Object Detection with
  Transformer
Unifying Global-Local Representations in Salient Object Detection with Transformer
Sucheng Ren
Qiang Wen
Nanxuan Zhao
Guoqiang Han
Shengfeng He
ViT
21
25
0
05 Aug 2021
Automatic Rail Component Detection Based on AttnConv-Net
Automatic Rail Component Detection Based on AttnConv-Net
Tian-hu Wang
Zijun Zhang
Fangfang Yang
K. Tsui
13
11
0
05 Aug 2021
Armour: Generalizable Compact Self-Attention for Vision Transformers
Armour: Generalizable Compact Self-Attention for Vision Transformers
Lingchuan Meng
ViT
11
3
0
03 Aug 2021
Toward Spatially Unbiased Generative Models
Toward Spatially Unbiased Generative Models
Jooyoung Choi
Jungbeom Lee
Yonghyun Jeong
Sungroh Yoon
DiffM
17
17
0
03 Aug 2021
Congested Crowd Instance Localization with Dilated Convolutional Swin
  Transformer
Congested Crowd Instance Localization with Dilated Convolutional Swin Transformer
Junyuan Gao
Maoguo Gong
Xuelong Li
ViT
11
45
0
02 Aug 2021
Transformer-based deep imitation learning for dual-arm robot
  manipulation
Transformer-based deep imitation learning for dual-arm robot manipulation
Heecheol Kim
Y. Ohmura
Y. Kuniyoshi
8
48
0
01 Aug 2021
Applications of Artificial Neural Networks in Microorganism Image
  Analysis: A Comprehensive Review from Conventional Multilayer Perceptron to
  Popular Convolutional Neural Network and Potential Visual Transformer
Applications of Artificial Neural Networks in Microorganism Image Analysis: A Comprehensive Review from Conventional Multilayer Perceptron to Popular Convolutional Neural Network and Potential Visual Transformer
Jinghua Zhang
Chen Li
Yi Yin
Jiawei Zhang
M. Grzegorzek
11
138
0
01 Aug 2021
HiFT: Hierarchical Feature Transformer for Aerial Tracking
HiFT: Hierarchical Feature Transformer for Aerial Tracking
Ziang Cao
Changhong Fu
Junjie Ye
Bowen Li
Yiming Li
13
194
0
31 Jul 2021
DPT: Deformable Patch-based Transformer for Visual Recognition
DPT: Deformable Patch-based Transformer for Visual Recognition
Zhiyang Chen
Yousong Zhu
Chaoyang Zhao
Guosheng Hu
Wei Zeng
Jinqiao Wang
Ming Tang
ViT
12
98
0
30 Jul 2021
Rethinking and Improving Relative Position Encoding for Vision
  Transformer
Rethinking and Improving Relative Position Encoding for Vision Transformer
Kan Wu
Houwen Peng
Minghao Chen
Jianlong Fu
Hongyang Chao
ViT
11
324
0
29 Jul 2021
A Unified Efficient Pyramid Transformer for Semantic Segmentation
A Unified Efficient Pyramid Transformer for Semantic Segmentation
Fangrui Zhu
Yi Zhu
Li Zhang
Chongruo Wu
Yanwei Fu
Mu Li
ViT
11
29
0
29 Jul 2021
PPT Fusion: Pyramid Patch Transformerfor a Case Study in Image Fusion
PPT Fusion: Pyramid Patch Transformerfor a Case Study in Image Fusion
Yu Fu
Tianyang Xu
Xiaojun Wu
J. Kittler
ViT
17
37
0
29 Jul 2021
Cross-Camera Feature Prediction for Intra-Camera Supervised Person
  Re-identification across Distant Scenes
Cross-Camera Feature Prediction for Intra-Camera Supervised Person Re-identification across Distant Scenes
Wenhang Ge
Chunyan Pan
Ancong Wu
Hongwei Zheng
Weishi Zheng
11
25
0
29 Jul 2021
QVHighlights: Detecting Moments and Highlights in Videos via Natural
  Language Queries
QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries
Jie Lei
Tamara L. Berg
Mohit Bansal
ViT
14
62
0
20 Jul 2021
Generative Video Transformer: Can Objects be the Words?
Generative Video Transformer: Can Objects be the Words?
Yi-Fu Wu
Jaesik Yoon
Sungjin Ahn
ViT
8
34
0
20 Jul 2021
Image Fusion Transformer
Image Fusion Transformer
VS Vibashan
Jeya Maria Jose Valanarasu
Poojan Oza
Vishal M. Patel
ViT
11
115
0
19 Jul 2021
Video Crowd Localization with Multi-focus Gaussian Neighborhood
  Attention and a Large-Scale Benchmark
Video Crowd Localization with Multi-focus Gaussian Neighborhood Attention and a Large-Scale Benchmark
Haopeng Li
Lingbo Liu
Kunlin Yang
Shinan Liu
Junyuan Gao
Bin Zhao
Rui Zhang
Jun Hou
37
14
0
19 Jul 2021
RAMS-Trans: Recurrent Attention Multi-scale Transformer forFine-grained
  Image Recognition
RAMS-Trans: Recurrent Attention Multi-scale Transformer forFine-grained Image Recognition
Yunqing Hu
Xuan Jin
Yin Zhang
Ha Hong
Jingfeng Zhang
Yuan He
Hui Xue
ViT
11
97
0
17 Jul 2021
Chimera: Efficiently Training Large-Scale Neural Networks with
  Bidirectional Pipelines
Chimera: Efficiently Training Large-Scale Neural Networks with Bidirectional Pipelines
Shigang Li
Torsten Hoefler
GNN
AI4CE
LRM
77
130
0
14 Jul 2021
Visual Parser: Representing Part-whole Hierarchies with Transformers
Visual Parser: Representing Part-whole Hierarchies with Transformers
Shuyang Sun
Xiaoyu Yue
S. Bai
Philip H. S. Torr
50
26
0
13 Jul 2021
CoBERL: Contrastive BERT for Reinforcement Learning
CoBERL: Contrastive BERT for Reinforcement Learning
Andrea Banino
Adria Puidomenech Badia
Jacob Walker
Tim Scholtes
Jovana Mitrović
Charles Blundell
OffRL
15
36
0
12 Jul 2021
Learning Vision-Guided Quadrupedal Locomotion End-to-End with
  Cross-Modal Transformers
Learning Vision-Guided Quadrupedal Locomotion End-to-End with Cross-Modal Transformers
Ruihan Yang
Minghao Zhang
Nicklas Hansen
Huazhe Xu
Xiaolong Wang
OffRL
11
99
0
08 Jul 2021
Learning Vision Transformer with Squeeze and Excitation for Facial
  Expression Recognition
Learning Vision Transformer with Squeeze and Excitation for Facial Expression Recognition
Mouath Aouayeb
W. Hamidouche
Catherine Soladié
K. Kpalma
Renaud Séguier
ViT
17
57
0
07 Jul 2021
GLiT: Neural Architecture Search for Global and Local Image Transformer
GLiT: Neural Architecture Search for Global and Local Image Transformer
Boyu Chen
Peixia Li
Chuming Li
Baopu Li
Lei Bai
Chen Lin
Ming-hui Sun
Junjie Yan
Wanli Ouyang
ViT
17
85
0
07 Jul 2021
TransformerFusion: Monocular RGB Scene Reconstruction using Transformers
TransformerFusion: Monocular RGB Scene Reconstruction using Transformers
Aljavz Bovzivc
Pablo Rodríguez Palafox
Justus Thies
Angela Dai
Matthias Nießner
ViT
20
133
0
05 Jul 2021
Physics-Guided Deep Learning for Dynamical Systems: A Survey
Physics-Guided Deep Learning for Dynamical Systems: A Survey
Rui Wang
Rose Yu
AI4CE
PINN
18
62
0
02 Jul 2021
AutoFormer: Searching Transformers for Visual Recognition
AutoFormer: Searching Transformers for Visual Recognition
Minghao Chen
Houwen Peng
Jianlong Fu
Haibin Ling
ViT
17
258
0
01 Jul 2021
Focal Self-attention for Local-Global Interactions in Vision
  Transformers
Focal Self-attention for Local-Global Interactions in Vision Transformers
Jianwei Yang
Chunyuan Li
Pengchuan Zhang
Xiyang Dai
Bin Xiao
Lu Yuan
Jianfeng Gao
ViT
17
422
0
01 Jul 2021
CBNet: A Composite Backbone Network Architecture for Object Detection
CBNet: A Composite Backbone Network Architecture for Object Detection
Tingting Liang
Xiao Chu
Yudong Liu
Yongtao Wang
Zhi Tang
Wei Chu
Jingdong Chen
Haibin Ling
ObjD
13
161
0
01 Jul 2021
Simple Training Strategies and Model Scaling for Object Detection
Simple Training Strategies and Model Scaling for Object Detection
Xianzhi Du
Barret Zoph
Wei-Chih Hung
Tsung-Yi Lin
ObjD
20
40
0
30 Jun 2021
Dual Aspect Self-Attention based on Transformer for Remaining Useful
  Life Prediction
Dual Aspect Self-Attention based on Transformer for Remaining Useful Life Prediction
Zhizheng Zhang
Wen Song
Qiqiang Li
AI4TS
28
162
0
30 Jun 2021
Looking Outside the Window: Wide-Context Transformer for the Semantic
  Segmentation of High-Resolution Remote Sensing Images
Looking Outside the Window: Wide-Context Transformer for the Semantic Segmentation of High-Resolution Remote Sensing Images
L. Ding
Dong Lin
Shaofu Lin
Jing Zhang
Xiaojie Cui
Yuebin Wang
H. Tang
Lorenzo Bruzzone
ViT
11
96
0
29 Jun 2021
K-Net: Towards Unified Image Segmentation
K-Net: Towards Unified Image Segmentation
Wenwei Zhang
Jiangmiao Pang
Kai-xiang Chen
Chen Change Loy
ISeg
11
356
0
28 Jun 2021
Previous
123...21222324
Next