Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1405.0312
Cited By
Microsoft COCO: Common Objects in Context
1 May 2014
Nayeon Lee
Michael Maire
Serge J. Belongie
Lubomir Bourdev
Ross B. Girshick
James Hays
Pietro Perona
Deva Ramanan
C. L. Zitnick
Piotr Dollár
ObjD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Microsoft COCO: Common Objects in Context"
50 / 652 papers shown
Title
Stable Diffusion for Data Augmentation in COCO and Weed Datasets
Boyang Deng
34
1
0
07 Dec 2023
Paragraph-to-Image Generation with Information-Enriched Diffusion Model
Weijia Wu
Zhuang Li
Yefei He
Mike Zheng Shou
Chunhua Shen
Lele Cheng
Yan Li
Yan Li
Di Zhang
VLM
158
25
0
24 Nov 2023
Two Views Are Better than One: Monocular 3D Pose Estimation with Multiview Consistency
C. K. Ingwersen
Anders Bjorholm Dahl
Rasmus Nylander
J. N. Jensen
Anders B. Dahl
M. Hannemose
91
2
0
21 Nov 2023
Progressive Feedback-Enhanced Transformer for Image Forgery Localization
Haochen Zhu
Gang Cao
Xianglin Huang
ViT
48
7
0
15 Nov 2023
Pretrain like Your Inference: Masked Tuning Improves Zero-Shot Composed Image Retrieval
Junyang Chen
Hanjiang Lai
VLM
62
15
0
13 Nov 2023
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition
Meng Lou
Hong-Yu Zhou
Sibei Yang
Yizhou Yu
Chuan Wu
Yizhou Yu
ViT
51
37
0
30 Oct 2023
Audio-Visual Instance Segmentation
Ruohao Guo
Yaru Chen
Yanyu Qi
Wenzhen Yue
Dantong Niu
...
Wenzhen Yue
Ji Shi
Qixun Wang
Peiliang Zhang
Buwen Liang
VLM
VOS
51
2
0
28 Oct 2023
Prompt-Driven Building Footprint Extraction in Aerial Images with Offset-Building Model
Kai Li
Yupeng Deng
Yun-long Kong
Diyou Liu
Jingbo Chen
Yu Meng
Junxian Ma
Chenhao Wang
111
1
0
25 Oct 2023
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions
Chengyang Zhao
Songlin Yang
Zhenfang Chen
Mingyu Ding
Chuang Gan
80
16
0
10 Oct 2023
Treating Motion as Option with Output Selection for Unsupervised Video Object Segmentation
Suhwan Cho
Minhyeok Lee
Jungho Lee
Myeongah Cho
Seungwook Park
Jaeyeob Kim
Hyunsung Jang
Sangyoun Lee
VOS
68
2
0
26 Sep 2023
Multiple Different Black Box Explanations for Image Classifiers
Hana Chockler
D. A. Kelly
Daniel Kroening
FAtt
42
0
0
25 Sep 2023
A Novel Neural-symbolic System under Statistical Relational Learning
Dongran Yu
Xueyan Liu
Shirui Pan
Anchen Li
Bo Yang
AI4CE
NAI
21
1
0
16 Sep 2023
DEFormer: DCT-driven Enhancement Transformer for Low-light Image and Dark Vision
Xiangchen Yin
Zhenda Yu
Xin Gao
Ran Ju
ViT
38
1
0
13 Sep 2023
Language Prompt for Autonomous Driving
Dongming Wu
Wencheng Han
Tiancai Wang
Yingfei Liu
Cheng-zhong Xu
Jianbing Shen
Jianbing Shen
VLM
64
81
0
08 Sep 2023
Language-Guided Diffusion Model for Visual Grounding
Sijia Chen
Baochun Li
52
5
0
18 Aug 2023
Linear Alignment of Vision-language Models for Image Captioning
Fabian Paischer
M. Hofmarcher
Sepp Hochreiter
Thomas Adler
CLIP
VLM
87
0
0
10 Jul 2023
All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment
Chunhui Zhang
Xin Sun
Li Liu
Yiqian Yang
Qiong Liu
Xiaoping Zhou
Yanfeng Wang
109
15
0
07 Jul 2023
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
Shilong Zhang
Pei Sun
Shoufa Chen
Min Xiao
Wenqi Shao
Wenwei Zhang
Yu Liu
Kai-xiang Chen
Ping Luo
VLM
MLLM
111
231
0
07 Jul 2023
UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding
Rui Sun
Zhecan Wang
Haoxuan You
Noel Codella
Kai-Wei Chang
Shih-Fu Chang
CLIP
55
3
0
03 Jul 2023
Review helps learn better: Temporal Supervised Knowledge Distillation
Dongwei Wang
Zhi Han
Yanmei Wang
Xi’ai Chen
Baichen Liu
Yandong Tang
90
1
0
03 Jul 2023
Spatial Re-parameterization for N:M Sparsity
Yuxin Zhang
Mingbao Lin
Mingliang Xu
Mengzhao Chen
Chia-Wen Lin
89
2
0
09 Jun 2023
DeepScribe: Localization and Classification of Elamite Cuneiform Signs Via Deep Learning
Edward C. Williams
Grace Su
Sandra R. Schloen
Miller C. Prosser
Susanne Paulus
S.Rohith Krishnan
176
3
0
02 Jun 2023
Lightweight Vision Transformer with Bidirectional Interaction
Qihang Fan
Huaibo Huang
Xiaoqiang Zhou
Ran He
ViT
71
28
0
01 Jun 2023
On the Importance of Backbone to the Adversarial Robustness of Object Detectors
Xiao-Li Li
Hang Chen
Xiaolin Hu
AAML
61
4
0
27 May 2023
VDD: Varied Drone Dataset for Semantic Segmentation
Wenxiao Cai
Ke Jin
Jinyan Hou
Cong Guo
Letian Wu
Wankou Yang
61
11
0
23 May 2023
Semantically Structured Image Compression via Irregular Group-Based Decoupling
V. Sheoran
Yixin Gao
Shreyansh Joshi
Tanisha R. Bhayani
Zhibo Chen
96
13
0
04 May 2023
Oil Spill Segmentation using Deep Encoder-Decoder models
Abhishek Ramanathapura Satyanarayana
Maruf A. Dhali
58
2
0
02 May 2023
NeuroBench: A Framework for Benchmarking Neuromorphic Computing Algorithms and Systems
Jason Yik
Korneel Van den Berghe
Douwe den Blanken
Younes Bouhadjar
Maxime Fabre
...
Fatima Tuz Zohora
Charlotte Frenkel
Vijay Janapa Reddi
Charlotte Frenkel
Vijay Janapa Reddi
61
20
0
10 Apr 2023
Deep Learning for Camera Calibration and Beyond: A Survey
K. Liao
Lang Nie
Shujuan Huang
Chunyu Lin
Jing Zhang
Yao Zhao
Moncef Gabbouj
Dacheng Tao
68
25
0
19 Mar 2023
Deep Learning for Cross-Domain Few-Shot Visual Recognition: A Survey
Huali Xu
Shuaifeng Zhi
Shuzhou Sun
Vishal M. Patel
Li Liu
76
13
0
15 Mar 2023
Oriented Object Detection in Optical Remote Sensing Images using Deep Learning: A Survey
Kunlin Wang
Zi Wang
Zhang Li
Ang Su
Xichao Teng
Minhao Liu
Qifeng Yu
Qifeng Yu
ObjD
109
9
0
21 Feb 2023
Preconditioned Score-based Generative Models
He Ma
Xiatian Zhu
Xiatian Zhu
Jianfeng Feng
DiffM
59
6
0
13 Feb 2023
DDS: Decoupled Dynamic Scene-Graph Generation Network
A S M Iftekhar
Raphael Ruschel
Satish Kumar
Suya You
B. S. Manjunath
54
2
0
18 Jan 2023
FIND: An Unsupervised Implicit 3D Model of Articulated Human Feet
Oliver Boyne
James Charles
R. Cipolla
3DH
39
4
0
21 Oct 2022
A systematic review of the use of Deep Learning in Satellite Imagery for Agriculture
Brandon Victor
Zhen He
Aiden Nibali
55
9
0
03 Oct 2022
Deep Learning-Based Automatic Diagnosis System for Developmental Dysplasia of the Hip
Yang Li
Leo Yan Li-Han
H. Tian
34
1
0
07 Sep 2022
Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction
Jun Chen
Ming Hu
Boyang Albert Li
Mohamed Elhoseiny
79
36
0
01 Jun 2022
Human-Object Interaction Detection via Disentangled Transformer
Desen Zhou
Zhichao Liu
Jian Wang
Leshan Wang
T. Hu
Errui Ding
Jingdong Wang
ViT
53
57
0
20 Apr 2022
Can Visual Dialogue Models Do Scorekeeping? Exploring How Dialogue Representations Incrementally Encode Shared Knowledge
Brielen Madureira
David Schlangen
45
4
0
14 Apr 2022
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection
Renrui Zhang
Han Qiu
Tai Wang
Ziyu Guo
Xuan Xu
Xuanzhuo Xu
Ziteng Cui
Peng Gao
Hongsheng Li
Hongsheng Li
ViT
MDE
66
87
0
24 Mar 2022
GIAOTracker: A comprehensive framework for MCMOT with global information and optimizing strategies in VisDrone 2021
Yunhao Du
Jun-Jun Wan
Yanyun Zhao
Binyu Zhang
Zhihang Tong
Junhao Dong
45
100
0
24 Feb 2022
Vision-Language Pre-Training with Triple Contrastive Learning
Jinyu Yang
Jiali Duan
Son N. Tran
Yi Xu
Sampath Chanda
Liqun Chen
Belinda Zeng
Trishul Chilimbi
Junzhou Huang
VLM
74
290
0
21 Feb 2022
AlertTrap: A study on object detection in remote insects trap monitoring system using on-the-edge deep learning platform
A. D. Le
Duy A. Pham
Dong T. Pham
H. Vo
34
7
0
26 Dec 2021
High Quality Segmentation for Ultra High-resolution Images
Tiancheng Shen
Yuechen Zhang
Lu Qi
Jason Kuen
Xingyu Xie
Jianlong Wu
Zhe Lin
Jiaya Jia
113
42
0
29 Nov 2021
FBNetV5: Neural Architecture Search for Multiple Tasks in One Run
Bichen Wu
Chaojian Li
Hang Zhang
Xiaoliang Dai
Peizhao Zhang
Matthew Yu
Jialiang Wang
Yingyan Lin
Peter Vajda
ViT
64
24
0
19 Nov 2021
Benchmarking the Robustness of Instance Segmentation Models
Said Fahri Altindis
Yusuf Dalva
Hamza Pehlivan
Aysegül Dündar
VLM
OOD
84
12
0
02 Sep 2021
Few-Shot Segmentation with Global and Local Contrastive Learning
Weide Liu
Zhonghua Wu
Henghui Ding
Fayao Liu
Jie Lin
Guosheng Lin
Wei Zhou
61
24
0
11 Aug 2021
DANCE: DAta-Network Co-optimization for Efficient Segmentation Model Training and Inference
Chaojian Li
Wuyang Chen
Yuchen Gu
Tianlong Chen
Yonggan Fu
Zhangyang Wang
Yingyan Lin
80
0
0
16 Jul 2021
FSOCO: The Formula Student Objects in Context Dataset
Niclas Vodisch
David Dodel
Michael Schötz
49
12
0
13 Dec 2020
Simultaneously-Collected Multimodal Lying Pose Dataset: Towards In-Bed Human Pose Monitoring under Adverse Vision Conditions
Shuangjun Liu
Xiaofei Huang
Nihang Fu
Cheng Li
Zhongnan Su
Sarah Ostadabbas
3DH
27
20
0
20 Aug 2020
Previous
1
2
3
...
11
12
13
14
Next