Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1908.03195
Cited By
v1
v2 (latest)
LVIS: A Dataset for Large Vocabulary Instance Segmentation
Computer Vision and Pattern Recognition (CVPR), 2019
8 August 2019
Agrim Gupta
Piotr Dollár
Ross B. Girshick
ISeg
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"LVIS: A Dataset for Large Vocabulary Instance Segmentation"
50 / 1,059 papers shown
DALL-E-Bot: Introducing Web-Scale Diffusion Models to Robotics
IEEE Robotics and Automation Letters (RA-L), 2022
Ivan Kapelyukh
Vitalis Vosylius
Edward Johns
LM&Ro
DiffM
536
176
0
05 Oct 2022
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Weicong Liang
Yuhui Yuan
Henghui Ding
Xiao Luo
Weihong Lin
Ding Jia
Zheng Zhang
Chao Zhang
Hanhua Hu
259
39
0
03 Oct 2022
Learning Equivariant Segmentation with Instance-Unique Querying
Neural Information Processing Systems (NeurIPS), 2022
Wenguan Wang
James Liang
Dongfang Liu
ISeg
324
94
0
03 Oct 2022
F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language Models
Weicheng Kuo
Huayu Chen
Xiuye Gu
A. Piergiovanni
A. Angelova
MLLM
VLM
ObjD
451
171
0
30 Sep 2022
EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations
Neural Information Processing Systems (NeurIPS), 2022
Ahmad Darkhalil
Dandan Shan
Bin Zhu
Jian Ma
Amlan Kar
Richard E. L. Higgins
Sanja Fidler
David Fouhey
Dima Damen
VOS
272
132
0
26 Sep 2022
BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
A. Athar
Jonathon Luiten
P. Voigtlaender
Tarasha Khurana
Achal Dave
Bastian Leibe
Deva Ramanan
VOS
VLM
270
74
0
25 Sep 2022
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection
Neural Information Processing Systems (NeurIPS), 2022
Lewei Yao
Jianhua Han
Youpeng Wen
Xiaodan Liang
Dan Xu
Wei Zhang
Zhenguo Li
Chunjing Xu
Hang Xu
CLIP
VLM
349
220
0
20 Sep 2022
ComplETR: Reducing the cost of annotations for object detection in dense scenes with vision transformers
Achin Jain
Kibok Lee
Gurumurthy Swaminathan
Han Yang
Bernt Schiele
Avinash Ravichandran
Onkar Dabeer
ViT
294
1
0
13 Sep 2022
Inverse Image Frequency for Long-tailed Image Recognition
IEEE Transactions on Image Processing (IEEE TIP), 2022
Konstantinos Panagiotis Alexandridis
Shang Luo
Anh H. Nguyen
Jiankang Deng
Stefanos Zafeiriou
241
18
0
11 Sep 2022
OmDet: Large-scale vision-language multi-dataset pre-training with multimodal detection network
IET Computer Vision (ICV), 2022
Tiancheng Zhao
Peng Liu
Kyusong Lee
VLM
MLLM
ObjD
157
15
0
10 Sep 2022
Progressive Domain Adaptation with Contrastive Learning for Object Detection in the Satellite Imagery
Debojyoti Biswas
Jelena Tevsić
ObjD
267
4
0
06 Sep 2022
Injecting Image Details into CLIP's Feature Space
Zilun Zhang
Cuifeng Shen
Yuan-Chung Shen
Huixin Xiong
Xinyu Zhou
VLM
CLIP
234
0
0
31 Aug 2022
PanorAMS: Automatic Annotation for Detecting Objects in Urban Context
IEEE transactions on multimedia (IEEE TMM), 2022
Inske Groenen
Stevan Rudinac
Marcel Worring
193
7
0
30 Aug 2022
Towards Calibrated Hyper-Sphere Representation via Distribution Overlap Coefficient for Long-tailed Learning
European Conference on Computer Vision (ECCV), 2022
Hualiang Wang
Siming Fu
Xiaoxuan He
Han Fang
Zuozhu Liu
Haoji Hu
232
23
0
22 Aug 2022
Label-Noise Learning with Intrinsically Long-Tailed Data
IEEE International Conference on Computer Vision (ICCV), 2022
Yang Lu
Yiliang Zhang
Bo Han
Yiu-ming Cheung
Hanzi Wang
NoLa
177
29
0
21 Aug 2022
Single-Stage Open-world Instance Segmentation with Cross-task Consistency Regularization
Xizhe Xue
Dongdong Yu
Lingqiao Liu
Yu Liu
Satoshi Tsutsui
Ying Li
Zehuan Yuan
Ping Song
Mike Zheng Shou
ISeg
200
4
0
18 Aug 2022
Open-Vocabulary Universal Image Segmentation with MaskCLIP
International Conference on Machine Learning (ICML), 2022
Zheng Ding
Jieke Wang
Zhuowen Tu
CLIP
ISeg
VLM
296
127
0
18 Aug 2022
DeepSportradar-v1: Computer Vision Dataset for Sports Understanding with High Quality Annotations
Gabriel Van Zandycke
Vladimir Somers
M. Istasse
Carlo Del Don
Davide Zambrano
191
54
0
17 Aug 2022
Sample hardness based gradient loss for long-tailed cervical cell detection
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022
Minmin Liu
Xuechen Li
Yantao Du
Junliang Chen
Linlin Shen
Huisi Wu
MedIm
145
6
0
07 Aug 2022
Constructing Balance from Imbalance for Long-tailed Image Recognition
European Conference on Computer Vision (ECCV), 2022
Yue Xu
Yong-Lu Li
Jiefeng Li
Cewu Lu
CVBM
206
39
0
04 Aug 2022
GPPF: A General Perception Pre-training Framework via Sparsely Activated Multi-Task Learning
Benyuan Sun
Jinqiao Dai
Zihao Liang
Cong Liu
Yi Yang
Bo Bai
MoE
219
4
0
03 Aug 2022
Class-Difficulty Based Methods for Long-Tailed Visual Recognition
International Journal of Computer Vision (IJCV), 2022
Saptarshi Sinha
Hiroki Ohashi
Katsuyuki Nakamura
262
42
0
29 Jul 2022
Iterative Scene Graph Generation
Neural Information Processing Systems (NeurIPS), 2022
Siddhesh Khandelwal
Leonid Sigal
OCL
221
37
0
27 Jul 2022
Uncertainty-based Visual Question Answering: Estimating Semantic Inconsistency between Image and Knowledge Base
IEEE International Joint Conference on Neural Network (IJCNN), 2022
Jinyeong Chae
Jihie Kim
161
5
0
27 Jul 2022
DETRs with Hybrid Matching
Computer Vision and Pattern Recognition (CVPR), 2022
Ding Jia
Yuhui Yuan
Hao He
Xiao-pei Wu
Haojun Yu
Weihong Lin
Lei-huan Sun
Chao Zhang
Hanhua Hu
465
271
0
26 Jul 2022
Tracking Every Thing in the Wild
European Conference on Computer Vision (ECCV), 2022
Siyuan Li
Martin Danelljan
Henghui Ding
Thomas E. Huang
Feng Yu
190
52
0
26 Jul 2022
Active Pointly-Supervised Instance Segmentation
European Conference on Computer Vision (ECCV), 2022
Chufeng Tang
Lingxi Xie
Qiang Chen
Xiaopeng Zhang
Qi Tian
Xiaolin Hu
ISeg
324
19
0
23 Jul 2022
Rethinking Few-Shot Object Detection on a Multi-Domain Benchmark
European Conference on Computer Vision (ECCV), 2022
Kibok Lee
Hao Yang
Satyaki Chakraborty
Zhaowei Cai
Gurumurthy Swaminathan
Avinash Ravichandran
Onkar Dabeer
314
27
0
22 Jul 2022
Few-shot Object Counting and Detection
European Conference on Computer Vision (ECCV), 2022
Trung Quoc Nguyen
Chau Pham
Khoi Duc Minh Nguyen
Minh Hoai
197
74
0
22 Jul 2022
Long-tailed Instance Segmentation using Gumbel Optimized Loss
European Conference on Computer Vision (ECCV), 2022
Konstantinos Panagiotis Alexandridis
Jiankang Deng
A. Nguyen
Shang Luo
196
27
0
22 Jul 2022
Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset
European Conference on Computer Vision (ECCV), 2022
Grant Van Horn
Rui Qian
Kimberly Wilber
Hartwig Adam
Oisin Mac Aodha
Serge Belongie
210
14
0
21 Jul 2022
Omni3D: A Large Benchmark and Model for 3D Object Detection in the Wild
Computer Vision and Pattern Recognition (CVPR), 2022
Garrick Brazil
Abhinav Kumar
Julian Straub
Nikhila Ravi
Justin Johnson
Georgia Gkioxari
VLM
387
159
0
21 Jul 2022
Exploiting Unlabeled Data with Vision and Language Models for Object Detection
European Conference on Computer Vision (ECCV), 2022
Shiyu Zhao
Zhixing Zhang
S. Schulter
Long Zhao
Vijay Kumar B.G
Anastasis Stathopoulos
Manmohan Chandraker
Dimitris N. Metaxas
VLM
ObjD
204
121
0
18 Jul 2022
Open-world Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding
European Conference on Computer Vision (ECCV), 2022
Quan Liu
Youpeng Wen
Jianhua Han
Chunjing Xu
Hang Xu
Xiaodan Liang
VLM
254
89
0
18 Jul 2022
Dual-branch Hybrid Learning Network for Unbiased Scene Graph Generation
Chao Zheng
Lianli Gao
Xinyu Lyu
Pengpeng Zeng
Abdulmotaleb El Saddik
Hengtao Shen
178
26
0
16 Jul 2022
PseudoClick: Interactive Image Segmentation with Click Imitation
European Conference on Computer Vision (ECCV), 2022
Qin Liu
Meng Zheng
Benjamin Planche
Srikrishna Karanam
Terrence Chen
Marc Niethammer
Ziyan Wu
VLM
256
70
0
12 Jul 2022
Scaling Novel Object Detection with Weakly Supervised Detection Transformers
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
T. LaBonte
Ya-heng Song
Xin Eric Wang
Vibhav Vineet
Neel Joshi
ViT
198
13
0
11 Jul 2022
Fine-grained Activities of People Worldwide
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
J. Byrne
Greg Castañón
Zhongheng Li
G. Ettinger
216
5
0
11 Jul 2022
Adaptive Fine-Grained Predicates Learning for Scene Graph Generation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Xinyu Lyu
Lianli Gao
Pengpeng Zeng
Hengtao Shen
Jingkuan Song
224
21
0
11 Jul 2022
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection
Neural Information Processing Systems (NeurIPS), 2022
H. Rasheed
Muhammad Maaz
Muhammad Uzair Khattak
Salman Khan
Fahad Shahbaz Khan
ObjD
VLM
375
183
0
07 Jul 2022
Diagnosing and Remedying Shot Sensitivity with Cosine Few-Shot Learners
Davis Wertheimer
Luming Tang
Bharath Hariharan
317
0
0
07 Jul 2022
CLEAR: Improving Vision-Language Navigation with Cross-Lingual, Environment-Agnostic Representations
Jialu Li
Hao Tan
Joey Tianyi Zhou
LM&Ro
228
12
0
05 Jul 2022
InsMix: Towards Realistic Generative Data Augmentation for Nuclei Instance Segmentation
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2022
Yi Lin
Zeyu Wang
Kwang-Ting Cheng
Hao Chen
MedIm
121
32
0
30 Jun 2022
Towards Federated Long-Tailed Learning
Zihan Chen
Songshan Liu
Hualiang Wang
Howard H. Yang
Tony Q.S. Quek
Zuozhu Liu
FedML
182
14
0
30 Jun 2022
Learning To Generate Scene Graph from Head to Tail
IEEE International Conference on Multimedia and Expo (ICME), 2022
Chao Zheng
Xinyu Lyu
Yuyu Guo
Pengpeng Zeng
Jingkuan Song
Lianli Gao
190
12
0
23 Jun 2022
Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization
Peixian Chen
Kekai Sheng
Mengdan Zhang
Mingbao Lin
Chunjiang Ge
Shaohui Lin
Bo Ren
Ke Li
VLM
ObjD
399
32
0
22 Jun 2022
Parallel Pre-trained Transformers (PPT) for Synthetic Data-based Instance Segmentation
Ming Li
Jie Wu
Jin Cai
J. Qin
Yuxi Ren
Xu Xiao
Min Zheng
Rui Wang
X. Pan
ViT
178
2
0
22 Jun 2022
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
International Conference on Learning Representations (ICLR), 2022
Jiasen Lu
Christopher Clark
Rowan Zellers
Roozbeh Mottaghi
Aniruddha Kembhavi
ObjD
VLM
MLLM
482
476
0
17 Jun 2022
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Neural Information Processing Systems (NeurIPS), 2022
Zi-Yi Dou
Aishwarya Kamath
Zhe Gan
Pengchuan Zhang
Jianfeng Wang
...
Ce Liu
Yann LeCun
Nanyun Peng
Jianfeng Gao
Lijuan Wang
VLM
ObjD
296
152
0
15 Jun 2022
GLIPv2: Unifying Localization and Vision-Language Understanding
Haotian Zhang
Pengchuan Zhang
Xiaowei Hu
Yen-Chun Chen
Liunian Harold Li
Xiyang Dai
Lijuan Wang
Lu Yuan
Lei Li
Jianfeng Gao
ObjD
VLM
296
354
0
12 Jun 2022
Previous
1
2
3
...
15
16
17
...
20
21
22
Next
Page 16 of 22
Page
of 22
Go