ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1908.03195
  4. Cited By
LVIS: A Dataset for Large Vocabulary Instance Segmentation
v1v2 (latest)

LVIS: A Dataset for Large Vocabulary Instance Segmentation

Computer Vision and Pattern Recognition (CVPR), 2019
8 August 2019
Agrim Gupta
Piotr Dollár
Ross B. Girshick
    ISegVLM
ArXiv (abs)PDFHTML

Papers citing "LVIS: A Dataset for Large Vocabulary Instance Segmentation"

50 / 1,056 papers shown
Title
OVMR: Open-Vocabulary Recognition with Multi-Modal References
OVMR: Open-Vocabulary Recognition with Multi-Modal ReferencesComputer Vision and Pattern Recognition (CVPR), 2024
Zehong Ma
Shiliang Zhang
Longhui Wei
Qi Tian
VLM
266
3
0
07 Jun 2024
Matching Anything by Segmenting Anything
Matching Anything by Segmenting AnythingComputer Vision and Pattern Recognition (CVPR), 2024
Siyuan Li
Lei Ke
Martin Danelljan
Luigi Piccinelli
Mattia Segu
Luc Van Gool
Fisher Yu
VOS
241
47
0
06 Jun 2024
Frequency-based Matcher for Long-tailed Semantic Segmentation
Frequency-based Matcher for Long-tailed Semantic Segmentation
Shan Li
Pu Cao
Pu Cao
Liulei Li
Huadong Ma
206
3
0
06 Jun 2024
Generative Active Learning for Long-tailed Instance Segmentation
Generative Active Learning for Long-tailed Instance Segmentation
Huanyi Zheng
Chengxiang Fan
Hao Chen
Yongxu Liu
Weian Mao
Xiaogang Xu
Chunhua Shen
159
7
0
04 Jun 2024
SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model
SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model
An-Chieh Cheng
Hongxu Yin
Yang Fu
Qiushan Guo
Ruihan Yang
Jan Kautz
Xiaolong Wang
Sifei Liu
LRM
262
180
0
03 Jun 2024
Collaborative Novel Object Discovery and Box-Guided Cross-Modal Alignment for Open-Vocabulary 3D Object Detection
Collaborative Novel Object Discovery and Box-Guided Cross-Modal Alignment for Open-Vocabulary 3D Object Detection
Yang Cao
Yihan Zeng
Hang Xu
Dan Xu
3DPCObjD
276
14
0
02 Jun 2024
Learning Background Prompts to Discover Implicit Knowledge for Open
  Vocabulary Object Detection
Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection
Jiaming Li
Jiacheng Zhang
Jichang Li
Ge Li
Si Liu
Liang Lin
Guanbin Li
ObjDVLM
310
27
0
01 Jun 2024
Extreme Point Supervised Instance Segmentation
Extreme Point Supervised Instance Segmentation
Hyeonjun Lee
S. Hwang
Suha Kwak
278
7
0
31 May 2024
On Calibration of Object Detectors: Pitfalls, Evaluation and Baselines
On Calibration of Object Detectors: Pitfalls, Evaluation and Baselines
Selim Kuzucu
Kemal Oksuz
Jonathan Sadeghi
P. Dokania
224
8
0
30 May 2024
RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection
RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection
Fangyi Chen
Han Zhang
Zhantao Yang
Hao Chen
Kai Hu
Marios Savvides
ObjDVLM
194
7
0
30 May 2024
Instruction-Guided Visual Masking
Instruction-Guided Visual Masking
Jinliang Zheng
Jianxiong Li
Si Cheng
Yinan Zheng
Jiaming Li
Jihao Liu
Yu Liu
Jingjing Liu
Xianyuan Zhan
244
17
0
30 May 2024
Enhancing Vision-Language Model with Unmasked Token Alignment
Enhancing Vision-Language Model with Unmasked Token Alignment
Jihao Liu
Jinliang Zheng
Boxiao Liu
Yu Liu
Jiaming Song
CLIP
178
0
0
29 May 2024
FocSAM: Delving Deeply into Focused Objects in Segmenting Anything
FocSAM: Delving Deeply into Focused Objects in Segmenting Anything
You Huang
Zongyu Lan
Liujuan Cao
Xianming Lin
Shengchuan Zhang
Guannan Jiang
Rongrong Ji
VLM
159
6
0
29 May 2024
OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and
  Open-World Unknown Objects Supervision
OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision
Junjie Wang
Bin Chen
Bin Kang
Yulin Li
Yichi Chen
Weizhi Xian
Huifeng Chang
VLMObjD
205
15
0
28 May 2024
VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models
VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models
Zejun Li
Ruipu Luo
Jiwen Zhang
Minghui Qiu
Zhongyu Wei
Zhongyu Wei
LRMMLLM
613
32
0
27 May 2024
Free Performance Gain from Mixing Multiple Partially Labeled Samples in
  Multi-label Image Classification
Free Performance Gain from Mixing Multiple Partially Labeled Samples in Multi-label Image Classification
Chak Fong Chong
Jielong Guo
Xu Yang
Wei Ke
Yapeng Wang
VLM
219
0
0
24 May 2024
PerSense: Training-Free Personalized Instance Segmentation in Dense Images
PerSense: Training-Free Personalized Instance Segmentation in Dense Images
Muhammad Ibraheem Siddiqui
Muhammad Umer Sheikh
Hassan Abid
M. H. Khan
VLM
444
0
0
22 May 2024
NubbleDrop: A Simple Way to Improve Matching Strategy for Prompted
  One-Shot Segmentation
NubbleDrop: A Simple Way to Improve Matching Strategy for Prompted One-Shot Segmentation
Zhiyu Xu
Qingliang Chen
126
0
0
19 May 2024
Efficient Multimodal Large Language Models: A Survey
Efficient Multimodal Large Language Models: A Survey
Yizhang Jin
Jian Li
Yexin Liu
Tianjun Gu
Kai Wu
...
Xin Tan
Zhenye Gan
Yabiao Wang
Chengjie Wang
Lizhuang Ma
LRM
269
84
0
17 May 2024
DiverGen: Improving Instance Segmentation by Learning Wider Data
  Distribution with More Diverse Generative Data
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative DataComputer Vision and Pattern Recognition (CVPR), 2024
Chengxiang Fan
Huanyi Zheng
Hao Chen
Yang Liu
Weijia Wu
Huaqi Zhang
Chunhua Shen
DiffM
270
14
0
16 May 2024
SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection
SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object DetectionComputer Vision and Pattern Recognition (CVPR), 2024
Mingxuan Liu
Tyler L. Hayes
Elisa Ricci
G. Csurka
Riccardo Volpi
ObjD
257
9
0
16 May 2024
Open-Vocabulary Object Detection via Neighboring Region Attention
  Alignment
Open-Vocabulary Object Detection via Neighboring Region Attention AlignmentEngineering applications of artificial intelligence (EAAI), 2024
Sunyuan Qiang
Xianfei Li
Yanyan Liang
Wenlong Liao
Tao He
Pai Peng
ObjD
185
0
0
14 May 2024
Structured Click Control in Transformer-based Interactive Segmentation
Structured Click Control in Transformer-based Interactive Segmentation
Long Xu
Yong-Xiang Chen
Rui Huang
Feng Wu
Shiwu Lai
107
2
0
07 May 2024
Mapping the Unseen: Unified Promptable Panoptic Mapping with Dynamic Labeling using Foundation Models
Mapping the Unseen: Unified Promptable Panoptic Mapping with Dynamic Labeling using Foundation Models
Mohamad Al Al Mdfaa
Raghad Salameh
Geesara Kulathunga
Sergey Zagoruyko
Gonzalo Ferrer
274
3
0
03 May 2024
ASAM: Boosting Segment Anything Model with Adversarial Tuning
ASAM: Boosting Segment Anything Model with Adversarial Tuning
Bo Li
Haoke Xiao
Lv Tang
242
15
0
01 May 2024
MFP: Making Full Use of Probability Maps for Interactive Image
  Segmentation
MFP: Making Full Use of Probability Maps for Interactive Image Segmentation
Chaewon Lee
Seon-Ho Lee
Chang-Su Kim
149
10
0
29 Apr 2024
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Navve Wasserman
Noam Rotstein
Roy Ganz
Ron Kimmel
DiffM
393
28
0
28 Apr 2024
DAVE -- A Detect-and-Verify Paradigm for Low-Shot Counting
DAVE -- A Detect-and-Verify Paradigm for Low-Shot Counting
Jer Pelhan
A. Lukežič
Vitjan Zavrtanik
Matej Kristan
256
42
0
25 Apr 2024
Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings
Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings
Olivia Wiles
Chuhan Zhang
Isabela Albuquerque
Ivana Kajić
Su Wang
...
Jordi Pont-Tuset
Aida Nematzadeh
Anant Nawalgaria
Jordi Pont-Tuset
Aida Nematzadeh
EGVM
939
31
0
25 Apr 2024
A Survey of Deep Long-Tail Classification Advancements
A Survey of Deep Long-Tail Classification Advancements
Charika De Alvis
Suranga Seneviratne
215
9
0
24 Apr 2024
Groma: Localized Visual Tokenization for Grounding Multimodal Large
  Language Models
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models
Chuofan Ma
Yi Jiang
Jiannan Wu
Zehuan Yuan
Xiaojuan Qi
VLMObjD
209
104
0
19 Apr 2024
BLINK: Multimodal Large Language Models Can See but Not Perceive
BLINK: Multimodal Large Language Models Can See but Not Perceive
Xingyu Fu
Yushi Hu
Bangzheng Li
Yu Feng
Haoyu Wang
Xudong Lin
Dan Roth
Noah A. Smith
Wei-Chiu Ma
Ranjay Krishna
VLMLRMMLLM
524
295
0
18 Apr 2024
Performance Evaluation of Segment Anything Model with Variational
  Prompting for Application to Non-Visible Spectrum Imagery
Performance Evaluation of Segment Anything Model with Variational Prompting for Application to Non-Visible Spectrum Imagery
Yona Falinie A. Gaus
Neelanjan Bhowmik
Brian K. S. Isaac-Medina
T. Breckon
VLM
168
5
0
18 Apr 2024
SOHES: Self-supervised Open-world Hierarchical Entity Segmentation
SOHES: Self-supervised Open-world Hierarchical Entity SegmentationInternational Conference on Learning Representations (ICLR), 2024
Shengcao Cao
J. Gu
Jason Kuen
Hao Tan
Ruiyi Zhang
Handong Zhao
A. Nenkova
Liangyan Gui
Tong Sun
Yu Wang
VLMOCL
336
3
0
18 Apr 2024
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection
Lewei Yao
Renjie Pi
Jianhua Han
Xiaodan Liang
Hang Xu
Wei Zhang
Zhenguo Li
Dan Xu
VLMObjD
240
43
0
14 Apr 2024
LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning
LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning
Junchi Wang
Lei Ke
MLLMLRMVLM
215
59
0
12 Apr 2024
COCONut: Modernizing COCO Segmentation
COCONut: Modernizing COCO Segmentation
XueQing Deng
Qihang Yu
Peng Wang
Xiaohui Shen
Liang-Chieh Chen
182
20
0
12 Apr 2024
Training-free Boost for Open-Vocabulary Object Detection with Confidence
  Aggregation
Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation
Yanhao Zheng
Kai Liu
ObjD
180
3
0
12 Apr 2024
FashionFail: Addressing Failure Cases in Fashion Object Detection and
  Segmentation
FashionFail: Addressing Failure Cases in Fashion Object Detection and Segmentation
Riza Velioglu
Robin Chan
Barbara Hammer
182
1
0
12 Apr 2024
Ferret-v2: An Improved Baseline for Referring and Grounding with Large
  Language Models
Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models
Haotian Zhang
Haoxuan You
Philipp Dufter
Bowen Zhang
Chen Chen
...
Tsu-Jui Fu
William Y. Wang
Shih-Fu Chang
Zhe Gan
Yinfei Yang
ObjDMLLM
241
82
0
11 Apr 2024
ConsistencyDet: A Few-step Denoising Framework for Object Detection Using the Consistency Model
ConsistencyDet: A Few-step Denoising Framework for Object Detection Using the Consistency Model
Lifan Jiang
Zhihui Wang
Changmiao Wang
Ming Li
Jiaxu Leng
DiffM
292
0
0
11 Apr 2024
Retrieval-Augmented Open-Vocabulary Object Detection
Retrieval-Augmented Open-Vocabulary Object Detection
Jooyeon Kim
Eulrang Cho
Sehyung Kim
Hyunwoo J. Kim
VLMObjD
224
20
0
08 Apr 2024
PAT: Pixel-wise Adaptive Training for Long-tailed Segmentation
PAT: Pixel-wise Adaptive Training for Long-tailed Segmentation
Khoi Do
Duong Nguyen
Nguyen-Hoang Tran
Viet Dung Nguyen
219
1
0
08 Apr 2024
Hyperbolic Learning with Synthetic Captions for Open-World Detection
Hyperbolic Learning with Synthetic Captions for Open-World Detection
Fanjie Kong
Yanbei Chen
Jiarui Cai
Davide Modolo
VLMObjD
206
14
0
07 Apr 2024
Inference-Time Rule Eraser: Fair Recognition via Distilling and Removing
  Biased Rules
Inference-Time Rule Eraser: Fair Recognition via Distilling and Removing Biased Rules
Yi Zhang
Dongyuan Lu
Jitao Sang
FaML
298
2
0
07 Apr 2024
Mixed-Query Transformer: A Unified Image Segmentation Architecture
Mixed-Query Transformer: A Unified Image Segmentation Architecture
Pei Wang
Zhaowei Cai
Hao Yang
Ashwin Swaminathan
R. Manmatha
Stefano Soatto
244
3
0
06 Apr 2024
Gen3DSR: Generalizable 3D Scene Reconstruction via Divide and Conquer from a Single View
Gen3DSR: Generalizable 3D Scene Reconstruction via Divide and Conquer from a Single ViewInternational Conference on 3D Vision (3DV), 2024
Andreea Dogaru
M. Ozer
Bernhard Egger
3DGS
333
22
0
04 Apr 2024
DeiT-LT Distillation Strikes Back for Vision Transformer Training on
  Long-Tailed Datasets
DeiT-LT Distillation Strikes Back for Vision Transformer Training on Long-Tailed DatasetsComputer Vision and Pattern Recognition (CVPR), 2024
Harsh Rangwani
Pradipto Mondal
Mayank Mishra
Ashish Ramayee Asokan
R. V. Babu
188
16
0
03 Apr 2024
MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image
  Generation
MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Petru-Daniel Tudosiu
Yongxin Yang
Shifeng Zhang
Fei Chen
Jingyu Sun
Gerasimos Lampouras
Ignacio Iacobacci
Sarah Parisot
226
24
0
03 Apr 2024
ViTamin: Designing Scalable Vision Models in the Vision-Language Era
ViTamin: Designing Scalable Vision Models in the Vision-Language EraComputer Vision and Pattern Recognition (CVPR), 2024
Jienneg Chen
Qihang Yu
Xiaohui Shen
Yaoyao Liu
Liang-Chieh Chen
3DVVLM
385
48
0
02 Apr 2024
Previous
123...678...202122
Next