ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1908.03195
  4. Cited By
LVIS: A Dataset for Large Vocabulary Instance Segmentation
v1v2 (latest)

LVIS: A Dataset for Large Vocabulary Instance Segmentation

Computer Vision and Pattern Recognition (CVPR), 2019
8 August 2019
Agrim Gupta
Piotr Dollár
Ross B. Girshick
    ISegVLM
ArXiv (abs)PDFHTML

Papers citing "LVIS: A Dataset for Large Vocabulary Instance Segmentation"

50 / 1,056 papers shown
Title
SUGAR: Pre-training 3D Visual Representations for Robotics
SUGAR: Pre-training 3D Visual Representations for RoboticsComputer Vision and Pattern Recognition (CVPR), 2024
Shizhe Chen
Ricardo Garcia Pinel
Ivan Laptev
Cordelia Schmid
209
32
0
01 Apr 2024
Rethinking Interactive Image Segmentation with Low Latency, High
  Quality, and Diverse Prompts
Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts
Qin Liu
Jaemin Cho
Mohit Bansal
Marc Niethammer
VLM
167
26
0
31 Mar 2024
Transformer based Pluralistic Image Completion with Reduced Information
  Loss
Transformer based Pluralistic Image Completion with Reduced Information Loss
Qiankun Liu
Yuqi Jiang
Zhentao Tan
DongDong Chen
Ying Fu
Qi Chu
Gang Hua
Nenghai Yu
ViT
238
22
0
31 Mar 2024
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Weifeng Lin
Xinyu Wei
Ruichuan An
Shiyang Feng
Bocheng Zou
Yulin Luo
Siyuan Huang
Shanghang Zhang
Jiaming Song
VLM
343
84
0
29 Mar 2024
OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via
  Cycle-Modality Propagation
OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation
Zhenyu Wang
Yali Li
Taichi Liu
Hengshuang Zhao
Shengjin Wang
3DPCObjD
261
15
0
28 Mar 2024
J-CRe3: A Japanese Conversation Dataset for Real-world Reference
  Resolution
J-CRe3: A Japanese Conversation Dataset for Real-world Reference Resolution
Nobuhiro Ueda
Hideko Habe
Yoko Matsui
Akishige Yuguchi
Seiya Kawano
Yasutomo Kawanishi
Sadao Kurohashi
Koichiro Yoshino
150
7
0
28 Mar 2024
Benchmarking Object Detectors with COCO: A New Path Forward
Benchmarking Object Detectors with COCO: A New Path Forward
Shweta Singh
Aayan Yadav
Jitesh Jain
Humphrey Shi
Justin Johnson
Karan Desai
169
21
0
27 Mar 2024
AIDE: An Automatic Data Engine for Object Detection in Autonomous
  Driving
AIDE: An Automatic Data Engine for Object Detection in Autonomous Driving
Mingfu Liang
Jong-Chyi Su
S. Schulter
Sparsh Garg
Shiyu Zhao
Ying Nian Wu
Manmohan Chandraker
VLM
175
29
0
26 Mar 2024
Gradient-based Sampling for Class Imbalanced Semi-supervised Object
  Detection
Gradient-based Sampling for Class Imbalanced Semi-supervised Object DetectionIEEE International Conference on Computer Vision (ICCV), 2023
Jiaming Li
Xiangru Lin
Wei Zhang
Xiao Tan
Yingying Li
Junyu Han
Errui Ding
Jingdong Wang
Guanbin Li
229
9
0
22 Mar 2024
Preventing Catastrophic Forgetting through Memory Networks in Continuous
  Detection
Preventing Catastrophic Forgetting through Memory Networks in Continuous Detection
Gaurav Bhatt
James Ross
Leonid Sigal
CLLVLM
232
7
0
21 Mar 2024
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
Qing Jiang
Feng Li
Zhaoyang Zeng
Tianhe Ren
Shilong Liu
Lei Zhang
VLM
283
78
0
21 Mar 2024
PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model
PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model
Zheng Zhang
Yeyao Ma
Enming Zhang
Xiang Bai
VLMMLLM
251
77
0
21 Mar 2024
Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship
  Detection
Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection
Tim Salzmann
Markus Ryll
Alex Bewley
Matthias Minderer
265
7
0
21 Mar 2024
CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation
CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation
Wenqi Zhu
Jiale Cao
Jin Xie
Shuangming Yang
Yanwei Pang
VLMCLIP
258
10
0
19 Mar 2024
Fusion Transformer with Object Mask Guidance for Image Forgery Analysis
Fusion Transformer with Object Mask Guidance for Image Forgery Analysis
Dimitrios Karageorgiou
Giorgos Kordopatis-Zilos
Symeon Papadopoulos
ViT
165
11
0
18 Mar 2024
Better (pseudo-)labels for semi-supervised instance segmentation
Better (pseudo-)labels for semi-supervised instance segmentation
Franccois Porcher
Camille Couprie
Marc Szafraniec
Jakob Verbeek
ISeg
150
3
0
18 Mar 2024
NetTrack: Tracking Highly Dynamic Objects with a Net
NetTrack: Tracking Highly Dynamic Objects with a Net
Guang-Zheng Zheng
Shijie Lin
Haobo Zuo
Changhong Fu
Jia Pan
231
23
0
17 Mar 2024
Generative Region-Language Pretraining for Open-Ended Object Detection
Generative Region-Language Pretraining for Open-Ended Object DetectionComputer Vision and Pattern Recognition (CVPR), 2024
Chuang Lin
Yi Jiang
Zhuang Li
Zehuan Yuan
Jianfei Cai
ObjDVLM
194
27
0
15 Mar 2024
Revisiting Adversarial Training under Long-Tailed Distributions
Revisiting Adversarial Training under Long-Tailed DistributionsComputer Vision and Pattern Recognition (CVPR), 2024
Xinli Yue
Ningping Mou
Qian Wang
Lingchen Zhao
AAML
239
13
0
15 Mar 2024
Open-Vocabulary Object Detection with Meta Prompt Representation and
  Instance Contrastive Optimization
Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive OptimizationBritish Machine Vision Conference (BMVC), 2024
Zhao Wang
Aoxue Li
Fengwei Zhou
Zhenguo Li
Qi Dou
ObjDVLM
180
4
0
14 Mar 2024
Efficient Transferability Assessment for Selection of Pre-trained
  Detectors
Efficient Transferability Assessment for Selection of Pre-trained DetectorsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Zhao Wang
Aoxue Li
Zhenguo Li
Qi Dou
162
0
0
14 Mar 2024
GiT: Towards Generalist Vision Transformer through Universal Language
  Interface
GiT: Towards Generalist Vision Transformer through Universal Language InterfaceEuropean Conference on Computer Vision (ECCV), 2024
Haiyang Wang
Hao Tang
Li Jiang
Shaoshuai Shi
Muhammad Ferjad Naeem
Jiaming Song
Bernt Schiele
Liwei Wang
VLM
254
22
0
14 Mar 2024
SAM-Lightening: A Lightweight Segment Anything Model with Dilated Flash
  Attention to Achieve 30 times Acceleration
SAM-Lightening: A Lightweight Segment Anything Model with Dilated Flash Attention to Achieve 30 times Acceleration
Yanfei Song
Bangzheng Pu
Peng Wang
Hongxu Jiang
Dong Dong
Yongxiang Cao
Yiqing Shen
VLM
236
16
0
14 Mar 2024
CuVLER: Enhanced Unsupervised Object Discoveries through Exhaustive
  Self-Supervised Transformers
CuVLER: Enhanced Unsupervised Object Discoveries through Exhaustive Self-Supervised TransformersComputer Vision and Pattern Recognition (CVPR), 2024
Shahaf Arica
Or Rubin
Sapir Gershov
S. Laufer
137
12
0
12 Mar 2024
Textual Knowledge Matters: Cross-Modality Co-Teaching for Generalized
  Visual Class Discovery
Textual Knowledge Matters: Cross-Modality Co-Teaching for Generalized Visual Class DiscoveryEuropean Conference on Computer Vision (ECCV), 2024
Haiyang Zheng
Nan Pu
Wenjing Li
Andrii Zadaianchuk
Zhun Zhong
213
15
0
12 Mar 2024
Class Imbalance in Object Detection: An Experimental Diagnosis and Study
  of Mitigation Strategies
Class Imbalance in Object Detection: An Experimental Diagnosis and Study of Mitigation Strategies
Nieves Crasto
ObjD
147
14
0
11 Mar 2024
Real-time Transformer-based Open-Vocabulary Detection with Efficient
  Fusion Head
Real-time Transformer-based Open-Vocabulary Detection with Efficient Fusion Head
Tiancheng Zhao
Peng Liu
Xuan He
Lu Zhang
Kyusong Lee
ObjD
153
19
0
11 Mar 2024
Probabilistic Contrastive Learning for Long-Tailed Visual Recognition
Probabilistic Contrastive Learning for Long-Tailed Visual RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Chaoqun Du
Yulin Wang
Shiji Song
Gao Huang
221
66
0
11 Mar 2024
ClickVOS: Click Video Object Segmentation
ClickVOS: Click Video Object Segmentation
Pinxue Guo
Lingyi Hong
Xinyu Zhou
Shuyong Gao
Wanyun Li
Jinglun Li
Zhaoyu Chen
Xiaoqiang Li
Wei Zhang
Wenqiang Zhang
VOS
235
2
0
10 Mar 2024
VastTrack: Vast Category Visual Object Tracking
VastTrack: Vast Category Visual Object Tracking
Liang Peng
Junyuan Gao
Hengrong Du
Weihong Li
Shaohua Dong
Zhipeng Zhang
Heng Fan
Libo Zhang
VLM
282
19
0
06 Mar 2024
RegionGPT: Towards Region Understanding Vision Language Model
RegionGPT: Towards Region Understanding Vision Language Model
Qiushan Guo
Shalini De Mello
Hongxu Yin
Wonmin Byeon
Ka Chun Cheung
Yizhou Yu
Ping Luo
Sifei Liu
VLM
182
68
0
04 Mar 2024
Boosting Box-supervised Instance Segmentation with Pseudo Depth
Boosting Box-supervised Instance Segmentation with Pseudo Depth
Xinyi Yu
Ling Yan
Peng-Tao Jiang
Hao Chen
Bo Li
Lin Yuanbo Wu
Linlin Ou
MDE
246
1
0
02 Mar 2024
When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on
  its Contour-following Ability
When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability
Wenjie Xuan
Yufei Xu
Shanshan Zhao
Chaoyue Wang
Juhua Liu
Bo Du
Dacheng Tao
193
10
0
01 Mar 2024
Comparing Importance Sampling Based Methods for Mitigating the Effect of
  Class Imbalance
Comparing Importance Sampling Based Methods for Mitigating the Effect of Class Imbalance
Indu Panigrahi
Richard Zhu
159
1
0
28 Feb 2024
TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding
TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding
Zhihao Zhang
Shengcao Cao
Yu Wang
202
18
0
28 Feb 2024
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
UniVS: Unified and Universal Video Segmentation with Prompts as Queries
Ming-hui Li
Shuai Li
Xindong Zhang
Lei Zhang
VOS
220
29
0
28 Feb 2024
ShapeLLM: Universal 3D Object Understanding for Embodied Interaction
ShapeLLM: Universal 3D Object Understanding for Embodied Interaction
Zekun Qi
Runpei Dong
Shaochen Zhang
Haoran Geng
Chunrui Han
Zheng Ge
Li Yi
Kaisheng Ma
454
108
0
27 Feb 2024
PANDAS: Prototype-based Novel Class Discovery and Detection
PANDAS: Prototype-based Novel Class Discovery and Detection
Tyler L. Hayes
César R. de Souza
Namil Kim
Jiwon Kim
Riccardo Volpi
Diane Larlus
ObjD
288
4
0
27 Feb 2024
GROUNDHOG: Grounding Large Language Models to Holistic Segmentation
GROUNDHOG: Grounding Large Language Models to Holistic Segmentation
Yichi Zhang
Ziqiao Ma
Xiaofeng Gao
Suhaila Shakiah
Qiaozi Gao
Joyce Chai
MLLMVLM
343
74
0
26 Feb 2024
Selective "Selective Prediction": Reducing Unnecessary Abstention in
  Vision-Language Reasoning
Selective "Selective Prediction": Reducing Unnecessary Abstention in Vision-Language Reasoning
Tejas Srinivasan
Jack Hessel
Tanmay Gupta
Bill Yuchen Lin
Yejin Choi
Jesse Thomason
Khyathi Chandu
259
14
0
23 Feb 2024
The Revolution of Multimodal Large Language Models: A Survey
The Revolution of Multimodal Large Language Models: A Survey
Davide Caffagni
Federico Cocchi
Luca Barsellotti
Nicholas Moratelli
Sara Sarto
Lorenzo Baraldi
Lorenzo Baraldi
Marcella Cornia
Rita Cucchiara
LRMVLM
324
119
0
19 Feb 2024
PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs
PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs
Michael Dorkenwald
Nimrod Barazani
Cees G. M. Snoek
Yuki M. Asano
VLMMLLM
162
14
0
13 Feb 2024
Prismatic VLMs: Investigating the Design Space of Visually-Conditioned
  Language Models
Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models
Siddharth Karamcheti
Suraj Nair
Ashwin Balakrishna
Percy Liang
Thomas Kollar
Dorsa Sadigh
MLLMVLM
256
232
0
12 Feb 2024
InstaGen: Enhancing Object Detection by Training on Synthetic Dataset
InstaGen: Enhancing Object Detection by Training on Synthetic Dataset
Chengjian Feng
Yujie Zhong
Zequn Jie
Weidi Xie
Lin Ma
ObjD
287
33
0
08 Feb 2024
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
Chris Liu
Renrui Zhang
Longtian Qiu
Siyuan Huang
Weifeng Lin
...
Hao Shao
Pan Lu
Jiaming Song
Yu Qiao
Shiyang Feng
MLLM
449
135
0
08 Feb 2024
EfficientViT-SAM: Accelerated Segment Anything Model Without Accuracy
  Loss
EfficientViT-SAM: Accelerated Segment Anything Model Without Accuracy Loss
Zhuoyang Zhang
Han Cai
Song Han
VLM
235
5
0
07 Feb 2024
LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained
  Descriptors
LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors
Sheng Jin
Xue-Qiu Jiang
Jiaxing Huang
Lewei Lu
Shijian Lu
VLMObjD
152
38
0
07 Feb 2024
Enhancing Embodied Object Detection through Language-Image Pre-training
  and Implicit Object Memory
Enhancing Embodied Object Detection through Language-Image Pre-training and Implicit Object Memory
N. H. Chapman
Feras Dayoub
Will N. Browne
Chris Lehnert
ObjDVLMLM&Ro
184
2
0
06 Feb 2024
VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language
  Navigation
VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language NavigationAAAI Conference on Artificial Intelligence (AAAI), 2024
Jialu Li
Aishwarya Padmakumar
Gaurav Sukhatme
Mohit Bansal
269
10
0
05 Feb 2024
HASSOD: Hierarchical Adaptive Self-Supervised Object Detection
HASSOD: Hierarchical Adaptive Self-Supervised Object DetectionNeural Information Processing Systems (NeurIPS), 2024
Shengcao Cao
Dhiraj Joshi
Liangyan Gui
Yu Wang
197
17
0
05 Feb 2024
Previous
123...789...202122
Next