ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1908.03195
  4. Cited By
LVIS: A Dataset for Large Vocabulary Instance Segmentation
v1v2 (latest)

LVIS: A Dataset for Large Vocabulary Instance Segmentation

Computer Vision and Pattern Recognition (CVPR), 2019
8 August 2019
Agrim Gupta
Piotr Dollár
Ross B. Girshick
    ISegVLM
ArXiv (abs)PDFHTML

Papers citing "LVIS: A Dataset for Large Vocabulary Instance Segmentation"

50 / 1,058 papers shown
Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5%
  Parameters and 90% Performance
Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance
Zhangwei Gao
Zhe Chen
Erfei Cui
Yiming Ren
Weiyun Wang
...
Lewei Lu
Tong Lu
Yu Qiao
Jifeng Dai
Wenhai Wang
VLM
402
87
0
21 Oct 2024
Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object
  Detection Considering Text Describability
Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability
Yusuke Hosoya
Masanori Suganuma
Takayuki Okatani
ObjD
281
0
0
20 Oct 2024
LocateBench: Evaluating the Locating Ability of Vision Language Models
LocateBench: Evaluating the Locating Ability of Vision Language Models
Ting-Rui Chiang
Joshua Robinson
Xinyan Velocity Yu
Dani Yogatama
VLMELM
243
0
0
17 Oct 2024
Configurable Embodied Data Generation for Class-Agnostic RGB-D Video
  Segmentation
Configurable Embodied Data Generation for Class-Agnostic RGB-D Video SegmentationIEEE Robotics and Automation Letters (RA-L), 2024
Anthony Opipari
Aravindhan K. Krishnan
Shreekant Gayaka
Min Sun
Cheng-Hao Kuo
Arnie Sen
Odest Chadwicke Jenkins
VOS
255
1
0
16 Oct 2024
LocoMotion: Learning Motion-Focused Video-Language Representations
LocoMotion: Learning Motion-Focused Video-Language RepresentationsAsian Conference on Computer Vision (ACCV), 2024
Hazel Doughty
Fida Mohammad Thoker
Cees G. M. Snoek
373
3
0
15 Oct 2024
OVS Meets Continual Learning: Towards Sustainable Open-Vocabulary Segmentation
OVS Meets Continual Learning: Towards Sustainable Open-Vocabulary Segmentation
Dongjun Hwang
Yejin Kim
Junsuk Choe
Seong Joon Oh
Junsuk Choe
VLM
723
0
0
15 Oct 2024
Fractal Calibration for long-tailed object detection
Fractal Calibration for long-tailed object detectionComputer Vision and Pattern Recognition (CVPR), 2024
Konstantinos Panagiotis Alexandridis
Ismail Elezi
Jiankang Deng
Anh H. Nguyen
Shan Luo
1.0K
3
0
15 Oct 2024
AutoTurb: Using Large Language Models for Automatic Algebraic Model
  Discovery of Turbulence Closure
AutoTurb: Using Large Language Models for Automatic Algebraic Model Discovery of Turbulence Closure
Yu Zhang
Kefeng Zheng
Fei Liu
Qingfu Zhang
Zhenkun Wang
250
9
0
14 Oct 2024
big.LITTLE Vision Transformer for Efficient Visual Recognition
big.LITTLE Vision Transformer for Efficient Visual Recognition
He Guo
Yulong Wang
Zixuan Ye
Jifeng Dai
Yuwen Xiong
ViT
262
1
0
14 Oct 2024
Locality Alignment Improves Vision-Language Models
Locality Alignment Improves Vision-Language ModelsInternational Conference on Learning Representations (ICLR), 2024
Ian Covert
Tony Sun
James Zou
Tatsunori Hashimoto
VLM
592
11
0
14 Oct 2024
Boosting Open-Vocabulary Object Detection by Handling Background Samples
Boosting Open-Vocabulary Object Detection by Handling Background Samples
Ruizhe Zeng
Lu Zhang
Xu Yang
Zhiyong Liu
VLMObjD
194
1
0
11 Oct 2024
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing
  Attention
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing AttentionAsian Conference on Computer Vision (ACCV), 2024
Nguyen Huu Bao Long
Chenyu Zhang
Yuzhi Shi
Tsubasa Hirakawa
Takayoshi Yamashita
Tohgoroh Matsui
H. Fujiyoshi
221
10
0
11 Oct 2024
Interactive4D: Interactive 4D LiDAR Segmentation
Interactive4D: Interactive 4D LiDAR SegmentationIEEE International Conference on Robotics and Automation (ICRA), 2024
Ilya Fradlin
Idil Esen Zulfikar
Kadir Yilmaz
Theodora Kontogianni
Bastian Leibe
305
4
0
10 Oct 2024
Training-Free Open-Ended Object Detection and Segmentation via Attention
  as Prompts
Training-Free Open-Ended Object Detection and Segmentation via Attention as PromptsNeural Information Processing Systems (NeurIPS), 2024
Zhiwei Lin
Yongtao Wang
Zhi Tang
ObjDVLM
207
15
0
08 Oct 2024
A Simple Image Segmentation Framework via In-Context Examples
A Simple Image Segmentation Framework via In-Context ExamplesNeural Information Processing Systems (NeurIPS), 2024
Yang Liu
Chenchen Jing
Hengtao Li
Huanyi Zheng
Hao Chen
Xinlong Wang
Chunhua Shen
178
12
0
07 Oct 2024
On Efficient Variants of Segment Anything Model: A Survey
On Efficient Variants of Segment Anything Model: A SurveyInternational Journal of Computer Vision (IJCV), 2024
Xiaorui Sun
Jing Liu
Mengqi Li
Xiaofeng Zhu
Ping Hu
VLM
530
19
0
07 Oct 2024
AHA: A Vision-Language-Model for Detecting and Reasoning Over Failures
  in Robotic Manipulation
AHA: A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic ManipulationInternational Conference on Learning Representations (ICLR), 2024
Jiafei Duan
Wilbert Pumacay
Nishanth Kumar
Yi Ru Wang
Shulin Tian
Wentao Yuan
Ranjay Krishna
Dieter Fox
Ajay Mandlekar
Yijie Guo
VLMLRM
268
80
0
01 Oct 2024
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning
Haotian Zhang
Mingfei Gao
Zhe Gan
Philipp Dufter
Nina Wenzel
...
Haoxuan You
Zirui Wang
Afshin Dehghan
Peter Grasch
Yinfei Yang
VLMMLLM
303
66
1
30 Sep 2024
ProMerge: Prompt and Merge for Unsupervised Instance Segmentation
ProMerge: Prompt and Merge for Unsupervised Instance SegmentationEuropean Conference on Computer Vision (ECCV), 2024
Dylan Li
Gyungin Shin
218
9
0
27 Sep 2024
A Novel Unified Architecture for Low-Shot Counting by Detection and
  Segmentation
A Novel Unified Architecture for Low-Shot Counting by Detection and SegmentationNeural Information Processing Systems (NeurIPS), 2024
Jer Pelhan
A. Lukežič
Vitjan Zavrtanik
Matej Kristan
ObjD
292
15
0
27 Sep 2024
You Only Speak Once to See
You Only Speak Once to SeeIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Wenhao Yang
Jianguo Wei
Wenhuan Lu
Lei Li
VOS
228
4
0
27 Sep 2024
Visual Concept Networks: A Graph-Based Approach to Detecting Anomalous
  Data in Deep Neural Networks
Visual Concept Networks: A Graph-Based Approach to Detecting Anomalous Data in Deep Neural NetworksInternational Conferences on Pattern Recognition and Artificial Intelligence (ICCPRAI), 2024
Debargha Ganguly
Debayan Gupta
Vipin Chaudhary
GNN
181
1
0
26 Sep 2024
Search and Detect: Training-Free Long Tail Object Detection via
  Web-Image Retrieval
Search and Detect: Training-Free Long Tail Object Detection via Web-Image RetrievalComputer Vision and Pattern Recognition (CVPR), 2024
Mankeerat Sidhu
Hetarth Chopra
Ansel Blume
Jeonghwan Kim
Revanth Gangi Reddy
Heng Ji
ObjDVLM
184
2
0
26 Sep 2024
DARE: Diverse Visual Question Answering with Robustness Evaluation
DARE: Diverse Visual Question Answering with Robustness EvaluationTransactions of the Association for Computational Linguistics (TACL), 2024
Hannah Sterz
Jonas Pfeiffer
Ivan Vulić
OODVLM
346
4
0
26 Sep 2024
Episodic Memory Verbalization using Hierarchical Representations of Life-Long Robot Experience
Episodic Memory Verbalization using Hierarchical Representations of Life-Long Robot Experience
Leonard Barmann
Chad DeChant
Joana Plewnia
Fabian Peller-Konrad
Daniel Bauer
Tamim Asfour
Alex Waibel
LM&Ro
427
4
0
26 Sep 2024
SSE: Multimodal Semantic Data Selection and Enrichment for
  Industrial-scale Data Assimilation
SSE: Multimodal Semantic Data Selection and Enrichment for Industrial-scale Data AssimilationKnowledge Discovery and Data Mining (KDD), 2024
Maying Shen
Nadine Chang
Sifei Liu
Jose M. Alvarez
247
4
0
20 Sep 2024
GraspSAM: When Segment Anything Model Meets Grasp Detection
GraspSAM: When Segment Anything Model Meets Grasp DetectionIEEE International Conference on Robotics and Automation (ICRA), 2024
Sangjun Noh
Jongwon Kim
Dongwoo Nam
Seunghyeok Back
Raeyoung Kang
Kyoobin Lee
VLM
354
11
0
19 Sep 2024
LPT++: Efficient Training on Mixture of Long-tailed Experts
LPT++: Efficient Training on Mixture of Long-tailed Experts
Bowen Dong
Pan Zhou
W. Zuo
VLM
232
0
0
17 Sep 2024
SLAck: Semantic, Location, and Appearance Aware Open-Vocabulary Tracking
SLAck: Semantic, Location, and Appearance Aware Open-Vocabulary TrackingEuropean Conference on Computer Vision (ECCV), 2024
Siyuan Li
Lei Ke
Yung-Hsu Yang
Luigi Piccinelli
Mattia Segu
Martin Danelljan
Luc Van Gool
VLM
215
8
0
17 Sep 2024
Label Convergence: Defining an Upper Performance Bound in Object Recognition through Contradictory Annotations
Label Convergence: Defining an Upper Performance Bound in Object Recognition through Contradictory AnnotationsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
David Tschirschwitz
Volker Rodehorst
289
2
0
14 Sep 2024
Associate Everything Detected: Facilitating Tracking-by-Detection to the Unknown
Associate Everything Detected: Facilitating Tracking-by-Detection to the UnknownIEEE Transactions on Image Processing (TIP), 2024
Zimeng Fang
Chao Liang
Xue Zhou
Shuyuan Zhu
Xi Li
277
3
0
14 Sep 2024
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary
  Detection
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary DetectionIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Haoxuan Wang
Qu He
Jinlong Peng
Hao Yang
Mingmin Chi
Yabiao Wang
Mamba
278
9
0
13 Sep 2024
GroundingBooth: Grounding Text-to-Image Customization
GroundingBooth: Grounding Text-to-Image Customization
Zhexiao Xiong
Wei Xiong
Jing Shi
Chentao Song
Yizhi Song
Nathan Jacobs
DiffM
434
12
0
13 Sep 2024
From COCO to COCO-FP: A Deep Dive into Background False Positives for
  COCO Detectors
From COCO to COCO-FP: A Deep Dive into Background False Positives for COCO Detectors
Longfei Liu
Wen Guo
Shijie Huang
Cheng Li
Xi Shen
ObjD
247
1
0
12 Sep 2024
Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation
Rethinking The Training And Evaluation of Rich-Context Layout-to-Image GenerationNeural Information Processing Systems (NeurIPS), 2024
Jiaxin Cheng
Zixu Zhao
Tong He
Tianjun Xiao
Yicong Zhou
Zheng Zhang
DiffM
382
5
0
07 Sep 2024
FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary
  Segmentation
FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary Segmentation
Xi Chen
Haosen Yang
Sheng Jin
Xiatian Zhu
Huanjin Yao
VLM
244
7
0
05 Sep 2024
Semantically Controllable Augmentations for Generalizable Robot Learning
Semantically Controllable Augmentations for Generalizable Robot Learning
Zoey Chen
Zhao Mandi
Homanga Bharadhwaj
Mohit Sharma
Shuran Song
Abhishek Gupta
Vikash Kumar
LM&Ro
339
19
0
02 Sep 2024
Anno-incomplete Multi-dataset Detection
Anno-incomplete Multi-dataset Detection
Yiran Xu
Haoxiang Zhong
Kai Wu
Jialin Li
Yong Liu
Chengjie Wang
Shu-Tao Xia
Hongen Liao
ObjD
163
0
0
29 Aug 2024
More Pictures Say More: Visual Intersection Network for Open Set Object
  Detection
More Pictures Say More: Visual Intersection Network for Open Set Object Detection
Bingcheng Dong
Yuning Ding
Jinrong Zhang
Sifan Zhang
Shenglan Liu
ObjD
169
0
0
26 Aug 2024
A Survey of Embodied Learning for Object-Centric Robotic Manipulation
A Survey of Embodied Learning for Object-Centric Robotic ManipulationMachine Intelligence Research (MIR), 2024
Ying Zheng
Lei Yao
Yuejiao Su
Yi Zhang
Yi Wang
Sicheng Zhao
Yiyi Zhang
Lap-Pui Chau
LM&Ro
243
23
0
21 Aug 2024
Reflex-Based Open-Vocabulary Navigation without Prior Knowledge Using
  Omnidirectional Camera and Multiple Vision-Language Models
Reflex-Based Open-Vocabulary Navigation without Prior Knowledge Using Omnidirectional Camera and Multiple Vision-Language Models
Kento Kawaharazuka
Yoshiki Obinata
Naoaki Kanazawa
Naoto Tsukamoto
Kei Okada
Masayuki Inaba
LM&Ro
152
3
0
21 Aug 2024
SAM-REF: Introducing Image-Prompt Synergy during Interaction for Detail Enhancement in the Segment Anything Model
SAM-REF: Introducing Image-Prompt Synergy during Interaction for Detail Enhancement in the Segment Anything ModelComputer Vision and Pattern Recognition (CVPR), 2024
Chongkai Yu
Anqi Li
Xiaochao Qu
Xiaochao Qu
Chengjing Wu
Luoqi Liu
Xiaolin Hu
VLM
279
2
0
21 Aug 2024
OE3DIS: Open-Ended 3D Point Cloud Instance Segmentation
OE3DIS: Open-Ended 3D Point Cloud Instance Segmentation
P. Nguyen
Minh Luu
Anh Tran
Cuong Pham
Khoi Duc Minh Nguyen
3DPC
286
1
0
21 Aug 2024
OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
Youjun Zhao
Jiaying Lin
Shuquan Ye
Qianshi Pang
Rynson W. H. Lau
424
4
0
20 Aug 2024
Image-Based Leopard Seal Recognition: Approaches and Challenges in
  Current Automated Systems
Image-Based Leopard Seal Recognition: Approaches and Challenges in Current Automated Systems
Jorge Yero Salazar
Pablo Rivas
Renato Borras-Chavez
Sarah Kienle
138
0
0
14 Aug 2024
ClickAttention: Click Region Similarity Guided Interactive Segmentation
ClickAttention: Click Region Similarity Guided Interactive SegmentationNeural Networks (NN), 2024
Long Xu
Shanghong Li
Yongquan Chen
Junkang Chen
Rui Huang
Feng Wu
254
0
0
12 Aug 2024
In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic
  Segmentation
In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic SegmentationEuropean Conference on Computer Vision (ECCV), 2024
Dahyun Kang
Minsu Cho
ObjDVLM
390
24
0
09 Aug 2024
Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs
Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMsEuropean Conference on Computer Vision (ECCV), 2024
Jeongkee Lim
Yusung Kim
307
5
0
05 Aug 2024
SAM 2: Segment Anything in Images and Videos
SAM 2: Segment Anything in Images and VideosInternational Conference on Learning Representations (ICLR), 2024
Nikhila Ravi
Valentin Gabeur
Yuan-Ting Hu
Ronghang Hu
Chaitanya K. Ryali
...
Nicolas Carion
Chao-Yuan Wu
Ross B. Girshick
Piotr Dollár
Christoph Feichtenhofer
VLMMLLM
502
2,234
0
01 Aug 2024
A Systematic Review on Long-Tailed Learning
A Systematic Review on Long-Tailed LearningIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Chongsheng Zhang
G. Almpanidis
Gaojuan Fan
Binquan Deng
Yanbo Zhang
Ji Liu
Aouaidjia Kamel
Paolo Soda
Joao Gama
406
26
0
01 Aug 2024
Previous
123456...202122
Next