Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1908.03195
Cited By
v1
v2 (latest)
LVIS: A Dataset for Large Vocabulary Instance Segmentation
Computer Vision and Pattern Recognition (CVPR), 2019
8 August 2019
Agrim Gupta
Piotr Dollár
Ross B. Girshick
ISeg
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"LVIS: A Dataset for Large Vocabulary Instance Segmentation"
50 / 1,058 papers shown
Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance
Zhangwei Gao
Zhe Chen
Erfei Cui
Yiming Ren
Weiyun Wang
...
Lewei Lu
Tong Lu
Yu Qiao
Jifeng Dai
Wenhai Wang
VLM
402
87
0
21 Oct 2024
Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability
Yusuke Hosoya
Masanori Suganuma
Takayuki Okatani
ObjD
281
0
0
20 Oct 2024
LocateBench: Evaluating the Locating Ability of Vision Language Models
Ting-Rui Chiang
Joshua Robinson
Xinyan Velocity Yu
Dani Yogatama
VLM
ELM
243
0
0
17 Oct 2024
Configurable Embodied Data Generation for Class-Agnostic RGB-D Video Segmentation
IEEE Robotics and Automation Letters (RA-L), 2024
Anthony Opipari
Aravindhan K. Krishnan
Shreekant Gayaka
Min Sun
Cheng-Hao Kuo
Arnie Sen
Odest Chadwicke Jenkins
VOS
255
1
0
16 Oct 2024
LocoMotion: Learning Motion-Focused Video-Language Representations
Asian Conference on Computer Vision (ACCV), 2024
Hazel Doughty
Fida Mohammad Thoker
Cees G. M. Snoek
373
3
0
15 Oct 2024
OVS Meets Continual Learning: Towards Sustainable Open-Vocabulary Segmentation
Dongjun Hwang
Yejin Kim
Junsuk Choe
Seong Joon Oh
Junsuk Choe
VLM
723
0
0
15 Oct 2024
Fractal Calibration for long-tailed object detection
Computer Vision and Pattern Recognition (CVPR), 2024
Konstantinos Panagiotis Alexandridis
Ismail Elezi
Jiankang Deng
Anh H. Nguyen
Shan Luo
1.0K
3
0
15 Oct 2024
AutoTurb: Using Large Language Models for Automatic Algebraic Model Discovery of Turbulence Closure
Yu Zhang
Kefeng Zheng
Fei Liu
Qingfu Zhang
Zhenkun Wang
250
9
0
14 Oct 2024
big.LITTLE Vision Transformer for Efficient Visual Recognition
He Guo
Yulong Wang
Zixuan Ye
Jifeng Dai
Yuwen Xiong
ViT
262
1
0
14 Oct 2024
Locality Alignment Improves Vision-Language Models
International Conference on Learning Representations (ICLR), 2024
Ian Covert
Tony Sun
James Zou
Tatsunori Hashimoto
VLM
592
11
0
14 Oct 2024
Boosting Open-Vocabulary Object Detection by Handling Background Samples
Ruizhe Zeng
Lu Zhang
Xu Yang
Zhiyong Liu
VLM
ObjD
194
1
0
11 Oct 2024
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention
Asian Conference on Computer Vision (ACCV), 2024
Nguyen Huu Bao Long
Chenyu Zhang
Yuzhi Shi
Tsubasa Hirakawa
Takayoshi Yamashita
Tohgoroh Matsui
H. Fujiyoshi
221
10
0
11 Oct 2024
Interactive4D: Interactive 4D LiDAR Segmentation
IEEE International Conference on Robotics and Automation (ICRA), 2024
Ilya Fradlin
Idil Esen Zulfikar
Kadir Yilmaz
Theodora Kontogianni
Bastian Leibe
305
4
0
10 Oct 2024
Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts
Neural Information Processing Systems (NeurIPS), 2024
Zhiwei Lin
Yongtao Wang
Zhi Tang
ObjD
VLM
207
15
0
08 Oct 2024
A Simple Image Segmentation Framework via In-Context Examples
Neural Information Processing Systems (NeurIPS), 2024
Yang Liu
Chenchen Jing
Hengtao Li
Huanyi Zheng
Hao Chen
Xinlong Wang
Chunhua Shen
178
12
0
07 Oct 2024
On Efficient Variants of Segment Anything Model: A Survey
International Journal of Computer Vision (IJCV), 2024
Xiaorui Sun
Jing Liu
Mengqi Li
Xiaofeng Zhu
Ping Hu
VLM
530
19
0
07 Oct 2024
AHA: A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic Manipulation
International Conference on Learning Representations (ICLR), 2024
Jiafei Duan
Wilbert Pumacay
Nishanth Kumar
Yi Ru Wang
Shulin Tian
Wentao Yuan
Ranjay Krishna
Dieter Fox
Ajay Mandlekar
Yijie Guo
VLM
LRM
268
80
0
01 Oct 2024
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning
Haotian Zhang
Mingfei Gao
Zhe Gan
Philipp Dufter
Nina Wenzel
...
Haoxuan You
Zirui Wang
Afshin Dehghan
Peter Grasch
Yinfei Yang
VLM
MLLM
303
66
1
30 Sep 2024
ProMerge: Prompt and Merge for Unsupervised Instance Segmentation
European Conference on Computer Vision (ECCV), 2024
Dylan Li
Gyungin Shin
218
9
0
27 Sep 2024
A Novel Unified Architecture for Low-Shot Counting by Detection and Segmentation
Neural Information Processing Systems (NeurIPS), 2024
Jer Pelhan
A. Lukežič
Vitjan Zavrtanik
Matej Kristan
ObjD
292
15
0
27 Sep 2024
You Only Speak Once to See
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Wenhao Yang
Jianguo Wei
Wenhuan Lu
Lei Li
VOS
228
4
0
27 Sep 2024
Visual Concept Networks: A Graph-Based Approach to Detecting Anomalous Data in Deep Neural Networks
International Conferences on Pattern Recognition and Artificial Intelligence (ICCPRAI), 2024
Debargha Ganguly
Debayan Gupta
Vipin Chaudhary
GNN
181
1
0
26 Sep 2024
Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval
Computer Vision and Pattern Recognition (CVPR), 2024
Mankeerat Sidhu
Hetarth Chopra
Ansel Blume
Jeonghwan Kim
Revanth Gangi Reddy
Heng Ji
ObjD
VLM
184
2
0
26 Sep 2024
DARE: Diverse Visual Question Answering with Robustness Evaluation
Transactions of the Association for Computational Linguistics (TACL), 2024
Hannah Sterz
Jonas Pfeiffer
Ivan Vulić
OOD
VLM
346
4
0
26 Sep 2024
Episodic Memory Verbalization using Hierarchical Representations of Life-Long Robot Experience
Leonard Barmann
Chad DeChant
Joana Plewnia
Fabian Peller-Konrad
Daniel Bauer
Tamim Asfour
Alex Waibel
LM&Ro
427
4
0
26 Sep 2024
SSE: Multimodal Semantic Data Selection and Enrichment for Industrial-scale Data Assimilation
Knowledge Discovery and Data Mining (KDD), 2024
Maying Shen
Nadine Chang
Sifei Liu
Jose M. Alvarez
247
4
0
20 Sep 2024
GraspSAM: When Segment Anything Model Meets Grasp Detection
IEEE International Conference on Robotics and Automation (ICRA), 2024
Sangjun Noh
Jongwon Kim
Dongwoo Nam
Seunghyeok Back
Raeyoung Kang
Kyoobin Lee
VLM
354
11
0
19 Sep 2024
LPT++: Efficient Training on Mixture of Long-tailed Experts
Bowen Dong
Pan Zhou
W. Zuo
VLM
232
0
0
17 Sep 2024
SLAck: Semantic, Location, and Appearance Aware Open-Vocabulary Tracking
European Conference on Computer Vision (ECCV), 2024
Siyuan Li
Lei Ke
Yung-Hsu Yang
Luigi Piccinelli
Mattia Segu
Martin Danelljan
Luc Van Gool
VLM
215
8
0
17 Sep 2024
Label Convergence: Defining an Upper Performance Bound in Object Recognition through Contradictory Annotations
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
David Tschirschwitz
Volker Rodehorst
289
2
0
14 Sep 2024
Associate Everything Detected: Facilitating Tracking-by-Detection to the Unknown
IEEE Transactions on Image Processing (TIP), 2024
Zimeng Fang
Chao Liang
Xue Zhou
Shuyuan Zhu
Xi Li
277
3
0
14 Sep 2024
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Haoxuan Wang
Qu He
Jinlong Peng
Hao Yang
Mingmin Chi
Yabiao Wang
Mamba
278
9
0
13 Sep 2024
GroundingBooth: Grounding Text-to-Image Customization
Zhexiao Xiong
Wei Xiong
Jing Shi
Chentao Song
Yizhi Song
Nathan Jacobs
DiffM
434
12
0
13 Sep 2024
From COCO to COCO-FP: A Deep Dive into Background False Positives for COCO Detectors
Longfei Liu
Wen Guo
Shijie Huang
Cheng Li
Xi Shen
ObjD
247
1
0
12 Sep 2024
Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation
Neural Information Processing Systems (NeurIPS), 2024
Jiaxin Cheng
Zixu Zhao
Tong He
Tianjun Xiao
Yicong Zhou
Zheng Zhang
DiffM
382
5
0
07 Sep 2024
FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary Segmentation
Xi Chen
Haosen Yang
Sheng Jin
Xiatian Zhu
Huanjin Yao
VLM
244
7
0
05 Sep 2024
Semantically Controllable Augmentations for Generalizable Robot Learning
Zoey Chen
Zhao Mandi
Homanga Bharadhwaj
Mohit Sharma
Shuran Song
Abhishek Gupta
Vikash Kumar
LM&Ro
339
19
0
02 Sep 2024
Anno-incomplete Multi-dataset Detection
Yiran Xu
Haoxiang Zhong
Kai Wu
Jialin Li
Yong Liu
Chengjie Wang
Shu-Tao Xia
Hongen Liao
ObjD
163
0
0
29 Aug 2024
More Pictures Say More: Visual Intersection Network for Open Set Object Detection
Bingcheng Dong
Yuning Ding
Jinrong Zhang
Sifan Zhang
Shenglan Liu
ObjD
169
0
0
26 Aug 2024
A Survey of Embodied Learning for Object-Centric Robotic Manipulation
Machine Intelligence Research (MIR), 2024
Ying Zheng
Lei Yao
Yuejiao Su
Yi Zhang
Yi Wang
Sicheng Zhao
Yiyi Zhang
Lap-Pui Chau
LM&Ro
243
23
0
21 Aug 2024
Reflex-Based Open-Vocabulary Navigation without Prior Knowledge Using Omnidirectional Camera and Multiple Vision-Language Models
Kento Kawaharazuka
Yoshiki Obinata
Naoaki Kanazawa
Naoto Tsukamoto
Kei Okada
Masayuki Inaba
LM&Ro
152
3
0
21 Aug 2024
SAM-REF: Introducing Image-Prompt Synergy during Interaction for Detail Enhancement in the Segment Anything Model
Computer Vision and Pattern Recognition (CVPR), 2024
Chongkai Yu
Anqi Li
Xiaochao Qu
Xiaochao Qu
Chengjing Wu
Luoqi Liu
Xiaolin Hu
VLM
279
2
0
21 Aug 2024
OE3DIS: Open-Ended 3D Point Cloud Instance Segmentation
P. Nguyen
Minh Luu
Anh Tran
Cuong Pham
Khoi Duc Minh Nguyen
3DPC
286
1
0
21 Aug 2024
OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
Youjun Zhao
Jiaying Lin
Shuquan Ye
Qianshi Pang
Rynson W. H. Lau
424
4
0
20 Aug 2024
Image-Based Leopard Seal Recognition: Approaches and Challenges in Current Automated Systems
Jorge Yero Salazar
Pablo Rivas
Renato Borras-Chavez
Sarah Kienle
138
0
0
14 Aug 2024
ClickAttention: Click Region Similarity Guided Interactive Segmentation
Neural Networks (NN), 2024
Long Xu
Shanghong Li
Yongquan Chen
Junkang Chen
Rui Huang
Feng Wu
254
0
0
12 Aug 2024
In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation
European Conference on Computer Vision (ECCV), 2024
Dahyun Kang
Minsu Cho
ObjD
VLM
390
24
0
09 Aug 2024
Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs
European Conference on Computer Vision (ECCV), 2024
Jeongkee Lim
Yusung Kim
307
5
0
05 Aug 2024
SAM 2: Segment Anything in Images and Videos
International Conference on Learning Representations (ICLR), 2024
Nikhila Ravi
Valentin Gabeur
Yuan-Ting Hu
Ronghang Hu
Chaitanya K. Ryali
...
Nicolas Carion
Chao-Yuan Wu
Ross B. Girshick
Piotr Dollár
Christoph Feichtenhofer
VLM
MLLM
502
2,234
0
01 Aug 2024
A Systematic Review on Long-Tailed Learning
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Chongsheng Zhang
G. Almpanidis
Gaojuan Fan
Binquan Deng
Yanbo Zhang
Ji Liu
Aouaidjia Kamel
Paolo Soda
Joao Gama
406
26
0
01 Aug 2024
Previous
1
2
3
4
5
6
...
20
21
22
Next