Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Home
Papers
1908.03195
Cited By
v1
v2 (latest)
LVIS: A Dataset for Large Vocabulary Instance Segmentation
Computer Vision and Pattern Recognition (CVPR), 2019
8 August 2019
Agrim Gupta
Piotr Dollár
Ross B. Girshick
ISeg
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"LVIS: A Dataset for Large Vocabulary Instance Segmentation"
50 / 1,058 papers shown
SqueezeSAM: User friendly mobile interactive segmentation
Bala Varadarajan
Bilge Soran
Forrest N. Iandola
Xiaoyu Xiang
Yunyang Xiong
Lemeng Wu
Chenchen Zhu
Raghuraman Krishnamoorthi
Vikas Chandra
VLM
281
5
0
11 Dec 2023
EdgeSAM: Prompt-In-the-Loop Distillation for SAM
Chong Zhou
Xiangtai Li
Chen Change Loy
Bo Dai
VLM
294
56
0
11 Dec 2023
Localized Symbolic Knowledge Distillation for Visual Commonsense Models
Neural Information Processing Systems (NeurIPS), 2023
Jinho Park
Jack Hessel
Khyathi Chandu
Paul Pu Liang
Ximing Lu
...
Youngjae Yu
Qiuyuan Huang
Jianfeng Gao
Ali Farhadi
Yejin Choi
VLM
268
13
0
08 Dec 2023
Gen2Det: Generate to Detect
Saksham Suri
Fanyi Xiao
Animesh Sinha
Sean Culatana
Raghuraman Krishnamoorthi
Chenchen Zhu
Abhinav Shrivastava
VLM
DiffM
313
12
0
07 Dec 2023
GPT4SGG: Synthesizing Scene Graphs from Holistic and Region-specific Narratives
Zuyao Chen
Jinlin Wu
Zhen Lei
Zhaoxiang Zhang
Changwen Chen
289
5
0
07 Dec 2023
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Zeyi Sun
Ye Fang
Tong Wu
Pan Zhang
Yuhang Zang
Shu Kong
Yuanjun Xiong
Dahua Lin
Yuan Liu
VLM
CLIP
351
162
0
06 Dec 2023
SO-NeRF: Active View Planning for NeRF using Surrogate Objectives
Keifer Lee
Shubham Gupta
Sunglyoung Kim
Bhargav Makwana
Chao-Yeh Chen
Chen Feng
196
8
0
06 Dec 2023
GPT4Point: A Unified Framework for Point-Language Understanding and Generation
Computer Vision and Pattern Recognition (CVPR), 2023
Zhangyang Qi
Ye Fang
Zeyi Sun
Xiaoyang Wu
Tong Wu
Yuan Liu
Dahua Lin
Hengshuang Zhao
MLLM
453
36
0
05 Dec 2023
Aligning and Prompting Everything All at Once for Universal Visual Perception
Computer Vision and Pattern Recognition (CVPR), 2023
Chunjiang Ge
Chaoyou Fu
Peixian Chen
Mengdan Zhang
Ke Li
Xing Sun
Yunsheng Wu
Shaohui Lin
Rongrong Ji
VLM
ObjD
287
64
0
04 Dec 2023
Learning Pseudo-Labeler beyond Noun Concepts for Open-Vocabulary Object Detection
Sunghun Kang
Junbum Cha
Jonghwan Mun
Byungseok Roh
Chang D. Yoo
VLM
ObjD
191
2
0
04 Dec 2023
Behind the Magic, MERLIM: Multi-modal Evaluation Benchmark for Large Image-Language Models
Andrés Villa
Juan Carlos León Alcázar
Alvaro Soto
Bernard Ghanem
MLLM
VLM
292
18
0
03 Dec 2023
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Computer Vision and Pattern Recognition (CVPR), 2023
Yunyang Xiong
Bala Varadarajan
Lemeng Wu
Xiaoyu Xiang
Fanyi Xiao
...
Dilin Wang
Fei Sun
Forrest N. Iandola
Raghuraman Krishnamoorthi
Vikas Chandra
VLM
368
235
0
01 Dec 2023
TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion Models
Pengxiang Li
Kai Chen
Zhili Liu
Ruiyuan Gao
Lanqing Hong
Guo Zhou
Hua Yao
Dit-Yan Yeung
Huchuan Lu
Xu Jia
VGen
DiffM
186
0
0
01 Dec 2023
Language-conditioned Detection Transformer
Computer Vision and Pattern Recognition (CVPR), 2023
Jang Hyun Cho
Philipp Krahenbuhl
VLM
ObjD
187
5
0
29 Nov 2023
Leveraging VLM-Based Pipelines to Annotate 3D Objects
International Conference on Machine Learning (ICML), 2023
Rishabh Kabra
Loic Matthey
Alexander Lerchner
Niloy J. Mitra
274
9
0
29 Nov 2023
The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understanding
Computer Vision and Pattern Recognition (CVPR), 2023
Lorenzo Bianchi
F. Carrara
Nicola Messina
Claudio Gennaro
Fabrizio Falchi
ObjD
349
24
0
29 Nov 2023
ViT-Lens: Towards Omni-modal Representations
Computer Vision and Pattern Recognition (CVPR), 2023
Weixian Lei
Yixiao Ge
Kun Yi
Jianfeng Zhang
Difei Gao
Dylan Sun
Yuying Ge
Ying Shan
Mike Zheng Shou
200
32
0
27 Nov 2023
EVCap: Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension
Computer Vision and Pattern Recognition (CVPR), 2023
Jiaxuan Li
D. Vo
Akihiro Sugimoto
Hideki Nakayama
KELM
VLM
249
43
0
27 Nov 2023
SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation
European Conference on Computer Vision (ECCV), 2023
Lingchen Meng
Shiyi Lan
Hengduo Li
Jose M. Alvarez
Zuxuan Wu
Yu-Gang Jiang
VLM
ISeg
MLLM
273
15
0
24 Nov 2023
Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models
European Conference on Computer Vision (ECCV), 2023
Yufei Zhan
Yousong Zhu
Zhiyang Chen
Fan Yang
E. Goles
Jinqiao Wang
ObjD
240
30
0
24 Nov 2023
Point, Segment and Count: A Generalized Framework for Object Counting
Computer Vision and Pattern Recognition (CVPR), 2023
Zhizhong Huang
Mingliang Dai
Yi Zhang
Junping Zhang
Hongming Shan
307
44
0
21 Nov 2023
Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning
Yan Li
Weiwei Guo
Xue Yang
Ning Liao
Dunyun He
Jiaqi Zhou
Wenxian Yu
ObjD
VLM
204
20
0
20 Nov 2023
Labeling Indoor Scenes with Fusion of Out-of-the-Box Perception Models
Yimeng Li
Navid Rajabi
Sulabh Shrestha
Md. Alimoor Reza
Jana Kosecka
132
3
0
17 Nov 2023
Towards Open-Ended Visual Recognition with Large Language Model
Qihang Yu
Xiaohui Shen
Liang-Chieh Chen
VLM
238
8
0
14 Nov 2023
SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models
Ziyi Lin
Chris Liu
Renrui Zhang
Shiyang Feng
Longtian Qiu
...
Siyuan Huang
Yichi Zhang
Xuming He
Jiaming Song
Yu Qiao
MLLM
VLM
300
275
0
13 Nov 2023
To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning
Junke Wang
Lingchen Meng
Zejia Weng
Bo He
Zuxuan Wu
Yu-Gang Jiang
MLLM
VLM
268
133
0
13 Nov 2023
CrashCar101: Procedural Generation for Damage Assessment
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Jens Parslov
Erik Riise
Dim P. Papadopoulos
277
5
0
11 Nov 2023
Window Attention is Bugged: How not to Interpolate Position Embeddings
International Conference on Learning Representations (ICLR), 2023
Daniel Bolya
Chaitanya K. Ryali
Judy Hoffman
Christoph Feichtenhofer
225
17
0
09 Nov 2023
Learning the What and How of Annotation in Video Object Segmentation
Thanos Delatolas
Vicky S. Kalogeiton
Dim P. Papadopoulos
VOS
195
18
0
08 Nov 2023
Meta-Adapter: An Online Few-shot Learner for Vision-Language Model
Neural Information Processing Systems (NeurIPS), 2023
Cheng Cheng
Lin Song
Ruoyi Xue
Hang Wang
Hongbin Sun
Yixiao Ge
Ying Shan
VLM
ObjD
423
48
0
07 Nov 2023
GLaMM: Pixel Grounding Large Multimodal Model
Computer Vision and Pattern Recognition (CVPR), 2023
H. Rasheed
Muhammad Maaz
Sahal Shaji Mullappilly
Abdelrahman M. Shaker
Salman Khan
Hisham Cholakkal
Rao M. Anwer
Erix Xing
Ming-Hsuan Yang
Fahad S. Khan
MLLM
VLM
433
396
0
06 Nov 2023
SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis
European Conference on Computer Vision (ECCV), 2023
Hanrong Ye
Jason Kuen
Qing Liu
Zhe Lin
Brian L. Price
Dan Xu
VLM
366
16
0
06 Nov 2023
OVIR-3D: Open-Vocabulary 3D Instance Retrieval Without Training on 3D Data
Conference on Robot Learning (CoRL), 2023
Shiyang Lu
Haonan Chang
E. Jing
Abdeslam Boularias
Kostas Bekris
250
89
0
06 Nov 2023
Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training
Computer Vision and Pattern Recognition (CVPR), 2023
Yipeng Gao
Zeyu Wang
Wei-Shi Zheng
Cihang Xie
Yuyin Zhou
3DPC
296
15
0
03 Nov 2023
Recognize Any Regions
Neural Information Processing Systems (NeurIPS), 2023
Haosen Yang
Chuofan Ma
Bin Wen
Yi Jiang
Zehuan Yuan
Xiatian Zhu
ObjD
VLM
359
3
0
02 Nov 2023
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks
Neural Information Processing Systems (NeurIPS), 2023
Micah Goldblum
Hossein Souri
Renkun Ni
Manli Shu
Viraj Prabhu
...
Adrien Bardes
Judy Hoffman
Ramalingam Chellappa
Andrew Gordon Wilson
Tom Goldstein
VLM
456
94
0
30 Oct 2023
Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation
Neural Information Processing Systems (NeurIPS), 2023
Fei Zhang
Tianfei Zhou
Boyang Li
Hao He
Chaofan Ma
Tianjiao Zhang
Jiangchao Yao
Ya Zhang
Yanfeng Wang
VLM
301
34
0
29 Oct 2023
Exploring Data Augmentations on Self-/Semi-/Fully- Supervised Pre-trained Models
Shentong Mo
Zhun Sun
Chao Li
123
2
0
28 Oct 2023
PrObeD: Proactive Object Detection Wrapper
Neural Information Processing Systems (NeurIPS), 2023
Vishal Asnani
Abhinav Kumar
Suya You
Xiaoming Liu
298
10
0
28 Oct 2023
LP-OVOD: Open-Vocabulary Object Detection by Linear Probing
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Chau Pham
Truong Vu
Khoi Duc Minh Nguyen
ObjD
310
27
0
26 Oct 2023
CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Neural Information Processing Systems (NeurIPS), 2023
Chuofan Ma
Yi Jiang
Xin Wen
Zehuan Yuan
Xiaojuan Qi
ObjD
VLM
246
68
0
25 Oct 2023
SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding
Haoxiang Wang
Pavan Kumar Anasosalu Vasu
Fartash Faghri
Raviteja Vemulapalli
Mehrdad Farajtabar
Sachin Mehta
Mohammad Rastegari
Oncel Tuzel
Hadi Pouransari
VLM
540
125
0
23 Oct 2023
OV-VG: A Benchmark for Open-Vocabulary Visual Grounding
Chunlei Wang
Wenquan Feng
Xiangtai Li
Guangliang Cheng
Shuchang Lyu
Binghao Liu
Lijiang Chen
Qi Zhao
ObjD
VLM
268
14
0
22 Oct 2023
Unsupervised Object Localization in the Era of Self-Supervised ViTs: A Survey
Oriane Siméoni
Éloi Zablocki
Spyros Gidaris
Gilles Puy
Patrick Pérez
316
17
0
19 Oct 2023
Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Neural Information Processing Systems (NeurIPS), 2023
Lingchen Meng
Xiyang Dai
Jianwei Yang
Dongdong Chen
Yinpeng Chen
Xiyang Dai
Yi-Ling Chen
Zuxuan Wu
Lu Yuan
Yu-Gang Jiang
149
12
0
18 Oct 2023
Panoptic Out-of-Distribution Segmentation
IEEE Robotics and Automation Letters (RA-L), 2023
Rohit Mohan
Kiran Kumaraswamy
Juana Valeria Hurtado
Kürsat Petek
Abhinav Valada
217
10
0
18 Oct 2023
Towards Training-free Open-world Segmentation via Image Prompt Foundation Models
International Journal of Computer Vision (IJCV), 2023
Lv Tang
Peng-Tao Jiang
Haoke Xiao
Bo Li
VLM
352
21
0
17 Oct 2023
Recursive Segmentation Living Image: An eXplainable AI (XAI) Approach for Computing Structural Beauty of Images or the Livingness of Space
Qianxiang Yao
Jiang Bin
136
0
0
16 Oct 2023
Ferret: Refer and Ground Anything Anywhere at Any Granularity
International Conference on Learning Representations (ICLR), 2023
Haoxuan You
Haotian Zhang
Zhe Gan
Xianzhi Du
Bowen Zhang
Zirui Wang
Liangliang Cao
Shih-Fu Chang
Yinfei Yang
ObjD
MLLM
VLM
411
451
0
11 Oct 2023
Zero-Shot Open-Vocabulary Tracking with Large Pre-Trained Models
IEEE International Conference on Robotics and Automation (ICRA), 2023
Wen-Hsuan Chu
Adam W. Harley
P. Tokmakov
Achal Dave
Leonidas Guibas
Katerina Fragkiadaki
VLM
321
11
0
10 Oct 2023
Previous
1
2
3
...
9
10
11
...
20
21
22
Next