v1v2 (latest)

LVIS: A Dataset for Large Vocabulary Instance Segmentation

Computer Vision and Pattern Recognition (CVPR), 2019

8 August 2019

Piotr Dollár

Papers citing "LVIS: A Dataset for Large Vocabulary Instance Segmentation"

50 / 1,058 papers shown

SqueezeSAM: User friendly mobile interactive segmentation

Raghuraman Krishnamoorthi

Vikas Chandra

VLM

281

11 Dec 2023

EdgeSAM: Prompt-In-the-Loop Distillation for SAM

294

11 Dec 2023

Localized Symbolic Knowledge Distillation for Visual Commonsense ModelsNeural Information Processing Systems (NeurIPS), 2023

...

Yejin Choi

268

08 Dec 2023

Gen2Det: Generate to Detect

Raghuraman Krishnamoorthi

Chenchen Zhu

Abhinav Shrivastava

VLM DiffM

313

07 Dec 2023

GPT4SGG: Synthesizing Scene Graphs from Holistic and Region-specific Narratives

Zuyao Chen

Jinlin Wu

Zhen Lei

Zhaoxiang Zhang

Changwen Chen

289

07 Dec 2023

Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Dahua Lin

351

162

06 Dec 2023

SO-NeRF: Active View Planning for NeRF using Surrogate Objectives

196

06 Dec 2023

GPT4Point: A Unified Framework for Point-Language Understanding and GenerationComputer Vision and Pattern Recognition (CVPR), 2023

453

05 Dec 2023

Aligning and Prompting Everything All at Once for Universal Visual PerceptionComputer Vision and Pattern Recognition (CVPR), 2023

Rongrong Ji

287

04 Dec 2023

Learning Pseudo-Labeler beyond Noun Concepts for Open-Vocabulary Object Detection

191

04 Dec 2023

Behind the Magic, MERLIM: Multi-modal Evaluation Benchmark for Large Image-Language Models

Andrés Villa

Juan Carlos León Alcázar

Alvaro Soto

Bernard Ghanem

MLLM VLM

292

03 Dec 2023

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment AnythingComputer Vision and Pattern Recognition (CVPR), 2023

...

Raghuraman Krishnamoorthi

Vikas Chandra

VLM

368

235

01 Dec 2023

TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion Models

Lanqing Hong

Huchuan Lu

186

01 Dec 2023

Language-conditioned Detection TransformerComputer Vision and Pattern Recognition (CVPR), 2023

Jang Hyun Cho

Philipp Krahenbuhl

VLM ObjD

187

29 Nov 2023

Leveraging VLM-Based Pipelines to Annotate 3D ObjectsInternational Conference on Machine Learning (ICML), 2023

Rishabh Kabra

Loic Matthey

Alexander Lerchner

Niloy J. Mitra

274

29 Nov 2023

The devil is in the fine-grained details: Evaluating open-vocabulary object detectors for fine-grained understandingComputer Vision and Pattern Recognition (CVPR), 2023

349

29 Nov 2023

ViT-Lens: Towards Omni-modal RepresentationsComputer Vision and Pattern Recognition (CVPR), 2023

Ying Shan

200

27 Nov 2023

EVCap: Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World ComprehensionComputer Vision and Pattern Recognition (CVPR), 2023

249

27 Nov 2023

SEGIC: Unleashing the Emergent Correspondence for In-Context SegmentationEuropean Conference on Computer Vision (ECCV), 2023

Zuxuan Wu

273

24 Nov 2023

Griffon: Spelling out All Object Locations at Any Granularity with Large Language ModelsEuropean Conference on Computer Vision (ECCV), 2023

240

24 Nov 2023

Point, Segment and Count: A Generalized Framework for Object CountingComputer Vision and Pattern Recognition (CVPR), 2023

307

21 Nov 2023

Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning

Yan Li

Weiwei Guo

Xue Yang

204

20 Nov 2023

Labeling Indoor Scenes with Fusion of Out-of-the-Box Perception Models

132

17 Nov 2023

Towards Open-Ended Visual Recognition with Large Language Model

Qihang Yu

Xiaohui Shen

Liang-Chieh Chen

VLM

238

14 Nov 2023

SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models

...

Yu Qiao

300

275

13 Nov 2023

To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning

Zuxuan Wu

268

133

13 Nov 2023

CrashCar101: Procedural Generation for Damage AssessmentIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

Jens Parslov

Erik Riise

Dim P. Papadopoulos

277

11 Nov 2023

Window Attention is Bugged: How not to Interpolate Position EmbeddingsInternational Conference on Learning Representations (ICLR), 2023

Daniel Bolya

Chaitanya K. Ryali

Judy Hoffman

Christoph Feichtenhofer

225

09 Nov 2023

Learning the What and How of Annotation in Video Object Segmentation

195

08 Nov 2023

Meta-Adapter: An Online Few-shot Learner for Vision-Language ModelNeural Information Processing Systems (NeurIPS), 2023

Ying Shan

423

07 Nov 2023

GLaMM: Pixel Grounding Large Multimodal ModelComputer Vision and Pattern Recognition (CVPR), 2023

H. Rasheed

Muhammad Maaz

Sahal Shaji Mullappilly

Abdelrahman M. Shaker

Salman Khan

Hisham Cholakkal

Rao M. Anwer

Erix Xing

Ming-Hsuan Yang

Fahad S. Khan

MLLM VLM

433

396

06 Nov 2023

SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img SynthesisEuropean Conference on Computer Vision (ECCV), 2023

Dan Xu

366

06 Nov 2023

OVIR-3D: Open-Vocabulary 3D Instance Retrieval Without Training on 3D DataConference on Robot Learning (CoRL), 2023

Shiyang Lu

Haonan Chang

E. Jing

Abdeslam Boularias

Kostas Bekris

250

06 Nov 2023

Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-trainingComputer Vision and Pattern Recognition (CVPR), 2023

Cihang Xie

296

03 Nov 2023

Recognize Any RegionsNeural Information Processing Systems (NeurIPS), 2023

359

02 Nov 2023

Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision TasksNeural Information Processing Systems (NeurIPS), 2023

...

456

30 Oct 2023

Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic SegmentationNeural Information Processing Systems (NeurIPS), 2023

Jiangchao Yao

301

29 Oct 2023

Exploring Data Augmentations on Self-/Semi-/Fully- Supervised Pre-trained Models

Shentong Mo

Zhun Sun

Chao Li

123

28 Oct 2023

PrObeD: Proactive Object Detection WrapperNeural Information Processing Systems (NeurIPS), 2023

Vishal Asnani

Abhinav Kumar

Suya You

Xiaoming Liu

298

28 Oct 2023

LP-OVOD: Open-Vocabulary Object Detection by Linear ProbingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023

310

26 Oct 2023

CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object DetectionNeural Information Processing Systems (NeurIPS), 2023

Xin Wen

Xiaojuan Qi

246

25 Oct 2023

SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding

Haoxiang Wang

Pavan Kumar Anasosalu Vasu

540

125

23 Oct 2023

OV-VG: A Benchmark for Open-Vocabulary Visual Grounding

Xiangtai Li

268

22 Oct 2023

Unsupervised Object Localization in the Era of Self-Supervised ViTs: A Survey

316

19 Oct 2023

Learning from Rich Semantics and Coarse Locations for Long-tailed Object DetectionNeural Information Processing Systems (NeurIPS), 2023

Jianwei Yang

Zuxuan Wu

Lu Yuan

Yu-Gang Jiang

149

18 Oct 2023

Panoptic Out-of-Distribution SegmentationIEEE Robotics and Automation Letters (RA-L), 2023

Rohit Mohan

Kiran Kumaraswamy

Juana Valeria Hurtado

Kürsat Petek

Abhinav Valada

217

18 Oct 2023

Towards Training-free Open-world Segmentation via Image Prompt Foundation ModelsInternational Journal of Computer Vision (IJCV), 2023

352

17 Oct 2023

Recursive Segmentation Living Image: An eXplainable AI (XAI) Approach for Computing Structural Beauty of Images or the Livingness of Space

Qianxiang Yao

Jiang Bin

136

16 Oct 2023

Ferret: Refer and Ground Anything Anywhere at Any GranularityInternational Conference on Learning Representations (ICLR), 2023

Xianzhi Du

411

451

11 Oct 2023

Zero-Shot Open-Vocabulary Tracking with Large Pre-Trained ModelsIEEE International Conference on Robotics and Automation (ICRA), 2023

321

10 Oct 2023

All Papers

LVIS: A Dataset for Large Vocabulary Instance Segmentation

Papers citing "LVIS: A Dataset for Large Vocabulary Instance Segmentation"