Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.02162
Cited By
Mapping the Unseen: Unified Promptable Panoptic Mapping with Dynamic Labeling using Foundation Models
3 May 2024
Mohamad Al Al Mdfaa
Raghad Salameh
Sergey Zagoruyko
Gonzalo Ferrer
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mapping the Unseen: Unified Promptable Panoptic Mapping with Dynamic Labeling using Foundation Models"
5 / 5 papers shown
Title
Tag2Text: Guiding Vision-Language Model via Image Tagging
Xinyu Huang
Youcai Zhang
Jinyu Ma
Weiwei Tian
Rui Feng
Yuejie Zhang
Yaqian Li
Yandong Guo
Lei Zhang
CLIP
MLLM
VLM
3DV
61
74
0
10 Mar 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
253
4,223
0
30 Jan 2023
Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding
Kirill Mazur
Edgar Sucar
Andrew J. Davison
3DPC
AI4CE
88
44
0
06 Oct 2022
Panoptic Multi-TSDFs: a Flexible Representation for Online Multi-resolution Volumetric Mapping and Long-term Dynamic Scene Consistency
L. Schmid
J. Delmerico
Johannes L. Schonberger
Juan I. Nieto
Marc Pollefeys
Roland Siegwart
César Cadena
114
58
0
21 Sep 2021
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,764
0
24 Feb 2021
1