Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.19719
Cited By
Revolutionizing Urban Safety Perception Assessments: Integrating Multimodal Large Language Models with Street View Images
29 July 2024
Jiaxin Zhanga
Yunqin Lia
Tomohiro Fukudab
Bowen Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Revolutionizing Urban Safety Perception Assessments: Integrating Multimodal Large Language Models with Street View Images"
2 / 2 papers shown
Title
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
253
4,223
0
30 Jan 2023
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
282
39,170
0
01 Sep 2014
1