Viewpoint Integration and Registration with Vision Language Foundation
Model for Image Change Understanding

Viewpoint Integration and Registration with Vision Language Foundation Model for Image Change Understanding

15 September 2023

Fan Wang

Papers citing "Viewpoint Integration and Registration with Vision Language Foundation Model for Image Change Understanding"

4 / 4 papers shown

Title
mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality Qinghao Ye Haiyang Xu Guohai Xu Jiabo Ye Ming Yan ... Junfeng Tian Qiang Qi Ji Zhang Feiyan Huang Jingren Zhou VLM MLLM 206 899 0 27 Apr 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models Junnan Li Dongxu Li Silvio Savarese Steven C. H. Hoi VLM MLLM 256 4,223 0 30 Jan 2023
Feature Pyramid Networks for Object Detection Tsung-Yi Lin Piotr Dollár Ross B. Girshick Kaiming He Bharath Hariharan Serge J. Belongie ObjD 166 21,643 0 09 Dec 2016
Spatial Transformer Networks Max Jaderberg Karen Simonyan Andrew Zisserman Koray Kavukcuoglu 124 7,319 0 05 Jun 2015