Visual Chronicles: Using Multimodal LLMs to Analyze Massive Collections of Images
v1v2v3 (latest)

Visual Chronicles: Using Multimodal LLMs to Analyze Massive Collections of Images

    HAI

Papers citing "Visual Chronicles: Using Multimodal LLMs to Analyze Massive Collections of Images"

31 / 31 papers shown
Title
Explaining Datasets in Words: Statistical Models with Natural Language Parameters
Explaining Datasets in Words: Statistical Models with Natural Language ParametersNeural Information Processing Systems (NeurIPS), 2024
Ruiqi Zhong
Heng Wang
Dan Klein
Jacob Steinhardt
163
10
0
13 Sep 2024
Prototype-based Dataset Comparison
Prototype-based Dataset ComparisonIEEE International Conference on Computer Vision (ICCV), 2023
151
10
0
05 Sep 2023
Changes to Captions: An Attentive Network for Remote Sensing Change
  Captioning
Changes to Captions: An Attentive Network for Remote Sensing Change CaptioningIEEE Transactions on Image Processing (IEEE TIP), 2023
123
63
0
03 Apr 2023
Image Difference Captioning with Pre-training and Contrastive Learning
Image Difference Captioning with Pre-training and Contrastive LearningAAAI Conference on Artificial Intelligence (AAAI), 2022
137
50
0
09 Feb 2022
Using Deep Learning and Google Street View to Estimate the Demographic
  Makeup of the US
Using Deep Learning and Google Street View to Estimate the Demographic Makeup of the USProceedings of the National Academy of Sciences of the United States of America (PNAS), 2017
184
439
0
22 Feb 2017
Deep Visual-Semantic Alignments for Generating Image Descriptions
Deep Visual-Semantic Alignments for Generating Image DescriptionsComputer Vision and Pattern Recognition (CVPR), 2014
A. Karpathy
Li Fei-Fei
418
5,808
0
07 Dec 2014
Show and Tell: A Neural Image Caption Generator
Show and Tell: A Neural Image Caption GeneratorComputer Vision and Pattern Recognition (CVPR), 2014
528
6,288
0
17 Nov 2014

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.