ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.04647
  4. Cited By
EXIF as Language: Learning Cross-Modal Associations Between Images and
  Camera Metadata

EXIF as Language: Learning Cross-Modal Associations Between Images and Camera Metadata

11 January 2023
Chenhao Zheng
Ayush Shrivastava
Andrew Owens
    VLM
ArXivPDFHTML

Papers citing "EXIF as Language: Learning Cross-Modal Associations Between Images and Camera Metadata"

10 / 10 papers shown
Title
FontGuard: A Robust Font Watermarking Approach Leveraging Deep Font Knowledge
FontGuard: A Robust Font Watermarking Approach Leveraging Deep Font Knowledge
Kahim Wong
Jicheng Zhou
Kemou Li
Yain-Whar Si
Xiaowei Wu
Jiantao Zhou
94
0
0
04 Apr 2025
Forensic Self-Descriptions Are All You Need for Zero-Shot Detection, Open-Set Source Attribution, and Clustering of AI-generated Images
Forensic Self-Descriptions Are All You Need for Zero-Shot Detection, Open-Set Source Attribution, and Clustering of AI-generated Images
Tai D. Nguyen
Aref Azizpour
Matthew C. Stamm
46
1
0
26 Mar 2025
RANGE: Retrieval Augmented Neural Fields for Multi-Resolution Geo-Embeddings
RANGE: Retrieval Augmented Neural Fields for Multi-Resolution Geo-Embeddings
A. Dhakal
S. Sastry
Subash Khanal
Adeel Ahmad
Eric Xing
Nathan Jacobs
48
0
0
27 Feb 2025
CoAPT: Context Attribute words for Prompt Tuning
CoAPT: Context Attribute words for Prompt Tuning
Gun Lee
Subin An
Sungyong Baik
Soochahn Lee
VPVLM
VLM
28
1
0
18 Jul 2024
Iterated Learning Improves Compositionality in Large Vision-Language
  Models
Iterated Learning Improves Compositionality in Large Vision-Language Models
Chenhao Zheng
Jieyu Zhang
Aniruddha Kembhavi
Ranjay Krishna
VLM
CoGe
41
9
0
02 Apr 2024
Binding Touch to Everything: Learning Unified Multimodal Tactile
  Representations
Binding Touch to Everything: Learning Unified Multimodal Tactile Representations
Fengyu Yang
Chao Feng
Ziyang Chen
Hyoungseob Park
Daniel Wang
...
Ziyao Zeng
Xien Chen
Rit Gangopadhyay
Andrew Owens
Alex Wong
36
53
0
31 Jan 2024
Generating Visual Scenes from Touch
Generating Visual Scenes from Touch
Fengyu Yang
Jiacheng Zhang
Andrew Owens
DiffM
22
24
0
26 Sep 2023
Pretrained Language Models as Visual Planners for Human Assistance
Pretrained Language Models as Visual Planners for Human Assistance
Dhruvesh Patel
H. Eghbalzadeh
Nitin Kamra
Michael L. Iuzzolino
Unnat Jain
Ruta Desai
LM&Ro
13
24
0
17 Apr 2023
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,764
0
24 Feb 2021
ImageNet Large Scale Visual Recognition Challenge
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
282
39,170
0
01 Sep 2014
1