Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.14244
Cited By
CLIP-Art: Contrastive Pre-training for Fine-Grained Art Classification
29 April 2022
Marcos V. Conde
Kerem Turgutlu
CLIP
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CLIP-Art: Contrastive Pre-training for Fine-Grained Art Classification"
50 / 53 papers shown
Title
ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding
Shuai Wang
Ivona Najdenkoska
Hongyi Zhu
S. Rudinac
Monika Kackovic
N. Wijnberg
M. Worring
35
0
0
09 May 2025
Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval
Haoqiang Lin
Haokun Wen
Xuemeng Song
Meng Liu
Yupeng Hu
Liqiang Nie
52
14
0
25 Mar 2025
CausalCLIPSeg: Unlocking CLIP's Potential in Referring Medical Image Segmentation with Causal Intervention
Yaxiong Chen
Minghong Wei
Zixuan Zheng
Jingliang Hu
Yilei Shi
Shengwu Xiong
Xiao Xiang Zhu
Lichao Mou
MedIm
41
0
0
20 Mar 2025
AFANet: Adaptive Frequency-Aware Network for Weakly-Supervised Few-Shot Semantic Segmentation
Jiaqi Ma
Guo-Sen Xie
Fang Zhao
Zechao Li
32
0
0
23 Dec 2024
ARTeFACT: Benchmarking Segmentation Models on Diverse Analogue Media Damage
D. Ivanova
Marco Aversa
Paul Henderson
John Williamson
76
0
0
05 Dec 2024
CLSP: High-Fidelity Contrastive Language-State Pre-training for Agent State Representation
Fuxian Huang
Qi Zhang
Shaopeng Zhai
Jie Wang
Tianyi Zhang
Haoran Zhang
Ming Zhou
Yu Liu
Yu Qiao
CLIP
AI4TS
34
0
0
24 Sep 2024
Have Large Vision-Language Models Mastered Art History?
Ombretta Strafforello
Derya Soydaner
Michiel Willems
Anne-Sofie Maerten
Stefanie De Winter
CoGe
VLM
MLLM
26
0
0
05 Sep 2024
SOOD-ImageNet: a Large-Scale Dataset for Semantic Out-Of-Distribution Image Classification and Semantic Segmentation
Alberto Bacchin
Davide Allegro
Stefano Ghidoni
Emanuele Menegatti
37
1
0
02 Sep 2024
State-of-the-Art Fails in the Art of Damage Detection
D. Ivanova
Marco Aversa
Paul Henderson
John Williamson
13
0
0
23 Aug 2024
Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization
Geuntaek Lim
Hyunwoo Kim
Joonsoo Kim
Yukyung Choi
20
0
0
12 Aug 2024
ST-SACLF: Style Transfer Informed Self-Attention Classifier for Bias-Aware Painting Classification
Mridula Vijendran
Frederick W. B. Li
Jingjing Deng
Hubert P. H. Shum
48
0
0
03 Aug 2024
GalleryGPT: Analyzing Paintings with Large Multimodal Models
Yi Bin
Wenhao Shi
Yujuan Ding
Zhiqiang Hu
Zheng Wang
Yang Yang
See-Kiong Ng
H. Shen
MLLM
30
11
0
01 Aug 2024
LEMoN: Label Error Detection using Multimodal Neighbors
Haoran Zhang
Aparna Balagopalan
Nassim Oufattole
Hyewon Jeong
Yan Wu
Jiacheng Zhu
Marzyeh Ghassemi
42
0
0
10 Jul 2024
MATE: Meet At The Embedding -- Connecting Images with Long Texts
Young Kyun Jang
Junmo Kang
Yong Jae Lee
Donghyun Kim
VLM
31
5
0
26 Jun 2024
Multimodal Metadata Assignment for Cultural Heritage Artifacts
Luis Rei
Dunja Mladenić
M. Dorozynski
Franz Rottensteiner
Thomas Schleider
Raphael Troncy
J. Lozano
Mar Gaitán Salvatella
27
6
0
01 Jun 2024
Dual-Modal Prompting for Sketch-Based Image Retrieval
Liying Gao
Bingliang Jiao
Peng Wang
Shizhou Zhang
Hanwang Zhang
Yanning Zhang
VLM
53
0
0
29 Apr 2024
Task2Box: Box Embeddings for Modeling Asymmetric Task Relationships
Rangel Daroya
Aaron Sun
Subhransu Maji
27
0
0
25 Mar 2024
Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval
Yuchen Suo
Fan Ma
Linchao Zhu
Yi Yang
27
19
0
24 Mar 2024
Not just Birds and Cars: Generic, Scalable and Explainable Models for Professional Visual Recognition
Junde Wu
Jiayuan Zhu
Min Xu
Yueming Jin
27
0
0
08 Mar 2024
A
3
^{3}
3
lign-DFER: Pioneering Comprehensive Dynamic Affective Alignment for Dynamic Facial Expression Recognition with CLIP
Zeng Tao
Yan Wang
Junxiong Lin
Haoran Wang
Xinji Mai
...
Ziheng Zhou
Shaoqi Yan
Qing Zhao
Liyuan Han
Wenqiang Zhang
33
13
0
07 Mar 2024
Scene Depth Estimation from Traditional Oriental Landscape Paintings
Sungho Kang
Yeonghyeon Park
H. Park
Juneho Yi
30
0
0
06 Mar 2024
Spurious Feature Eraser: Stabilizing Test-Time Adaptation for Vision-Language Foundation Model
Huan Ma
Yan Zhu
Changqing Zhang
Peilin Zhao
Baoyuan Wu
Long-Kai Huang
Qinghua Hu
Bing Wu
VLM
64
1
0
01 Mar 2024
SeD: Semantic-Aware Discriminator for Image Super-Resolution
Bingchen Li
Xin Li
Hanxin Zhu
Yeying Jin
Ruoyu Feng
Zhizheng Zhang
Zhibo Chen
SupR
35
22
0
29 Feb 2024
Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model
Hao-Ran Cheng
Erjia Xiao
Jindong Gu
Le Yang
Jinhao Duan
Jize Zhang
Jiahang Cao
Kaidi Xu
Renjing Xu
29
6
0
29 Feb 2024
CARZero: Cross-Attention Alignment for Radiology Zero-Shot Classification
Haoran Lai
Qingsong Yao
Zihang Jiang
Rongsheng Wang
Zhiyang He
Xiaodong Tao
S. Kevin Zhou
MedIm
31
12
0
27 Feb 2024
Impression-CLIP: Contrastive Shape-Impression Embedding for Fonts
Yugo Kubota
Daichi Haraguchi
Seiichi Uchida
CLIP
VLM
27
1
0
26 Feb 2024
Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning
Kaibin Tian
Yanhua Cheng
Yi Liu
Xinglin Hou
Quan Chen
Han Li
22
3
0
01 Jan 2024
FD-Align: Feature Discrimination Alignment for Fine-tuning Pre-Trained Models in Few-Shot Learning
Kun Song
Huimin Ma
Bochao Zou
Huishuai Zhang
Weiran Huang
18
10
0
23 Oct 2023
Domain-Controlled Prompt Learning
Qinglong Cao
Zhengqin Xu
Yuantian Chen
Chao Ma
Xiaokang Yang
VLM
16
15
0
30 Sep 2023
Practical Membership Inference Attacks Against Large-Scale Multi-Modal Models: A Pilot Study
Myeongseob Ko
Ming Jin
Chenguang Wang
Ruoxi Jia
31
27
0
29 Sep 2023
ARTxAI: Explainable Artificial Intelligence Curates Deep Representation Learning for Artistic Images using Fuzzy Techniques
Javier Fumanal-Idocin
Javier Andreu-Perez
O. Cordón
H. Hagras
H. Bustince
25
7
0
29 Aug 2023
Extending Cross-Modal Retrieval with Interactive Learning to Improve Image Retrieval Performance in Forensics
Nils Böhne
Mark Berger
Ronald van Velzen
11
0
0
28 Aug 2023
Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features
Alberto Baldrati
Marco Bertini
Tiberio Uricchio
A. Bimbo
CLIP
CoGe
11
29
0
22 Aug 2023
SCRAPS: Speech Contrastive Representations of Acoustic and Phonetic Spaces
Iván Vallés-Pérez
Grzegorz Beringer
Piotr Bilinski
G. Cook
Roberto Barra-Chicote
11
1
0
23 Jul 2023
MMSD2.0: Towards a Reliable Multi-modal Sarcasm Detection System
Libo Qin
Shijue Huang
Qiguang Chen
Chenran Cai
Yudi Zhang
Bin Liang
Wanxiang Che
Ruifeng Xu
6
29
0
14 Jul 2023
Leveraging Vision-Language Foundation Models for Fine-Grained Downstream Tasks
Denis Coquenet
Clément Rambour
Emanuele Dalsasso
Nicolas Thome
MLLM
CLIP
VLM
19
1
0
13 Jul 2023
LM-CPPF: Paraphrasing-Guided Data Augmentation for Contrastive Prompt-Based Few-Shot Fine-Tuning
Amirhossein Abaskohi
S. Rothe
Yadollah Yaghoobzadeh
VLM
21
16
0
29 May 2023
Progressive Visual Prompt Learning with Contrastive Feature Re-formation
C. Xu
Yuhan Zhu
Haocheng Shen
Fengyuan Shi
Boheng Chen
Yixuan Liao
Xiaoxin Chen
Limin Wang
VLM
25
20
0
17 Apr 2023
Defense-Prefix for Preventing Typographic Attacks on CLIP
Hiroki Azuma
Yusuke Matsui
VLM
AAML
18
16
0
10 Apr 2023
Multi-modal Fake News Detection on Social Media via Multi-grained Information Fusion
Yangming Zhou
Yuzhou Yang
Qichao Ying
Zhenxing Qian
Xinpeng Zhang
11
37
0
03 Apr 2023
FER-former: Multi-modal Transformer for Facial Expression Recognition
Yande Li
Mingjie Wang
Minglun Gong
Y. Lu
Li Liu
21
7
0
23 Mar 2023
Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training
Dezhao Luo
Jiabo Huang
S. Gong
Hailin Jin
Yang Liu
VGen
21
28
0
28 Feb 2023
Mixed Hierarchy Network for Image Restoration
Huiyu Gao
Depeng Dang
19
14
0
19 Feb 2023
ExpNet: A unified network for Expert-Level Classification
Junde Wu
Huihui Fang
Yehui Yang
Yu Zhang
Haoyi Xiong
H. Fu
Yanwu Xu
13
0
0
29 Nov 2022
A Brief Overview of AI Governance for Responsible Machine Learning Systems
Navdeep Gill
Abhishek Mathur
Marcos V. Conde
11
5
0
21 Nov 2022
General Image Descriptors for Open World Image Retrieval using ViT CLIP
Marcos V. Conde
Ivan Aerlic
Simon Jégou
CLIP
11
2
0
20 Oct 2022
Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and Restoration
Marcos V. Conde
Ui-Jin Choi
Maxime Burchi
Radu Timofte
ViT
46
134
0
22 Sep 2022
GAMA: Generative Adversarial Multi-Object Scene Attacks
Abhishek Aich
Calvin-Khang Ta
Akash Gupta
Chengyu Song
S. Krishnamurthy
M. Salman Asif
A. Roy-Chowdhury
AAML
36
17
0
20 Sep 2022
Bootstrapping Multi-view Representations for Fake News Detection
Qichao Ying
Xiaoxiao Hu
Yangming Zhou
Zhenxing Qian
Dan Zeng
Shiming Ge
16
45
0
12 Jun 2022
Multimodal Fake News Detection via CLIP-Guided Learning
Yangming Zhou
Qichao Ying
Zhenxing Qian
Sheng Li
Xinpeng Zhang
6
52
0
28 May 2022
1
2
Next