Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2303.02437
Cited By
v1
v2 (latest)
ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing
Computer Vision and Pattern Recognition (CVPR), 2023
4 March 2023
Zequn Zeng
Hao Zhang
Zhengjue Wang
Ruiying Lu
Dongsheng Wang
Bo Chen
BDL
DiffM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing"
28 / 28 papers shown
Text-Only Training for Image Captioning with Retrieval Augmentation and Modality Gap Correction
Rui Fonseca
Bruno Martins
Gil Rocha
VLM
156
0
0
03 Dec 2025
Parallel Tokenizers: Rethinking Vocabulary Design for Cross-Lingual Transfer
Muhammad Dehan Al Kautsar
Fajri Koto
274
1
0
07 Oct 2025
PriorRG: Prior-Guided Contrastive Pre-training and Coarse-to-Fine Decoding for Chest X-ray Report Generation
Kang Liu
Zhuoqi Ma
Zikang Fang
Yunan Li
Kun Xie
Qiguang Miao
ViT
MedIm
231
3
0
07 Aug 2025
AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning
Yiming Ren
Zhiqiang Lin
Yu Li
Gao Meng
Weiyun Wang
...
Zicheng Lin
Jifeng Dai
Yujiu Yang
Wenhai Wang
Ruihang Chu
244
3
0
17 Jul 2025
Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models
Computer Vision and Pattern Recognition (CVPR), 2025
Yan Xie
Zequn Zeng
Hao Zhang
Yucheng Ding
Yun Wang
Zhengjue Wang
Bo Chen
Hongwei Liu
OT
391
9
0
12 May 2025
Group-based Distinctive Image Captioning with Memory Difference Encoding and Attention
International Journal of Computer Vision (IJCV), 2024
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
500
3
0
03 Apr 2025
ANNEXE: Unified Analyzing, Answering, and Pixel Grounding for Egocentric Interaction
Computer Vision and Pattern Recognition (CVPR), 2025
Yuejiao Su
Yi Wang
Qiongyang Hu
Chuang Yang
Lap-Pui Chau
377
6
0
02 Apr 2025
The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning
Mingkai Tian
Guorong Li
Yuankai Qi
Amin Beheshti
Javen Qinfeng Shi
Anton van den Hengel
Qingming Huang
VGen
322
0
0
31 Mar 2025
Explaining Domain Shifts in Language: Concept erasing for Interpretable Image Classification
Computer Vision and Pattern Recognition (CVPR), 2025
Zequn Zeng
Yudi Su
Jianqiao Sun
Tiansheng Wen
Hao Zhang
Zhengjue Wang
Bo Chen
Hongwei Liu
Jiawei Ma
VLM
457
3
0
24 Mar 2025
Enhanced Contrastive Learning with Multi-view Longitudinal Data for Chest X-ray Report Generation
Computer Vision and Pattern Recognition (CVPR), 2025
Kang Liu
Zhuoqi Ma
Xiaolu Kang
Yunan Li
Kun Xie
Zhicheng Jiao
Qiguang Miao
305
36
0
27 Feb 2025
Visual Zero-Shot E-Commerce Product Attribute Value Extraction
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Jiaying Gong
Ming Cheng
Hongda Shen
Pierre-Yves Vandenbussche
Janet Jenq
Hoda Eldardiry
241
5
0
21 Feb 2025
Semantically Guided Dynamic Visual Prototype Refinement for Compositional Zero-Shot Learning
Zhong Peng
Yishi Xu
Gerong Wang
Wenchao Chen
Bo Chen
Jing Zhang
Hongwei Liu
CoGe
311
0
0
13 Jan 2025
TROPE: TRaining-Free Object-Part Enhancement for Seamlessly Improving Fine-Grained Zero-Shot Image Captioning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Joshua Forster Feinglass
Yezhou Yang
255
0
0
30 Sep 2024
Fine-grained length controllable video captioning with ordinal embeddings
IEEE Access (IEEE Access), 2024
Tomoya Nitta
Takumi Fukuzawa
Toru Tamaki
391
1
0
27 Aug 2024
EditScribe: Non-Visual Image Editing with Natural Language Verification Loops
International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS), 2024
Ruei-Che Chang
Yuxuan Liu
Lotus Zhang
Anhong Guo
DiffM
231
13
0
13 Aug 2024
Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends and Metrics Analysis
Uri Berger
Gabriel Stanovsky
Omri Abend
Lea Frermann
524
0
0
09 Aug 2024
HICEScore: A Hierarchical Metric for Image Captioning Evaluation
Zequn Zeng
Jianqiao Sun
Hao Zhang
Tiansheng Wen
Yudi Su
Yan Xie
Zhengjue Wang
Boli Chen
238
9
0
26 Jul 2024
MemBench: Memorized Image Trigger Prompt Dataset for Diffusion Models
Chunsan Hong
Tae-Hyun Oh
Minhyuk Sung
VLM
EGVM
380
3
0
24 Jul 2024
Controllable Contextualized Image Captioning: Directing the Visual Narrative through User-Defined Highlights
Shunqi Mao
Chaoyi Zhang
Hang Su
Hwanjun Song
Igor Shalyminov
Weidong Cai
347
4
0
16 Jul 2024
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes
Ting Yu
Xiaojun Lin
Shuhui Wang
Weiguo Sheng
Qingming Huang
Jun-chen Yu
3DV
251
19
0
12 Mar 2024
MeaCap: Memory-Augmented Zero-shot Image Captioning
Zequn Zeng
Yan Xie
Hao Zhang
Chiyu Chen
Zhengjue Wang
Boli Chen
VLM
352
58
0
06 Mar 2024
NExT-GPT: Any-to-Any Multimodal LLM
International Conference on Machine Learning (ICML), 2023
Shengqiong Wu
Hao Fei
Leigang Qu
Wei Ji
Tat-Seng Chua
MLLM
479
761
0
11 Sep 2023
Improving Generalization of Image Captioning with Unsupervised Prompt Learning
Hongchen Wei
Zhenzhong Chen
VLM
207
4
0
05 Aug 2023
Visual Captioning at Will: Describing Images and Videos Guided by a Few Stylized Sentences
ACM Multimedia (ACM MM), 2023
Di Yang
Hongyu Chen
Xinglin Hou
Bo Xiao
Yuning Jiang
Qin Jin
294
8
0
31 Jul 2023
ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple Oracles
Natural Language Processing and Chinese Computing (NLPCC), 2023
Haoqin Tu
Bowen Yang
Xianfeng Zhao
241
7
0
29 Jun 2023
LMCap: Few-shot Multilingual Image Captioning by Retrieval Augmented Language Model Prompting
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
R. Ramos
Bruno Martins
Desmond Elliott
VLM
223
26
0
31 May 2023
Image Captioning with Multi-Context Synthetic Data
AAAI Conference on Artificial Intelligence (AAAI), 2023
Feipeng Ma
Y. Zhou
Fengyun Rao
Yueyi Zhang
Xiaoyan Sun
DiffM
299
21
0
29 May 2023
Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Teng Wang
Jinrui Zhang
Junjie Fei
Hao Zheng
Yunlong Tang
Zhe Li
Mingqi Gao
Shanshan Zhao
MLLM
564
130
0
04 May 2023
1
Page 1 of 1