Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.16649
Cited By
ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple Oracles
29 June 2023
Haoqin Tu
Bowen Yang
Xianfeng Zhao
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple Oracles"
10 / 10 papers shown
Title
Language Models Can See Better: Visual Contrastive Decoding For LLM Multimodal Reasoning
Yuqi Pang
Bowen Yang
Haoqin Tu
Yun Cao
Zeyu Zhang
LRM
MLLM
62
0
0
17 Feb 2025
TROPE: TRaining-Free Object-Part Enhancement for Seamlessly Improving Fine-Grained Zero-Shot Image Captioning
Joshua Forster Feinglass
Yezhou Yang
23
0
0
30 Sep 2024
MeaCap: Memory-Augmented Zero-shot Image Captioning
Zequn Zeng
Yan Xie
Hao Zhang
Chiyu Chen
Zhengjue Wang
Boli Chen
VLM
18
13
0
06 Mar 2024
How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs
Haoqin Tu
Chenhang Cui
Zijun Wang
Yiyang Zhou
Bingchen Zhao
Junlin Han
Wangchunshu Zhou
Huaxiu Yao
Cihang Xie
MLLM
36
70
0
27 Nov 2023
Air-Decoding: Attribute Distribution Reconstruction for Decoding-Time Controllable Text Generation
Tianqi Zhong
Quan Wang
Jingxuan Han
Yongdong Zhang
Zhendong Mao
12
7
0
23 Oct 2023
VLIS: Unimodal Language Models Guide Multimodal Language Generation
Jiwan Chung
Youngjae Yu
VLM
22
1
0
15 Oct 2023
DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training
Wei Li
Linchao Zhu
Longyin Wen
Yi Yang
VLM
40
81
0
06 Mar 2023
Text-Only Training for Image Captioning using Noise-Injected CLIP
David Nukrai
Ron Mokady
Amir Globerson
VLM
CLIP
41
69
0
01 Nov 2022
FAST: Improving Controllability for Text Generation with Feedback Aware Self-Training
Junyi Chai
Reid Pryzant
Victor Ye Dong
Konstantin Golobokov
Chenguang Zhu
Yi Liu
22
5
0
06 Oct 2022
The Power of Scale for Parameter-Efficient Prompt Tuning
Brian Lester
Rami Al-Rfou
Noah Constant
VPVLM
278
3,784
0
18 Apr 2021
1