ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.01803
  4. Cited By
Controllable Image Captioning via Prompting

Controllable Image Captioning via Prompting

AAAI Conference on Artificial Intelligence (AAAI), 2022
4 December 2022
Ning Wang
Jiahao Xie
Jihao Wu
Mingbo Jia
Linlin Li
ArXiv (abs)PDFHTML

Papers citing "Controllable Image Captioning via Prompting"

12 / 12 papers shown
Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identificationComputer Vision and Pattern Recognition (CVPR), 2025
Jiayu Jiang
Changxing Ding
Wentao Tan
Junhong Wang
Jin Tao
Xiangmin Xu
330
5
0
13 Mar 2025
Continual Panoptic Perception: Towards Multi-modal Incremental
  Interpretation of Remote Sensing Images
Continual Panoptic Perception: Towards Multi-modal Incremental Interpretation of Remote Sensing Images
Bo Yuan
Danpei Zhao
Zhuoran Liu
Wentao Li
Tian Li
CLLVLM
391
4
0
19 Jul 2024
Controllable Contextualized Image Captioning: Directing the Visual
  Narrative through User-Defined Highlights
Controllable Contextualized Image Captioning: Directing the Visual Narrative through User-Defined Highlights
Shunqi Mao
Chaoyi Zhang
Hang Su
Hwanjun Song
Igor Shalyminov
Weidong Cai
308
4
0
16 Jul 2024
Prompt Learning for Generalized Vehicle Routing
Prompt Learning for Generalized Vehicle Routing
Fei Liu
Xi Lin
Weiduo Liao
Zhenkun Wang
Qingfu Zhang
Xialiang Tong
Mingxuan Yuan
VLM
213
7
0
20 May 2024
HistGen: Histopathology Report Generation via Local-Global Feature
  Encoding and Cross-modal Context Interaction
HistGen: Histopathology Report Generation via Local-Global Feature Encoding and Cross-modal Context InteractionInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2024
Zhengrui Guo
Jiabo Ma
Ying Xu
Yihui Wang
Liansheng Wang
Hao Chen
464
39
0
08 Mar 2024
Open-Vocabulary Calibration for Fine-tuned CLIP
Open-Vocabulary Calibration for Fine-tuned CLIP
Shuoyuan Wang
Yongfeng Zhang
Guoqing Wang
Bob Zhang
Kaiyang Zhou
Jianguo Huang
VLM
450
12
0
07 Feb 2024
Can Prompt Learning Benefit Radiology Report Generation?
Can Prompt Learning Benefit Radiology Report Generation?
Jun Wang
Lixing Zhu
A. Bhalerao
Yulan He
MedIm
169
2
0
30 Aug 2023
Caption Anything: Interactive Image Description with Diverse Multimodal
  Controls
Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Teng Wang
Jinrui Zhang
Junjie Fei
Hao Zheng
Yunlong Tang
Zhe Li
Mingqi Gao
Shanshan Zhao
MLLM
470
124
0
04 May 2023
Learning Combinatorial Prompts for Universal Controllable Image
  Captioning
Learning Combinatorial Prompts for Universal Controllable Image CaptioningInternational Journal of Computer Vision (IJCV), 2023
Zhen Wang
Jun Xiao
Yueting Zhuang
Fei Gao
Jian Shao
Long Chen
196
11
0
11 Mar 2023
DEVICE: Depth and Visual Concepts Aware Transformer for OCR-based Image Captioning
DEVICE: Depth and Visual Concepts Aware Transformer for OCR-based Image CaptioningPattern Recognition (Pattern Recogn.), 2023
Dongsheng Xu
Qingbao Huang
Shuang Feng
Yiru Cai
Feng Shuang
Yi Cai
ViTVLM
475
1
0
03 Feb 2023
OSIC: A New One-Stage Image Captioner Coined
OSIC: A New One-Stage Image Captioner CoinedInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Bo Wang
Zhao Zhang
Ming Zhao
Xiaojie Jin
Mingliang Xu
Meng Wang
VLM
218
6
0
04 Nov 2022
PreSTU: Pre-Training for Scene-Text Understanding
PreSTU: Pre-Training for Scene-Text UnderstandingIEEE International Conference on Computer Vision (ICCV), 2022
Jihyung Kil
Soravit Changpinyo
Xi Chen
Hexiang Hu
Sebastian Goodman
Wei-Lun Chao
Radu Soricut
VLM
337
38
0
12 Sep 2022
1