ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.03949
  4. Cited By
Towards Unique and Informative Captioning of Images

Towards Unique and Informative Captioning of Images

European Conference on Computer Vision (ECCV), 2020
8 September 2020
Zeyu Wang
Berthy Feng
Karthik Narasimhan
Olga Russakovsky
ArXiv (abs)PDFHTML

Papers citing "Towards Unique and Informative Captioning of Images"

23 / 23 papers shown
Mammo-CLIP Dissect: A Framework for Analysing Mammography Concepts in Vision-Language Models
Mammo-CLIP Dissect: A Framework for Analysing Mammography Concepts in Vision-Language Models
Suaiba Amina Salahuddin
Teresa Dorszewski
Marit Almenning Martiniussen
Tone Hovda
Antonio Portaluri
Solveig Thrun
Michael C. Kampffmeyer
Elisabeth Wetzer
Kristoffer Wickstrøm
Robert Jenssen
VLM
167
0
0
25 Sep 2025
It's Just Another Day: Unique Video Captioning by Discriminative
  Prompting
It's Just Another Day: Unique Video Captioning by Discriminative PromptingAsian Conference on Computer Vision (ACCV), 2024
Toby Perrett
Tengda Han
Dima Damen
Andrew Zisserman
289
3
0
15 Oct 2024
No Detail Left Behind: Revisiting Self-Retrieval for Fine-Grained Image Captioning
No Detail Left Behind: Revisiting Self-Retrieval for Fine-Grained Image Captioning
Manu Gaur
Darshan Singh
Makarand Tapaswi
993
3
0
04 Sep 2024
Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends and Metrics Analysis
Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends and Metrics Analysis
Uri Berger
Gabriel Stanovsky
Omri Abend
Lea Frermann
541
0
0
09 Aug 2024
Identifying Interpretable Subspaces in Image Representations
Identifying Interpretable Subspaces in Image RepresentationsInternational Conference on Machine Learning (ICML), 2023
Neha Kalibhat
S. Bhardwaj
Bayan Bruss
Hamed Firooz
Maziar Sanjabi
Soheil Feizi
FAtt
417
39
0
20 Jul 2023
Improving Reference-based Distinctive Image Captioning with Contrastive
  Rewards
Improving Reference-based Distinctive Image Captioning with Contrastive Rewards
Yangjun Mao
Jun Xiao
Dong Zhang
Meng Cao
Jian Shao
Yueting Zhuang
Long Chen
EGVM
284
10
0
25 Jun 2023
Revisiting the Role of Language Priors in Vision-Language Models
Revisiting the Role of Language Priors in Vision-Language ModelsInternational Conference on Machine Learning (ICML), 2023
Zhiqiu Lin
Xinyue Chen
Deepak Pathak
Pengchuan Zhang
Deva Ramanan
VLM
554
44
0
02 Jun 2023
Positive-Augmented Contrastive Learning for Image and Video Captioning
  Evaluation
Positive-Augmented Contrastive Learning for Image and Video Captioning EvaluationComputer Vision and Pattern Recognition (CVPR), 2023
Sara Sarto
Manuele Barraco
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
399
93
0
21 Mar 2023
Switching to Discriminative Image Captioning by Relieving a Bottleneck
  of Reinforcement Learning
Switching to Discriminative Image Captioning by Relieving a Bottleneck of Reinforcement LearningIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Ukyo Honda
Taro Watanabe
Yuji Matsumoto
289
13
0
06 Dec 2022
Distinctive Image Captioning via CLIP Guided Group Optimization
Distinctive Image Captioning via CLIP Guided Group Optimization
Youyuan Zhang
Jiuniu Wang
Hao Wu
Wenjia Xu
VLM
460
9
0
08 Aug 2022
Rethinking the Reference-based Distinctive Image Captioning
Rethinking the Reference-based Distinctive Image CaptioningACM Multimedia (ACM MM), 2022
Yangjun Mao
Long Chen
Zhihong Jiang
Dong Zhang
Zhimeng Zhang
Jian Shao
Jun Xiao
DiffM
304
23
0
22 Jul 2022
Controllable Image Captioning
Luka Maxwell
443
0
0
28 Apr 2022
CLIP-Dissect: Automatic Description of Neuron Representations in Deep
  Vision Networks
CLIP-Dissect: Automatic Description of Neuron Representations in Deep Vision NetworksInternational Conference on Learning Representations (ICLR), 2022
Tuomas P. Oikarinen
Tsui-Wei Weng
VLM
522
143
1
23 Apr 2022
Linking Emergent and Natural Languages via Corpus Transfer
Linking Emergent and Natural Languages via Corpus TransferInternational Conference on Learning Representations (ICLR), 2022
Shunyu Yao
Mo Yu
Yang Zhang
Karthik Narasimhan
J. Tenenbaum
Chuang Gan
351
20
0
24 Mar 2022
Natural Language Descriptions of Deep Visual Features
Natural Language Descriptions of Deep Visual FeaturesInternational Conference on Learning Representations (ICLR), 2022
Evan Hernandez
Sarah Schwettmann
David Bau
Teona Bagashvili
Antonio Torralba
Jacob Andreas
MILM
1.1K
156
0
26 Jan 2022
Transparent Human Evaluation for Image Captioning
Transparent Human Evaluation for Image Captioning
Jungo Kasai
Keisuke Sakaguchi
Lavinia Dunagan
Jacob Morrison
Ronan Le Bras
Yejin Choi
Noah A. Smith
234
64
0
17 Nov 2021
Is An Image Worth Five Sentences? A New Look into Semantics for
  Image-Text Matching
Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
Ali Furkan Biten
Andrés Mafla
Lluís Gómez
Dimosthenis Karatzas
511
20
0
06 Oct 2021
Let there be a clock on the beach: Reducing Object Hallucination in
  Image Captioning
Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning
Ali Furkan Biten
L. G. I. Bigorda
Dimosthenis Karatzas
506
91
0
04 Oct 2021
Journalistic Guidelines Aware News Image Captioning
Journalistic Guidelines Aware News Image CaptioningConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Xuewen Yang
Svebor Karaman
Joel R. Tetreault
Alex Jaimes
354
32
0
07 Sep 2021
Caption Generation on Scenes with Seen and Unseen Object Categories
Caption Generation on Scenes with Seen and Unseen Object CategoriesImage and Vision Computing (IVC), 2021
B. Demirel
R. G. Cinbis
VLM
389
2
0
13 Aug 2021
ReFormer: The Relational Transformer for Image Captioning
ReFormer: The Relational Transformer for Image CaptioningACM Multimedia (ACM MM), 2021
Xuewen Yang
Yingru Liu
Xin Wang
ViT
271
67
0
29 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
From Show to Tell: A Survey on Deep Learning-based Image CaptioningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DVVLMMLLM
585
373
0
14 Jul 2021
Human-like Controllable Image Captioning with Verb-specific Semantic
  Roles
Human-like Controllable Image Captioning with Verb-specific Semantic RolesComputer Vision and Pattern Recognition (CVPR), 2021
Long Chen
Zhihong Jiang
Jun Xiao
Wei Liu
331
85
0
22 Mar 2021
1
Page 1 of 1