ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1608.05203
  4. Cited By
Seeing with Humans: Gaze-Assisted Neural Image Captioning

Seeing with Humans: Gaze-Assisted Neural Image Captioning

18 August 2016
Yusuke Sugano
Andreas Bulling
ArXiv (abs)PDFHTML

Papers citing "Seeing with Humans: Gaze-Assisted Neural Image Captioning"

26 / 26 papers shown
Title
Look and Tell: A Dataset for Multimodal Grounding Across Egocentric and Exocentric Views
Look and Tell: A Dataset for Multimodal Grounding Across Egocentric and Exocentric Views
Anna Deichler
Jonas Beskow
VGen
140
0
0
26 Oct 2025
Gaze on the Prize: Shaping Visual Attention with Return-Guided Contrastive Learning
Gaze on the Prize: Shaping Visual Attention with Return-Guided Contrastive Learning
Andrew Lee
Ian Chuang
D. Gao
Kai Fukazawa
Iman Soltani
124
0
0
09 Oct 2025
Artificial Human Intelligence: The role of Humans in the Development of Next Generation AI
Artificial Human Intelligence: The role of Humans in the Development of Next Generation AI
Suayb S. Arslan
377
2
0
24 Sep 2024
A look under the hood of the Interactive Deep Learning Enterprise
  (No-IDLE)
A look under the hood of the Interactive Deep Learning Enterprise (No-IDLE)
Daniel Sonntag
Michael Barz
Thiago S. Gouvêa
VLM
232
6
0
27 Jun 2024
Trends, Applications, and Challenges in Human Attention Modelling
Trends, Applications, and Challenges in Human Attention Modelling
Giuseppe Cartella
Marcella Cornia
Vittorio Cuculo
Alessandro D’Amelio
Dario Zanca
Giuseppe Boccignone
Rita Cucchiara
254
11
0
28 Feb 2024
Neglected Free Lunch -- Learning Image Classifiers Using Annotation
  Byproducts
Neglected Free Lunch -- Learning Image Classifiers Using Annotation ByproductsIEEE International Conference on Computer Vision (ICCV), 2023
Dongyoon Han
Junsuk Choe
Dante Chun
John Joon Young Chung
Minsuk Chang
Sangdoo Yun
Jean Y. Song
Seong Joon Oh
OOD
1.3K
4
1
30 Mar 2023
What You See is What You Grasp: User-Friendly Grasping Guided by
  Near-eye-tracking
What You See is What You Grasp: User-Friendly Grasping Guided by Near-eye-trackingInternational Conference on Development and Learning (ICDL), 2022
Shaochen Wang
Wei Zhang
Zhangli Zhou
Jiaxi Cao
Ziyang Chen
Kang Chen
Bin Li
Z. Kan
198
8
0
13 Sep 2022
Are metrics measuring what they should? An evaluation of image
  captioning task metrics
Are metrics measuring what they should? An evaluation of image captioning task metricsSignal processing. Image communication (SPIC), 2022
Othón González-Chávez
Guillermo Ruiz
Daniela Moctezuma
Tania A. Ramirez-delreal
199
9
0
04 Jul 2022
Do Transformer Models Show Similar Attention Patterns to Task-Specific
  Human Gaze?
Do Transformer Models Show Similar Attention Patterns to Task-Specific Human Gaze?Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Stephanie Brandl
Oliver Eberle
Jonas Pilot
Anders Søgaard
178
42
0
25 Apr 2022
Visual Attention Methods in Deep Learning: An In-Depth Survey
Visual Attention Methods in Deep Learning: An In-Depth SurveyInformation Fusion (Inf. Fusion), 2022
Mohammed Hassanin
Saeed Anwar
Ibrahim Radwan
Fahad Shahbaz Khan
Lin Wang
297
236
0
16 Apr 2022
Multimodal Integration of Human-Like Attention in Visual Question
  Answering
Multimodal Integration of Human-Like Attention in Visual Question Answering
Ekta Sood
Fabian Kögel
Philippe Muller
Dominike Thomas
Mihai Bâce
Andreas Bulling
162
22
0
27 Sep 2021
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods
  in Natural Language Processing
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language ProcessingACM Computing Surveys (CSUR), 2021
Pengfei Liu
Weizhe Yuan
Jinlan Fu
Zhengbao Jiang
Hiroaki Hayashi
Graham Neubig
VLMSyDa
753
4,777
0
28 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
From Show to Tell: A Survey on Deep Learning-based Image CaptioningIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DVVLMMLLM
379
342
0
14 Jul 2021
Generating Image Descriptions via Sequential Cross-Modal Alignment
  Guided by Human Gaze
Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human Gaze
Ece Takmaz
Sandro Pezzelle
Lisa Beinborn
Raquel Fernández
204
25
0
09 Nov 2020
Improving Natural Language Processing Tasks with Human Gaze-Guided
  Neural Attention
Improving Natural Language Processing Tasks with Human Gaze-Guided Neural Attention
Ekta Sood
Simon Tannert
Philipp Mueller
Andreas Bulling
223
79
0
15 Oct 2020
Interpreting Attention Models with Human Visual Attention in Machine
  Reading Comprehension
Interpreting Attention Models with Human Visual Attention in Machine Reading Comprehension
Ekta Sood
Simon Tannert
Diego Frassinelli
Andreas Bulling
Ngoc Thang Vu
HAI
183
60
0
13 Oct 2020
Image Captioning with Attention for Smart Local Tourism using
  EfficientNet
Image Captioning with Attention for Smart Local Tourism using EfficientNet
D. H. Fudholi
Yurio Windiatmoko
Nurdi Afrianto
Prastyo Eko Susanto
Magfirah Suyuti
A. Hidayatullah
R. Rahmadi
3DH
109
13
0
18 Sep 2020
The Fluidity of Concept Representations in Human Brain Signals
The Fluidity of Concept Representations in Human Brain Signals
E. Hendrikx
Lisa Beinborn
77
0
0
20 Feb 2020
Gaze360: Physically Unconstrained Gaze Estimation in the Wild
Gaze360: Physically Unconstrained Gaze Estimation in the WildIEEE International Conference on Computer Vision (ICCV), 2019
Petr Kellnhofer
Adrià Recasens
Simon Stent
Wojciech Matusik
Antonio Torralba
248
410
0
22 Oct 2019
Boosted Attention: Leveraging Human Attention for Image Captioning
Boosted Attention: Leveraging Human Attention for Image CaptioningEuropean Conference on Computer Vision (ECCV), 2018
Shi Chen
Qi Zhao
169
49
0
18 Mar 2019
A Comprehensive Survey of Deep Learning for Image Captioning
A Comprehensive Survey of Deep Learning for Image Captioning
Md Zakir Hossain
Ferdous Sohel
M. Shiratuddin
Hamid Laga
VLM3DV
294
839
0
06 Oct 2018
Object Referring in Videos with Language and Human Gaze
Object Referring in Videos with Language and Human Gaze
A. Vasudevan
Dengxin Dai
Luc Van Gool
VOS
197
82
0
04 Jan 2018
Paying Attention to Descriptions Generated by Image Captioning Models
Paying Attention to Descriptions Generated by Image Captioning Models
Hamed R. Tavakoli
Rakshith Shetty
Ali Borji
Jorma T. Laaksonen
256
80
0
24 Apr 2017
Predicting Human Eye Fixations via an LSTM-based Saliency Attentive
  Model
Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model
Marcella Cornia
Lorenzo Baraldi
G. Serra
Rita Cucchiara
234
584
0
29 Nov 2016
Gaze Embeddings for Zero-Shot Image Classification
Gaze Embeddings for Zero-Shot Image Classification
Nour Karessli
Zeynep Akata
Bernt Schiele
Andreas Bulling
220
113
0
28 Nov 2016
Predicting the Category and Attributes of Visual Search Targets Using
  Deep Gaze Pooling
Predicting the Category and Attributes of Visual Search Targets Using Deep Gaze Pooling
Hosnieh Sattar
Andreas Bulling
Mario Fritz
142
0
0
27 Nov 2016
1