Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2107.00789
Cited By
Case Relation Transformer: A Crossmodal Language Generation Model for Fetching Instructions
2 July 2021
Motonari Kambara
K. Sugiura
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Case Relation Transformer: A Crossmodal Language Generation Model for Fetching Instructions"
5 / 5 papers shown
Title
Polos: Multimodal Metric Learning from Human Feedback for Image Captioning
Yuiga Wada
Kanta Kaneda
Daichi Saito
Komei Sugiura
34
24
0
28 Feb 2024
JaSPICE: Automatic Evaluation Metric Using Predicate-Argument Structures for Image Captioning Models
Yuiga Wada
Kanta Kaneda
Komei Sugiura
23
4
0
07 Nov 2023
Fully Automated Task Management for Generation, Execution, and Evaluation: A Framework for Fetch-and-Carry Tasks with Natural Language Instructions in Continuous Space
Motonari Kambara
K. Sugiura
LM&Ro
24
0
0
07 Nov 2023
Relational Future Captioning Model for Explaining Likely Collisions in Daily Tasks
Motonari Kambara
K. Sugiura
22
6
0
19 Jul 2022
On the Evaluation of Vision-and-Language Navigation Instructions
Mingde Zhao
Peter Anderson
Vihan Jain
Su Wang
Alexander Ku
Jason Baldridge
Eugene Ie
231
51
0
26 Jan 2021
1