ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.02180
  4. Cited By
Ordered Attention for Coherent Visual Storytelling

Ordered Attention for Coherent Visual Storytelling

4 August 2021
Tom Braude
Idan Schwartz
A. Schwing
Ariel Shamir
ArXivPDFHTML

Papers citing "Ordered Attention for Coherent Visual Storytelling"

8 / 8 papers shown
Title
Semantic Alignment for Multimodal Large Language Models
Semantic Alignment for Multimodal Large Language Models
Tao Wu
Mengze Li
Jingyuan Chen
Wei Ji
Wang Lin
Jinyang Gao
Kun Kuang
Zhou Zhao
Fei Wu
33
3
0
23 Aug 2024
Diffusion-Based Visual Art Creation: A Survey and New Perspectives
Diffusion-Based Visual Art Creation: A Survey and New Perspectives
Bingyuan Wang
Qifeng Chen
Zeyu Wang
44
7
0
22 Aug 2024
DataNarrative: Automated Data-Driven Storytelling with Visualizations
  and Texts
DataNarrative: Automated Data-Driven Storytelling with Visualizations and Texts
Mohammed Saidul Islam
Md Tahmid Rahman Laskar
Md. Rizwan Parvez
Enamul Hoque
Shafiq R. Joty
DiffM
37
6
0
09 Aug 2024
TARN-VIST: Topic Aware Reinforcement Network for Visual Storytelling
TARN-VIST: Topic Aware Reinforcement Network for Visual Storytelling
Weiran Chen
Xin Li
Jiaqi Su
Guiqian Zhu
Ying Li
Yi Ji
Chunping Liu
21
0
0
18 Mar 2024
Text-Only Training for Visual Storytelling
Text-Only Training for Visual Storytelling
Yuechen Wang
Wen-gang Zhou
Zhenbo Lu
Houqiang Li
DiffM
24
2
0
17 Aug 2023
Describing Sets of Images with Textual-PCA
Describing Sets of Images with Textual-PCA
Oded Hupert
Idan Schwartz
Lior Wolf
CoGe
29
1
0
21 Oct 2022
Normalized and Geometry-Aware Self-Attention Network for Image
  Captioning
Normalized and Geometry-Aware Self-Attention Network for Image Captioning
Longteng Guo
Jing Liu
Xinxin Zhu
Peng Yao
Shichen Lu
Hanqing Lu
ViT
112
189
0
19 Mar 2020
Multimodal Compact Bilinear Pooling for Visual Question Answering and
  Visual Grounding
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
147
1,465
0
06 Jun 2016
1