ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2101.00419
  4. Cited By
KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense
  Generation
v1v2 (latest)

KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation

Annual Meeting of the Association for Computational Linguistics (ACL), 2021
2 January 2021
Yiran Xing
Z. Shi
Zhao Meng
Gerhard Lakemeyer
Yunpu Ma
Roger Wattenhofer
    VLM
ArXiv (abs)PDFHTML

Papers citing "KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation"

19 / 19 papers shown
Title
DIVE: Towards Descriptive and Diverse Visual Commonsense Generation
DIVE: Towards Descriptive and Diverse Visual Commonsense GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Jun-Hyung Park
Hyuntae Park
Youjin Kang
Eojin Jeon
SangKeun Lee
138
0
0
15 Aug 2024
Sumotosima: A Framework and Dataset for Classifying and Summarizing
  Otoscopic Images
Sumotosima: A Framework and Dataset for Classifying and Summarizing Otoscopic ImagesICON (ICON), 2024
Eram Anwarul Khan
Anas Anwarul Haq Khan
86
0
0
13 Aug 2024
Few-shot Adaptation of Multi-modal Foundation Models: A Survey
Few-shot Adaptation of Multi-modal Foundation Models: A SurveyArtificial Intelligence Review (Artif Intell Rev), 2024
Fan Liu
Tianshu Zhang
Wenwen Dai
Wenwen Cai
Wenwen Cai Xiaocong Zhou
Delong Chen
VLMOffRL
221
48
0
03 Jan 2024
Improving Cross-modal Alignment with Synthetic Pairs for Text-only Image
  Captioning
Improving Cross-modal Alignment with Synthetic Pairs for Text-only Image CaptioningAAAI Conference on Artificial Intelligence (AAAI), 2023
Zhiyue Liu
Jinyuan Liu
Fanrong Ma
CLIPVLM
204
19
0
14 Dec 2023
UniSA: Unified Generative Framework for Sentiment Analysis
UniSA: Unified Generative Framework for Sentiment AnalysisACM Multimedia (ACM MM), 2023
Zaijing Li
Ting-En Lin
Yuchuan Wu
Meng Liu
Zhiqi Guo
Mingde Zhao
Yongbin Li
242
25
0
04 Sep 2023
Multi-source Semantic Graph-based Multimodal Sarcasm Explanation
  Generation
Multi-source Semantic Graph-based Multimodal Sarcasm Explanation GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Liqiang Jing
Xuemeng Song
Kun Ouyang
Mengzhao Jia
Liqiang Nie
152
27
0
29 Jun 2023
A Comprehensive Survey on Applications of Transformers for Deep Learning
  Tasks
A Comprehensive Survey on Applications of Transformers for Deep Learning TasksExpert systems with applications (ESWA), 2023
Saidul Islam
Hanae Elmekki
Ahmed Elsebai
Jamal Bentahar
Najat Drawel
Gaith Rjoub
Witold Pedrycz
ViTMedIm
199
351
0
11 Jun 2023
The Contribution of Knowledge in Visiolinguistic Learning: A Survey on
  Tasks and Challenges
The Contribution of Knowledge in Visiolinguistic Learning: A Survey on Tasks and Challenges
Maria Lymperaiou
Giorgos Stamou
VLM
212
5
0
04 Mar 2023
A Marker-based Neural Network System for Extracting Social Determinants
  of Health
A Marker-based Neural Network System for Extracting Social Determinants of Health
Xingmeng Zhao
Anthony Rios
176
1
0
24 Dec 2022
Summary-Oriented Vision Modeling for Multimodal Abstractive
  Summarization
Summary-Oriented Vision Modeling for Multimodal Abstractive SummarizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Yunlong Liang
Fandong Meng
Jinan Xu
Jiaan Wang
Jinan Xu
Jie Zhou
211
23
0
15 Dec 2022
A survey on knowledge-enhanced multimodal learning
A survey on knowledge-enhanced multimodal learningArtificial Intelligence Review (Artif Intell Rev), 2022
Maria Lymperaiou
Giorgos Stamou
417
21
0
19 Nov 2022
DiMBERT: Learning Vision-Language Grounded Representations with
  Disentangled Multimodal-Attention
DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-AttentionACM Transactions on Knowledge Discovery from Data (TKDD), 2021
Fenglin Liu
Xian Wu
Shen Ge
Xuancheng Ren
Wei Fan
Xu Sun
Yuexian Zou
VLM
175
13
0
28 Oct 2022
COFAR: Commonsense and Factual Reasoning in Image Search
COFAR: Commonsense and Factual Reasoning in Image Search
Prajwal Gatti
A. S. Penamakuri
Revant Teotia
Anand Mishra
Shubhashis Sengupta
Roshni Ramnani
ReLMLRM
123
4
0
16 Oct 2022
Vision-Language Pre-Training for Multimodal Aspect-Based Sentiment
  Analysis
Vision-Language Pre-Training for Multimodal Aspect-Based Sentiment AnalysisAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Yan Ling
Jianfei Yu
Rui Xia
131
107
0
17 Apr 2022
Attention Mechanism based Cognition-level Scene Understanding
Attention Mechanism based Cognition-level Scene Understanding
Xuejiao Tang
Tai Le Quy
LRM
272
0
0
17 Apr 2022
UTSA NLP at SemEval-2022 Task 4: An Exploration of Simple Ensembles of
  Transformers, Convolutional, and Recurrent Neural Networks
UTSA NLP at SemEval-2022 Task 4: An Exploration of Simple Ensembles of Transformers, Convolutional, and Recurrent Neural NetworksInternational Workshop on Semantic Evaluation (SemEval), 2022
Xingmeng Zhao
Anthony Rios
106
1
0
28 Mar 2022
Recent Advances in Neural Text Generation: A Task-Agnostic Survey
Recent Advances in Neural Text Generation: A Task-Agnostic Survey
Chen Tang
Frank Guerin
Chenghua Lin
AI4CEOOD
320
20
0
06 Mar 2022
Knowledge Graph Augmented Network Towards Multiview Representation
  Learning for Aspect-based Sentiment Analysis
Knowledge Graph Augmented Network Towards Multiview Representation Learning for Aspect-based Sentiment AnalysisIEEE Transactions on Knowledge and Data Engineering (TKDE), 2022
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Hua Jin
Dacheng Tao
162
98
0
13 Jan 2022
A Simple But Powerful Graph Encoder for Temporal Knowledge Graph
  Completion
A Simple But Powerful Graph Encoder for Temporal Knowledge Graph Completion
Zifeng Ding
Yunpu Ma
Bailan He
Volker Tresp
179
23
0
14 Dec 2021
1