Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.02401
Cited By
Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization
6 September 2021
Tiezheng Yu
Wenliang Dai
Zihan Liu
Pascale Fung
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization"
10 / 10 papers shown
Title
Grafting Pre-trained Models for Multimodal Headline Generation
Lingfeng Qiao
Chen Wu
Ye Liu
Haoyuan Peng
Di Yin
Bo Ren
17
5
0
14 Nov 2022
Interpreting Song Lyrics with an Audio-Informed Pre-trained Language Model
Yixiao Zhang
Junyan Jiang
Gus Xia
S. Dixon
17
9
0
24 Aug 2022
Kaggle Competition: Cantonese Audio-Visual Speech Recognition for In-car Commands
Wenliang Dai
Samuel Cahyawijaya
Tiezheng Yu
Elham J. Barezi
Pascale Fung
11
1
0
06 Jul 2022
An Empirical Survey on Long Document Summarization: Datasets, Models and Metrics
Huan Yee Koh
Jiaxin Ju
Ming Liu
Shirui Pan
62
120
0
03 Jul 2022
Learning Cluster Patterns for Abstractive Summarization
Sung-Guk Jo
Jeong-Jae Kim
Byung-Won On
9
3
0
22 Feb 2022
Speech Summarization using Restricted Self-Attention
Roshan S. Sharma
Shruti Palaskar
A. Black
Florian Metze
4
33
0
12 Oct 2021
Unifying Vision-and-Language Tasks via Text Generation
Jaemin Cho
Jie Lei
Hao Tan
Mohit Bansal
MLLM
249
518
0
04 Feb 2021
VinVL: Revisiting Visual Representations in Vision-Language Models
Pengchuan Zhang
Xiujun Li
Xiaowei Hu
Jianwei Yang
Lei Zhang
Lijuan Wang
Yejin Choi
Jianfeng Gao
ObjD
VLM
252
157
0
02 Jan 2021
Unified Vision-Language Pre-Training for Image Captioning and VQA
Luowei Zhou
Hamid Palangi
Lei Zhang
Houdong Hu
Jason J. Corso
Jianfeng Gao
MLLM
VLM
250
922
0
24 Sep 2019
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
208
7,687
0
17 Aug 2015
1