Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.10198
Cited By
Generating Wikipedia by Summarizing Long Sequences
30 January 2018
Peter J. Liu
Mohammad Saleh
Etienne Pot
Ben Goodrich
Ryan Sepassi
Lukasz Kaiser
Noam M. Shazeer
CVBM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Generating Wikipedia by Summarizing Long Sequences"
50 / 174 papers shown
Title
Small Clips, Big Gains: Learning Long-Range Refocused Temporal Information for Video Super-Resolution
Xingyu Zhou
Wei Long
Jingbo Lu
Shiyin Jiang
Weiyi You
Haifeng Wu
Shuhang Gu
41
0
0
04 May 2025
Advancements in Natural Language Processing for Automatic Text Summarization
Nevidu Jayatilleke
Ruvan Weerasinghe
Nipuna Senanayake
162
1
0
27 Feb 2025
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking
Zekun Xi
Wenbiao Yin
Jizhan Fang
Jialong Wu
Runnan Fang
N. Zhang
Jiang Yong
Pengjun Xie
Fei Huang
H. Chen
SyDa
LRM
106
6
0
21 Feb 2025
EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents
Mengna Zhu
Kaisheng Zeng
Mao Wang
Kaiming Xiao
Lei Hou
Hongbin Huang
Juanzi Li
188
1
0
16 Dec 2024
All or None: Identifiable Linear Properties of Next-token Predictors in Language Modeling
Emanuele Marconato
Sébastien Lachapelle
Sebastian Weichwald
Luigi Gresele
69
3
0
30 Oct 2024
Leveraging Entailment Judgements in Cross-Lingual Summarisation
Huajian Zhang
Laura Perez-Beltrachini
HILM
41
0
0
01 Aug 2024
VoCo-LLaMA: Towards Vision Compression with Large Language Models
Xubing Ye
Yukang Gan
Xiaoke Huang
Yixiao Ge
Yansong Tang
MLLM
VLM
43
22
0
18 Jun 2024
Scientific Hypothesis Generation by a Large Language Model: Laboratory Validation in Breast Cancer Treatment
A. Abdel-Rehim
Hector Zenil
Oghenejokpeme I. Orhobor
Marie Fisher
Ross J. Collins
...
Gareth W. Fearnley
Emma Tate
Holly X. Smith
Larisa B. Soldatova
Ross D. King
LM&MA
63
4
0
20 May 2024
Transformers as Transducers
Lena Strobl
Dana Angluin
David Chiang
Jonathan Rawski
Ashish Sabharwal
29
5
0
02 Apr 2024
Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models
Yijia Shao
Yucheng Jiang
Theodore A. Kanell
Peter Xu
Omar Khattab
Monica S. Lam
LLMAG
KELM
44
35
0
22 Feb 2024
How Smooth Is Attention?
Valérie Castin
Pierre Ablin
Gabriel Peyré
AAML
40
9
0
22 Dec 2023
Object Recognition as Next Token Prediction
Kaiyu Yue
Borchun Chen
Jonas Geiping
Hengduo Li
Tom Goldstein
Ser-Nam Lim
40
9
0
04 Dec 2023
PELMS: Pre-training for Effective Low-Shot Multi-Document Summarization
Joseph Peper
Wenzhao Qiu
Lu Wang
23
0
0
16 Nov 2023
Surveying the Landscape of Text Summarization with Deep Learning: A Comprehensive Review
Guanghua Wang
Weili Wu
AI4TS
AILaw
38
3
0
13 Oct 2023
Sweeping Heterogeneity with Smart MoPs: Mixture of Prompts for LLM Task Adaptation
Chen Dun
Mirian Hipolito Garcia
Guoqing Zheng
Ahmed Hassan Awadallah
Anastasios Kyrillidis
Robert Sim
81
6
0
04 Oct 2023
Transformer-VQ: Linear-Time Transformers via Vector Quantization
Albert Mohwald
31
15
0
28 Sep 2023
Small-scale proxies for large-scale Transformer training instabilities
Mitchell Wortsman
Peter J. Liu
Lechao Xiao
Katie Everett
A. Alemi
...
Jascha Narain Sohl-Dickstein
Kelvin Xu
Jaehoon Lee
Justin Gilmer
Simon Kornblith
37
81
0
25 Sep 2023
Multi-document Summarization: A Comparative Evaluation
Kushan Hewapathirana
Nisansa de Silva
Sri Lanka
ELM
16
3
0
10 Sep 2023
RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models
Jie Huang
Ming-Yu Liu
Peng-Tao Xu
M. Shoeybi
Kevin Chen-Chuan Chang
Bryan Catanzaro
RALM
35
33
0
15 Aug 2023
Arithmetic with Language Models: from Memorization to Computation
Davide Maltoni
Matteo Ferrara
KELM
LRM
38
5
0
02 Aug 2023
In-Context Learning Learns Label Relationships but Is Not Conventional Learning
Jannik Kossen
Y. Gal
Tom Rainforth
37
27
0
23 Jul 2023
Training-free Diffusion Model Adaptation for Variable-Sized Text-to-Image Synthesis
Zhiyu Jin
Xuli Shen
Bin Li
Xiangyang Xue
24
36
0
14 Jun 2023
Generating EDU Extracts for Plan-Guided Summary Re-Ranking
Griffin Adams
Alexander R. Fabbri
Faisal Ladhak
Kathleen McKeown
Noémie Elhadad
18
10
0
28 May 2023
A Multi-Scale Attentive Transformer for Multi-Instrument Symbolic Music Generation
Xipin Wei
Junhui Chen
Zirui Zheng
Li Guo
Lantian Li
Dong Wang
17
3
0
26 May 2023
Large Language Models are Not Yet Human-Level Evaluators for Abstractive Summarization
Chenhui Shen
Liying Cheng
Xuan-Phi Nguyen
Yang You
Lidong Bing
ELM
ALM
47
64
0
22 May 2023
A Hierarchical Encoding-Decoding Scheme for Abstractive Multi-document Summarization
Chenhui Shen
Liying Cheng
Xuan-Phi Nguyen
Yang You
Lidong Bing
22
10
0
15 May 2023
Reconstruct Before Summarize: An Efficient Two-Step Framework for Condensing and Summarizing Meeting Transcripts
Haochen Tan
Han Wu
Wei Shao
Xinyun Zhang
Mingjie Zhan
Zhaohui Hou
Ding Liang
Linqi Song
41
0
0
13 May 2023
The Current State of Summarization
Fabian Retkowski
23
6
0
08 May 2023
Learning to Compress Prompts with Gist Tokens
Jesse Mu
Xiang Lisa Li
Noah D. Goodman
VLM
44
206
0
17 Apr 2023
Improving Autoregressive NLP Tasks via Modular Linearized Attention
Victor Agostinelli
Lizhong Chen
24
1
0
17 Apr 2023
An Iterative Optimizing Framework for Radiology Report Summarization with ChatGPT
Chong Ma
Zihao Wu
Jiaqi Wang
Shaochen Xu
Yaonai Wei
...
Tuo Zhang
Dajiang Zhu
Dinggang Shen
Tianming Liu
Xiang Li
MedIm
LM&MA
23
97
0
17 Apr 2023
MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks
Weicheng Kuo
A. Piergiovanni
Dahun Kim
Xiyang Luo
Benjamin Caine
...
Luowei Zhou
Andrew M. Dai
Zhifeng Chen
Claire Cui
A. Angelova
MLLM
VLM
29
23
0
29 Mar 2023
Energy-efficient Task Adaptation for NLP Edge Inference Leveraging Heterogeneous Memory Architectures
Zirui Fu
Aleksandre Avaliani
M. Donato
44
1
0
25 Mar 2023
XWikiGen: Cross-lingual Summarization for Encyclopedic Text Generation in Low Resource Languages
Dhaval Taunk
Shivprasad Sagare
Anupam Patil
Shivansh Subramanian
Manish Gupta
Vasudeva Varma
22
3
0
22 Mar 2023
RIOT: Recursive Inertial Odometry Transformer for Localisation from Low-Cost IMU Measurements
James Brotchie
Wenchao Li
A. Greentree
A. Kealy
22
8
0
03 Mar 2023
Elementwise Language Representation
Du-Yeong Kim
Jeeeun Kim
33
0
0
27 Feb 2023
ChatGPT: Jack of all trades, master of none
Jan Kocoñ
Igor Cichecki
Oliwier Kaszyca
Mateusz Kochanek
Dominika Szydło
...
Maciej Piasecki
Lukasz Radliñski
Konrad Wojtasik
Stanislaw Wo'zniak
Przemyslaw Kazienko
AI4MH
37
527
0
21 Feb 2023
Generating a Structured Summary of Numerous Academic Papers: Dataset and Method
Shuaiqi Liu
Jiannong Cao
Ruosong Yang
Zhiyuan Wen
51
16
0
09 Feb 2023
Efficient Attention via Control Variates
Lin Zheng
Jianbo Yuan
Chong-Jun Wang
Lingpeng Kong
34
18
0
09 Feb 2023
A Survey of Deep Learning: From Activations to Transformers
Johannes Schneider
Michalis Vlachos
ViT
MedIm
AI4TS
AI4CE
50
9
0
01 Feb 2023
Do Multi-Document Summarization Models Synthesize?
Jay DeYoung
Stephanie C. Martinez
Iain J. Marshall
Byron C. Wallace
22
8
0
31 Jan 2023
OASum: Large-Scale Open Domain Aspect-based Summarization
Xianjun Yang
Kaiqiang Song
Sangwoo Cho
Xiaoyang Wang
Xiaoman Pan
Linda R. Petzold
Dong Yu
RALM
26
24
0
19 Dec 2022
Multi-embodiment Legged Robot Control as a Sequence Modeling Problem
Chenyi Yu
Weinan Zhang
H. Lai
Zheng Tian
L. Kneip
Jun Wang
31
15
0
18 Dec 2022
SumREN: Summarizing Reported Speech about Events in News
R. Reddy
Heba Elfardy
Hou Pong Chan
Kevin Small
Heng Ji
24
5
0
02 Dec 2022
Automatic Generation of Socratic Subquestions for Teaching Math Word Problems
Kumar Shridhar
Jakub Macina
Mennatallah El-Assady
Tanmay Sinha
Manu Kapur
Mrinmaya Sachan
AIMat
28
45
0
23 Nov 2022
Exploring the Efficacy of Pre-trained Checkpoints in Text-to-Music Generation Task
Shangda Wu
Maosong Sun
17
20
0
21 Nov 2022
CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling
Jinchao Zhang
Shuyang Jiang
Jiangtao Feng
Lin Zheng
Lingpeng Kong
3DV
43
9
0
14 Oct 2022
Bird-Eye Transformers for Text Generation Models
Lei Sha
Yuhang Song
Yordan Yordanov
Tommaso Salvatori
Thomas Lukasiewicz
30
0
0
08 Oct 2022
GLM-130B: An Open Bilingual Pre-trained Model
Aohan Zeng
Xiao Liu
Zhengxiao Du
Zihan Wang
Hanyu Lai
...
Jidong Zhai
Wenguang Chen
Peng-Zhen Zhang
Yuxiao Dong
Jie Tang
BDL
LRM
253
1,073
0
05 Oct 2022
Out-of-Distribution Detection and Selective Generation for Conditional Language Models
Jie Jessie Ren
Jiaming Luo
Yao-Min Zhao
Kundan Krishna
Mohammad Saleh
Balaji Lakshminarayanan
Peter J. Liu
OODD
72
94
0
30 Sep 2022
1
2
3
4
Next