Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.00292
Cited By
Generalization in Generation: A closer look at Exposure Bias
1 October 2019
Florian Schmidt
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Generalization in Generation: A closer look at Exposure Bias"
27 / 27 papers shown
Title
Sequence-level Large Language Model Training with Contrastive Preference Optimization
Zhili Feng
Dhananjay Ram
Cole Hawkins
Aditya Rawal
Jinman Zhao
Sheng Zha
62
0
0
23 Feb 2025
Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens
Zhepeng Cen
Yao Liu
Siliang Zeng
Pratik Chaudhar
Huzefa Rangwala
George Karypis
Rasool Fakoor
SyDa
AIFin
34
3
0
18 Oct 2024
Transducers with Pronunciation-aware Embeddings for Automatic Speech Recognition
Hainan Xu
Zhehuai Chen
Fei Jia
Boris Ginsburg
35
0
0
04 Apr 2024
DemoSG: Demonstration-enhanced Schema-guided Generation for Low-resource Event Extraction
Gang Zhao
Xiaocheng Gong
Xinjie Yang
Guanting Dong
Shudong Lu
Si Li
26
11
0
16 Oct 2023
Exploiting the Signal-Leak Bias in Diffusion Models
Martin Nicolas Everaert
Athanasios Fitsios
Marco Bocchio
Sami Arpa
Sabine Süsstrunk
R. Achanta
DiffM
27
25
0
27 Sep 2023
Elucidating the Exposure Bias in Diffusion Models
Mang Ning
Mingxiao Li
Jianlin Su
A. A. Salah
Itir Onal Ertugrul
DiffM
119
35
0
29 Aug 2023
A Systematic Study of Knowledge Distillation for Natural Language Generation with Pseudo-Target Training
Nitay Calderon
Subhabrata Mukherjee
Roi Reichart
Amir Kantor
31
17
0
03 May 2023
Learning from Predictions: Fusing Training and Autoregressive Inference for Long-Term Spatiotemporal Forecasts
Pantelis R. Vlachas
Petros Koumoutsakos
AI4TS
AI4CE
18
7
0
22 Feb 2023
Tuning computer vision models with task rewards
André Susano Pinto
Alexander Kolesnikov
Yuge Shi
Lucas Beyer
Xiaohua Zhai
VLM
27
40
0
16 Feb 2023
An Comparative Analysis of Different Pitch and Metrical Grid Encoding Methods in the Task of Sequential Music Generation
Yuqiang Li
Shengchen Li
Georgy Fazekas
42
2
0
31 Jan 2023
Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?
Xuan Shi
Erica Cooper
Xin Wang
Junichi Yamagishi
Shrikanth Narayanan
27
1
0
25 Nov 2022
Reward Gaming in Conditional Text Generation
Richard Yuanzhe Pang
Vishakh Padmakumar
Thibault Sellam
Ankur P. Parikh
He He
32
24
0
16 Nov 2022
Flipped Classroom: Effective Teaching for Time Series Forecasting
P. Teutsch
Patrick Mäder
AI4TS
21
8
0
17 Oct 2022
State-of-the-art generalisation research in NLP: A taxonomy and review
Dieuwke Hupkes
Mario Giulianelli
Verna Dankers
Mikel Artetxe
Yanai Elazar
...
Leila Khalatbari
Maria Ryskina
Rita Frieske
Ryan Cotterell
Zhijing Jin
114
93
0
06 Oct 2022
G2P-DDM: Generating Sign Pose Sequence from Gloss Sequence with Discrete Diffusion Model
Pan Xie
Qipeng Zhang
Zexian Li
Hao Tang
Yao Du
Xiaohui Hu
DiffM
38
12
0
19 Aug 2022
Auto-regressive Image Synthesis with Integrated Quantization
Fangneng Zhan
Yingchen Yu
Rongliang Wu
Jiahui Zhang
Kai Cui
Changgong Zhang
Shijian Lu
35
10
0
21 Jul 2022
Keywords and Instances: A Hierarchical Contrastive Learning Framework Unifying Hybrid Granularities for Text Generation
Li Mingzhe
Xiexiong Lin
Xiuying Chen
Jinxiong Chang
Qishen Zhang
...
Taifeng Wang
Zhongyi Liu
Wei Chu
Dongyan Zhao
Rui Yan
46
11
0
26 May 2022
Why Exposure Bias Matters: An Imitation Learning Perspective of Error Accumulation in Language Generation
Kushal Arora
Layla El Asri
Hareesh Bahuleyan
Jackie C.K. Cheung
29
79
0
03 Apr 2022
PreTR: Spatio-Temporal Non-Autoregressive Trajectory Prediction Transformer
Lina Achaji
Thierno Barry
Thibault Fouqueray
Julien Moreau
François Aioun
François Charpillet
18
15
0
17 Mar 2022
Robust Probabilistic Time Series Forecasting
Taeho Yoon
Youngsuk Park
Ernest K. Ryu
Yuyang Wang
AAML
AI4TS
20
18
0
24 Feb 2022
Vector Quantized Diffusion Model for Text-to-Image Synthesis
Shuyang Gu
Dong Chen
Jianmin Bao
Fang Wen
Bo Zhang
Dongdong Chen
Lu Yuan
B. Guo
DiffM
62
757
0
29 Nov 2021
Relating Neural Text Degeneration to Exposure Bias
Ting-Rui Chiang
Yun-Nung Chen
50
17
0
17 Sep 2021
Are Training Resources Insufficient? Predict First Then Explain!
Myeongjun Jang
Thomas Lukasiewicz
LRM
21
7
0
29 Aug 2021
ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis
Patrick Esser
Robin Rombach
A. Blattmann
Bjorn Ommer
DiffM
24
156
0
19 Aug 2021
Learning to summarize from human feedback
Nisan Stiennon
Long Ouyang
Jeff Wu
Daniel M. Ziegler
Ryan J. Lowe
Chelsea Voss
Alec Radford
Dario Amodei
Paul Christiano
ALM
19
1,978
0
02 Sep 2020
Teacher-Student Training for Robust Tacotron-based TTS
Rui Liu
Berrak Sisman
Jingdong Li
F. Bao
Guanglai Gao
Haizhou Li
19
38
0
07 Nov 2019
Language GANs Falling Short
Massimo Caccia
Lucas Caccia
W. Fedus
Hugo Larochelle
Joelle Pineau
Laurent Charlin
124
215
0
06 Nov 2018
1