Representation Degeneration Problem in Training Natural Language Generation Models

International Conference on Learning Representations (ICLR), 2019

28 July 2019

Xu Tan

Papers citing "Representation Degeneration Problem in Training Natural Language Generation Models"

50 / 162 papers shown

On Isotropy, Contextualization and Learning Dynamics of Contrastive-based Sentence Representation LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Chenghao Xiao

Yang Long

Noura Al Moubayed

206

18 Dec 2022

HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation PerturbationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Hongyi Yuan

Zheng Yuan

Chuanqi Tan

Fei Huang

Songfang Huang

235

17 Dec 2022

Reliable Measures of Spread in High Dimensional Latent SpacesInternational Conference on Machine Learning (ICML), 2022

Anna C. Marbut

Katy McKinney-Bock

Travis J. Wheeler

289

15 Dec 2022

Self-supervised Trajectory Representation Learning with Temporal Regularities and Travel SemanticsIEEE International Conference on Data Engineering (ICDE), 2022

281

116

17 Nov 2022

Evade the Trap of Mediocrity: Promoting Diversity and Novelty in Text Generation via Concentrating AttentionConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Wenhao Li

Xiaoyuan Yi

Jinyi Hu

Maosong Sun

Xing Xie

241

14 Nov 2022

Reconciliation of Pre-trained Models and Prototypical Neural Networks in Few-shot Named Entity RecognitionConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

169

07 Nov 2022

Optimizing text representations to capture (dis)similarity between political partiesConference on Computational Natural Language Learning (CoNLL), 2022

Tanise Ceron

Nico Blokker

Sebastian Padó

135

21 Oct 2022

Kernel-Whitening: Overcome Dataset Bias with Isotropic Sentence EmbeddingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Songyang Gao

Jiajun Sun

Tao Gui

Xuanjing Huang

154

14 Oct 2022

ContraCLM: Contrastive Learning For Causal Language ModelAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

...

259

03 Oct 2022

Prompt Combines Paraphrase: Teaching Pre-trained Models to Understand Rare Biomedical WordsInternational Conference on Computational Linguistics (COLING), 2022

209

14 Sep 2022

Pre-Training a Graph Recurrent Network for Language Representation

Yue Zhang

241

08 Sep 2022

Analyzing Transformers in Embedding SpaceAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

335

124

06 Sep 2022

SimCLF: A Simple Contrastive Learning Framework for Function-level Binary Embeddings

177

06 Sep 2022

RLIP: Relational Language-Image Pre-training for Human-Object Interaction DetectionNeural Information Processing Systems (NeurIPS), 2022

374

05 Sep 2022

Isotropic Representation Can Improve Dense RetrievalPacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2022

235

01 Sep 2022

Addressing Token Uniformity in Transformers via Singular Value TransformationConference on Uncertainty in Artificial Intelligence (UAI), 2022

Hanqi Yan

Lin Gui

Wenjie Li

Yulan He

194

24 Aug 2022

Mere Contrastive Learning for Cross-Domain Sentiment AnalysisInternational Conference on Computational Linguistics (COLING), 2022

Yun Luo

Fang Guo

Zihan Liu

Yue Zhang

155

18 Aug 2022

LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale

Tim Dettmers

M. Lewis

Younes Belkada

Luke Zettlemoyer

478

843

15 Aug 2022

Outliers Dimensions that Disrupt Transformers Are Driven by FrequencyConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

540

23 May 2022

Fine-tuning Pre-trained Language Models for Few-shot Intent Detection: Supervised Pre-training and IsotropizationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

Xiao-Ming Wu

232

15 May 2022

Label Anchored Contrastive Learning for Language UnderstandingNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022

185

26 Apr 2022

Reprint: a randomized extrapolation based on principal components for data augmentationSocial Science Research Network (SSRN), 2022

192

26 Apr 2022

A Token-level Contrastive Framework for Sign Language TranslationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2022

208

11 Apr 2022

CoCoSoDa: Effective Contrastive Learning for Code SearchInternational Conference on Software Engineering (ICSE), 2022

322

07 Apr 2022

The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of RedundancyComputer Vision and Pattern Recognition (CVPR), 2022

Tianlong Chen

Zhenyu Zhang

Yu Cheng

Ahmed Hassan Awadallah

Zinan Lin

ViT

260

12 Mar 2022

Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation LearningNeural Information Processing Systems (NeurIPS), 2022

455

600

03 Mar 2022

A Simple but Effective Pluggable Entity Lookup Table for Pre-trained Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Deming Ye

Yankai Lin

Peng Li

Maosong Sun

Zhiyuan Liu

KELM

214

27 Feb 2022

Exploring the Impact of Negative Samples of Contrastive Learning: A Case Study of Sentence EmbeddingFindings (Findings), 2022

Zheng Wang

307

26 Feb 2022

PromptBERT: Improving BERT Sentence Embeddings with PromptsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

233

149

12 Jan 2022

Frequency-Aware Contrastive Learning for Neural Machine TranslationAAAI Conference on Artificial Intelligence (AAAI), 2021

Tong Zhang

Wei Ye

Baosong Yang

Long Zhang

Xingzhang Ren

Dayiheng Liu

183

29 Dec 2021

A Survey of Visual TransformersIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021

Yang Liu

473

487

11 Nov 2021

Leveraging Advantages of Interactive and Non-Interactive Models for Vector-Based Cross-Lingual Information Retrieval

Dayiheng Liu

151

03 Nov 2021

Dict-BERT: Enhancing Language Model Pre-training with Dictionary

383

13 Oct 2021

An Isotropy Analysis in the Multilingual BERT Embedding SpaceFindings (Findings), 2021

S. Rajaee

Mohammad Taher Pilehvar

246

09 Oct 2021

Text analysis and deep learning: A network approach

Ingo Marquart

171

08 Oct 2021

On Isotropy Calibration of TransformersFirst Workshop on Insights from Negative Results in NLP (Insights), 2021

Roger Wattenhofer

131

27 Sep 2021

How Does Fine-tuning Affect the Geometry of Embedding Space: A Case Study on IsotropyConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

S. Rajaee

Mohammad Taher Pilehvar

261

10 Sep 2021

All Bark and No Bite: Rogue Dimensions in Transformer Language Models Obscure Representational QualityConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

William Timkey

Marten van Schijndel

504

133

09 Sep 2021

Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare Token EmbeddingsAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

395

07 Sep 2021

IsoScore: Measuring the Uniformity of Embedding Space Utilization

223

16 Aug 2021

Language Models as Zero-shot Visual Semantic Learners

109

26 Jul 2021

Noisy Training Improves E2E ASR for the Edge

Ozlem Kalinli

231

09 Jul 2021

A Cluster-based Approach for Improving Isotropy in Contextual Embedding SpaceAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

S. Rajaee

Mohammad Taher Pilehvar

162

02 Jun 2021

Rejuvenating Low-Frequency Words: Making the Most of Parallel Data in Non-Autoregressive TranslationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Liang Ding

214

02 Jun 2021

ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation TransferAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Rumei Li

Weiran Xu

282

617

25 May 2021

Vision Transformers with Patch Diversification

257

26 Apr 2021

Low Anisotropy Sense Retrofitting (LASeR) : Towards Isotropic and Sense Enriched RepresentationsWorkshop on Knowledge Extraction and Integration for Deep Learning Architectures; Deep Learning Inside Out (DEELIO), 2021

Geetanjali Bihani

Julia Taylor Rayz

172

22 Apr 2021

SimCSE: Simple Contrastive Learning of Sentence EmbeddingsConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

829

4,055

18 Apr 2021

Learning to Remove: Towards Isotropic Pre-trained BERT EmbeddingInternational Conference on Artificial Neural Networks (ICANN), 2021

433

12 Apr 2021

Whitening Sentence Representations for Better Semantics and Faster Retrieval

300

338

29 Mar 2021