ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2109.03127
  4. Cited By
Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via
  Adaptive Gradient Gating for Rare Token Embeddings

Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare Token Embeddings

7 September 2021
Sangwon Yu
Jongyoon Song
Heeseung Kim
SeongEun Lee
Woo-Jong Ryu
Sung-Hoon Yoon
ArXivPDFHTML

Papers citing "Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare Token Embeddings"

7 / 7 papers shown
Title
Norm of Mean Contextualized Embeddings Determines their Variance
Norm of Mean Contextualized Embeddings Determines their Variance
Hiroaki Yamagiwa
Hidetoshi Shimodaira
27
0
0
17 Sep 2024
Addressing the Rank Degeneration in Sequential Recommendation via
  Singular Spectrum Smoothing
Addressing the Rank Degeneration in Sequential Recommendation via Singular Spectrum Smoothing
Ziwei Fan
Zhiwei Liu
Hao Peng
Philip S. Yu
32
1
0
21 Jun 2023
Exploring Anisotropy and Outliers in Multilingual Language Models for
  Cross-Lingual Semantic Sentence Similarity
Exploring Anisotropy and Outliers in Multilingual Language Models for Cross-Lingual Semantic Sentence Similarity
Katharina Hämmerl
Alina Fastowski
Jindrich Libovický
Alexander M. Fraser
20
6
0
01 Jun 2023
Token-Level Fitting Issues of Seq2seq Models
Token-Level Fitting Issues of Seq2seq Models
Guangsheng Bao
Zhiyang Teng
Yue Zhang
16
0
0
08 May 2023
Token Imbalance Adaptation for Radiology Report Generation
Token Imbalance Adaptation for Radiology Report Generation
Yuexin Wu
I. Huang
Xiaolei Huang
MedIm
21
7
0
18 Apr 2023
Evade the Trap of Mediocrity: Promoting Diversity and Novelty in Text
  Generation via Concentrating Attention
Evade the Trap of Mediocrity: Promoting Diversity and Novelty in Text Generation via Concentrating Attention
Wenhao Li
Xiaoyuan Yi
Jinyi Hu
Maosong Sun
Xing Xie
23
0
0
14 Nov 2022
No Word Embedding Model Is Perfect: Evaluating the Representation
  Accuracy for Social Bias in the Media
No Word Embedding Model Is Perfect: Evaluating the Representation Accuracy for Social Bias in the Media
Maximilian Spliethover
Maximilian Keiff
Henning Wachsmuth
21
4
0
07 Nov 2022
1