Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.19531
Cited By
MiLe Loss: a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models
30 October 2023
Zhenpeng Su
Xing Wu
Xue Bai
Zijia Lin
Hui Chen
Guiguang Ding
Wei Zhou
Songlin Hu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MiLe Loss: a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models"
2 / 2 papers shown
Title
CartesianMoE: Boosting Knowledge Sharing among Experts via Cartesian Product Routing in Mixture-of-Experts
Zhenpeng Su
Xing Wu
Zijia Lin
Yizhe Xiong
Minxuan Lv
Guangyuan Ma
Hui Chen
Songlin Hu
Guiguang Ding
MoE
26
2
0
21 Oct 2024
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Leo Gao
Stella Biderman
Sid Black
Laurence Golding
Travis Hoppe
...
Horace He
Anish Thite
Noa Nabeshima
Shawn Presser
Connor Leahy
AIMat
245
1,977
0
31 Dec 2020
1