Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.06096
Cited By
The Grammar-Learning Trajectories of Neural Language Models
13 September 2021
Leshem Choshen
Guy Hacohen
D. Weinshall
Omri Abend
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Grammar-Learning Trajectories of Neural Language Models"
8 / 8 papers shown
Title
Bigram Subnetworks: Mapping to Next Tokens in Transformer Language Models
Tyler A. Chang
Benjamin Bergen
46
0
0
21 Apr 2025
A distributional simplicity bias in the learning dynamics of transformers
Riccardo Rende
Federica Gerace
A. Laio
Sebastian Goldt
71
8
0
17 Feb 2025
ColD Fusion: Collaborative Descent for Distributed Multitask Finetuning
Shachar Don-Yehiya
Elad Venezian
Colin Raffel
Noam Slonim
Yoav Katz
Leshem Choshen
MoMe
16
52
0
02 Dec 2022
RuCoLA: Russian Corpus of Linguistic Acceptability
Vladislav Mikhailov
T. Shamardina
Max Ryabinin
A. Pestova
I. Smurov
Ekaterina Artemova
17
28
0
23 Oct 2022
Fusing finetuned models for better pretraining
Leshem Choshen
Elad Venezian
Noam Slonim
Yoav Katz
FedML
AI4CE
MoMe
31
86
0
06 Apr 2022
Active Learning on a Budget: Opposite Strategies Suit High and Low Budgets
Guy Hacohen
Avihu Dekel
D. Weinshall
119
116
0
06 Feb 2022
The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics
Sebastian Gehrmann
Tosin P. Adewumi
Karmanya Aggarwal
Pawan Sasanka Ammanamanchi
Aremu Anuoluwapo
...
Nishant Subramani
Wei-ping Xu
Diyi Yang
Akhila Yerukola
Jiawei Zhou
VLM
246
283
0
02 Feb 2021
A Theoretical Analysis of the Repetition Problem in Text Generation
Z. Fu
Wai Lam
Anthony Man-Cho So
Bei Shi
69
89
0
29 Dec 2020
1