Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2004.14975
Cited By
v1
v2 (latest)
Investigating Transferability in Pretrained Language Models
Findings (Findings), 2020
30 April 2020
Alex Tamkin
Trisha Singh
D. Giovanardi
Noah D. Goodman
MILM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Investigating Transferability in Pretrained Language Models"
29 / 29 papers shown
What do Transformers Know about Government?
Jue Hou
Anisia Katinskaia
Lari Kotilainen
Sathianpong Trangcasanchai
Anh Vu
R. Yangarber
315
2
0
22 Apr 2024
Scaling Laws for Downstream Task Performance of Large Language Models
International Conference on Learning Representations (ICLR), 2024
Berivan Isik
Natalia Ponomareva
Hussein Hazimeh
Dimitris Paparas
Sergei Vassilvitskii
Sanmi Koyejo
383
53
0
06 Feb 2024
The Effect of Masking Strategies on Knowledge Retention by Language Models
Jonas Wallat
Tianyi Zhang
Avishek Anand
KELM
CLL
152
0
0
12 Jun 2023
Hidden State Variability of Pretrained Language Models Can Guide Computation Reduction for Transfer Learning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Shuo Xie
Jiahao Qiu
Ankita Pasad
Li Du
Qing Qu
Hongyuan Mei
276
16
0
18 Oct 2022
Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks
Tilman Raukur
A. Ho
Stephen Casper
Dylan Hadfield-Menell
AAML
AI4CE
886
183
0
27 Jul 2022
ReFine: Re-randomization before Fine-tuning for Cross-domain Few-shot Learning
International Conference on Information and Knowledge Management (CIKM), 2022
Jaehoon Oh
Sungnyun Kim
Namgyu Ho
Jin-Hwa Kim
Hwanjun Song
Se-Young Yun
301
13
0
11 May 2022
Oolong: Investigating What Makes Transfer Learning Hard with Controlled Studies
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Zhengxuan Wu
Alex Tamkin
Isabel Papadimitriou
301
15
0
24 Feb 2022
Diagnosing AI Explanation Methods with Folk Concepts of Behavior
Conference on Fairness, Accountability and Transparency (FAccT), 2022
Alon Jacovi
Jasmijn Bastings
Sebastian Gehrmann
Yoav Goldberg
Katja Filippova
555
22
0
27 Jan 2022
Sparse Interventions in Language Models with Differentiable Masking
Nicola De Cao
Leon Schmid
Dieuwke Hupkes
Ivan Titov
285
34
0
13 Dec 2021
Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research for Language Understanding Tasks
Transactions of the Association for Computational Linguistics (TACL), 2021
Aakanksha Naik
J. Lehman
Carolyn Rose
408
9
0
02 Nov 2021
Rethinking Why Intermediate-Task Fine-Tuning Works
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Ting-Yun Chang
Chi-Jen Lu
LRM
252
32
0
26 Aug 2021
Grounding Representation Similarity with Statistical Testing
Frances Ding
Jean-Stanislas Denain
Jacob Steinhardt
252
34
0
03 Aug 2021
Counterfactual Interventions Reveal the Causal Effect of Relative Clause Representations on Agreement Prediction
Conference on Computational Natural Language Learning (CoNLL), 2021
Shauli Ravfogel
Grusha Prasad
Tal Linzen
Yoav Goldberg
445
72
0
14 May 2021
REPT: Bridging Language Models and Machine Reading Comprehension via Retrieval-Based Pre-training
Findings (Findings), 2021
Fangkai Jiao
Yangyang Guo
Yilin Niu
Feng Ji
Feng-Lin Li
Liqiang Nie
LRM
233
12
0
10 May 2021
Identifying the Limits of Cross-Domain Knowledge Transfer for Pretrained Models
Workshop on Representation Learning for NLP (RepL4NLP), 2021
Zhengxuan Wu
Nelson F. Liu
Christopher Potts
156
5
0
17 Apr 2021
What's in your Head? Emergent Behaviour in Multi-Task Transformer Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Mor Geva
Uri Katz
Aviv Ben-Arie
Jonathan Berant
LRM
473
11
0
13 Apr 2021
On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic Dependencies
North American Chapter of the Association for Computational Linguistics (NAACL), 2021
Tianyi Zhang
Tatsunori Hashimoto
AI4CE
262
30
0
12 Apr 2021
The Rediscovery Hypothesis: Language Models Need to Meet Linguistics
Journal of Artificial Intelligence Research (JAIR), 2021
Vassilina Nikoulina
Maxat Tezekbayev
Nuradil Kozhakhmet
Madina Babazhanova
Matthias Gallé
Z. Assylbekov
345
9
0
02 Mar 2021
Contrastive Explanations for Model Interpretability
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Alon Jacovi
Swabha Swayamdipta
Shauli Ravfogel
Yanai Elazar
Yejin Choi
Yoav Goldberg
539
115
0
02 Mar 2021
Probing Classifiers: Promises, Shortcomings, and Advances
International Conference on Computational Logic (ICCL), 2021
Yonatan Belinkov
981
697
0
24 Feb 2021
Damage detection using in-domain and cross-domain transfer learning
Zaharah Bukhsh
N. Jansen
Aaqib Saeed
261
50
0
07 Feb 2021
Meta-learning Transferable Representations with a Single Target Domain
Hong Liu
Jeff Z. HaoChen
Colin Wei
Tengyu Ma
AAML
267
5
0
03 Nov 2020
Rethinking embedding coupling in pre-trained language models
International Conference on Learning Representations (ICLR), 2020
Hyung Won Chung
Thibault Févry
Henry Tsai
Melvin Johnson
Sebastian Ruder
363
174
0
24 Oct 2020
What is being transferred in transfer learning?
Neural Information Processing Systems (NeurIPS), 2020
Behnam Neyshabur
Hanie Sedghi
Chiyuan Zhang
581
609
0
26 Aug 2020
Discovering Useful Sentence Representations from Large Pretrained Language Models
Nishant Subramani
Nivedita Suresh
198
9
0
20 Aug 2020
Revisiting Few-sample BERT Fine-tuning
International Conference on Learning Representations (ICLR), 2020
Tianyi Zhang
Felix Wu
Arzoo Katiyar
Kilian Q. Weinberger
Yoav Artzi
650
496
0
10 Jun 2020
Amnesic Probing: Behavioral Explanation with Amnesic Counterfactuals
Yanai Elazar
Shauli Ravfogel
Alon Jacovi
Yoav Goldberg
448
25
0
01 Jun 2020
What Happens To BERT Embeddings During Fine-tuning?
BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2020
Amil Merchant
Elahe Rahimtoroghi
Ellie Pavlick
Ian Tenney
312
216
0
29 Apr 2020
On the Effect of Dropping Layers of Pre-trained Transformer Models
Computer Speech and Language (CSL), 2020
Hassan Sajjad
Fahim Dalvi
Nadir Durrani
Preslav Nakov
485
185
0
08 Apr 2020
1
Page 1 of 1