Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1905.06316
Cited By
What do you learn from context? Probing for sentence structure in contextualized word representations
15 May 2019
Ian Tenney
Patrick Xia
Berlin Chen
Alex Jinpeng Wang
Adam Poliak
R. Thomas McCoy
Najoung Kim
Benjamin Van Durme
Samuel R. Bowman
Dipanjan Das
Ellie Pavlick
Re-assign community
ArXiv
PDF
HTML
Papers citing
"What do you learn from context? Probing for sentence structure in contextualized word representations"
50 / 532 papers shown
Title
Probing Neural Language Models for Human Tacit Assumptions
Nathaniel Weir
Adam Poliak
Benjamin Van Durme
8
6
0
10 Apr 2020
On the Effect of Dropping Layers of Pre-trained Transformer Models
Hassan Sajjad
Fahim Dalvi
Nadir Durrani
Preslav Nakov
23
131
0
08 Apr 2020
Understanding Cross-Lingual Syntactic Transfer in Multilingual Recurrent Neural Networks
Prajit Dhar
Arianna Bisazza
4
10
0
31 Mar 2020
Information-Theoretic Probing with Minimum Description Length
Elena Voita
Ivan Titov
19
269
0
27 Mar 2020
Pre-trained Models for Natural Language Processing: A Survey
Xipeng Qiu
Tianxiang Sun
Yige Xu
Yunfan Shao
Ning Dai
Xuanjing Huang
LM&MA
VLM
243
1,450
0
18 Mar 2020
A Survey on Contextual Embeddings
Qi Liu
Matt J. Kusner
Phil Blunsom
214
146
0
16 Mar 2020
jiant: A Software Toolkit for Research on General-Purpose Text Understanding Models
Yada Pruksachatkun
Philip Yeres
Haokun Liu
Jason Phang
Phu Mon Htut
Alex Jinpeng Wang
Ian Tenney
Samuel R. Bowman
SSeg
6
94
0
04 Mar 2020
A Primer in BERTology: What we know about how BERT works
Anna Rogers
Olga Kovaleva
Anna Rumshisky
OffRL
30
1,455
0
27 Feb 2020
Echo State Neural Machine Translation
Ankush Garg
Yuan Cao
Qi Ge
18
5
0
27 Feb 2020
Differentiable Reasoning over a Virtual Knowledge Base
Bhuwan Dhingra
Manzil Zaheer
Vidhisha Balachandran
Graham Neubig
Ruslan Salakhutdinov
William W. Cohen
10
88
0
25 Feb 2020
A Hierarchy of Limitations in Machine Learning
M. Malik
13
55
0
12 Feb 2020
Parsing as Pretraining
David Vilares
Michalina Strzyz
Anders Søgaard
Carlos Gómez-Rodríguez
28
31
0
05 Feb 2020
BERT's output layer recognizes all hidden layers? Some Intriguing Phenomena and a simple way to boost BERT
Wei-Tsung Kao
Tsung-Han Wu
Po-Han Chi
Chun-Cheng Hsieh
Hung-yi Lee
SSL
10
5
0
25 Jan 2020
oLMpics -- On what Language Model Pre-training Captures
Alon Talmor
Yanai Elazar
Yoav Goldberg
Jonathan Berant
LRM
17
300
0
31 Dec 2019
The performance evaluation of Multi-representation in the Deep Learning models for Relation Extraction Task
Jefferson A. Peña Torres
R. Gutierrez
Víctor A. Bucheli
Fabio Gonzalez
8
0
0
17 Dec 2019
Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection
Shubhi Tyagi
M. Nicolis
Jonas Rohnke
Thomas Drugman
Jaime Lorenzo-Trueba
16
32
0
02 Dec 2019
BLiMP: The Benchmark of Linguistic Minimal Pairs for English
Alex Warstadt
Alicia Parrish
Haokun Liu
Anhad Mohananey
Wei Peng
Sheng-Fu Wang
Samuel R. Bowman
18
464
0
02 Dec 2019
How Can We Know What Language Models Know?
Zhengbao Jiang
Frank F. Xu
Jun Araki
Graham Neubig
KELM
4
1,368
0
28 Nov 2019
Do Attention Heads in BERT Track Syntactic Dependencies?
Phu Mon Htut
Jason Phang
Shikha Bordia
Samuel R. Bowman
19
135
0
27 Nov 2019
Evaluating Commonsense in Pre-trained Language Models
Xuhui Zhou
Yue Zhang
Leyang Cui
Dandan Huang
AI4MH
LRM
20
181
0
27 Nov 2019
Attending to Entities for Better Text Understanding
Pengxiang Cheng
K. Erk
LRM
19
36
0
11 Nov 2019
Deep Contextualized Self-training for Low Resource Dependency Parsing
Guy Rotman
Roi Reichart
11
50
0
11 Nov 2019
Generalizing Natural Language Analysis through Span-relation Representations
Zhengbao Jiang
W. Xu
Jun Araki
Graham Neubig
16
60
0
10 Nov 2019
Multi-Sentence Argument Linking
Seth Ebner
Patrick Xia
Ryan Culkin
Kyle Rawlins
Benjamin Van Durme
HAI
13
156
0
09 Nov 2019
Why Do Masked Neural Language Models Still Need Common Sense Knowledge?
Sunjae Kwon
Cheongwoong Kang
Jiyeon Han
Jaesik Choi
11
16
0
08 Nov 2019
BERTs of a feather do not generalize together: Large variability in generalization across models with similar test set performance
R. Thomas McCoy
Junghyun Min
Tal Linzen
16
147
0
07 Nov 2019
On the Linguistic Representational Power of Neural Machine Translation Models
Yonatan Belinkov
Nadir Durrani
Fahim Dalvi
Hassan Sajjad
James R. Glass
MILM
22
68
0
01 Nov 2019
A Neural Entity Coreference Resolution Review
Nikolaos Stylianou
I. Vlahavas
8
38
0
21 Oct 2019
Diversify Your Datasets: Analyzing Generalization via Controlled Variance in Adversarial Datasets
Ohad Rozen
Vered Shwartz
Roee Aharoni
Ido Dagan
AAML
19
37
0
21 Oct 2019
Discovering the Compositional Structure of Vector Representations with Role Learning Networks
Paul Soulos
R. Thomas McCoy
Tal Linzen
P. Smolensky
CoGe
21
43
0
21 Oct 2019
exBERT: A Visual Analysis Tool to Explore Learned Representations in Transformers Models
Benjamin Hoover
Hendrik Strobelt
Sebastian Gehrmann
19
86
0
11 Oct 2019
Specializing Word Embeddings (for Parsing) by Information Bottleneck
Xiang Lisa Li
Jason Eisner
28
65
0
01 Oct 2019
Graph Convolutions over Constituent Trees for Syntax-Aware Semantic Role Labeling
Diego Marcheggiani
Ivan Titov
GNN
14
39
0
21 Sep 2019
How Does BERT Answer Questions? A Layer-Wise Analysis of Transformer Representations
Betty van Aken
B. Winter
Alexander Loser
Felix Alexander Gers
11
152
0
11 Sep 2019
Designing and Interpreting Probes with Control Tasks
John Hewitt
Percy Liang
19
522
0
08 Sep 2019
Effective Use of Transformer Networks for Entity Tracking
Aditya Gupta
Greg Durrett
14
20
0
05 Sep 2019
MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance
Wei-Ye Zhao
Maxime Peyrard
Fei Liu
Yang Gao
Christian M. Meyer
Steffen Eger
22
582
0
05 Sep 2019
Investigating BERT's Knowledge of Language: Five Analysis Methods with NPIs
Alex Warstadt
Yuning Cao
Ioana Grosu
Wei Peng
Hagen Blix
...
Jason Phang
Anhad Mohananey
Phu Mon Htut
Paloma Jeretic
Samuel R. Bowman
13
122
0
05 Sep 2019
Investigating Multilingual NMT Representations at Scale
Sneha Kudugunta
Ankur Bapna
Isaac Caswell
N. Arivazhagan
Orhan Firat
LRM
136
120
0
05 Sep 2019
The Bottom-up Evolution of Representations in the Transformer: A Study with Machine Translation and Language Modeling Objectives
Elena Voita
Rico Sennrich
Ivan Titov
190
181
0
03 Sep 2019
Language Models as Knowledge Bases?
Fabio Petroni
Tim Rocktaschel
Patrick Lewis
A. Bakhtin
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
AI4MH
408
2,584
0
03 Sep 2019
How Contextual are Contextualized Word Representations? Comparing the Geometry of BERT, ELMo, and GPT-2 Embeddings
Kawin Ethayarajh
11
842
0
02 Sep 2019
QuASE: Question-Answer Driven Sentence Encoding
Hangfeng He
Qiang Ning
Dan Roth
14
1
0
01 Sep 2019
Higher-order Comparisons of Sentence Encoder Representations
Mostafa Abdou
Artur Kulmizev
Felix Hill
D. Low
Anders Søgaard
12
16
0
01 Sep 2019
Evaluation Benchmarks and Learning Criteria for Discourse-Aware Sentence Representations
Mingda Chen
Zewei Chu
Kevin Gimpel
17
46
0
31 Aug 2019
Learning Latent Parameters without Human Response Patterns: Item Response Theory with Artificial Crowds
John P. Lalor
Hao Wu
Hong-ye Yu
11
42
0
29 Aug 2019
Shallow Syntax in Deep Water
Swabha Swayamdipta
Matthew E. Peters
Brendan Roof
Chris Dyer
Noah A. Smith
12
10
0
29 Aug 2019
Compositionality decomposed: how do neural networks generalise?
Dieuwke Hupkes
Verna Dankers
Mathijs Mul
Elia Bruni
CoGe
17
320
0
22 Aug 2019
Deep Contextualized Word Embeddings in Transition-Based and Graph-Based Dependency Parsing -- A Tale of Two Parsers Revisited
Artur Kulmizev
Miryam de Lhoneux
Johannes Gontrum
Elena Fano
Joakim Nivre
22
56
0
20 Aug 2019
Visualizing and Understanding the Effectiveness of BERT
Y. Hao
Li Dong
Furu Wei
Ke Xu
22
181
0
15 Aug 2019
Previous
1
2
3
...
10
11
9
Next