Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1906.08237
Cited By
v1
v2 (latest)
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Neural Information Processing Systems (NeurIPS), 2019
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 3,737 papers shown
Structured Pruning of a BERT-based Question Answering Model
J. Scott McCarley
Rishav Chakravarti
Avirup Sil
302
54
0
14 Oct 2019
Q8BERT: Quantized 8Bit BERT
Ofir Zafrir
Guy Boudoukh
Peter Izsak
Moshe Wasserblat
MQ
493
564
0
14 Oct 2019
Stabilizing Transformers for Reinforcement Learning
International Conference on Machine Learning (ICML), 2019
Emilio Parisotto
H. F. Song
Jack W. Rae
Razvan Pascanu
Çağlar Gülçehre
...
Aidan Clark
Seb Noury
M. Botvinick
N. Heess
R. Hadsell
OffRL
356
446
0
13 Oct 2019
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations
International Conference on Learning Representations (ICLR), 2019
Alexei Baevski
Steffen Schneider
Michael Auli
SSL
684
723
0
12 Oct 2019
Conversational Transfer Learning for Emotion Recognition
Devamanyu Hazarika
Soujanya Poria
Roger Zimmermann
Amélie Reymond
265
17
0
11 Oct 2019
Multilingual Question Answering from Formatted Text applied to Conversational Agents
W. Siblini
Charlotte Pasqual
Axel Lavielle
Mohamed Challal
Cyril Cauchois
219
21
0
10 Oct 2019
PipeMare: Asynchronous Pipeline Parallel DNN Training
Conference on Machine Learning and Systems (MLSys), 2019
Bowen Yang
Jian Zhang
Jonathan Li
Christopher Ré
Christopher R. Aberger
Christopher De Sa
344
127
0
09 Oct 2019
Domain-Relevant Embeddings for Medical Question Similarity
Clara H. McCreery
Namit Katariya
A. Kannan
Manish Chablani
X. Amatriain
187
9
0
09 Oct 2019
HuggingFace's Transformers: State-of-the-art Natural Language Processing
Thomas Wolf
Lysandre Debut
Victor Sanh
Julien Chaumond
Clement Delangue
...
Teven Le Scao
Sylvain Gugger
Mariama Drame
Quentin Lhoest
Alexander M. Rush
AI4CE
527
3,638
0
09 Oct 2019
Knowledge Distillation from Internal Representations
AAAI Conference on Artificial Intelligence (AAAI), 2019
Gustavo Aguilar
Yuan Ling
Yu Zhang
Benjamin Yao
Xing Fan
Edward Guo
432
198
0
08 Oct 2019
Read, Highlight and Summarize: A Hierarchical Neural Semantic Encoder-based Approach
Rajeev Bhatt Ambati
Saptarashmi Bandyopadhyay
P. Mitra
62
0
0
08 Oct 2019
BERT for Evidence Retrieval and Claim Verification
European Conference on Information Retrieval (ECIR), 2019
Shrishti Saha Shetu
Christof Monz
E. Mabande
RALM
162
144
0
07 Oct 2019
MASTER: Multi-Aspect Non-local Network for Scene Text Recognition
Pattern Recognition (Pattern Recognit.), 2019
Ning Lu
Wenwen Yu
Xianbiao Qi
Yihao Chen
Ping Gong
Rong Xiao
Xiang Bai
250
178
0
07 Oct 2019
Distilling BERT into Simple Neural Networks with Unlabeled Transfer Data
Subhabrata Mukherjee
Ahmed Hassan Awadallah
196
25
0
04 Oct 2019
Cracking the Contextual Commonsense Code: Understanding Commonsense Reasoning Aptitude of Deep Contextual Representations
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Jeff Da
Jungo Kasai
LRM
183
41
0
02 Oct 2019
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
Victor Sanh
Lysandre Debut
Julien Chaumond
Thomas Wolf
3.1K
9,292
0
02 Oct 2019
SummAE: Zero-Shot Abstractive Text Summarization using Length-Agnostic Auto-Encoders
Peter J. Liu
Yu-An Chung
Jie Jessie Ren
270
20
0
02 Oct 2019
Exploiting BERT for End-to-End Aspect-based Sentiment Analysis
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Xin Li
Lidong Bing
Wenxuan Zhang
W. Lam
267
312
0
02 Oct 2019
State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention With Dilated 1D Convolutions
Automatic Speech Recognition & Understanding (ASRU), 2019
Kyu Jeong Han
R. Prieto
Kaixing(Kai) Wu
T. Ma
316
77
0
01 Oct 2019
Better Document-Level Machine Translation with Bayes' Rule
Lei Yu
Laurent Sartran
Wojciech Stokowiec
Wang Ling
Lingpeng Kong
Phil Blunsom
Chris Dyer
203
7
0
01 Oct 2019
MMM: Multi-stage Multi-task Learning for Multi-choice Reading Comprehension
AAAI Conference on Artificial Intelligence (AAAI), 2019
Di Jin
Shuyang Gao
Jiun-Yu Kao
Tagyoung Chung
Dilek Z. Hakkani-Tür
268
72
0
01 Oct 2019
TMLab: Generative Enhanced Model (GEM) for adversarial attacks
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
P. Niewinski
M. Pszona
M. Janicka
VLM
GAN
156
19
0
01 Oct 2019
Biomedical relation extraction with pre-trained language representations and minimal task-specific architecture
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Ashok Thillaisundaram
Theodosia Togia
120
18
0
26 Sep 2019
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
International Conference on Learning Representations (ICLR), 2019
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
1.4K
7,308
0
26 Sep 2019
Aspect and Opinion Term Extraction for Hotel Reviews using Transfer Learning and Auxiliary Labels
Yosef Ardhito Winatmoko
Ali Akbar Septiandri
Arie Pratama Sutiono
221
4
0
26 Sep 2019
Pre-train, Interact, Fine-tune: A Novel Interaction Representation for Text Classification
Information Processing & Management (IPM), 2019
Jianming Zheng
Fei Cai
Honghui Chen
Maarten de Rijke
106
23
0
26 Sep 2019
FreeLB: Enhanced Adversarial Training for Natural Language Understanding
International Conference on Learning Representations (ICLR), 2019
Chen Zhu
Yu Cheng
Zhe Gan
S. Sun
Tom Goldstein
Jingjing Liu
AAML
733
494
0
25 Sep 2019
UNITER: UNiversal Image-TExt Representation Learning
European Conference on Computer Vision (ECCV), 2019
Yen-Chun Chen
Linjie Li
Licheng Yu
Ahmed El Kholy
Faisal Ahmed
Zhe Gan
Yu Cheng
Jingjing Liu
VLM
OT
422
469
0
25 Sep 2019
Extremely Small BERT Models from Mixed-Vocabulary Training
Sanqiang Zhao
Raghav Gupta
Yang Song
Denny Zhou
VLM
256
54
0
25 Sep 2019
Reducing Transformer Depth on Demand with Structured Dropout
International Conference on Learning Representations (ICLR), 2019
Angela Fan
Edouard Grave
Armand Joulin
728
675
0
25 Sep 2019
Multi-task Batch Reinforcement Learning with Metric Learning
Jiachen Li
Q. Vuong
Shuang Liu
Minghua Liu
K. Ciosek
George Andriopoulos
Henrik I. Christensen
H. Su
OffRL
344
2
0
25 Sep 2019
Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models
International Conference on Learning Representations (ICLR), 2019
Cheolhyoung Lee
Dong Wang
Wanmo Kang
MoE
526
232
0
25 Sep 2019
Understanding Semantics from Speech Through Pre-training
P. Wang
Liangchen Wei
Yong Cao
Jinghui Xie
Yuji Cao
Zaiqing Nie
SSL
VLM
142
6
0
24 Sep 2019
Technical report on Conversational Question Answering
Yingnan Ju
Fubang Zhao
Shijie Chen
Bowen Zheng
Xuefeng Yang
Yunfeng Liu
120
50
0
24 Sep 2019
Portuguese Named Entity Recognition using BERT-CRF
Fábio Souza
Rodrigo Nogueira
R. Lotufo
323
283
0
23 Sep 2019
Cross-Lingual Natural Language Generation via Pre-Training
AAAI Conference on Artificial Intelligence (AAAI), 2019
Zewen Chi
Li Dong
Furu Wei
Wenhui Wang
Xian-Ling Mao
Heyan Huang
340
141
0
23 Sep 2019
Does BERT Make Any Sense? Interpretable Word Sense Disambiguation with Contextualized Embeddings
Conference on Natural Language Processing (NLP), 2019
Gregor Wiedemann
Steffen Remus
Avi Chawla
Chris Biemann
335
195
0
23 Sep 2019
TinyBERT: Distilling BERT for Natural Language Understanding
Findings (Findings), 2019
Xiaoqi Jiao
Yichun Yin
Lifeng Shang
Xin Jiang
Xiao Chen
Linlin Li
F. Wang
Qun Liu
VLM
714
2,236
0
23 Sep 2019
Teaching Pretrained Models with Commonsense Reasoning: A Preliminary KB-Based Approach
Shiyang Li
Jianshu Chen
Dian Yu
ReLM
LRM
186
21
0
20 Sep 2019
Representation Learning for Electronic Health Records
W. Weng
Peter Szolovits
181
22
0
19 Sep 2019
ASU at TextGraphs 2019 Shared Task: Explanation ReGeneration using Language Models and Iterative Re-Ranking
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Pratyay Banerjee
LRM
154
21
0
19 Sep 2019
Summary Level Training of Sentence Rewriting for Abstractive Summarization
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Sanghwan Bae
Taeuk Kim
Jihoon Kim
Sang-goo Lee
170
73
0
19 Sep 2019
Cross-Lingual Contextual Word Embeddings Mapping With Multi-Sense Words In Mind
Zheng Zhang
Ruiqing Yin
Jun Zhu
Pierre Zweigenbaum
117
4
0
18 Sep 2019
Language models and Automated Essay Scoring
Pedro Uría Rodríguez
Amir Jafari
C. Ormerod
288
109
0
18 Sep 2019
Extractive Summarization of Long Documents by Combining Global and Local Context
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019
Wen Xiao
Giuseppe Carenini
250
162
0
17 Sep 2019
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
Mohammad Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
2.0K
2,581
0
17 Sep 2019
K-BERT: Enabling Language Representation with Knowledge Graph
AAAI Conference on Artificial Intelligence (AAAI), 2019
Weijie Liu
Peng Zhou
Zhe Zhao
Zhiruo Wang
Qi Ju
Haotang Deng
Ping Wang
689
883
0
17 Sep 2019
I-MAD: Interpretable Malware Detector Using Galaxy Transformer
Computers & security (Comput. Secur.), 2019
Miles Q. Li
Benjamin C. M. Fung
P. Charland
Steven H. H. Ding
446
39
0
15 Sep 2019
Temporal FiLM: Capturing Long-Range Sequence Dependencies with Feature-Wise Modulations
Neural Information Processing Systems (NeurIPS), 2019
Sawyer Birnbaum
Volodymyr Kuleshov
S. Enam
Pang Wei Koh
Stefano Ermon
AI4TS
263
89
0
14 Sep 2019
SANVis: Visual Analytics for Understanding Self-Attention Networks
Visual .. (VISUAL), 2019
Cheonbok Park
Inyoup Na
Yongjang Jo
Sungbok Shin
J. Yoo
Bum Chul Kwon
Jian Zhao
Hyungjong Noh
Yeonsoo Lee
Jaegul Choo
HAI
213
42
0
13 Sep 2019
Previous
1
2
3
...
72
73
74
75
Next
Page 73 of 75
Page
of 75
Go