v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Neural Information Processing Systems (NeurIPS), 2019

19 June 2019

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,737 papers shown

Structured Pruning of a BERT-based Question Answering Model

J. Scott McCarley

Rishav Chakravarti

Avirup Sil

302

14 Oct 2019

Q8BERT: Quantized 8Bit BERT

493

564

14 Oct 2019

Stabilizing Transformers for Reinforcement LearningInternational Conference on Machine Learning (ICML), 2019

...

356

446

13 Oct 2019

vq-wav2vec: Self-Supervised Learning of Discrete Speech RepresentationsInternational Conference on Learning Representations (ICLR), 2019

684

723

12 Oct 2019

Conversational Transfer Learning for Emotion Recognition

Devamanyu Hazarika

Soujanya Poria

Roger Zimmermann

Amélie Reymond

265

11 Oct 2019

Multilingual Question Answering from Formatted Text applied to Conversational Agents

219

10 Oct 2019

PipeMare: Asynchronous Pipeline Parallel DNN TrainingConference on Machine Learning and Systems (MLSys), 2019

Christopher R. Aberger

Christopher De Sa

344

127

09 Oct 2019

Domain-Relevant Embeddings for Medical Question Similarity

187

09 Oct 2019

HuggingFace's Transformers: State-of-the-art Natural Language Processing

...

527

3,638

09 Oct 2019

Knowledge Distillation from Internal RepresentationsAAAI Conference on Artificial Intelligence (AAAI), 2019

432

198

08 Oct 2019

Read, Highlight and Summarize: A Hierarchical Neural Semantic Encoder-based Approach

Rajeev Bhatt Ambati

Saptarashmi Bandyopadhyay

P. Mitra

08 Oct 2019

BERT for Evidence Retrieval and Claim VerificationEuropean Conference on Information Retrieval (ECIR), 2019

162

144

07 Oct 2019

MASTER: Multi-Aspect Non-local Network for Scene Text RecognitionPattern Recognition (Pattern Recognit.), 2019

Yihao Chen

250

178

07 Oct 2019

Distilling BERT into Simple Neural Networks with Unlabeled Transfer Data

Subhabrata Mukherjee

Ahmed Hassan Awadallah

196

04 Oct 2019

Cracking the Contextual Commonsense Code: Understanding Commonsense Reasoning Aptitude of Deep Contextual RepresentationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2019

Jeff Da

Jungo Kasai

LRM

183

02 Oct 2019

DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter

3.1K

9,292

02 Oct 2019

SummAE: Zero-Shot Abstractive Text Summarization using Length-Agnostic Auto-Encoders

Peter J. Liu

Yu-An Chung

Jie Jessie Ren

270

02 Oct 2019

Exploiting BERT for End-to-End Aspect-based Sentiment AnalysisConference on Empirical Methods in Natural Language Processing (EMNLP), 2019

Xin Li

Lidong Bing

Wenxuan Zhang

W. Lam

267

312

02 Oct 2019

State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention With Dilated 1D ConvolutionsAutomatic Speech Recognition & Understanding (ASRU), 2019

316

01 Oct 2019

Better Document-Level Machine Translation with Bayes' Rule

Wojciech Stokowiec

Lingpeng Kong

203

01 Oct 2019

MMM: Multi-stage Multi-task Learning for Multi-choice Reading ComprehensionAAAI Conference on Artificial Intelligence (AAAI), 2019

268

01 Oct 2019

TMLab: Generative Enhanced Model (GEM) for adversarial attacksConference on Empirical Methods in Natural Language Processing (EMNLP), 2019

156

01 Oct 2019

Biomedical relation extraction with pre-trained language representations and minimal task-specific architectureConference on Empirical Methods in Natural Language Processing (EMNLP), 2019

Ashok Thillaisundaram

Theodosia Togia

120

26 Sep 2019

ALBERT: A Lite BERT for Self-supervised Learning of Language RepresentationsInternational Conference on Learning Representations (ICLR), 2019

1.4K

7,308

26 Sep 2019

Aspect and Opinion Term Extraction for Hotel Reviews using Transfer Learning and Auxiliary Labels

Yosef Ardhito Winatmoko

Ali Akbar Septiandri

Arie Pratama Sutiono

221

26 Sep 2019

Pre-train, Interact, Fine-tune: A Novel Interaction Representation for Text ClassificationInformation Processing & Management (IPM), 2019

Jianming Zheng

Fei Cai

Honghui Chen

Maarten de Rijke

106

26 Sep 2019

FreeLB: Enhanced Adversarial Training for Natural Language UnderstandingInternational Conference on Learning Representations (ICLR), 2019

733

494

25 Sep 2019

UNITER: UNiversal Image-TExt Representation LearningEuropean Conference on Computer Vision (ECCV), 2019

422

469

25 Sep 2019

Extremely Small BERT Models from Mixed-Vocabulary Training

256

25 Sep 2019

Reducing Transformer Depth on Demand with Structured DropoutInternational Conference on Learning Representations (ICLR), 2019

Angela Fan

Edouard Grave

Armand Joulin

728

675

25 Sep 2019

Multi-task Batch Reinforcement Learning with Metric Learning

Henrik I. Christensen

H. Su

OffRL

344

25 Sep 2019

Mixout: Effective Regularization to Finetune Large-scale Pretrained Language ModelsInternational Conference on Learning Representations (ICLR), 2019

526

232

25 Sep 2019

Understanding Semantics from Speech Through Pre-training

142

24 Sep 2019

Technical report on Conversational Question Answering

Fubang Zhao

120

24 Sep 2019

Portuguese Named Entity Recognition using BERT-CRF

Fábio Souza

Rodrigo Nogueira

R. Lotufo

323

283

23 Sep 2019

Cross-Lingual Natural Language Generation via Pre-TrainingAAAI Conference on Artificial Intelligence (AAAI), 2019

340

141

23 Sep 2019

Does BERT Make Any Sense? Interpretable Word Sense Disambiguation with Contextualized EmbeddingsConference on Natural Language Processing (NLP), 2019

335

195

23 Sep 2019

TinyBERT: Distilling BERT for Natural Language UnderstandingFindings (Findings), 2019

Xiaoqi Jiao

Yichun Yin

Lifeng Shang

Xin Jiang

Xiao Chen

Linlin Li

F. Wang

Qun Liu

VLM

714

2,236

23 Sep 2019

Teaching Pretrained Models with Commonsense Reasoning: A Preliminary KB-Based Approach

Shiyang Li

Jianshu Chen

Dian Yu

ReLM LRM

186

20 Sep 2019

Representation Learning for Electronic Health Records

W. Weng

Peter Szolovits

181

19 Sep 2019

ASU at TextGraphs 2019 Shared Task: Explanation ReGeneration using Language Models and Iterative Re-RankingConference on Empirical Methods in Natural Language Processing (EMNLP), 2019

Pratyay Banerjee

LRM

154

19 Sep 2019

Summary Level Training of Sentence Rewriting for Abstractive SummarizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2019

170

19 Sep 2019

Cross-Lingual Contextual Word Embeddings Mapping With Multi-Sense Words In Mind

Zheng Zhang

Ruiqing Yin

Jun Zhu

Pierre Zweigenbaum

117

18 Sep 2019

Language models and Automated Essay Scoring

Pedro Uría Rodríguez

Amir Jafari

C. Ormerod

288

109

18 Sep 2019

Extractive Summarization of Long Documents by Combining Global and Local ContextConference on Empirical Methods in Natural Language Processing (EMNLP), 2019

Wen Xiao

Giuseppe Carenini

250

162

17 Sep 2019

Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism

2.0K

2,581

17 Sep 2019

K-BERT: Enabling Language Representation with Knowledge GraphAAAI Conference on Artificial Intelligence (AAAI), 2019

689

883

17 Sep 2019

I-MAD: Interpretable Malware Detector Using Galaxy TransformerComputers & security (Comput. Secur.), 2019

446

15 Sep 2019

Temporal FiLM: Capturing Long-Range Sequence Dependencies with Feature-Wise ModulationsNeural Information Processing Systems (NeurIPS), 2019

Pang Wei Koh

263

14 Sep 2019

SANVis: Visual Analytics for Understanding Self-Attention NetworksVisual .. (VISUAL), 2019

213

13 Sep 2019