ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Neural Information Processing Systems (NeurIPS), 2019
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

32 / 3,732 papers shown
DLGNet: A Transformer-based Model for Dialogue Response Generation
DLGNet: A Transformer-based Model for Dialogue Response Generation
O. Olabiyi
Erik T. Mueller
191
13
0
26 Jul 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
4.0K
28,007
0
26 Jul 2019
SpanBERT: Improving Pre-training by Representing and Predicting Spans
SpanBERT: Improving Pre-training by Representing and Predicting SpansTransactions of the Association for Computational Linguistics (TACL), 2019
Mandar Joshi
Danqi Chen
Yinhan Liu
Daniel S. Weld
Luke Zettlemoyer
Omer Levy
622
2,106
0
24 Jul 2019
Green AI
Green AI
Roy Schwartz
Jesse Dodge
Noah A. Smith
Oren Etzioni
857
1,429
0
22 Jul 2019
Introduction to Neural Network based Approaches for Question Answering
  over Knowledge Graphs
Introduction to Neural Network based Approaches for Question Answering over Knowledge Graphs
Nilesh Chakraborty
Denis Lukovnikov
Gaurav Maheshwari
Priyansh Trivedi
Jens Lehmann
Asja Fischer
GNNLMTD
152
56
0
22 Jul 2019
What is this Article about? Extreme Summarization with Topic-aware
  Convolutional Neural Networks
What is this Article about? Extreme Summarization with Topic-aware Convolutional Neural NetworksJournal of Artificial Intelligence Research (JAIR), 2019
Shashi Narayan
Shay B. Cohen
Mirella Lapata
AILaw
231
18
0
19 Jul 2019
A Pragmatics-Centered Evaluation Framework for Natural Language
  Understanding
A Pragmatics-Centered Evaluation Framework for Natural Language UnderstandingInternational Conference on Language Resources and Evaluation (LREC), 2019
Damien Sileo
Tim Van de Cruys
Camille Pradel
Philippe Muller
ELM
140
3
0
19 Jul 2019
Low-Shot Classification: A Comparison of Classical and Deep Transfer
  Machine Learning Approaches
Low-Shot Classification: A Comparison of Classical and Deep Transfer Machine Learning Approaches
Peter Usherwood
S. Smit
VLM
86
12
0
17 Jul 2019
DeepTrax: Embedding Graphs of Financial Transactions
DeepTrax: Embedding Graphs of Financial TransactionsInternational Conference on Machine Learning and Applications (ICMLA), 2019
C. Bayan Bruss
Anish Khazane
Jonathan Rider
R. Serpe
Antonia Gogoglou
Keegan E. Hines
AIFinGNN
179
53
0
16 Jul 2019
Can Unconditional Language Models Recover Arbitrary Sentences?
Can Unconditional Language Models Recover Arbitrary Sentences?Neural Information Processing Systems (NeurIPS), 2019
Nishant Subramani
Samuel R. Bowman
Dong Wang
171
25
0
10 Jul 2019
Head-Driven Phrase Structure Grammar Parsing on Penn Treebank
Head-Driven Phrase Structure Grammar Parsing on Penn TreebankAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Junru Zhou
Zhao Hai
362
150
0
05 Jul 2019
Katecheo: A Portable and Modular System for Multi-Topic Question
  Answering
Katecheo: A Portable and Modular System for Multi-Topic Question Answering
S. Hirekodi
Seban Sunny
Leonard Topno
Alwin Daniel
Daniel Whitenack
Reuben Skewes
Stuart Cranney
KELM
128
1
0
01 Jul 2019
Pre-Training with Whole Word Masking for Chinese BERT
Pre-Training with Whole Word Masking for Chinese BERTIEEE/ACM Transactions on Audio Speech and Language Processing (TASLP), 2019
Yiming Cui
Wanxiang Che
Ting Liu
Bing Qin
Ziqing Yang
265
233
0
19 Jun 2019
Calibration, Entropy Rates, and Memory in Language Models
Calibration, Entropy Rates, and Memory in Language ModelsInternational Conference on Machine Learning (ICML), 2019
M. Braverman
Xinyi Chen
Sham Kakade
Karthik Narasimhan
Cyril Zhang
Yi Zhang
228
44
0
11 Jun 2019
Better Long-Range Dependency By Bootstrapping A Mutual Information
  Regularizer
Better Long-Range Dependency By Bootstrapping A Mutual Information RegularizerInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2019
Yanshuai Cao
Peng Xu
131
2
0
28 May 2019
A Review of Keyphrase Extraction
A Review of Keyphrase Extraction
Eirini Papagiannopoulou
Grigorios Tsoumakas
220
184
0
13 May 2019
Deep Unsupervised Cardinality Estimation
Deep Unsupervised Cardinality EstimationProceedings of the VLDB Endowment (PVLDB), 2019
Zongheng Yang
Eric Liang
Amog Kamsetty
Chenggang Wu
Yan Duan
Peter Chen
Pieter Abbeel
J. M. Hellerstein
S. Krishnan
Ion Stoica
234
230
0
10 May 2019
Survey on Evaluation Methods for Dialogue Systems
Survey on Evaluation Methods for Dialogue SystemsArtificial Intelligence Review (AIR), 2019
Jan Deriu
Álvaro Rodrigo
Arantxa Otegi
Guillermo Echegoyen
S. Rosset
Eneko Agirre
Mark Cieliebak
278
322
0
10 May 2019
Taming Pretrained Transformers for Extreme Multi-label Text
  Classification
Taming Pretrained Transformers for Extreme Multi-label Text Classification
Wei-Cheng Chang
Hsiang-Fu Yu
Kai Zhong
Yiming Yang
Inderjit Dhillon
272
20
0
07 May 2019
SuperGLUE: A Stickier Benchmark for General-Purpose Language
  Understanding Systems
SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding SystemsNeural Information Processing Systems (NeurIPS), 2019
Alex Jinpeng Wang
Yada Pruksachatkun
Nikita Nangia
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
714
2,629
0
02 May 2019
Terminologies augmented recurrent neural network model for clinical
  named entity recognition
Terminologies augmented recurrent neural network model for clinical named entity recognition
Ivan Lerner
N. Paris
Xavier Tannier
151
38
0
25 Apr 2019
BERTScore: Evaluating Text Generation with BERT
BERTScore: Evaluating Text Generation with BERT
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
2.4K
7,563
0
21 Apr 2019
DocBERT: BERT for Document Classification
DocBERT: BERT for Document Classification
Ashutosh Adhikari
Achyudh Ram
Raphael Tang
Jimmy J. Lin
LLMAGVLM
286
320
0
17 Apr 2019
An Attentive Survey of Attention Models
An Attentive Survey of Attention Models
S. Chaudhari
Varun Mithal
Gungor Polatkan
R. Ramanath
444
723
0
05 Apr 2019
Recent Advances in Natural Language Inference: A Survey of Benchmarks,
  Resources, and Approaches
Recent Advances in Natural Language Inference: A Survey of Benchmarks, Resources, and Approaches
Shane Storks
Qiaozi Gao
J. Chai
476
142
0
02 Apr 2019
Contextual Word Representations: A Contextual Introduction
Contextual Word Representations: A Contextual Introduction
Noah A. Smith
242
35
0
15 Feb 2019
Dual Co-Matching Network for Multi-choice Reading Comprehension
Dual Co-Matching Network for Multi-choice Reading Comprehension
Shuailiang Zhang
Zhao Hai
Yuwei Wu
Zhuosheng Zhang
Xi Zhou
Xiaoping Zhou
316
135
0
27 Jan 2019
AccUDNN: A GPU Memory Efficient Accelerator for Training Ultra-deep
  Neural Networks
AccUDNN: A GPU Memory Efficient Accelerator for Training Ultra-deep Neural Networks
Jinrong Guo
Wantao Liu
Wang Wang
Q. Lu
Songlin Hu
Jizhong Han
Ruixuan Li
165
11
0
21 Jan 2019
Sentence transition matrix: An efficient approach that preserves
  sentence semantics
Sentence transition matrix: An efficient approach that preserves sentence semantics
Myeongjun Jang
Pilsung Kang
103
3
0
16 Jan 2019
Neural Abstractive Text Summarization with Sequence-to-Sequence Models
Neural Abstractive Text Summarization with Sequence-to-Sequence Models
Tian Shi
Yaser Keneshloo
Naren Ramakrishnan
Chandan K. Reddy
451
252
0
05 Dec 2018
Efficient Attention: Attention with Linear Complexities
Efficient Attention: Attention with Linear Complexities
Zhuoran Shen
Mingyuan Zhang
Haiyu Zhao
Shuai Yi
Jiaming Song
682
669
0
04 Dec 2018
Compositional Coding Capsule Network with K-Means Routing for Text
  Classification
Compositional Coding Capsule Network with K-Means Routing for Text Classification
Hao Ren
Hong-wei Lu
257
56
0
22 Oct 2018
Previous
123...737475
Page 75 of 75
Pageof 75