Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1511.01432
Cited By
Semi-supervised Sequence Learning
4 November 2015
Andrew M. Dai
Quoc V. Le
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Semi-supervised Sequence Learning"
50 / 197 papers shown
Title
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization
Jingqing Zhang
Yao-Min Zhao
Mohammad Saleh
Peter J. Liu
RALM
3DGS
45
2,018
0
18 Dec 2019
FlauBERT: Unsupervised Language Model Pre-training for French
Hang Le
Loïc Vial
Jibril Frej
Vincent Segonne
Maximin Coavoux
Benjamin Lecouteux
A. Allauzen
Benoît Crabbé
Laurent Besacier
D. Schwab
AI4CE
49
395
0
11 Dec 2019
A Transformer-based approach to Irony and Sarcasm detection
Rolandos Alexandros Potamias
Georgios Siolas
A. Stafylopatis
33
206
0
23 Nov 2019
CamemBERT: a Tasty French Language Model
Louis Martin
Benjamin Muller
Pedro Ortiz Suarez
Yoann Dupont
Laurent Romary
Eric Villemonte de la Clergerie
Djamé Seddah
Benoît Sagot
42
956
0
10 Nov 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
129
19,529
0
23 Oct 2019
A Mutual Information Maximization Perspective of Language Representation Learning
Lingpeng Kong
Cyprien de Masson dÁutume
Wang Ling
Lei Yu
Zihang Dai
Dani Yogatama
SSL
226
166
0
18 Oct 2019
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
109
6,377
0
26 Sep 2019
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
301
1,616
0
18 Sep 2019
UER: An Open-Source Toolkit for Pre-training Models
Zhe Zhao
Hui Chen
Jinbin Zhang
Xin Zhao
Tao Liu
Wei Lu
Xi Chen
Haotang Deng
Qi Ju
Xiaoyong Du
28
112
0
12 Sep 2019
CTRL: A Conditional Transformer Language Model for Controllable Generation
N. Keskar
Bryan McCann
L. Varshney
Caiming Xiong
R. Socher
AI4CE
57
1,237
0
11 Sep 2019
Knowledge Enhanced Contextual Word Representations
Matthew E. Peters
Mark Neumann
IV RobertL.Logan
Roy Schwartz
Vidur Joshi
Sameer Singh
Noah A. Smith
234
656
0
09 Sep 2019
Supervised Multimodal Bitransformers for Classifying Images and Text
Douwe Kiela
Suvrat Bhooshan
Hamed Firooz
Ethan Perez
Davide Testuggine
59
242
0
06 Sep 2019
Neural Attentive Bag-of-Entities Model for Text Classification
Ikuya Yamada
Hiroyuki Shindo
15
33
0
03 Sep 2019
Non-local Recurrent Neural Memory for Supervised Sequence Modeling
Canmiao Fu
Wenjie Pei
Qiong Cao
Chaopeng Zhang
Yong Zhao
Xiaoyong Shen
Yu-Wing Tai
24
11
0
26 Aug 2019
Patient Knowledge Distillation for BERT Model Compression
S. Sun
Yu Cheng
Zhe Gan
Jingjing Liu
78
831
0
25 Aug 2019
Denoising based Sequence-to-Sequence Pre-training for Text Generation
Liang Wang
Wei-Ye Zhao
Ruoyu Jia
Sujian Li
Jingming Liu
VLM
AI4CE
42
37
0
22 Aug 2019
"Mask and Infill" : Applying Masked Language Model to Sentiment Transfer
Xing Wu
Tao Zhang
Liangjun Zang
Jizhong Han
Songlin Hu
38
109
0
21 Aug 2019
Similarity Learning for Authorship Verification in Social Media
Benedikt T. Boenninghoff
R. M. Nickel
Steffen Zeiler
D. Kolossa
25
42
0
20 Aug 2019
Scalable Attentive Sentence-Pair Modeling via Distilled Sentence Embedding
Oren Barkan
Noam Razin
Itzik Malkiel
Ori Katz
Avi Caciularu
Noam Koenigstein
FedML
23
37
0
14 Aug 2019
Leveraging Pre-trained Checkpoints for Sequence Generation Tasks
S. Rothe
Shashi Narayan
Aliaksei Severyn
SILM
71
433
0
29 Jul 2019
BAM! Born-Again Multi-Task Networks for Natural Language Understanding
Kevin Clark
Minh-Thang Luong
Urvashi Khandelwal
Christopher D. Manning
Quoc V. Le
24
228
0
10 Jul 2019
Learning Compressed Sentence Representations for On-Device Text Processing
Dinghan Shen
Pengyu Cheng
Dhanasekar Sundararaman
Xinyuan Zhang
Qian Yang
Meng Tang
Asli Celikyilmaz
Lawrence Carin
18
22
0
19 Jun 2019
What Does BERT Look At? An Analysis of BERT's Attention
Kevin Clark
Urvashi Khandelwal
Omer Levy
Christopher D. Manning
MILM
114
1,581
0
11 Jun 2019
Sequence Tagging with Contextual and Non-Contextual Subword Representations: A Multilingual Evaluation
Benjamin Heinzerling
Michael Strube
10
35
0
04 Jun 2019
Pretraining Methods for Dialog Context Representation Learning
Shikib Mehri
E. Razumovskaia
Tiancheng Zhao
M. Eskénazi
22
84
0
02 Jun 2019
Adaptation of Deep Bidirectional Multilingual Transformers for Russian Language
Yuri Kuratov
M. Arkhipov
11
274
0
17 May 2019
Gmail Smart Compose: Real-Time Assisted Writing
Mengzhao Chen
Benjamin Lee
G. Bansal
Yuan Cao
Shuyuan Zhang
...
Yinan Wang
Andrew M. Dai
Zhiwen Chen
Timothy Sohn
Yonghui Wu
18
203
0
17 May 2019
Unified Language Model Pre-training for Natural Language Understanding and Generation
Li Dong
Nan Yang
Wenhui Wang
Furu Wei
Xiaodong Liu
Yu-Chiang Frank Wang
Jianfeng Gao
M. Zhou
H. Hon
ELM
AI4CE
80
1,551
0
08 May 2019
Unsupervised Domain Adaptation of Contextualized Embeddings for Sequence Labeling
Xiaochuang Han
Jacob Eisenstein
19
20
0
04 Apr 2019
To Tune or Not to Tune? Adapting Pretrained Representations to Diverse Tasks
Matthew E. Peters
Sebastian Ruder
Noah A. Smith
39
433
0
14 Mar 2019
An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models
Alexandra Chronopoulou
Christos Baziotis
Alexandros Potamianos
CLL
25
129
0
27 Feb 2019
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Zihang Dai
Zhilin Yang
Yiming Yang
J. Carbonell
Quoc V. Le
Ruslan Salakhutdinov
VLM
38
3,679
0
09 Jan 2019
Unsupervised Transfer Learning for Spoken Language Understanding in Intelligent Agents
Aditya Siddhant
Anuj Kumar Goyal
A. Metallinou
19
50
0
13 Nov 2018
Learning to Explicitate Connectives with Seq2Seq Network for Implicit Discourse Relation Classification
Junjie Zeng
Yue Hu
19
32
0
05 Nov 2018
Picking Apart Story Salads
Su Wang
Eric Holgate
Greg Durrett
K. Erk
17
6
0
31 Oct 2018
Learning to Represent Edits
Pengcheng Yin
Graham Neubig
Miltiadis Allamanis
Marc Brockschmidt
Alexander L. Gaunt
KELM
26
112
0
31 Oct 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
83
93,140
0
11 Oct 2018
Unsupervised Learning via Meta-Learning
Kyle Hsu
Sergey Levine
Chelsea Finn
SSL
OffRL
37
229
0
04 Oct 2018
Language Modeling Teaches You More Syntax than Translation Does: Lessons Learned Through Auxiliary Task Analysis
Kelly W. Zhang
Samuel R. Bowman
32
70
0
26 Sep 2018
Semi-Supervised Sequence Modeling with Cross-View Training
Kevin Clark
Minh-Thang Luong
Christopher D. Manning
Quoc V. Le
SSL
11
333
0
22 Sep 2018
Improved Semantic-Aware Network Embedding with Fine-Grained Word Alignment
Dinghan Shen
Xinyuan Zhang
Ricardo Henao
Lawrence Carin
AI4TS
22
25
0
29 Aug 2018
Dissecting Contextual Word Embeddings: Architecture and Representation
Matthew E. Peters
Mark Neumann
Luke Zettlemoyer
Wen-tau Yih
35
426
0
27 Aug 2018
Exploiting Deep Learning for Persian Sentiment Analysis
K. Dashtipour
M. Gogate
Ahsan Adeel
C. Ieracitano
H. Larijani
Amir Hussain
26
54
0
15 Aug 2018
Text Classification using Capsules
Jaeyoung Kim
Sion Jang
Sungchul Choi
Eunjeong Lucy Park
19
160
0
12 Aug 2018
Large Scale Language Modeling: Converging on 40GB of Text in Four Hours
Raul Puri
Robert M. Kirby
Nikolai Yakovenko
Bryan Catanzaro
19
29
0
03 Aug 2018
A Simple Method for Commonsense Reasoning
Trieu H. Trinh
Quoc V. Le
LRM
ReLM
31
432
0
07 Jun 2018
Interpretable Adversarial Perturbation in Input Embedding Space for Text
Motoki Sato
Jun Suzuki
Hiroyuki Shindo
Yuji Matsumoto
21
188
0
08 May 2018
Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling
Liyuan Liu
Xiang Ren
Jingbo Shang
Jian-wei Peng
Jiawei Han
25
44
0
20 Apr 2018
Reinforced Co-Training
Jiawei Wu
Lei Li
William Yang Wang
OffRL
25
51
0
17 Apr 2018
A Hierarchical Latent Vector Model for Learning Long-Term Structure in Music
Adam Roberts
Jesse Engel
Colin Raffel
Curtis Hawthorne
Douglas Eck
BDL
41
474
0
13 Mar 2018
Previous
1
2
3
4
Next