ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.01432
  4. Cited By
Semi-supervised Sequence Learning

Semi-supervised Sequence Learning

4 November 2015
Andrew M. Dai
Quoc V. Le
    SSL
ArXivPDFHTML

Papers citing "Semi-supervised Sequence Learning"

50 / 197 papers shown
Title
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive
  Summarization
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization
Jingqing Zhang
Yao-Min Zhao
Mohammad Saleh
Peter J. Liu
RALM
3DGS
45
2,018
0
18 Dec 2019
FlauBERT: Unsupervised Language Model Pre-training for French
FlauBERT: Unsupervised Language Model Pre-training for French
Hang Le
Loïc Vial
Jibril Frej
Vincent Segonne
Maximin Coavoux
Benjamin Lecouteux
A. Allauzen
Benoît Crabbé
Laurent Besacier
D. Schwab
AI4CE
49
395
0
11 Dec 2019
A Transformer-based approach to Irony and Sarcasm detection
A Transformer-based approach to Irony and Sarcasm detection
Rolandos Alexandros Potamias
Georgios Siolas
A. Stafylopatis
33
206
0
23 Nov 2019
CamemBERT: a Tasty French Language Model
CamemBERT: a Tasty French Language Model
Louis Martin
Benjamin Muller
Pedro Ortiz Suarez
Yoann Dupont
Laurent Romary
Eric Villemonte de la Clergerie
Djamé Seddah
Benoît Sagot
42
956
0
10 Nov 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
129
19,529
0
23 Oct 2019
A Mutual Information Maximization Perspective of Language Representation
  Learning
A Mutual Information Maximization Perspective of Language Representation Learning
Lingpeng Kong
Cyprien de Masson dÁutume
Wang Ling
Lei Yu
Zihang Dai
Dani Yogatama
SSL
226
166
0
18 Oct 2019
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
109
6,377
0
26 Sep 2019
Fine-Tuning Language Models from Human Preferences
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
301
1,616
0
18 Sep 2019
UER: An Open-Source Toolkit for Pre-training Models
UER: An Open-Source Toolkit for Pre-training Models
Zhe Zhao
Hui Chen
Jinbin Zhang
Xin Zhao
Tao Liu
Wei Lu
Xi Chen
Haotang Deng
Qi Ju
Xiaoyong Du
28
112
0
12 Sep 2019
CTRL: A Conditional Transformer Language Model for Controllable
  Generation
CTRL: A Conditional Transformer Language Model for Controllable Generation
N. Keskar
Bryan McCann
L. Varshney
Caiming Xiong
R. Socher
AI4CE
57
1,237
0
11 Sep 2019
Knowledge Enhanced Contextual Word Representations
Knowledge Enhanced Contextual Word Representations
Matthew E. Peters
Mark Neumann
IV RobertL.Logan
Roy Schwartz
Vidur Joshi
Sameer Singh
Noah A. Smith
234
656
0
09 Sep 2019
Supervised Multimodal Bitransformers for Classifying Images and Text
Supervised Multimodal Bitransformers for Classifying Images and Text
Douwe Kiela
Suvrat Bhooshan
Hamed Firooz
Ethan Perez
Davide Testuggine
59
242
0
06 Sep 2019
Neural Attentive Bag-of-Entities Model for Text Classification
Neural Attentive Bag-of-Entities Model for Text Classification
Ikuya Yamada
Hiroyuki Shindo
15
33
0
03 Sep 2019
Non-local Recurrent Neural Memory for Supervised Sequence Modeling
Non-local Recurrent Neural Memory for Supervised Sequence Modeling
Canmiao Fu
Wenjie Pei
Qiong Cao
Chaopeng Zhang
Yong Zhao
Xiaoyong Shen
Yu-Wing Tai
24
11
0
26 Aug 2019
Patient Knowledge Distillation for BERT Model Compression
Patient Knowledge Distillation for BERT Model Compression
S. Sun
Yu Cheng
Zhe Gan
Jingjing Liu
78
831
0
25 Aug 2019
Denoising based Sequence-to-Sequence Pre-training for Text Generation
Denoising based Sequence-to-Sequence Pre-training for Text Generation
Liang Wang
Wei-Ye Zhao
Ruoyu Jia
Sujian Li
Jingming Liu
VLM
AI4CE
42
37
0
22 Aug 2019
"Mask and Infill" : Applying Masked Language Model to Sentiment Transfer
"Mask and Infill" : Applying Masked Language Model to Sentiment Transfer
Xing Wu
Tao Zhang
Liangjun Zang
Jizhong Han
Songlin Hu
38
109
0
21 Aug 2019
Similarity Learning for Authorship Verification in Social Media
Similarity Learning for Authorship Verification in Social Media
Benedikt T. Boenninghoff
R. M. Nickel
Steffen Zeiler
D. Kolossa
25
42
0
20 Aug 2019
Scalable Attentive Sentence-Pair Modeling via Distilled Sentence
  Embedding
Scalable Attentive Sentence-Pair Modeling via Distilled Sentence Embedding
Oren Barkan
Noam Razin
Itzik Malkiel
Ori Katz
Avi Caciularu
Noam Koenigstein
FedML
23
37
0
14 Aug 2019
Leveraging Pre-trained Checkpoints for Sequence Generation Tasks
Leveraging Pre-trained Checkpoints for Sequence Generation Tasks
S. Rothe
Shashi Narayan
Aliaksei Severyn
SILM
71
433
0
29 Jul 2019
BAM! Born-Again Multi-Task Networks for Natural Language Understanding
BAM! Born-Again Multi-Task Networks for Natural Language Understanding
Kevin Clark
Minh-Thang Luong
Urvashi Khandelwal
Christopher D. Manning
Quoc V. Le
24
228
0
10 Jul 2019
Learning Compressed Sentence Representations for On-Device Text
  Processing
Learning Compressed Sentence Representations for On-Device Text Processing
Dinghan Shen
Pengyu Cheng
Dhanasekar Sundararaman
Xinyuan Zhang
Qian Yang
Meng Tang
Asli Celikyilmaz
Lawrence Carin
18
22
0
19 Jun 2019
What Does BERT Look At? An Analysis of BERT's Attention
What Does BERT Look At? An Analysis of BERT's Attention
Kevin Clark
Urvashi Khandelwal
Omer Levy
Christopher D. Manning
MILM
114
1,581
0
11 Jun 2019
Sequence Tagging with Contextual and Non-Contextual Subword
  Representations: A Multilingual Evaluation
Sequence Tagging with Contextual and Non-Contextual Subword Representations: A Multilingual Evaluation
Benjamin Heinzerling
Michael Strube
10
35
0
04 Jun 2019
Pretraining Methods for Dialog Context Representation Learning
Pretraining Methods for Dialog Context Representation Learning
Shikib Mehri
E. Razumovskaia
Tiancheng Zhao
M. Eskénazi
22
84
0
02 Jun 2019
Adaptation of Deep Bidirectional Multilingual Transformers for Russian
  Language
Adaptation of Deep Bidirectional Multilingual Transformers for Russian Language
Yuri Kuratov
M. Arkhipov
11
274
0
17 May 2019
Gmail Smart Compose: Real-Time Assisted Writing
Gmail Smart Compose: Real-Time Assisted Writing
Mengzhao Chen
Benjamin Lee
G. Bansal
Yuan Cao
Shuyuan Zhang
...
Yinan Wang
Andrew M. Dai
Zhiwen Chen
Timothy Sohn
Yonghui Wu
18
203
0
17 May 2019
Unified Language Model Pre-training for Natural Language Understanding
  and Generation
Unified Language Model Pre-training for Natural Language Understanding and Generation
Li Dong
Nan Yang
Wenhui Wang
Furu Wei
Xiaodong Liu
Yu-Chiang Frank Wang
Jianfeng Gao
M. Zhou
H. Hon
ELM
AI4CE
80
1,551
0
08 May 2019
Unsupervised Domain Adaptation of Contextualized Embeddings for Sequence
  Labeling
Unsupervised Domain Adaptation of Contextualized Embeddings for Sequence Labeling
Xiaochuang Han
Jacob Eisenstein
19
20
0
04 Apr 2019
To Tune or Not to Tune? Adapting Pretrained Representations to Diverse
  Tasks
To Tune or Not to Tune? Adapting Pretrained Representations to Diverse Tasks
Matthew E. Peters
Sebastian Ruder
Noah A. Smith
39
433
0
14 Mar 2019
An Embarrassingly Simple Approach for Transfer Learning from Pretrained
  Language Models
An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models
Alexandra Chronopoulou
Christos Baziotis
Alexandros Potamianos
CLL
25
129
0
27 Feb 2019
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Zihang Dai
Zhilin Yang
Yiming Yang
J. Carbonell
Quoc V. Le
Ruslan Salakhutdinov
VLM
38
3,679
0
09 Jan 2019
Unsupervised Transfer Learning for Spoken Language Understanding in
  Intelligent Agents
Unsupervised Transfer Learning for Spoken Language Understanding in Intelligent Agents
Aditya Siddhant
Anuj Kumar Goyal
A. Metallinou
19
50
0
13 Nov 2018
Learning to Explicitate Connectives with Seq2Seq Network for Implicit
  Discourse Relation Classification
Learning to Explicitate Connectives with Seq2Seq Network for Implicit Discourse Relation Classification
Junjie Zeng
Yue Hu
19
32
0
05 Nov 2018
Picking Apart Story Salads
Picking Apart Story Salads
Su Wang
Eric Holgate
Greg Durrett
K. Erk
17
6
0
31 Oct 2018
Learning to Represent Edits
Learning to Represent Edits
Pengcheng Yin
Graham Neubig
Miltiadis Allamanis
Marc Brockschmidt
Alexander L. Gaunt
KELM
26
112
0
31 Oct 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
83
93,140
0
11 Oct 2018
Unsupervised Learning via Meta-Learning
Unsupervised Learning via Meta-Learning
Kyle Hsu
Sergey Levine
Chelsea Finn
SSL
OffRL
37
229
0
04 Oct 2018
Language Modeling Teaches You More Syntax than Translation Does: Lessons
  Learned Through Auxiliary Task Analysis
Language Modeling Teaches You More Syntax than Translation Does: Lessons Learned Through Auxiliary Task Analysis
Kelly W. Zhang
Samuel R. Bowman
32
70
0
26 Sep 2018
Semi-Supervised Sequence Modeling with Cross-View Training
Semi-Supervised Sequence Modeling with Cross-View Training
Kevin Clark
Minh-Thang Luong
Christopher D. Manning
Quoc V. Le
SSL
11
333
0
22 Sep 2018
Improved Semantic-Aware Network Embedding with Fine-Grained Word
  Alignment
Improved Semantic-Aware Network Embedding with Fine-Grained Word Alignment
Dinghan Shen
Xinyuan Zhang
Ricardo Henao
Lawrence Carin
AI4TS
22
25
0
29 Aug 2018
Dissecting Contextual Word Embeddings: Architecture and Representation
Dissecting Contextual Word Embeddings: Architecture and Representation
Matthew E. Peters
Mark Neumann
Luke Zettlemoyer
Wen-tau Yih
35
426
0
27 Aug 2018
Exploiting Deep Learning for Persian Sentiment Analysis
Exploiting Deep Learning for Persian Sentiment Analysis
K. Dashtipour
M. Gogate
Ahsan Adeel
C. Ieracitano
H. Larijani
Amir Hussain
26
54
0
15 Aug 2018
Text Classification using Capsules
Text Classification using Capsules
Jaeyoung Kim
Sion Jang
Sungchul Choi
Eunjeong Lucy Park
19
160
0
12 Aug 2018
Large Scale Language Modeling: Converging on 40GB of Text in Four Hours
Large Scale Language Modeling: Converging on 40GB of Text in Four Hours
Raul Puri
Robert M. Kirby
Nikolai Yakovenko
Bryan Catanzaro
19
29
0
03 Aug 2018
A Simple Method for Commonsense Reasoning
A Simple Method for Commonsense Reasoning
Trieu H. Trinh
Quoc V. Le
LRM
ReLM
31
432
0
07 Jun 2018
Interpretable Adversarial Perturbation in Input Embedding Space for Text
Interpretable Adversarial Perturbation in Input Embedding Space for Text
Motoki Sato
Jun Suzuki
Hiroyuki Shindo
Yuji Matsumoto
21
188
0
08 May 2018
Efficient Contextualized Representation: Language Model Pruning for
  Sequence Labeling
Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling
Liyuan Liu
Xiang Ren
Jingbo Shang
Jian-wei Peng
Jiawei Han
25
44
0
20 Apr 2018
Reinforced Co-Training
Reinforced Co-Training
Jiawei Wu
Lei Li
William Yang Wang
OffRL
25
51
0
17 Apr 2018
A Hierarchical Latent Vector Model for Learning Long-Term Structure in
  Music
A Hierarchical Latent Vector Model for Learning Long-Term Structure in Music
Adam Roberts
Jesse Engel
Colin Raffel
Curtis Hawthorne
Douglas Eck
BDL
41
474
0
13 Mar 2018
Previous
1234
Next