ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1708.00107
  4. Cited By
Learned in Translation: Contextualized Word Vectors

Learned in Translation: Contextualized Word Vectors

1 August 2017
Bryan McCann
James Bradbury
Caiming Xiong
R. Socher
ArXivPDFHTML

Papers citing "Learned in Translation: Contextualized Word Vectors"

50 / 395 papers shown
Title
Simple and Effective Paraphrastic Similarity from Parallel Translations
Simple and Effective Paraphrastic Similarity from Parallel Translations
John Wieting
Kevin Gimpel
Graham Neubig
Taylor Berg-Kirkpatrick
14
49
0
30 Sep 2019
Integrated Triaging for Fast Reading Comprehension
Integrated Triaging for Fast Reading Comprehension
Felix Wu
Boyi Li
Lequn Wang
Ni Lao
John Blitzer
Kilian Q. Weinberger
11
3
0
28 Sep 2019
ALBERT: A Lite BERT for Self-supervised Learning of Language
  Representations
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
Zhenzhong Lan
Mingda Chen
Sebastian Goodman
Kevin Gimpel
Piyush Sharma
Radu Soricut
SSL
AIMat
53
6,370
0
26 Sep 2019
Pre-train, Interact, Fine-tune: A Novel Interaction Representation for
  Text Classification
Pre-train, Interact, Fine-tune: A Novel Interaction Representation for Text Classification
Jianming Zheng
Fei Cai
Honghui Chen
Maarten de Rijke
17
21
0
26 Sep 2019
Cross-Lingual Natural Language Generation via Pre-Training
Cross-Lingual Natural Language Generation via Pre-Training
Zewen Chi
Li Dong
Furu Wei
Wenhui Wang
Xian-Ling Mao
Heyan Huang
19
136
0
23 Sep 2019
Improving Natural Language Inference with a Pretrained Parser
Improving Natural Language Inference with a Pretrained Parser
D. Pang
Lucy H. Lin
Noah A. Smith
17
14
0
18 Sep 2019
Megatron-LM: Training Multi-Billion Parameter Language Models Using
  Model Parallelism
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
M. Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
245
1,817
0
17 Sep 2019
Cross-Lingual BERT Transformation for Zero-Shot Dependency Parsing
Cross-Lingual BERT Transformation for Zero-Shot Dependency Parsing
Yuxuan Wang
Wanxiang Che
Jiang Guo
Yijia Liu
Ting Liu
11
118
0
15 Sep 2019
Sequence-to-sequence Pre-training with Data Augmentation for Sentence
  Rewriting
Sequence-to-sequence Pre-training with Data Augmentation for Sentence Rewriting
Yi Zhang
Tao Ge
Furu Wei
Ming Zhou
Xu Sun
12
19
0
13 Sep 2019
Retrofitting Contextualized Word Embeddings with Paraphrases
Retrofitting Contextualized Word Embeddings with Paraphrases
Weijia Shi
Muhao Chen
Pei Zhou
Kai-Wei Chang
6
28
0
12 Sep 2019
UER: An Open-Source Toolkit for Pre-training Models
UER: An Open-Source Toolkit for Pre-training Models
Zhe Zhao
Hui Chen
Jinbin Zhang
Xin Zhao
Tao Liu
Wei Lu
Xi Chen
Haotang Deng
Qi Ju
Xiaoyong Du
23
112
0
12 Sep 2019
CTRL: A Conditional Transformer Language Model for Controllable
  Generation
CTRL: A Conditional Transformer Language Model for Controllable Generation
N. Keskar
Bryan McCann
L. Varshney
Caiming Xiong
R. Socher
AI4CE
46
1,232
0
11 Sep 2019
Knowledge Enhanced Contextual Word Representations
Knowledge Enhanced Contextual Word Representations
Matthew E. Peters
Mark Neumann
IV RobertL.Logan
Roy Schwartz
Vidur Joshi
Sameer Singh
Noah A. Smith
226
656
0
09 Sep 2019
Conditional Text Generation for Harmonious Human-Machine Interaction
Conditional Text Generation for Harmonious Human-Machine Interaction
Bin Guo
Hao Wang
Yasan Ding
Wei Wu
Shaoyang Hao
Yueqi Sun
Zhiwen Yu
21
4
0
08 Sep 2019
Pretrained AI Models: Performativity, Mobility, and Change
Pretrained AI Models: Performativity, Mobility, and Change
L. Varshney
N. Keskar
R. Socher
16
20
0
07 Sep 2019
Show Your Work: Improved Reporting of Experimental Results
Show Your Work: Improved Reporting of Experimental Results
Jesse Dodge
Suchin Gururangan
Dallas Card
Roy Schwartz
Noah A. Smith
14
249
0
06 Sep 2019
Distributionally Robust Language Modeling
Distributionally Robust Language Modeling
Yonatan Oren
Shiori Sagawa
Tatsunori B. Hashimoto
Percy Liang
OOD
6
167
0
04 Sep 2019
Trouble on the Horizon: Forecasting the Derailment of Online
  Conversations as they Develop
Trouble on the Horizon: Forecasting the Derailment of Online Conversations as they Develop
Jonathan P. Chang
Cristian Danescu-Niculescu-Mizil
8
59
0
03 Sep 2019
Unicoder: A Universal Language Encoder by Pre-training with Multiple
  Cross-lingual Tasks
Unicoder: A Universal Language Encoder by Pre-training with Multiple Cross-lingual Tasks
Haoyang Huang
Yaobo Liang
Nan Duan
Ming Gong
Linjun Shou
Daxin Jiang
M. Zhou
23
230
0
03 Sep 2019
Evaluating the Cross-Lingual Effectiveness of Massively Multilingual
  Neural Machine Translation
Evaluating the Cross-Lingual Effectiveness of Massively Multilingual Neural Machine Translation
Aditya Siddhant
Melvin Johnson
Henry Tsai
N. Arivazhagan
Jason Riesa
Ankur Bapna
Orhan Firat
Karthik Raman
24
70
0
01 Sep 2019
QuASE: Question-Answer Driven Sentence Encoding
QuASE: Question-Answer Driven Sentence Encoding
Hangfeng He
Qiang Ning
Dan Roth
14
1
0
01 Sep 2019
Evaluation Benchmarks and Learning Criteria for Discourse-Aware Sentence
  Representations
Evaluation Benchmarks and Learning Criteria for Discourse-Aware Sentence Representations
Mingda Chen
Zewei Chu
Kevin Gimpel
17
46
0
31 Aug 2019
EntEval: A Holistic Evaluation Benchmark for Entity Representations
EntEval: A Holistic Evaluation Benchmark for Entity Representations
Mingda Chen
Zewei Chu
Yang Chen
K. Stratos
Kevin Gimpel
11
12
0
31 Aug 2019
Memorizing All for Implicit Discourse Relation Recognition
Memorizing All for Implicit Discourse Relation Recognition
Hongxiao Bai
Hai Zhao
Junhan Zhao
11
10
0
29 Aug 2019
Shallow Syntax in Deep Water
Shallow Syntax in Deep Water
Swabha Swayamdipta
Matthew E. Peters
Brendan Roof
Chris Dyer
Noah A. Smith
12
10
0
29 Aug 2019
Investigating Meta-Learning Algorithms for Low-Resource Natural Language
  Understanding Tasks
Investigating Meta-Learning Algorithms for Low-Resource Natural Language Understanding Tasks
Zi-Yi Dou
Keyi Yu
Antonios Anastasopoulos
6
126
0
27 Aug 2019
FinBERT: Financial Sentiment Analysis with Pre-trained Language Models
FinBERT: Financial Sentiment Analysis with Pre-trained Language Models
Dogu Araci
AIFin
10
623
0
27 Aug 2019
On the Effectiveness of Low-Rank Matrix Factorization for LSTM Model
  Compression
On the Effectiveness of Low-Rank Matrix Factorization for LSTM Model Compression
Genta Indra Winata
Andrea Madotto
Jamin Shin
Elham J. Barezi
Pascale Fung
16
28
0
27 Aug 2019
Non-local Recurrent Neural Memory for Supervised Sequence Modeling
Non-local Recurrent Neural Memory for Supervised Sequence Modeling
Canmiao Fu
Wenjie Pei
Qiong Cao
Chaopeng Zhang
Yong Zhao
Xiaoyong Shen
Yu-Wing Tai
16
11
0
26 Aug 2019
Patient Knowledge Distillation for BERT Model Compression
Patient Knowledge Distillation for BERT Model Compression
S. Sun
Yu Cheng
Zhe Gan
Jingjing Liu
28
826
0
25 Aug 2019
Adversarial Domain Adaptation for Machine Reading Comprehension
Adversarial Domain Adaptation for Machine Reading Comprehension
Huazheng Wang
Zhe Gan
Xiaodong Liu
Jingjing Liu
Jianfeng Gao
Hongning Wang
23
63
0
24 Aug 2019
BERT for Coreference Resolution: Baselines and Analysis
BERT for Coreference Resolution: Baselines and Analysis
Mandar Joshi
Omer Levy
Daniel S. Weld
Luke Zettlemoyer
7
320
0
24 Aug 2019
Denoising based Sequence-to-Sequence Pre-training for Text Generation
Denoising based Sequence-to-Sequence Pre-training for Text Generation
Liang Wang
Wei-Ye Zhao
Ruoyu Jia
Sujian Li
Jingming Liu
VLM
AI4CE
34
37
0
22 Aug 2019
Encoder-Agnostic Adaptation for Conditional Language Generation
Encoder-Agnostic Adaptation for Conditional Language Generation
Zachary M. Ziegler
Luke Melas-Kyriazi
Sebastian Gehrmann
Alexander M. Rush
AI4CE
11
57
0
19 Aug 2019
Visualizing and Understanding the Effectiveness of BERT
Visualizing and Understanding the Effectiveness of BERT
Y. Hao
Li Dong
Furu Wei
Ke Xu
22
181
0
15 Aug 2019
Raw-to-End Name Entity Recognition in Social Media
Raw-to-End Name Entity Recognition in Social Media
Liyuan Liu
Zihan Wang
Jingbo Shang
Dandong Yin
Heng Ji
Xiang Ren
Shaowen Wang
Jiawei Han
8
3
0
14 Aug 2019
StructBERT: Incorporating Language Structures into Pre-training for Deep
  Language Understanding
StructBERT: Incorporating Language Structures into Pre-training for Deep Language Understanding
Wei Wang
Bin Bi
Ming Yan
Chen Henry Wu
Zuyi Bao
Jiangnan Xia
Liwei Peng
Luo Si
12
260
0
13 Aug 2019
Semi-supervised Thai Sentence Segmentation Using Local and Distant Word
  Representations
Semi-supervised Thai Sentence Segmentation Using Local and Distant Word Representations
Chanatip Saetia
E. Chuangsuwanich
Tawunrat Chalothorn
P. Vateekul
15
5
0
04 Aug 2019
Representation Degeneration Problem in Training Natural Language
  Generation Models
Representation Degeneration Problem in Training Natural Language Generation Models
Jun Gao
Di He
Xu Tan
Tao Qin
Liwei Wang
Tie-Yan Liu
8
263
0
28 Jul 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
82
23,856
0
26 Jul 2019
MacNet: Transferring Knowledge from Machine Comprehension to
  Sequence-to-Sequence Models
MacNet: Transferring Knowledge from Machine Comprehension to Sequence-to-Sequence Models
Boyuan Pan
Yazheng Yang
Hao Li
Zhou Zhao
Yueting Zhuang
Deng Cai
Xiaofei He
11
18
0
23 Jul 2019
Discourse Marker Augmented Network with Reinforcement Learning for
  Natural Language Inference
Discourse Marker Augmented Network with Reinforcement Learning for Natural Language Inference
Boyuan Pan
Yazheng Yang
Zhou Zhao
Yueting Zhuang
Deng Cai
Xiaofei He
11
33
0
23 Jul 2019
To Tune or Not To Tune? How About the Best of Both Worlds?
To Tune or Not To Tune? How About the Best of Both Worlds?
Ran A. Wang
Haibo Su
Chunye Wang
Kailin Ji
J. Ding
VLM
25
17
0
09 Jul 2019
A Comparative Analysis of Knowledge-Intensive and Data-Intensive
  Semantic Parsers
A Comparative Analysis of Knowledge-Intensive and Data-Intensive Semantic Parsers
Junjie Cao
Zi-yu Lin
Weiwei SUN
Xiaojun Wan
6
1
0
04 Jul 2019
Neural Machine Reading Comprehension: Methods and Trends
Neural Machine Reading Comprehension: Methods and Trends
Kaixuan Li
Xiujuan Xian
Sheng Zhang
Jiafu Wang
N. Yu
FaML
17
12
0
02 Jul 2019
Latent Variable Sentiment Grammar
Latent Variable Sentiment Grammar
Liwen Zhang
Kewei Tu
Yue Zhang
17
5
0
29 Jun 2019
Supervised Contextual Embeddings for Transfer Learning in Natural
  Language Processing Tasks
Supervised Contextual Embeddings for Transfer Learning in Natural Language Processing Tasks
Mihir Kale
Aditya Siddhant
Sreyashi Nag
R. Parik
Matthias Grabmair
A. Tomasic
SSL
13
5
0
28 Jun 2019
A Neural-based Program Decompiler
A Neural-based Program Decompiler
Cheng Fu
Huili Chen
Haolan Liu
Xinyun Chen
Yuandong Tian
F. Koushanfar
Jishen Zhao
14
3
0
28 Jun 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
8
8,340
0
19 Jun 2019
A Simple and Effective Approach to Automatic Post-Editing with Transfer
  Learning
A Simple and Effective Approach to Automatic Post-Editing with Transfer Learning
Gonçalo M. Correia
André F. T. Martins
11
42
0
14 Jun 2019
Previous
12345678
Next