BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 15,098 papers shown

Title
Linking Social Media Posts to News with Siamese Transformers Jacob Danovitch 18 2 0 10 Jan 2020
Multiplex Word Embeddings for Selectional Preference Acquisition Hongming Zhang Jiaxin Bai Yan Song Kun Xu Changlong Yu Yangqiu Song Wilfred Ng Dong Yu 11 17 0 09 Jan 2020
On Interpretability of Artificial Neural Networks: A Survey Fenglei Fan Jinjun Xiong Mengzhou Li Ge Wang AAML AI4CE 38 300 0 08 Jan 2020
An Exploration of Embodied Visual Exploration Santhosh Kumar Ramakrishnan Dinesh Jayaraman Kristen Grauman LM&Ro 27 98 0 07 Jan 2020
Attention over Parameters for Dialogue Systems Andrea Madotto Zhaojiang Lin Chien-Sheng Wu Jamin Shin Pascale Fung 22 20 0 07 Jan 2020
Language Models Are An Effective Patient Representation Learning Technique For Electronic Health Record Data E. Steinberg Kenneth Jung Jason Alan Fries Conor K. Corbin Stephen R. Pfohl N. Shah 18 103 0 06 Jan 2020
Social Media Attributions in the Context of Water Crisis Rupak Sarkar Hirak Sarkar Sayantan Mahinder Ashiqur R. KhudaBukhsh 11 10 0 06 Jan 2020
A Survey on Machine Reading Comprehension Systems Razieh Baradaran Razieh Ghiasi Hossein Amirkhani FaML 11 85 0 06 Jan 2020
Stance Detection Benchmark: How Robust Is Your Stance Detection? Benjamin Schiller Johannes Daxenberger Iryna Gurevych 11 95 0 06 Jan 2020
Computationally Efficient NER Taggers with Combined Embeddings and Constrained Decoding Brian Lester Daniel Pressel Amy Hemmeter Sagnik Ray Choudhury 14 3 0 05 Jan 2020
Empirical Studies on the Properties of Linear Regions in Deep Neural Networks Xiao Zhang Dongrui Wu 8 38 0 04 Jan 2020
Two-Level Transformer and Auxiliary Coherence Modeling for Improved Text Segmentation Goran Glavas Swapna Somasundaran VLM 16 55 0 03 Jan 2020
TED: A Pretrained Unsupervised Summarization Model with Theme Modeling and Denoising Ziyi Yang Chenguang Zhu R. Gmyr Michael Zeng Xuedong Huang Eric Darve 15 61 0 03 Jan 2020
A Deep Learning Approach to Diagnosing Multiple Sclerosis from Smartphone Data Patrick Schwab W. Karlen 28 24 0 02 Jan 2020
Stacked DeBERT: All Attention in Incomplete Data for Text Classification Gwenaelle Cunha Sergio Minho Lee 19 30 0 01 Jan 2020
Deep Attentive Ranking Networks for Learning to Order Sentences Pawan Kumar Dhanajit Brahma H. Karnick Piyush Rai 8 45 0 31 Dec 2019
oLMpics -- On what Language Model Pre-training Captures Alon Talmor Yanai Elazar Yoav Goldberg Jonathan Berant LRM 17 300 0 31 Dec 2019
AraNet: A Deep Learning Toolkit for Arabic Social Media Muhammad Abdul-Mageed Chiyu Zhang A. Hashemi El Moatez Billah Nagoudi GNN 8 32 0 30 Dec 2019
Semi-Supervised Learning with Normalizing Flows Pavel Izmailov Polina Kirichenko Marc Finzi A. Wilson DRL BDL 25 111 0 30 Dec 2019
AutoDiscern: Rating the Quality of Online Health Information with Hierarchical Encoder Attention-based Neural Networks Laura Kinkead Ahmed Allam Michael Krauthammer 17 19 0 30 Dec 2019
Machine Learning from a Continuous Viewpoint E. Weinan Chao Ma Lei Wu 23 102 0 30 Dec 2019
Towards Deep Federated Defenses Against Malware in Cloud Ecosystems Josh Payne A. Kundu FedML 15 10 0 27 Dec 2019
Encoding word order in complex embeddings Benyou Wang Donghao Zhao Christina Lioma Qiuchi Li Peng Zhang J. Simonsen 11 111 0 27 Dec 2019
Multi-Graph Transformer for Free-Hand Sketch Recognition Peng-Tao Xu Chaitanya K. Joshi Xavier Bresson ViT 17 85 0 24 Dec 2019
Probing the phonetic and phonological knowledge of tones in Mandarin TTS models Jian Zhu 16 8 0 23 Dec 2019
A Multimodal Target-Source Classifier with Attention Branches to Understand Ambiguous Instructions for Fetching Daily Objects A. Magassouba K. Sugiura Hisashi Kawai 38 9 0 23 Dec 2019
Harnessing Evolution of Multi-Turn Conversations for Effective Answer Retrieval Mohammad Aliannejadi Manajit Chakraborty E. A. Ríssola Fabio Crestani 14 48 0 22 Dec 2019
Pretrained Encyclopedia: Weakly Supervised Knowledge-Pretrained Language Model Wenhan Xiong Jingfei Du William Yang Wang Veselin Stoyanov SSL KELM 38 201 0 20 Dec 2019
BERTje: A Dutch BERT Model Wietse de Vries Andreas van Cranenburgh Arianna Bisazza Tommaso Caselli Gertjan van Noord Malvina Nissim VLM SSeg 11 291 0 19 Dec 2019
Fashion Outfit Complementary Item Retrieval Yen-Liang Lin Son N. Tran Larry S. Davis 8 84 0 19 Dec 2019
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization Jingqing Zhang Yao-Min Zhao Mohammad Saleh Peter J. Liu RALM 3DGS 43 2,011 0 18 Dec 2019
Transfer learning in hybrid classical-quantum neural networks A. Mari T. Bromley J. Izaac Maria Schuld N. Killoran 19 282 0 17 Dec 2019
Meshed-Memory Transformer for Image Captioning Marcella Cornia Matteo Stefanini Lorenzo Baraldi Rita Cucchiara 14 868 0 17 Dec 2019
A Multi-task Learning Model for Chinese-oriented Aspect Polarity Classification and Aspect Term Extraction Heng Yang Biqing Zeng Jianhao Yang Youwei Song Ruyang Xu 32 132 0 17 Dec 2019
Cross-Lingual Ability of Multilingual BERT: An Empirical Study Karthikeyan K Zihan Wang Stephen D. Mayhew Dan Roth LRM 25 334 0 17 Dec 2019
Multilingual is not enough: BERT for Finnish Antti Virtanen Jenna Kanerva Rami Ilo Jouni Luoma Juhani Luotolahti T. Salakoski Filip Ginter S. Pyysalo 25 277 0 15 Dec 2019
FlauBERT: Unsupervised Language Model Pre-training for French Hang Le Loïc Vial Jibril Frej Vincent Segonne Maximin Coavoux Benjamin Lecouteux A. Allauzen Benoît Crabbé Laurent Besacier D. Schwab AI4CE 35 395 0 11 Dec 2019
CoSimLex: A Resource for Evaluating Graded Word Similarity in Context C. S. Armendariz Matthew Purver Matej Ulčar Senja Pollak Nikola Ljubesic Marko Robnik-Šikonja Mark Granroth-Wilding Kristiina Vaik 14 35 0 11 Dec 2019
Improving Neural Protein-Protein Interaction Extraction with Knowledge Selection Huiwei Zhou Xuefei Li Weihong Yao Zhuang Liu Shixian Ning Chengkun Lang Lei Du 17 7 0 11 Dec 2019
Adversarial Analysis of Natural Language Inference Systems Tiffany Chien Jugal Kalita AAML 34 12 0 07 Dec 2019
Weak Supervision helps Emergence of Word-Object Alignment and improves Vision-Language Tasks Corentin Kervadec G. Antipov M. Baccouche Christian Wolf 21 15 0 06 Dec 2019
Reading the Manual: Event Extraction as Definition Comprehension Yunmo Chen Tongfei Chen Seth Ebner Aaron Steven White Benjamin Van Durme 21 63 0 03 Dec 2019
An Annotated Dataset of Coreference in English Literature David Bamman Olivia Lewke A. Mansoor 6 105 0 03 Dec 2019
Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection Shubhi Tyagi M. Nicolis Jonas Rohnke Thomas Drugman Jaime Lorenzo-Trueba 26 32 0 02 Dec 2019
Knowledge Infused Learning (K-IL): Towards Deep Incorporation of Knowledge in Deep Learning Ugur Kursuncu Manas Gaur A. Sheth NAI 18 57 0 01 Dec 2019
Deconstructing and reconstructing word embedding algorithms Edward Newell Kian Kenyon-Dean Jackie C.K. Cheung 31 4 0 29 Nov 2019
Blockwisely Supervised Neural Architecture Search with Knowledge Distillation Changlin Li Jiefeng Peng Liuchun Yuan Guangrun Wang Xiaodan Liang Liang Lin Xiaojun Chang 23 179 0 29 Nov 2019
Inducing Relational Knowledge from BERT Zied Bouraoui Jose Camacho-Collados Steven Schockaert 21 166 0 28 Nov 2019
End-to-End Trainable Non-Collaborative Dialog System Yu Li Kun Qian Weiyan Shi Zhou Yu 21 45 0 25 Nov 2019
Who did They Respond to? Conversation Structure Modeling using Masked Hierarchical Transformer Henghui Zhu Feng Nan Zhiguo Wang Ramesh Nallapati Bing Xiang 22 39 0 25 Nov 2019