v1v2v3v4v5v6 (latest)

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

International Conference on Learning Representations (ICLR), 2019

26 September 2019

ArXiv (abs)PDF HTML Github (3271★)

Papers citing "ALBERT: A Lite BERT for Self-supervised Learning of Language Representations"

50 / 3,044 papers shown

Title
GMAT: Global Memory Augmentation for Transformers Ankit Gupta Jonathan Berant RALM 145 52 0 05 Jun 2020
Understanding Self-Attention of Self-Supervised Audio Transformers Shu-Wen Yang Andy T. Liu Hung-yi Lee 124 31 0 05 Jun 2020
Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing Zihang Dai Guokun Lai Yiming Yang Quoc V. Le 228 251 0 05 Jun 2020
Position Masking for Language Models Andy Wagner T. Mitra Mrinal Iyer Godfrey Da Costa Marc Tremblay 36 5 0 02 Jun 2020
Subjective Question Answering: Deciphering the inner workings of Transformers in the realm of subjectivity Lukas Muttenthaler 147 3 0 02 Jun 2020
WikiBERT models: deep transfer learning for many languagesNordic Conference of Computational Linguistics (NODALIDA), 2020 S. Pyysalo Jenna Kanerva Antti Virtanen Filip Ginter KELM 151 39 0 02 Jun 2020
Question Answering on Scholarly Knowledge GraphsInternational Conference on Theory and Practice of Digital Libraries (TPDL), 2020 M. Y. Jaradeh M. Stocker Sören Auer LMTD RALM 94 15 0 02 Jun 2020
Careful analysis of XRD patterns with Attention Koichi Kano T. Segi H. Ozono 66 0 0 02 Jun 2020
A Pairwise Probe for Understanding BERT Fine-Tuning on Machine Reading ComprehensionAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2020 Jie Cai Zhengzhou Zhu Ping Nie Qian Liu AAML 87 7 0 02 Jun 2020
BERT-based Ensembles for Modeling Disclosure and Support in Conversational Social Media Text Tanvi Dadu Kartikey Pant R. Mamidi 79 9 0 01 Jun 2020
Emergence of Separable Manifolds in Deep Language RepresentationsInternational Conference on Machine Learning (ICML), 2020 Jonathan Mamou Hang Le Miguel Angel del Rio Cory Stephenson Hanlin Tang Yoon Kim SueYeon Chung AAML AI4CE 260 44 0 01 Jun 2020
Conversational Machine Comprehension: a Literature ReviewInternational Conference on Computational Linguistics (COLING), 2020 Somil Gupta Bhanu Pratap Singh Rawat Hong Yu 172 22 0 01 Jun 2020
A Survey on Transfer Learning in Natural Language Processing Zaid Alyafeai Maged S. Alshaibani Irfan Ahmad 201 84 0 31 May 2020
LRG at SemEval-2020 Task 7: Assessing the Ability of BERT and Derivative Models to Perform Short-Edits based Humor GradingInternational Workshop on Semantic Evaluation (SemEval), 2020 Siddhant Mahurkar Rajaswa Patil 114 8 0 31 May 2020
Beyond Leaderboards: A survey of methods for revealing weaknesses in Natural Language Inference data and models Viktor Schlegel Goran Nenadic Riza Batista-Navarro ELM 171 18 0 29 May 2020
ValueNet: A Natural Language-to-SQL System that Learns from Database Information Ursin Brunner Kurt Stockinger 110 10 0 29 May 2020
Language Models are Few-Shot LearnersNeural Information Processing Systems (NeurIPS), 2020 Tom B. Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared Kaplan ... Christopher Berner Sam McCandlish Alec Radford Ilya Sutskever Dario Amodei BDL 1.9K 51,003 0 28 May 2020
Language Representation Models for Fine-Grained Sentiment Classification Brian Cheang Bailey Wei David Kogan H. Qiu Masud Ahmed AI4MH 121 11 0 27 May 2020
Syntactic Structure Distillation Pretraining For Bidirectional EncodersTransactions of the Association for Computational Linguistics (TACL), 2020 A. Kuncoro Lingpeng Kong Daniel Fried Dani Yogatama Laura Rimell Chris Dyer Phil Blunsom 145 34 0 27 May 2020
GECToR -- Grammatical Error Correction: Tag, Not RewriteWorkshop on Innovative Use of NLP for Building Educational Applications (UNBEA), 2020 Kostiantyn Omelianchuk Vitaliy Atrasevych Artem Chernodub Oleksandr Skurzhanskyi 215 353 0 26 May 2020
ParsBERT: Transformer-based Model for Persian Language UnderstandingNeural Processing Letters (NPL), 2020 Mehrdad Farahani Mohammad Gharachorloo Marzieh Farahani Mohammad Manthouri 210 235 0 26 May 2020
An Audio-enriched BERT-based Framework for Spoken Multiple-choice Question AnsweringInterspeech (Interspeech), 2020 Chia-Chih Kuo Shang-Bao Luo Kuan-Yu Chen 122 18 0 25 May 2020
NILE : Natural Language Inference with Faithful Natural Language ExplanationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2020 Sawan Kumar Partha P. Talukdar XAI LRM 251 169 0 25 May 2020
KaLM at SemEval-2020 Task 4: Knowledge-aware Language Models for Comprehension And Generation Jiajing Wan Xinting Huang LRM 118 5 0 24 May 2020
Transformer-based Context-aware Sarcasm Detection in Conversation Threads from Social Media Xiangjue Dong Changmao Li Jinho Choi 109 28 0 22 May 2020
Open-Retrieval Conversational Question Answering Chen Qu Liu Yang Cen Chen Minghui Qiu W. Bruce Croft Mohit Iyyer RALM 193 192 0 22 May 2020
Comparative Study of Machine Learning Models and BERT on SQuAD Devshree Patel Param Raval Ratnam Parikh Yesha Shastri 78 8 0 22 May 2020
PruneNet: Channel Pruning via Global Importance A. Khetan Zohar Karnin 100 12 0 22 May 2020
Med-BERT: pre-trained contextualized embeddings on large-scale structured electronic health records for disease prediction L. Rasmy Yang Xiang Z. Xie Cui Tao Degui Zhi AI4MH LM&MA 222 826 0 22 May 2020
Pretraining with Contrastive Sentence Objectives Improves Discourse Performance of Language Models Dan Iter Kelvin Guu L. Lansing Dan Jurafsky 154 83 0 20 May 2020
BiQGEMM: Matrix Multiplication with Lookup Table For Binary-Coding-based Quantized DNNs Yongkweon Jeon Baeseong Park S. Kwon Byeongwook Kim Jeongin Yun Dongsoo Lee MQ 277 39 0 20 May 2020
FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal Retrieval D. Gao Linbo Jin Ben Chen Minghui Qiu Peng Li Yi Wei Yitao Hu Haozhe Jasper Wang OOD 189 146 0 20 May 2020
Normalized Attention Without Probability Cage Oliver Richter Roger Wattenhofer 218 22 0 19 May 2020
Sketch-BERT: Learning Sketch Bidirectional Encoder Representation from Transformers by Self-supervised Learning of Sketch Gestalt Hangyu Lin Yanwei Fu Yu-Gang Jiang Xiangyang Xue SSL 199 75 0 19 May 2020
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio Representation Po-Han Chi Pei-Hung Chung Tsung-Han Wu Chun-Cheng Hsieh Yen-Hao Chen Shang-Wen Li Hung-yi Lee SSL 288 156 0 18 May 2020
Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory Prediction Cunjun Yu Xiao Ma Jiawei Ren Haiyu Zhao Shuai Yi 320 561 0 18 May 2020
T-VSE: Transformer-Based Visual Semantic Embedding M. Bastan Arnau Ramisa Mehmet Tek ViT 107 7 0 17 May 2020
CS-NLP team at SemEval-2020 Task 4: Evaluation of State-of-the-art NLP Deep Learning Architectures on Commonsense Reasoning Task Sirwe Saeedi Ali (Aliakbar) Panahi Seyran Saeedi A. Fong ReLM ELM LRM 213 12 0 17 May 2020
Speech Recognition and Multi-Speaker Diarization of Long Conversations H. H. Mao Shuyang Li Julian McAuley G. Cottrell VLM 209 49 0 16 May 2020
CERT: Contrastive Self-supervised Learning for Language Understanding Hongchao Fang Sicheng Wang Meng Zhou Jiayuan Ding P. Xie ELM SSL 181 368 0 16 May 2020
COVID-Twitter-BERT: A Natural Language Processing Model to Analyse COVID-19 Content on Twitter Martin Müller M. Salathé P. Kummervold VLM MedIm AI4MH 182 390 0 15 May 2020
Spelling Error Correction with Soft-Masked BERT Shaohua Zhang Haoran Huang Jicong Liu Hang Li 96 242 0 15 May 2020
WG-WaveNet: Real-Time High-Fidelity Speech Synthesis without GPU Po-Chun Hsu Hung-yi Lee 115 16 0 15 May 2020
Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond Zhuosheng Zhang Hai Zhao Rui Wang 192 66 0 13 May 2020
Automated Extraction of Socio-political Events from News (AESPEN): Workshop and Shared Task Report Ali Hürriyetoǧlu Vanni Zavarella Hristo Tanev E. Yoruk Ali Safaya Osman Mutlu 106 31 0 12 May 2020
A Report on the 2020 Sarcasm Detection Shared Task Debanjan Ghosh Avijit Vajpayee Smaranda Muresan 113 63 0 12 May 2020
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis Hao Tian Can Gao Xinyan Xiao Hao Liu Bolei He Hua Wu Haifeng Wang Feng Wu 194 265 0 12 May 2020
How Context Affects Language Models' Factual Predictions Fabio Petroni Patrick Lewis Aleksandra Piktus Tim Rocktaschel Yuxiang Wu Alexander H. Miller Sebastian Riedel KELM 190 251 0 10 May 2020
schuBERT: Optimizing Elements of BERT A. Khetan Zohar Karnin 184 31 0 09 May 2020
Modeling Document Interactions for Learning to Rank with Regularized Self-Attention Shuo Sun Kevin Duh 93 5 0 08 May 2020