BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

11 October 2018

Papers citing "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"

50 / 15,000 papers shown

Title
A Neighbourhood Framework for Resource-Lean Content Flagging Sheikh Muhammad Sarwar Dimitrina Zlatkova Momchil Hardalov Yoan Dinkov Isabelle Augenstein Preslav Nakov 16 5 0 31 Mar 2021
Deep Neural Approaches to Relation Triplets Extraction: A Comprehensive Survey Tapas Nayak Navonil Majumder Pawan Goyal Soujanya Poria ViT 14 49 0 31 Mar 2021
Learning Generalizable Robotic Reward Functions from "In-The-Wild" Human Videos Annie S. Chen Suraj Nair Chelsea Finn 30 137 0 31 Mar 2021
Dual Contrastive Loss and Attention for GANs Ning Yu Guilin Liu Aysegül Dündar Andrew Tao Bryan Catanzaro Larry S. Davis Mario Fritz GAN 24 60 0 31 Mar 2021
EnergyVis: Interactively Tracking and Exploring Energy Consumption for ML Models Omar Shaikh Jon Saad-Falcon Austin P. Wright Nilaksh Das Scott Freitas O. Asensio Duen Horng Chau 24 18 0 30 Mar 2021
Grounding Dialogue Systems via Knowledge Graph Aware Decoding with Pre-trained Transformers Debanjan Chaudhuri Md. Rony Jens Lehmann 13 12 0 30 Mar 2021
Autocorrect in the Process of Translation -- Multi-task Learning Improves Dialogue Machine Translation Tao Wang Chengqi Zhao Mingxuan Wang Lei Li Deyi Xiong 20 13 0 30 Mar 2021
CaSiNo: A Corpus of Campsite Negotiation Dialogues for Automatic Negotiation Systems Kushal Chawla Jaysa Ramirez Rene Clever Gale M. Lucas Jonathan May Jonathan Gratch 15 50 0 29 Mar 2021
ViViT: A Video Vision Transformer Anurag Arnab Mostafa Dehghani G. Heigold Chen Sun Mario Lucic Cordelia Schmid ViT 30 2,086 0 29 Mar 2021
On the Adversarial Robustness of Vision Transformers Rulin Shao Zhouxing Shi Jinfeng Yi Pin-Yu Chen Cho-Jui Hsieh ViT 30 137 0 29 Mar 2021
Efficient Explanations from Empirical Explainers Robert Schwarzenberg Nils Feldhus Sebastian Möller FAtt 29 9 0 29 Mar 2021
Changing the Mind of Transformers for Topically-Controllable Language Generation Haw-Shiuan Chang Jiaming Yuan Mohit Iyyer Andrew McCallum 20 9 0 29 Mar 2021
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS Ye Jia Heiga Zen Jonathan Shen Yu Zhang Yonghui Wu SSL 19 81 0 28 Mar 2021
TransICD: Transformer Based Code-wise Attention Model for Explainable ICD Coding Biplob Biswas Thai-Hoang Pham Ping Zhang 13 29 0 28 Mar 2021
Accurate and Reliable Forecasting using Stochastic Differential Equations Peng Cui Zhijie Deng Wenbo Hu Jun Zhu UQCV 30 1 0 28 Mar 2021
Automated Backend-Aware Post-Training Quantization Ziheng Jiang Animesh Jain An Liu Josh Fromm Chengqian Ma Tianqi Chen Luis Ceze MQ 35 2 0 27 Mar 2021
Machine Learning Meets Natural Language Processing -- The story so far N. Galanis P. Vafiadis K.-G. Mirzaev G. Papakostas 30 6 0 27 Mar 2021
Synthesis of Compositional Animations from Textual Descriptions Anindita Ghosh N. Cheema Cennet Oguz Christian Theobalt P. Slusallek 31 170 0 26 Mar 2021
Gated Transformer Networks for Multivariate Time Series Classification Minghao Liu Shengqi Ren Siyuan Ma Jiahui Jiao Yizhou Chen Zhiguang Wang Wei Song AI4TS 36 130 0 26 Mar 2021
Describing and Localizing Multiple Changes with Transformers Yue Qiu Shintaro Yamamoto Kodai Nakashima Ryota Suzuki K. Iwata Hirokatsu Kataoka Y. Satoh 27 55 0 25 Mar 2021
High-Fidelity Pluralistic Image Completion with Transformers Ziyu Wan Jingbo Zhang Dongdong Chen Jing Liao ViT 23 231 0 25 Mar 2021
Bertinho: Galician BERT Representations David Vilares Marcos Garcia Carlos Gómez-Rodríguez 57 22 0 25 Mar 2021
Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting A. Bhunia Pinaki Nath Chowdhury Yongxin Yang Timothy M. Hospedales Tao Xiang Yi-Zhe Song SSL 17 59 0 25 Mar 2021
An Approach to Improve Robustness of NLP Systems against ASR Errors Tong Cui Jinghui Xiao Liangyou Li Xin Jiang Qun Liu 19 11 0 25 Mar 2021
Improving Online Forums Summarization via Hierarchical Unified Deep Neural Network Sansiri Tarnpradab Fereshteh Jafariakinabad K. Hua 13 5 0 25 Mar 2021
Efficient Feature Transformations for Discriminative and Generative Continual Learning Vinay K. Verma Kevin J Liang Nikhil Mehta Piyush Rai Lawrence Carin CLL 35 76 0 25 Mar 2021
Vision Transformers for Dense Prediction René Ranftl Alexey Bochkovskiy V. Koltun ViT MDE 42 1,659 0 24 Mar 2021
FastMoE: A Fast Mixture-of-Expert Training System Jiaao He J. Qiu Aohan Zeng Zhilin Yang Jidong Zhai Jie Tang ALM MoE 22 94 0 24 Mar 2021
Representing Numbers in NLP: a Survey and a Vision Avijit Thawani Jay Pujara Pedro A. Szekely Filip Ilievski 24 114 0 24 Mar 2021
Thinking Aloud: Dynamic Context Generation Improves Zero-Shot Reasoning Performance of GPT-2 Gregor Betz Kyle Richardson Christian Voigt ReLM LRM 16 29 0 24 Mar 2021
Czert -- Czech BERT-like Model for Language Representation Jakub Sido O. Pražák P. Pribán Jan Pasek Michal Seják Miloslav Konopík 16 43 0 24 Mar 2021
UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask Benchmark Nicholas Lourie Ronan Le Bras Chandra Bhagavatula Yejin Choi LRM 22 137 0 24 Mar 2021
Multi-view 3D Reconstruction with Transformer Dan Wang Xinrui Cui Xun Chen Zhengxia Zou Tianyang Shi Septimiu Salcudean Z. J. Wang Rabab Ward ViT 20 87 0 24 Mar 2021
Scene-Intuitive Agent for Remote Embodied Visual Grounding Xiangru Lin Guanbin Li Yizhou Yu LM&Ro 22 52 0 24 Mar 2021
Region Similarity Representation Learning Tete Xiao Colorado Reed Xiaolong Wang Kurt Keutzer Trevor Darrell VLM SSL 29 116 0 24 Mar 2021
The NLP Cookbook: Modern Recipes for Transformer based Deep Learning Architectures Sushant Singh A. Mahmood AI4TS 60 92 0 23 Mar 2021
Self-Supervised Pretraining Improves Self-Supervised Pretraining Colorado Reed Xiangyu Yue Aniruddha Nrusimha Sayna Ebrahimi Vivek Vijaykumar ... Shanghang Zhang Devin Guillory Sean L. Metzger Kurt Keutzer Trevor Darrell 25 105 0 23 Mar 2021
QuestEval: Summarization Asks for Fact-based Evaluation Thomas Scialom Paul-Alexis Dray Patrick Gallinari Sylvain Lamprier Benjamin Piwowarski Jacopo Staiano Alex Jinpeng Wang HILM 11 267 0 23 Mar 2021
How to decay your learning rate Aitor Lewkowycz 36 24 0 23 Mar 2021
Self-supervised representation learning from 12-lead ECG data Temesgen Mehari Nils Strodthoff SSL 18 141 0 23 Mar 2021
Are Neural Language Models Good Plagiarists? A Benchmark for Neural Paraphrase Detection Jan Philip Wahle Terry Ruas Norman Meuschke Bela Gipp 25 34 0 23 Mar 2021
Detecting Hate Speech with GPT-3 Ke-Li Chiu Annie Collins Rohan Alexander AILaw 15 108 0 23 Mar 2021
Instance-level Image Retrieval using Reranking Transformers Fuwen Tan Jiangbo Yuan Vicente Ordonez ViT 26 89 0 22 Mar 2021
Tiny Transformers for Environmental Sound Classification at the Edge David Elliott Carlos E. Otero Steven Wyatt Evan Martino 21 15 0 22 Mar 2021
Open Domain Question Answering over Tables via Dense Retrieval Jonathan Herzig Thomas Müller Syrine Krichene Julian Martin Eisenschlos LMTD VLM RALM 36 99 0 22 Mar 2021
Improving and Simplifying Pattern Exploiting Training Derek Tam Rakesh R Menon Mohit Bansal Shashank Srivastava Colin Raffel 13 149 0 22 Mar 2021
BERT: A Review of Applications in Natural Language Processing and Understanding M. V. Koroteev VLM 22 194 0 22 Mar 2021
Retrieve Fast, Rerank Smart: Cooperative and Joint Approaches for Improved Cross-Modal Retrieval Gregor Geigle Jonas Pfeiffer Nils Reimers Ivan Vulić Iryna Gurevych 27 59 0 22 Mar 2021
Identifying Machine-Paraphrased Plagiarism Jan Philip Wahle Terry Ruas Tomávs Foltýnek Norman Meuschke Bela Gipp 11 30 0 22 Mar 2021
DeepViT: Towards Deeper Vision Transformer Daquan Zhou Bingyi Kang Xiaojie Jin Linjie Yang Xiaochen Lian Zihang Jiang Qibin Hou Jiashi Feng ViT 42 510 0 22 Mar 2021