BERT Rediscovers the Classical NLP Pipeline

15 May 2019

Papers citing "BERT Rediscovers the Classical NLP Pipeline"

50 / 244 papers shown

Title
ZeroQuant: Efficient and Affordable Post-Training Quantization for Large-Scale Transformers Z. Yao Reza Yazdani Aminabadi Minjia Zhang Xiaoxia Wu Conglong Li Yuxiong He VLM MQ 45 440 0 04 Jun 2022
On Building Spoken Language Understanding Systems for Low Resourced Languages Akshat Gupta 17 8 0 25 May 2022
What Drives the Use of Metaphorical Language? Negative Insights from Abstractness, Affect, Discourse Coherence and Contextualized Word Representations P. Piccirilli Sabine Schulte im Walde 10 4 0 23 May 2022
The Geometry of Multilingual Language Model Representations Tyler A. Chang Z. Tu Benjamin Bergen 16 56 0 22 May 2022
Life after BERT: What do Other Muppets Understand about Language? Vladislav Lialin Kevin Zhao Namrata Shivagunde Anna Rumshisky 39 6 0 21 May 2022
Assessing the Limits of the Distributional Hypothesis in Semantic Spaces: Trait-based Relational Knowledge and the Impact of Co-occurrences Mark Anderson Jose Camacho-Collados 30 0 0 16 May 2022
Discovering Latent Concepts Learned in BERT Fahim Dalvi A. Khan Firoj Alam Nadir Durrani Jia Xu Hassan Sajjad SSL 11 56 0 15 May 2022
Exploiting Inductive Bias in Transformers for Unsupervised Disentanglement of Syntax and Semantics with VAEs G. Felhi Joseph Le Roux Djamé Seddah DRL 26 2 0 12 May 2022
When a sentence does not introduce a discourse entity, Transformer-based models still sometimes refer to it Sebastian Schuster Tal Linzen 11 25 0 06 May 2022
Adaptable Adapters N. Moosavi Quentin Delfosse Kristian Kersting Iryna Gurevych 48 21 0 03 May 2022
AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks Chin-Lun Fu Zih-Ching Chen Yun-Ru Lee Hung-yi Lee 28 44 0 30 Apr 2022
UniTE: Unified Translation Evaluation Yu Wan Dayiheng Liu Baosong Yang Haibo Zhang Boxing Chen Derek F. Wong Lidia S. Chao 30 41 0 28 Apr 2022
LyS_ACoruña at SemEval-2022 Task 10: Repurposing Off-the-Shelf Tools for Sentiment Analysis as Semantic Dependency Parsing I. Alonso-Alonso David Vilares Carlos Gómez-Rodríguez 17 1 0 27 Apr 2022
Mono vs Multilingual BERT for Hate Speech Detection and Text Classification: A Case Study in Marathi Abhishek Velankar H. Patil Raviraj Joshi 28 31 0 19 Apr 2022
Text Revision by On-the-Fly Representation Optimization Jingjing Li Zichao Li Tao Ge Irwin King M. Lyu BDL 23 17 0 15 Apr 2022
An Exploratory Study on Code Attention in BERT Rishab Sharma Fuxiang Chen Fatemeh H. Fard David Lo 19 25 0 05 Apr 2022
Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language Andy Zeng Maria Attarian Brian Ichter K. Choromanski Adrian S. Wong ... Michael S. Ryoo Vikas Sindhwani Johnny Lee Vincent Vanhoucke Peter R. Florence ReLM LRM 13 571 0 01 Apr 2022
Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems Takuma Udagawa Masayuki Suzuki Gakuto Kurata N. Itoh G. Saon 34 23 0 01 Apr 2022
Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space Mor Geva Avi Caciularu Ke Wang Yoav Goldberg KELM 43 333 0 28 Mar 2022
Metaphors in Pre-Trained Language Models: Probing and Generalization Across Datasets and Languages Ehsan Aghazadeh Mohsen Fayyaz Yadollah Yaghoobzadeh 33 51 0 26 Mar 2022
Probing for Labeled Dependency Trees Max Müller-Eberstein Rob van der Goot Barbara Plank 11 7 0 24 Mar 2022
Coloring the Blank Slate: Pre-training Imparts a Hierarchical Inductive Bias to Sequence-to-sequence Models Aaron Mueller Robert Frank Tal Linzen Luheng Wang Sebastian Schuster AIMat 19 33 0 17 Mar 2022
Contrastive Visual Semantic Pretraining Magnifies the Semantics of Natural Language Representations Robert Wolfe Aylin Caliskan VLM 21 13 0 14 Mar 2022
Grounding Commands for Autonomous Vehicles via Layer Fusion with Region-specific Dynamic Layer Attention Hou Pong Chan M. Guo Chengguang Xu 16 4 0 14 Mar 2022
TrimBERT: Tailoring BERT for Trade-offs S. N. Sridhar Anthony Sarah Sairam Sundaresan MQ 19 4 0 24 Feb 2022
Probing BERT's priors with serial reproduction chains Takateru Yamakoshi Thomas L. Griffiths Robert D. Hawkins 18 12 0 24 Feb 2022
Do Transformers know symbolic rules, and would we know if they did? Tommi Gröndahl Yu-Wen Guo Nirmal Asokan 25 0 0 19 Feb 2022
Open-Ended Reinforcement Learning with Neural Reward Functions Robert Meier Asier Mujika 37 7 0 16 Feb 2022
What Do They Capture? -- A Structural Analysis of Pre-Trained Language Models for Source Code Yao Wan Wei-Ye Zhao Hongyu Zhang Yulei Sui Guandong Xu Hairong Jin 27 105 0 14 Feb 2022
Interpreting Arabic Transformer Models Ahmed Abdelali Nadir Durrani Fahim Dalvi Hassan Sajjad 25 2 0 19 Jan 2022
Does Entity Abstraction Help Generative Transformers Reason? Nicolas Angelard-Gontier Siva Reddy C. Pal 19 5 0 05 Jan 2022
How Should Pre-Trained Language Models Be Fine-Tuned Towards Adversarial Robustness? Xinhsuai Dong Anh Tuan Luu Min-Bin Lin Shuicheng Yan Hanwang Zhang SILM AAML 16 55 0 22 Dec 2021
Linguistic Frameworks Go Toe-to-Toe at Neuro-Symbolic Language Modeling Jakob Prange Nathan Schneider Lingpeng Kong 19 9 0 15 Dec 2021
Inducing Causal Structure for Interpretable Neural Networks Atticus Geiger Zhengxuan Wu Hanson Lu J. Rozner Elisa Kreiss Thomas F. Icard Noah D. Goodman Christopher Potts CML OOD 16 70 0 01 Dec 2021
To Augment or Not to Augment? A Comparative Study on Text Augmentation Techniques for Low-Resource NLP Gözde Gül Sahin 30 33 0 18 Nov 2021
Interpreting Language Models Through Knowledge Graph Extraction Vinitra Swamy Angelika Romanou Martin Jaggi 20 20 0 16 Nov 2021
Discovering Supply Chain Links with Augmented Intelligence Achintya Gopal Chun-Han Chang 29 3 0 02 Nov 2021
LMdiff: A Visual Diff Tool to Compare Language Models Hendrik Strobelt Benjamin Hoover Arvind Satyanarayan Sebastian Gehrmann VLM 29 19 0 02 Nov 2021
Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey Bonan Min Hayley L Ross Elior Sulem Amir Pouran Ben Veyseh Thien Huu Nguyen Oscar Sainz Eneko Agirre Ilana Heinz Dan Roth LM&MA VLM AI4CE 69 1,029 0 01 Nov 2021
Interpreting Deep Learning Models in Natural Language Processing: A Review Xiaofei Sun Diyi Yang Xiaoya Li Tianwei Zhang Yuxian Meng Han Qiu Guoyin Wang Eduard H. Hovy Jiwei Li 17 44 0 20 Oct 2021
Inductive Biases and Variable Creation in Self-Attention Mechanisms Benjamin L. Edelman Surbhi Goel Sham Kakade Cyril Zhang 27 115 0 19 Oct 2021
BERMo: What can BERT learn from ELMo? Sangamesh Kodge Kaushik Roy 28 3 0 18 Oct 2021
Identifying Introductions in Podcast Episodes from Automatically Generated Transcripts Elise Jing K. Schneck Dennis Egan Scott A. Waterman 15 2 0 14 Oct 2021
Global Explainability of BERT-Based Evaluation Metrics by Disentangling along Linguistic Factors Marvin Kaster Wei-Ye Zhao Steffen Eger 19 24 0 08 Oct 2021
BadPre: Task-agnostic Backdoor Attacks to Pre-trained NLP Foundation Models Kangjie Chen Yuxian Meng Xiaofei Sun Shangwei Guo Tianwei Zhang Jiwei Li Chun Fan SILM 23 105 0 06 Oct 2021
MoEfication: Transformer Feed-forward Layers are Mixtures of Experts Zhengyan Zhang Yankai Lin Zhiyuan Liu Peng Li Maosong Sun Jie Zhou MoE 19 117 0 05 Oct 2021
Low Frequency Names Exhibit Bias and Overfitting in Contextualizing Language Models Robert Wolfe Aylin Caliskan 85 51 0 01 Oct 2021
SlovakBERT: Slovak Masked Language Model Matúš Pikuliak Stefan Grivalsky Martin Konopka Miroslav Blšták Martin Tamajka Viktor Bachratý Marián Simko Pavol Balázik Michal Trnka Filip Uhlárik 27 25 0 30 Sep 2021
Analysing the Effect of Masking Length Distribution of MLM: An Evaluation Framework and Case Study on Chinese MRC Datasets Changchang Zeng Shaobo Li 16 6 0 29 Sep 2021
Fine-Tuned Transformers Show Clusters of Similar Representations Across Layers Jason Phang Haokun Liu Samuel R. Bowman 22 25 0 17 Sep 2021