Human Parity on CommonsenseQA: Augmenting Self-Attention with External
Attention

Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention

6 December 2021

Xiaodong Liu

Papers citing "Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention"

10 / 10 papers shown

Title
SpiritSight Agent: Advanced GUI Agent with One Look Zhiyuan Huang Ziming Cheng Junting Pan Zhaohui Hou Mingjie Zhan LLMAG 72 2 0 05 Mar 2025
CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language Models Jonathan Bourne 39 4 0 30 Aug 2024
Thrust: Adaptively Propels Large Language Models with External Knowledge Xinran Zhao Hongming Zhang Xiaoman Pan Wenlin Yao Dong Yu Jianshu Chen KELM 38 4 0 19 Jul 2023
Leveraging Knowledge in Multilingual Commonsense Reasoning Yuwei Fang Shuohang Wang Yichong Xu Ruochen Xu Siqi Sun Chenguang Zhu Michael Zeng LRM 216 16 0 16 Oct 2021
Carbon Emissions and Large Neural Network Training David A. Patterson Joseph E. Gonzalez Quoc V. Le Chen Liang Lluís-Miquel Munguía D. Rothchild David R. So Maud Texier J. Dean AI4CE 233 626 0 21 Apr 2021
RiddleSense: Reasoning about Riddle Questions Featuring Linguistic Creativity and Commonsense Knowledge Bill Yuchen Lin Ziyi Wu Yichi Yang Dong-Ho Lee Xiang Ren ReLM LRM 215 62 0 02 Jan 2021
Learning Contextualized Knowledge Structures for Commonsense Reasoning Jun Yan Mrigank Raman Aaron Chan Tianyu Zhang Ryan Rossi Handong Zhao Sungchul Kim Nedim Lipka Xiang Ren 203 36 0 24 Oct 2020
Posterior Differential Regularization with f-divergence for Improving Model Robustness Hao Cheng Xiaodong Liu L. Pereira Yaoliang Yu Jianfeng Gao 224 31 0 23 Oct 2020
Scaling Laws for Neural Language Models Jared Kaplan Sam McCandlish T. Henighan Tom B. Brown B. Chess R. Child Scott Gray Alec Radford Jeff Wu Dario Amodei 220 3,054 0 23 Jan 2020
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding Alex Jinpeng Wang Amanpreet Singh Julian Michael Felix Hill Omer Levy Samuel R. Bowman ELM 294 6,003 0 20 Apr 2018