On the Compositional Generalization Gap of In-Context LearningBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackboxNLP), 2022

Arian Hosseini

Ankit Vani

Dzmitry Bahdanau

Alessandro Sordoni

Rameswar Panda

201

15 Nov 2022

Mind Your Bias: A Critical Review of Bias Detection Methods for Contextual Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Silke Husse

Andreas Spitz

233

15 Nov 2022

Evaluating the Factual Consistency of Large Language Models Through News SummarizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

282

132

15 Nov 2022

FolkScope: Intention Knowledge Graph Construction for E-commerce Commonsense DiscoveryAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Xin Liu

209

15 Nov 2022

Prompting Language Models for Linguistic StructureAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Terra Blevins

Hila Gonen

Luke Zettlemoyer

LRM

249

15 Nov 2022

Breadth-First Pipeline Parallelism

J. Lamy-Poirier

GNN MoE AI4CE

125

11 Nov 2022

Measuring Reliability of Large Language Models through Semantic Consistency

273

10 Nov 2022

Nano: Nested Human-in-the-Loop Reward Learning for Few-shot Language Model ControlAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Louis-Philippe Morency

BDL

300

10 Nov 2022

Collateral facilitation in humans and language modelsConference on Computational Natural Language Learning (CoNLL), 2022

J. Michaelov

Benjamin Bergen

217

09 Nov 2022

Grammatical Error Correction: A Survey of the State of the ArtComputational Linguistics (CL), 2022

Hwee Tou Ng

252

115

09 Nov 2022

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Angela Fan

...

841

2,772

09 Nov 2022

Creative Writing with an AI-Powered Writing Assistant: Perspectives from Professional Writers

231

124

09 Nov 2022

Active Example Selection for In-Context LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

329

253

08 Nov 2022

Intriguing Properties of Compression on Multilingual ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

302

04 Nov 2022

MolE: a molecular foundation model for drug discovery

Oscar Méndez-Lucio

C. Nicolaou

Berton Earnshaw

161

03 Nov 2022

LMentry: A Language Model Benchmark of Elementary Language TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Avia Efrat

Or Honovich

Omer Levy

239

03 Nov 2022

Estimating the Carbon Footprint of BLOOM, a 176B Parameter Language ModelJournal of machine learning research (JMLR), 2022

A. Luccioni

S. Viguier

Anne-Laure Ligozat

607

427

03 Nov 2022

Large Language Models Are Human-Level Prompt EngineersInternational Conference on Learning Representations (ICLR), 2022

Silviu Pitis

Jimmy Ba

508

1,181

03 Nov 2022

Preventing Verbatim Memorization in Language Models Gives a False Sense of PrivacyInternational Conference on Natural Language Generation (INLG), 2022

Christopher A. Choquette-Choo

Nicholas Carlini

PILM MU

385

31 Oct 2022

SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular ControlAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

Xiaochuang Han

Sachin Kumar

Yulia Tsvetkov

325

147

31 Oct 2022

GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers

Elias Frantar

Saleh Ashkboos

Torsten Hoefler

Dan Alistarh

535

1,573

31 Oct 2022

A Solvable Model of Neural Scaling Laws

A. Maloney

Daniel A. Roberts

J. Sully

262

30 Oct 2022

Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language ModelsInternational Conference on Learning Representations (ICLR), 2022

Wenlin Yao

Dian Yu

569

28 Oct 2022

Class Based Thresholding in Early Exit Semantic Segmentation NetworksIEEE Signal Processing Letters (SPL), 2022

Alperen Görmez

Erdem Koyuncu

152

27 Oct 2022

What Language Model to Train if You Have One Million GPU Hours?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022

...

576

120

27 Oct 2022

TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection

184

27 Oct 2022

Personalized Dialogue Generation with Persona-Adaptive AttentionAAAI Conference on Artificial Intelligence (AAAI), 2022

304

27 Oct 2022

Multi-lingual Evaluation of Code Generation ModelsInternational Conference on Learning Representations (ICLR), 2022

...

765

217

26 Oct 2022

Scaling Laws Beyond Backpropagation

Matthew J. Filipovich

Alessandro Cappelli

Daniel Hesslow

Julien Launay

209

26 Oct 2022

RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Victor Zhong

Weijia Shi

Anuj Kumar

Luke Zettlemoyer

229

25 Oct 2022

Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language ModelsInternational Conference on Machine Learning (ICML), 2022

326

25 Oct 2022

Weakly Supervised Data Augmentation Through Prompting for Dialogue Understanding

Maximillian Chen

Alexandros Papangelis

Yang Liu

229

25 Oct 2022

Contrastive Search Is What You Need For Neural Text Generation

Yixuan Su

Nigel Collier

245

25 Oct 2022

Towards Better Few-Shot and Finetuning Performance with Forgetful Causal Language Models

Sharan Narang

Pieter Abbeel

KELM CLL

255

24 Oct 2022

Neural Theory-of-Mind? On the Limits of Social Intelligence in Large LMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Maarten Sap

Ronan Le Bras

Daniel Fried

Yejin Choi

397

272

24 Oct 2022

The Curious Case of Absolute Position EmbeddingsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Koustuv Sinha

Amirhossein Kazemnejad

Siva Reddy

J. Pineau

Dieuwke Hupkes

Adina Williams

226

23 Oct 2022

Exploring The Landscape of Distributional Robustness for Question Answering ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

229

22 Oct 2022

Z-LaVI: Zero-Shot Language Solver Fueled by Visual ImaginationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Wenlin Yao

224

21 Oct 2022