GPT-NeoX-20B: An Open-Source Autoregressive Language Model

14 April 2022

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)Github (7200★)

Papers citing "GPT-NeoX-20B: An Open-Source Autoregressive Language Model"

50 / 603 papers shown

"We Demand Justice!": Towards Social Context Grounding of Political TextsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

345

15 Nov 2023

XplainLLM: A QA Explanation Dataset for Understanding LLM Decision-MakingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

216

15 Nov 2023

A Ship of Theseus: Curious Cases of Paraphrasing in LLM-Generated TextsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

360

14 Nov 2023

STEER: Unified Style Transfer with Expert ReinforcementConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Faeze Brahman

Yejin Choi

181

13 Nov 2023

In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space SteeringInternational Conference on Machine Learning (ICML), 2023

Sheng Liu

Haotian Ye

Lei Xing

James Y. Zou

250

215

11 Nov 2023

Chain of Images for Intuitively Reasoning

239

09 Nov 2023

Ziya2: Data-centric Learning is All LLMs Need

...

299

06 Nov 2023

Large language models implicitly learn to straighten neural sentence trajectories to construct a predictive representation of natural languagebioRxiv (bioRxiv), 2023

Eghbal A. Hosseini

Evelina Fedorenko

LLMSV

189

05 Nov 2023

Vision-Language Foundation Models as Effective Robot ImitatorsInternational Conference on Learning Representations (ICLR), 2023

...

494

308

02 Nov 2023

Predicting Question-Answering Performance of Large Language Models through Semantic ConsistencyIEEE Games Entertainment Media Conference (IEEE GEM), 2023

554

02 Nov 2023

InstructCoder: Instruction Tuning Large Language Models for Code EditingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

227

31 Oct 2023

TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise

...

254

29 Oct 2023

FP8-LM: Training FP8 Large Language Models

...

307

27 Oct 2023

Evaluation of large language models using an Indian language LGBTI+ lexiconAI Ethics Journal (JAE), 2023

Aditya Joshi

S. Rawat

A. Dange

109

26 Oct 2023

Codebook Features: Sparse and Discrete Interpretability for Neural NetworksInternational Conference on Machine Learning (ICML), 2023

Alex Tamkin

Mohammad Taufeeque

Noah D. Goodman

214

26 Oct 2023

Deja Vu: Contextual Sparsity for Efficient LLMs at Inference TimeInternational Conference on Machine Learning (ICML), 2023

...

Anshumali Shrivastava

357

280

26 Oct 2023

Detecting Pretraining Data from Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023

Weijia Shi

Luke Zettlemoyer

436

315

25 Oct 2023

CLEX: Continuous Length Extrapolation for Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023

Xin Li

287

25 Oct 2023

Locally Differentially Private Document Generation Using Zero Shot PromptingConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Saiteja Utpala

Sara Hooker

Pin-Yu Chen

274

24 Oct 2023

BLESS: Benchmarking Large Language Models on Sentence SimplificationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Tannon Kew

Alison Chi

Laura Vásquez-Rodríguez

Sweta Agrawal

Dennis Aumiller

Fernando Alva-Manchego

Teven Le Scao

239

24 Oct 2023

Function Vectors in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2023

324

182

23 Oct 2023

Geographical Erasure in Language GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

181

23 Oct 2023

Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs

Young-Suk Lee

Md Arafat Sultan

Yousef El-Kurdi

Tahira Naseem Asim Munawar

Radu Florian

Salim Roukos

Ramón Fernández Astudillo

SyDa

223

21 Oct 2023

CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code CompletionNeural Information Processing Systems (NeurIPS), 2023

...

280

194

17 Oct 2023

H2O Open Ecosystem for State-of-the-art Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

152

17 Oct 2023

Llemma: An Open Language Model For MathematicsInternational Conference on Learning Representations (ICLR), 2023

Albert Q. Jiang

331

388

16 Oct 2023

Generative Calibration for In-context LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Zhongtao Jiang

Yuanzhe Zhang

Cao Liu

Jun Zhao

Kang Liu

416

16 Oct 2023

Unsupervised Domain Adaption for Neural Information Retrieval

167

13 Oct 2023

SeqXGPT: Sentence-Level AI-Generated Text DetectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Xipeng Qiu

389

13 Oct 2023

Training Generative Question-Answering on Synthetic Data Obtained from an Instruct-tuned ModelPacific Asia Conference on Language, Information and Computation (PACLIC), 2023

162

12 Oct 2023

GenTKG: Generative Forecasting on Temporal Knowledge Graph with Large Language Models

404

11 Oct 2023

LLMs Killed the Script Kiddie: How Agents Supported by Large Language Models Change the Landscape of Network Threat Testing

207

10 Oct 2023

CodeFuse-13B: A Pretrained Multi-lingual Code Large Language Model

Hang Yu

...

224

10 Oct 2023

FABRIC: Automated Scoring and Feedback Generation for Essays

Hyunseung Lim

...

Hwajung Hong

104

08 Oct 2023

MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Yifan Wei

Yisong Su

Huanhuan Ma

Xiaoyan Yu

Fangyu Lei

Yuanzhe Zhang

Jun Zhao

Kang Liu

LRM

242

08 Oct 2023

Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability CurvatureInternational Conference on Learning Representations (ICLR), 2023

Yue Zhang

346

251

08 Oct 2023

How Reliable Are AI-Generated-Text Detectors? An Assessment Framework Using Evasive Soft PromptsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Huan Liu

181

08 Oct 2023

Pushing the Limits of Pre-training for Time Series Forecasting in the CloudOps Domain

361

08 Oct 2023

GoLLIE: Annotation Guidelines improve Zero-Shot Information-ExtractionInternational Conference on Learning Representations (ICLR), 2023

Oscar Sainz

Iker García-Ferrero

Rodrigo Agerri

Oier López de Lacalle

German Rigau

Eneko Agirre

706

139

05 Oct 2023

InstructProtein: Aligning Human and Protein Language via Knowledge InstructionAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Huajun Chen

245

05 Oct 2023

Low Resource Summarization using Pre-trained Language Models

180

04 Oct 2023

Large Language Models for Test-Free Fault LocalizationInternational Conference on Software Engineering (ICSE), 2023

Aidan Z. H. Yang

Ruben Martins

Claire Le Goues

Vincent J. Hellendoorn

LRM

234

161

03 Oct 2023

Synthetic Data Generation in Low-Resource Settings via Fine-Tuning of Large Language Models

Jean Kaddour

Qi Liu

SyDa

241

02 Oct 2023

GrowLength: Accelerating LLMs Pretraining by Progressively Growing Training Length

Chia-Yuan Chang

157

01 Oct 2023

L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language ModelsTransactions of the Association for Computational Linguistics (TACL), 2023

...

Yingbo Zhou

Arman Cohan

246

29 Sep 2023

Qwen Technical Report

Jinze Bai

Shuai Bai

Yunfei Chu

Zeyu Cui

Kai Dang

...

Zhenru Zhang

Chang Zhou

Jingren Zhou

Xiaohuan Zhou

Tianhang Zhu

OSLM

822

3,094

28 Sep 2023

Identifying and Mitigating Privacy Risks Stemming from Language Models: A Survey

Victoria Smith

Ali Shahin Shamsabadi

Carolyn Ashurst

Adrian Weller

PILM

484

27 Sep 2023

Joint Prediction and Denoising for Large-scale Multilingual Self-supervised LearningAutomatic Speech Recognition & Understanding (ASRU), 2023

Jiatong Shi

Wangyou Zhang

262

26 Sep 2023

Physics of Language Models: Part 3.2, Knowledge ManipulationInternational Conference on Learning Representations (ICLR), 2023

Zeyuan Allen-Zhu

Yuanzhi Li

KELM

406

142

25 Sep 2023

Physics of Language Models: Part 3.1, Knowledge Storage and ExtractionInternational Conference on Machine Learning (ICML), 2023

Zeyuan Allen-Zhu

Yuanzhi Li

KELM

533

237

25 Sep 2023