Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.06745
Cited By
GPT-NeoX-20B: An Open-Source Autoregressive Language Model
14 April 2022
Sid Black
Stella Biderman
Eric Hallahan
Quentin G. Anthony
Leo Gao
Laurence Golding
Horace He
Connor Leahy
Kyle McDonell
Jason Phang
Michael Pieler
USVSN Sai Prashanth
Shivanshu Purohit
Laria Reynolds
J. Tow
Benqi Wang
Samuel Weinbach
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GPT-NeoX-20B: An Open-Source Autoregressive Language Model"
50 / 554 papers shown
Title
Ziya2: Data-centric Learning is All LLMs Need
Ruyi Gan
Ziwei Wu
Renliang Sun
Junyu Lu
Xiaojun Wu
...
Ping Yang
Qi Yang
Hao Wang
Jiaxing Zhang
Yan Song
VLM
ALM
21
16
0
06 Nov 2023
Large language models implicitly learn to straighten neural sentence trajectories to construct a predictive representation of natural language
Eghbal A. Hosseini
Evelina Fedorenko
LLMSV
20
4
0
05 Nov 2023
Vision-Language Foundation Models as Effective Robot Imitators
Xinghang Li
Minghuan Liu
Hanbo Zhang
Cunjun Yu
Jie Xu
...
Ya Jing
Weinan Zhang
Huaping Liu
Hang Li
Tao Kong
LM&Ro
24
134
0
02 Nov 2023
Predicting Question-Answering Performance of Large Language Models through Semantic Consistency
Ella Rabinovich
Samuel Ackerman
Orna Raz
E. Farchi
Ateret Anaby-Tavor
211
17
0
02 Nov 2023
InstructCoder: Instruction Tuning Large Language Models for Code Editing
Kaixin Li
Qisheng Hu
Xu Zhao
Hui Chen
Yuxi Xie
Tiedong Liu
Qizhe Xie
Junxian He
ALM
SyDa
36
11
0
31 Oct 2023
TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise
Nan He
Hanyu Lai
Chenyang Zhao
Zirui Cheng
Junting Pan
...
Zhaohui Hou
Zhiyuan Huang
Shaoqing Lu
Ding Liang
Mingjie Zhan
LRM
15
12
0
29 Oct 2023
FP8-LM: Training FP8 Large Language Models
Houwen Peng
Kan Wu
Yixuan Wei
Guoshuai Zhao
Yuxiang Yang
...
Zheng-Wei Zhang
Shuguang Liu
Joe Chau
Han Hu
Peng Cheng
MQ
59
37
0
27 Oct 2023
Evaluation of large language models using an Indian language LGBTI+ lexicon
Aditya Joshi
S. Rawat
A. Dange
19
1
0
26 Oct 2023
Codebook Features: Sparse and Discrete Interpretability for Neural Networks
Alex Tamkin
Mohammad Taufeeque
Noah D. Goodman
25
27
0
26 Oct 2023
Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time
Zichang Liu
Jue Wang
Tri Dao
Tianyi Zhou
Binhang Yuan
...
Anshumali Shrivastava
Ce Zhang
Yuandong Tian
Christopher Ré
Beidi Chen
BDL
17
191
0
26 Oct 2023
Detecting Pretraining Data from Large Language Models
Weijia Shi
Anirudh Ajith
Mengzhou Xia
Yangsibo Huang
Daogao Liu
Terra Blevins
Danqi Chen
Luke Zettlemoyer
MIALM
18
161
0
25 Oct 2023
CLEX: Continuous Length Extrapolation for Large Language Models
Guanzheng Chen
Xin Li
Zaiqiao Meng
Shangsong Liang
Li Bing
15
29
0
25 Oct 2023
Locally Differentially Private Document Generation Using Zero Shot Prompting
Saiteja Utpala
Sara Hooker
Pin-Yu Chen
13
36
0
24 Oct 2023
BLESS: Benchmarking Large Language Models on Sentence Simplification
Tannon Kew
Alison Chi
Laura Vásquez-Rodríguez
Sweta Agrawal
Dennis Aumiller
Fernando Alva-Manchego
Teven Le Scao
40
22
0
24 Oct 2023
Function Vectors in Large Language Models
Eric Todd
Millicent Li
Arnab Sen Sharma
Aaron Mueller
Byron C. Wallace
David Bau
8
99
0
23 Oct 2023
Geographical Erasure in Language Generation
Pola Schwöbel
Jacek Golebiowski
Michele Donini
Cédric Archambeau
Danish Pruthi
8
5
0
23 Oct 2023
Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs
Young-Suk Lee
Md Arafat Sultan
Yousef El-Kurdi
Tahira Naseem Asim Munawar
Radu Florian
Salim Roukos
Ramón Fernández Astudillo
SyDa
19
6
0
21 Oct 2023
CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion
Yangruibo Ding
Zijian Wang
Wasi Uddin Ahmad
Hantian Ding
Ming Tan
...
M. K. Ramanathan
Ramesh Nallapati
Parminder Bhatia
Dan Roth
Bing Xiang
ELM
21
115
0
17 Oct 2023
H2O Open Ecosystem for State-of-the-art Large Language Models
Arno Candel
Jon McKinney
Philipp Singer
Pascal Pfeiffer
Maximilian Jeblick
Chun Ming Lee
Marcos V. Conde
VLM
17
4
0
17 Oct 2023
Llemma: An Open Language Model For Mathematics
Zhangir Azerbayev
Hailey Schoelkopf
Keiran Paster
Marco Dos Santos
Stephen Marcus McAleer
Albert Q. Jiang
Jia Deng
Stella Biderman
Sean Welleck
CLL
24
270
0
16 Oct 2023
Generative Calibration for In-context Learning
Zhongtao Jiang
Yuanzhe Zhang
Cao Liu
Jun Zhao
Kang Liu
157
17
0
16 Oct 2023
Unsupervised Domain Adaption for Neural Information Retrieval
Carlos Dominguez
Jon Ander Campos
Eneko Agirre
Gorka Azkune
19
0
0
13 Oct 2023
SeqXGPT: Sentence-Level AI-Generated Text Detection
Pengyu Wang
Linyang Li
Ke Ren
Botian Jiang
Dong Zhang
Xipeng Qiu
DeLMO
21
49
0
13 Oct 2023
Training Generative Question-Answering on Synthetic Data Obtained from an Instruct-tuned Model
Kosuke Takahashi
Takahiro Omi
Kosuke Arima
Tatsuya Ishigaki
23
0
0
12 Oct 2023
GenTKG: Generative Forecasting on Temporal Knowledge Graph with Large Language Models
Ruotong Liao
Xu Jia
Yangzhe Li
Yunpu Ma
Volker Tresp
AI4TS
23
24
0
11 Oct 2023
LLMs Killed the Script Kiddie: How Agents Supported by Large Language Models Change the Landscape of Network Threat Testing
Stephen Moskal
Sam Laney
Erik Hemberg
Una-May O’Reilly
37
17
0
10 Oct 2023
CodeFuse-13B: A Pretrained Multi-lingual Code Large Language Model
Peng Di
Jianguo Li
Hang Yu
Wei Jiang
Wenting Cai
...
Zelin Zhao
Xunjin Zheng
Hailian Zhou
Lifu Zhu
Xianying Zhu
ELM
ALM
AI4CE
35
12
0
10 Oct 2023
MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Language Models
Yifan Wei
Yisong Su
Huanhuan Ma
Xiaoyan Yu
Fangyu Lei
Yuanzhe Zhang
Jun Zhao
Kang Liu
LRM
17
9
0
08 Oct 2023
Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature
Guangsheng Bao
Yanbin Zhao
Zhiyang Teng
Linyi Yang
Yue Zhang
18
129
0
08 Oct 2023
How Reliable Are AI-Generated-Text Detectors? An Assessment Framework Using Evasive Soft Prompts
Tharindu Kumarage
Paras Sheth
Raha Moraffah
Joshua Garland
Huan Liu
DeLMO
26
23
0
08 Oct 2023
Pushing the Limits of Pre-training for Time Series Forecasting in the CloudOps Domain
Gerald Woo
Chenghao Liu
Akshat Kumar
Doyen Sahoo
AI4TS
AI4CE
27
12
0
08 Oct 2023
GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction
Oscar Sainz
Iker García-Ferrero
Rodrigo Agerri
Oier López de Lacalle
German Rigau
Eneko Agirre
30
75
0
05 Oct 2023
InstructProtein: Aligning Human and Protein Language via Knowledge Instruction
Zeyuan Wang
Qiang Zhang
Keyan Ding
Ming Qin
Zhuang Xiang
Xiaotong Li
Huajun Chen
16
28
0
05 Oct 2023
Low Resource Summarization using Pre-trained Language Models
Mubashir Munaf
Hammad Afzal
N. Iltaf
Khawir Mahmood
10
3
0
04 Oct 2023
Large Language Models for Test-Free Fault Localization
Aidan Z. H. Yang
Ruben Martins
Claire Le Goues
Vincent J. Hellendoorn
LRM
27
83
0
03 Oct 2023
Synthetic Data Generation in Low-Resource Settings via Fine-Tuning of Large Language Models
Jean Kaddour
Qi Liu
SyDa
17
2
0
02 Oct 2023
GrowLength: Accelerating LLMs Pretraining by Progressively Growing Training Length
Hongye Jin
Xiaotian Han
Jingfeng Yang
Zhimeng Jiang
Chia-Yuan Chang
Xia Hu
33
11
0
01 Oct 2023
L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models
Ansong Ni
Pengcheng Yin
Yilun Zhao
Chen Wei
Yanjun Wang
...
Mingyuan Zhang
Chen Change Loy
Yingbo Zhou
Dragomir R. Radev
Arman Cohan
ELM
19
16
0
29 Sep 2023
Qwen Technical Report
Jinze Bai
Shuai Bai
Yunfei Chu
Zeyu Cui
Kai Dang
...
Zhenru Zhang
Chang Zhou
Jingren Zhou
Xiaohuan Zhou
Tianhang Zhu
OSLM
29
1,568
0
28 Sep 2023
Identifying and Mitigating Privacy Risks Stemming from Language Models: A Survey
Victoria Smith
Ali Shahin Shamsabadi
Carolyn Ashurst
Adrian Weller
PILM
32
24
0
27 Sep 2023
Joint Prediction and Denoising for Large-scale Multilingual Self-supervised Learning
William Chen
Jiatong Shi
Brian Yan
Dan Berrebbi
Wangyou Zhang
Yifan Peng
Xuankai Chang
Soumi Maiti
Shinji Watanabe
24
8
0
26 Sep 2023
Physics of Language Models: Part 3.2, Knowledge Manipulation
Zeyuan Allen-Zhu
Yuanzhi Li
KELM
6
83
0
25 Sep 2023
Physics of Language Models: Part 3.1, Knowledge Storage and Extraction
Zeyuan Allen-Zhu
Yuanzhi Li
KELM
41
127
0
25 Sep 2023
User Simulation with Large Language Models for Evaluating Task-Oriented Dialogue
Sam Davidson
Salvatore Romeo
Raphael Shu
James Gung
Arshit Gupta
Saab Mansour
Yi Zhang
ALM
LLMAG
27
3
0
23 Sep 2023
Knowledge Sanitization of Large Language Models
Yoichi Ishibashi
Hidetoshi Shimodaira
KELM
18
19
0
21 Sep 2023
SlimPajama-DC: Understanding Data Combinations for LLM Training
Zhiqiang Shen
Tianhua Tao
Liqun Ma
W. Neiswanger
Zhengzhong Liu
...
Bowen Tan
Joel Hestness
Natalia Vassilieva
Daria Soboleva
Eric P. Xing
25
44
0
19 Sep 2023
CFGPT: Chinese Financial Assistant with Large Language Model
Jiangtong Li
Yuxuan Bian
Guoxuan Wang
Yang Lei
Dawei Cheng
Zhijun Ding
Changjun Jiang
37
10
0
19 Sep 2023
Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity
Haojun Xia
Zhen Zheng
Yuchao Li
Donglin Zhuang
Zhongzhu Zhou
Xiafei Qiu
Yong Li
Wei Lin
S. Song
49
11
0
19 Sep 2023
Generative modeling, design and analysis of spider silk protein sequences for enhanced mechanical properties
Wei Lu
David L. Kaplan
Markus J. Buehler
10
31
0
18 Sep 2023
Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?
Xiangru Tang
Yiming Zong
Jason Phang
Yilun Zhao
Wangchunshu Zhou
Arman Cohan
Mark B. Gerstein
LMTD
ELM
ALM
42
8
0
16 Sep 2023
Previous
1
2
3
...
5
6
7
...
10
11
12
Next