Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.05463
Cited By
Textbooks Are All You Need II: phi-1.5 technical report
11 September 2023
Yuan-Fang Li
Sébastien Bubeck
Ronen Eldan
Allison Del Giorno
Suriya Gunasekar
Yin Tat Lee
ALM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Textbooks Are All You Need II: phi-1.5 technical report"
50 / 334 papers shown
Title
Experimental Pragmatics with Machines: Testing LLM Predictions for the Inferences of Plain and Embedded Disjunctions
Polina Tsvilodub
Paul Marty
Sonia Ramotowska
Jacopo Romoli
Michael Franke
21
0
0
09 May 2024
Seeds of Stereotypes: A Large-Scale Textual Analysis of Race and Gender Associations with Diseases in Online Sources
Lasse Hyldig Hansen
Nikolaj Andersen
Jack Gallifant
Liam G McCoy
James K Stone
Nura Izath
Marcela Aguirre-Jerez
Danielle S Bitterman
J. Gichoya
Leo Anthony Celi
18
3
0
08 May 2024
Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs
Jordan Dotzel
Yuzong Chen
Bahaa Kotb
Sushma Prasad
Gang Wu
Sheng R. Li
Mohamed S. Abdelfattah
Zhiru Zhang
24
7
0
06 May 2024
MedAdapter: Efficient Test-Time Adaptation of Large Language Models towards Medical Reasoning
Wenqi Shi
Ran Xu
Yuchen Zhuang
Yue Yu
Hang Wu
Carl Yang
M. D. Wang
MedIm
LM&MA
33
18
0
05 May 2024
UniGen: Universal Domain Generalization for Sentiment Classification via Zero-shot Dataset Generation
Juhwan Choi
Yeonghwa Kim
Seunguk Yu
Jungmin Yun
Youngbin Kim
36
1
0
02 May 2024
Benchmarking Benchmark Leakage in Large Language Models
Ruijie Xu
Zengzhi Wang
Run-Ze Fan
Pengfei Liu
53
42
0
29 Apr 2024
PatentGPT: A Large Language Model for Intellectual Property
Zilong Bai
Ruiji Zhang
Linqing Chen
Qijun Cai
Yuan Zhong
...
Fu Bian
Xiaolong Gu
Lisha Zhang
Weilei Wang
Changyang Tu
41
3
0
28 Apr 2024
MRScore: Evaluating Radiology Report Generation with LLM-based Reward System
Yunyi Liu
Zhanyu Wang
Yingshu Li
Xinyu Liang
Lingqiao Liu
Lei Wang
Luping Zhou
LM&MA
11
3
0
27 Apr 2024
HateTinyLLM : Hate Speech Detection Using Tiny Large Language Models
Tanmay Sen
Ansuman Das
Mrinmay Sen
36
4
0
26 Apr 2024
TinyChart: Efficient Chart Understanding with Visual Token Merging and Program-of-Thoughts Learning
Liang Zhang
Anwen Hu
Haiyang Xu
Mingshi Yan
Yichen Xu
Qin Jin
Ji Zhang
Fei Huang
39
15
0
25 Apr 2024
Fake Artificial Intelligence Generated Contents (FAIGC): A Survey of Theories, Detection Methods, and Opportunities
Xiaomin Yu
Yezhaohui Wang
Yanfang Chen
Zhen Tao
Dinghao Xi
Shichao Song
Simin Niu
Zhiyu Li
62
7
0
25 Apr 2024
Rethinking LLM Memorization through the Lens of Adversarial Compression
Avi Schwarzschild
Zhili Feng
Pratyush Maini
Zachary Chase Lipton
J. Zico Kolter
39
38
0
23 Apr 2024
Automated Multi-Language to English Machine Translation Using Generative Pre-Trained Transformers
Elijah Pelofske
Vincent Urias
L. Liebrock
26
0
0
23 Apr 2024
Automated Long Answer Grading with RiceChem Dataset
Shashank Sonkar
Kangqi Ni
Lesa Tran Lu
Kristi Kincaid
John S. Hutchinson
Richard G. Baraniuk
54
7
0
22 Apr 2024
A Survey on Efficient Inference for Large Language Models
Zixuan Zhou
Xuefei Ning
Ke Hong
Tianyu Fu
Jiaming Xu
...
Shengen Yan
Guohao Dai
Xiao-Ping Zhang
Yuhan Dong
Yu-Xiang Wang
46
80
0
22 Apr 2024
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Marah Abdin
Sam Ade Jacobs
A. A. Awan
J. Aneja
Ahmed Hassan Awadallah
...
Li Lyna Zhang
Yi Zhang
Yue Zhang
Yunan Zhang
Xiren Zhou
LRM
ALM
50
1,019
0
22 Apr 2024
Enabling Natural Zero-Shot Prompting on Encoder Models via Statement-Tuning
A. Elshabrawy
Yongix Huang
Iryna Gurevych
Alham Fikri Aji
16
0
0
19 Apr 2024
Relevant or Random: Can LLMs Truly Perform Analogical Reasoning?
Chengwei Qin
Wenhan Xia
Tan Wang
Fangkai Jiao
Yuchen Hu
Bosheng Ding
Ruirui Chen
Shafiq R. Joty
LRM
35
3
0
19 Apr 2024
Towards Multi-modal Transformers in Federated Learning
Guangyu Sun
Matías Mendieta
Aritra Dutta
Xin Li
C. L. P. Chen
67
3
0
18 Apr 2024
Large Language Models in Targeted Sentiment Analysis
Nicolay Rusnachenko
A. Golubev
Natalia V. Loukachevitch
LRM
22
3
0
18 Apr 2024
Pack of LLMs: Model Fusion at Test-Time via Perplexity Optimization
Costas Mavromatis
Petros Karypis
George Karypis
MoMe
24
23
0
17 Apr 2024
AgentKit: Flow Engineering with Graphs, not Coding
Yue Wu
Yewen Fan
So Yeon Min
Shrimai Prabhumoye
Stephen Marcus McAleer
Yonatan Bisk
Ruslan Salakhutdinov
Yuanzhi Li
Tom Michael Mitchell
AI4CE
32
0
0
17 Apr 2024
High-Dimension Human Value Representation in Large Language Models
Samuel Cahyawijaya
Delong Chen
Yejin Bang
Leila Khalatbari
Bryan Wilie
Ziwei Ji
Etsuko Ishii
Pascale Fung
63
5
0
11 Apr 2024
Scaling Laws for Data Filtering -- Data Curation cannot be Compute Agnostic
Sachin Goyal
Pratyush Maini
Zachary Chase Lipton
Aditi Raghunathan
J. Zico Kolter
43
40
0
10 Apr 2024
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Shengding Hu
Yuge Tu
Xu Han
Chaoqun He
Ganqu Cui
...
Chaochao Jia
Guoyang Zeng
Dahai Li
Zhiyuan Liu
Maosong Sun
MoE
38
275
0
09 Apr 2024
RoT: Enhancing Large Language Models with Reflection on Search Trees
Wenyang Hui
Kewei Tu
LRM
27
6
0
08 Apr 2024
ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
Simone Tedeschi
Felix Friedrich
P. Schramowski
Kristian Kersting
Roberto Navigli
Huu Nguyen
Bo Li
ELM
33
45
0
06 Apr 2024
Prompt Public Large Language Models to Synthesize Data for Private On-device Applications
Shanshan Wu
Zheng Xu
Yanxiang Zhang
Yuanbo Zhang
Daniel Ramage
SyDa
21
9
0
05 Apr 2024
Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
Xinrun Du
Zhouliang Yu
Songyang Gao
Ding Pan
Yuyang Cheng
...
Tianyu Zheng
Xinchen Luo
Guorui Zhou
Wenhu Chen
Ge Zhang
46
16
0
05 Apr 2024
CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues
Makesh Narsimhan Sreedhar
Traian Rebedea
Shaona Ghosh
Jiaqi Zeng
Christopher Parisien
ALM
27
4
0
04 Apr 2024
The Impact of Prompts on Zero-Shot Detection of AI-Generated Text
Kaito Taguchi
Yujie Gu
Kouichi Sakurai
AAML
DeLMO
24
6
0
29 Mar 2024
Juru: Legal Brazilian Large Language Model from Reputable Sources
Roseval Malaquias Junior
Ramon Pires
R. Romero
R. Nogueira
ELM
AILaw
30
0
0
26 Mar 2024
Untangling Knots: Leveraging LLM for Error Resolution in Computational Notebooks
Konstantin Grotov
Sergey Titov
Yaroslav Zharov
T. Bryksin
21
0
0
26 Mar 2024
The Unreasonable Ineffectiveness of the Deeper Layers
Andrey Gromov
Kushal Tirumala
Hassan Shapourian
Paolo Glorioso
Daniel A. Roberts
41
79
0
26 Mar 2024
Large Language Models in Biomedical and Health Informatics: A Bibliometric Review
Huizi Yu
Lizhou Fan
Lingyao Li
Jiayan Zhou
Zihui Ma
...
Sijia He
Mingyu Jin
Yongfeng Zhang
Ashvin Gandhi
Xin Ma
LM&MA
32
11
0
24 Mar 2024
SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series
Badri N. Patro
Vijay Srinivas Agneeswaran
Mamba
51
50
0
22 Mar 2024
PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model
Zheng-Wei Zhang
Yeyao Ma
Enming Zhang
Xiang Bai
VLM
MLLM
32
29
0
21 Mar 2024
Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference
Han Zhao
Min Zhang
Wei Zhao
Pengxiang Ding
Siteng Huang
Donglin Wang
Mamba
36
65
0
21 Mar 2024
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
Yaowei Zheng
Richong Zhang
Junhao Zhang
Yanhan Ye
Zheyan Luo
Zhangchi Feng
Yongqiang Ma
30
360
0
20 Mar 2024
From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models
Kung-Hsiang Huang
Hou Pong Chan
Yi Ren Fung
Haoyi Qiu
Mingyang Zhou
Shafiq R. Joty
Shih-Fu Chang
Heng Ji
AI4TS
64
14
0
18 Mar 2024
Komodo: A Linguistic Expedition into Indonesia's Regional Languages
Louis Owen
Vishesh Tripathi
Abhay Kumar
Biddwan Ahmed
ELM
27
7
0
14 Mar 2024
LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code
Naman Jain
King Han
Alex Gu
Wen-Ding Li
Fanjia Yan
Tianjun Zhang
Sida I. Wang
Armando Solar-Lezama
Koushik Sen
Ion Stoica
ELM
29
260
0
12 Mar 2024
SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models
Yu Yang
Siddhartha Mishra
Jeffrey N Chiang
Baharan Mirzasoleiman
32
17
0
12 Mar 2024
Mipha: A Comprehensive Overhaul of Multimodal Assistant with Small Language Models
Minjie Zhu
Yichen Zhu
Xin Liu
Ning Liu
Zhiyuan Xu
Chaomin Shen
Yaxin Peng
Zhicai Ou
Feifei Feng
Jian Tang
VLM
55
20
0
10 Mar 2024
Breeze-7B Technical Report
Chan-Jan Hsu
Chang-Le Liu
Feng-Ting Liao
Po-Chun Hsu
Yi-Chang Chen
Da-shan Shiu
21
2
0
05 Mar 2024
Accelerating Greedy Coordinate Gradient via Probe Sampling
Yiran Zhao
Wenyue Zheng
Tianle Cai
Xuan Long Do
Kenji Kawaguchi
Anirudh Goyal
Michael Shieh
36
11
0
02 Mar 2024
MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT
Omkar Thawakar
Ashmal Vayani
Salman Khan
Hisham Cholakal
Rao M. Anwer
M. Felsberg
Timothy Baldwin
Eric P. Xing
Fahad Shahbaz Khan
46
31
0
26 Feb 2024
RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering
Zihan Zhang
Meng Fang
Ling-Hao Chen
RALM
54
12
0
26 Feb 2024
An Integrated Data Processing Framework for Pretraining Foundation Models
Yiding Sun
Feng Wang
Yutao Zhu
Wayne Xin Zhao
Jiaxin Mao
46
4
0
26 Feb 2024
CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
Zicheng Lin
Zhibin Gou
Tian Liang
Ruilin Luo
Haowei Liu
Yujiu Yang
LRM
40
43
0
22 Feb 2024
Previous
1
2
3
4
5
6
7
Next