ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.05463
  4. Cited By
Textbooks Are All You Need II: phi-1.5 technical report

Textbooks Are All You Need II: phi-1.5 technical report

11 September 2023
Yuan-Fang Li
Sébastien Bubeck
Ronen Eldan
Allison Del Giorno
Suriya Gunasekar
Yin Tat Lee
    ALM
    LRM
ArXivPDFHTML

Papers citing "Textbooks Are All You Need II: phi-1.5 technical report"

50 / 334 papers shown
Title
Experimental Pragmatics with Machines: Testing LLM Predictions for the
  Inferences of Plain and Embedded Disjunctions
Experimental Pragmatics with Machines: Testing LLM Predictions for the Inferences of Plain and Embedded Disjunctions
Polina Tsvilodub
Paul Marty
Sonia Ramotowska
Jacopo Romoli
Michael Franke
21
0
0
09 May 2024
Seeds of Stereotypes: A Large-Scale Textual Analysis of Race and Gender
  Associations with Diseases in Online Sources
Seeds of Stereotypes: A Large-Scale Textual Analysis of Race and Gender Associations with Diseases in Online Sources
Lasse Hyldig Hansen
Nikolaj Andersen
Jack Gallifant
Liam G McCoy
James K Stone
Nura Izath
Marcela Aguirre-Jerez
Danielle S Bitterman
J. Gichoya
Leo Anthony Celi
18
3
0
08 May 2024
Learning from Students: Applying t-Distributions to Explore Accurate and
  Efficient Formats for LLMs
Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs
Jordan Dotzel
Yuzong Chen
Bahaa Kotb
Sushma Prasad
Gang Wu
Sheng R. Li
Mohamed S. Abdelfattah
Zhiru Zhang
24
7
0
06 May 2024
MedAdapter: Efficient Test-Time Adaptation of Large Language Models
  towards Medical Reasoning
MedAdapter: Efficient Test-Time Adaptation of Large Language Models towards Medical Reasoning
Wenqi Shi
Ran Xu
Yuchen Zhuang
Yue Yu
Hang Wu
Carl Yang
M. D. Wang
MedIm
LM&MA
33
18
0
05 May 2024
UniGen: Universal Domain Generalization for Sentiment Classification via
  Zero-shot Dataset Generation
UniGen: Universal Domain Generalization for Sentiment Classification via Zero-shot Dataset Generation
Juhwan Choi
Yeonghwa Kim
Seunguk Yu
Jungmin Yun
Youngbin Kim
36
1
0
02 May 2024
Benchmarking Benchmark Leakage in Large Language Models
Benchmarking Benchmark Leakage in Large Language Models
Ruijie Xu
Zengzhi Wang
Run-Ze Fan
Pengfei Liu
53
42
0
29 Apr 2024
PatentGPT: A Large Language Model for Intellectual Property
PatentGPT: A Large Language Model for Intellectual Property
Zilong Bai
Ruiji Zhang
Linqing Chen
Qijun Cai
Yuan Zhong
...
Fu Bian
Xiaolong Gu
Lisha Zhang
Weilei Wang
Changyang Tu
41
3
0
28 Apr 2024
MRScore: Evaluating Radiology Report Generation with LLM-based Reward
  System
MRScore: Evaluating Radiology Report Generation with LLM-based Reward System
Yunyi Liu
Zhanyu Wang
Yingshu Li
Xinyu Liang
Lingqiao Liu
Lei Wang
Luping Zhou
LM&MA
11
3
0
27 Apr 2024
HateTinyLLM : Hate Speech Detection Using Tiny Large Language Models
HateTinyLLM : Hate Speech Detection Using Tiny Large Language Models
Tanmay Sen
Ansuman Das
Mrinmay Sen
36
4
0
26 Apr 2024
TinyChart: Efficient Chart Understanding with Visual Token Merging and
  Program-of-Thoughts Learning
TinyChart: Efficient Chart Understanding with Visual Token Merging and Program-of-Thoughts Learning
Liang Zhang
Anwen Hu
Haiyang Xu
Mingshi Yan
Yichen Xu
Qin Jin
Ji Zhang
Fei Huang
39
15
0
25 Apr 2024
Fake Artificial Intelligence Generated Contents (FAIGC): A Survey of
  Theories, Detection Methods, and Opportunities
Fake Artificial Intelligence Generated Contents (FAIGC): A Survey of Theories, Detection Methods, and Opportunities
Xiaomin Yu
Yezhaohui Wang
Yanfang Chen
Zhen Tao
Dinghao Xi
Shichao Song
Simin Niu
Zhiyu Li
62
7
0
25 Apr 2024
Rethinking LLM Memorization through the Lens of Adversarial Compression
Rethinking LLM Memorization through the Lens of Adversarial Compression
Avi Schwarzschild
Zhili Feng
Pratyush Maini
Zachary Chase Lipton
J. Zico Kolter
39
38
0
23 Apr 2024
Automated Multi-Language to English Machine Translation Using Generative
  Pre-Trained Transformers
Automated Multi-Language to English Machine Translation Using Generative Pre-Trained Transformers
Elijah Pelofske
Vincent Urias
L. Liebrock
26
0
0
23 Apr 2024
Automated Long Answer Grading with RiceChem Dataset
Automated Long Answer Grading with RiceChem Dataset
Shashank Sonkar
Kangqi Ni
Lesa Tran Lu
Kristi Kincaid
John S. Hutchinson
Richard G. Baraniuk
54
7
0
22 Apr 2024
A Survey on Efficient Inference for Large Language Models
A Survey on Efficient Inference for Large Language Models
Zixuan Zhou
Xuefei Ning
Ke Hong
Tianyu Fu
Jiaming Xu
...
Shengen Yan
Guohao Dai
Xiao-Ping Zhang
Yuhan Dong
Yu-Xiang Wang
46
80
0
22 Apr 2024
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your
  Phone
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Marah Abdin
Sam Ade Jacobs
A. A. Awan
J. Aneja
Ahmed Hassan Awadallah
...
Li Lyna Zhang
Yi Zhang
Yue Zhang
Yunan Zhang
Xiren Zhou
LRM
ALM
50
1,019
0
22 Apr 2024
Enabling Natural Zero-Shot Prompting on Encoder Models via
  Statement-Tuning
Enabling Natural Zero-Shot Prompting on Encoder Models via Statement-Tuning
A. Elshabrawy
Yongix Huang
Iryna Gurevych
Alham Fikri Aji
16
0
0
19 Apr 2024
Relevant or Random: Can LLMs Truly Perform Analogical Reasoning?
Relevant or Random: Can LLMs Truly Perform Analogical Reasoning?
Chengwei Qin
Wenhan Xia
Tan Wang
Fangkai Jiao
Yuchen Hu
Bosheng Ding
Ruirui Chen
Shafiq R. Joty
LRM
35
3
0
19 Apr 2024
Towards Multi-modal Transformers in Federated Learning
Towards Multi-modal Transformers in Federated Learning
Guangyu Sun
Matías Mendieta
Aritra Dutta
Xin Li
C. L. P. Chen
67
3
0
18 Apr 2024
Large Language Models in Targeted Sentiment Analysis
Large Language Models in Targeted Sentiment Analysis
Nicolay Rusnachenko
A. Golubev
Natalia V. Loukachevitch
LRM
22
3
0
18 Apr 2024
Pack of LLMs: Model Fusion at Test-Time via Perplexity Optimization
Pack of LLMs: Model Fusion at Test-Time via Perplexity Optimization
Costas Mavromatis
Petros Karypis
George Karypis
MoMe
24
23
0
17 Apr 2024
AgentKit: Flow Engineering with Graphs, not Coding
AgentKit: Flow Engineering with Graphs, not Coding
Yue Wu
Yewen Fan
So Yeon Min
Shrimai Prabhumoye
Stephen Marcus McAleer
Yonatan Bisk
Ruslan Salakhutdinov
Yuanzhi Li
Tom Michael Mitchell
AI4CE
32
0
0
17 Apr 2024
High-Dimension Human Value Representation in Large Language Models
High-Dimension Human Value Representation in Large Language Models
Samuel Cahyawijaya
Delong Chen
Yejin Bang
Leila Khalatbari
Bryan Wilie
Ziwei Ji
Etsuko Ishii
Pascale Fung
63
5
0
11 Apr 2024
Scaling Laws for Data Filtering -- Data Curation cannot be Compute
  Agnostic
Scaling Laws for Data Filtering -- Data Curation cannot be Compute Agnostic
Sachin Goyal
Pratyush Maini
Zachary Chase Lipton
Aditi Raghunathan
J. Zico Kolter
43
40
0
10 Apr 2024
MiniCPM: Unveiling the Potential of Small Language Models with Scalable
  Training Strategies
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Shengding Hu
Yuge Tu
Xu Han
Chaoqun He
Ganqu Cui
...
Chaochao Jia
Guoyang Zeng
Dahai Li
Zhiyuan Liu
Maosong Sun
MoE
38
275
0
09 Apr 2024
RoT: Enhancing Large Language Models with Reflection on Search Trees
RoT: Enhancing Large Language Models with Reflection on Search Trees
Wenyang Hui
Kewei Tu
LRM
27
6
0
08 Apr 2024
ALERT: A Comprehensive Benchmark for Assessing Large Language Models'
  Safety through Red Teaming
ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
Simone Tedeschi
Felix Friedrich
P. Schramowski
Kristian Kersting
Roberto Navigli
Huu Nguyen
Bo Li
ELM
33
45
0
06 Apr 2024
Prompt Public Large Language Models to Synthesize Data for Private
  On-device Applications
Prompt Public Large Language Models to Synthesize Data for Private On-device Applications
Shanshan Wu
Zheng Xu
Yanxiang Zhang
Yuanbo Zhang
Daniel Ramage
SyDa
21
9
0
05 Apr 2024
Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
Xinrun Du
Zhouliang Yu
Songyang Gao
Ding Pan
Yuyang Cheng
...
Tianyu Zheng
Xinchen Luo
Guorui Zhou
Wenhu Chen
Ge Zhang
46
16
0
05 Apr 2024
CantTalkAboutThis: Aligning Language Models to Stay on Topic in
  Dialogues
CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues
Makesh Narsimhan Sreedhar
Traian Rebedea
Shaona Ghosh
Jiaqi Zeng
Christopher Parisien
ALM
27
4
0
04 Apr 2024
The Impact of Prompts on Zero-Shot Detection of AI-Generated Text
The Impact of Prompts on Zero-Shot Detection of AI-Generated Text
Kaito Taguchi
Yujie Gu
Kouichi Sakurai
AAML
DeLMO
24
6
0
29 Mar 2024
Juru: Legal Brazilian Large Language Model from Reputable Sources
Juru: Legal Brazilian Large Language Model from Reputable Sources
Roseval Malaquias Junior
Ramon Pires
R. Romero
R. Nogueira
ELM
AILaw
30
0
0
26 Mar 2024
Untangling Knots: Leveraging LLM for Error Resolution in Computational
  Notebooks
Untangling Knots: Leveraging LLM for Error Resolution in Computational Notebooks
Konstantin Grotov
Sergey Titov
Yaroslav Zharov
T. Bryksin
21
0
0
26 Mar 2024
The Unreasonable Ineffectiveness of the Deeper Layers
The Unreasonable Ineffectiveness of the Deeper Layers
Andrey Gromov
Kushal Tirumala
Hassan Shapourian
Paolo Glorioso
Daniel A. Roberts
41
79
0
26 Mar 2024
Large Language Models in Biomedical and Health Informatics: A
  Bibliometric Review
Large Language Models in Biomedical and Health Informatics: A Bibliometric Review
Huizi Yu
Lizhou Fan
Lingyao Li
Jiayan Zhou
Zihui Ma
...
Sijia He
Mingyu Jin
Yongfeng Zhang
Ashvin Gandhi
Xin Ma
LM&MA
32
11
0
24 Mar 2024
SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate
  Time series
SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series
Badri N. Patro
Vijay Srinivas Agneeswaran
Mamba
51
50
0
22 Mar 2024
PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model
PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model
Zheng-Wei Zhang
Yeyao Ma
Enming Zhang
Xiang Bai
VLM
MLLM
32
29
0
21 Mar 2024
Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference
Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference
Han Zhao
Min Zhang
Wei Zhao
Pengxiang Ding
Siteng Huang
Donglin Wang
Mamba
36
65
0
21 Mar 2024
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
Yaowei Zheng
Richong Zhang
Junhao Zhang
Yanhan Ye
Zheyan Luo
Zhangchi Feng
Yongqiang Ma
30
360
0
20 Mar 2024
From Pixels to Insights: A Survey on Automatic Chart Understanding in
  the Era of Large Foundation Models
From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models
Kung-Hsiang Huang
Hou Pong Chan
Yi Ren Fung
Haoyi Qiu
Mingyang Zhou
Shafiq R. Joty
Shih-Fu Chang
Heng Ji
AI4TS
64
14
0
18 Mar 2024
Komodo: A Linguistic Expedition into Indonesia's Regional Languages
Komodo: A Linguistic Expedition into Indonesia's Regional Languages
Louis Owen
Vishesh Tripathi
Abhay Kumar
Biddwan Ahmed
ELM
27
7
0
14 Mar 2024
LiveCodeBench: Holistic and Contamination Free Evaluation of Large
  Language Models for Code
LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code
Naman Jain
King Han
Alex Gu
Wen-Ding Li
Fanjia Yan
Tianjun Zhang
Sida I. Wang
Armando Solar-Lezama
Koushik Sen
Ion Stoica
ELM
29
260
0
12 Mar 2024
SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large
  Language Models by Summarizing Training Trajectories of Small Models
SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models
Yu Yang
Siddhartha Mishra
Jeffrey N Chiang
Baharan Mirzasoleiman
32
17
0
12 Mar 2024
Mipha: A Comprehensive Overhaul of Multimodal Assistant with Small
  Language Models
Mipha: A Comprehensive Overhaul of Multimodal Assistant with Small Language Models
Minjie Zhu
Yichen Zhu
Xin Liu
Ning Liu
Zhiyuan Xu
Chaomin Shen
Yaxin Peng
Zhicai Ou
Feifei Feng
Jian Tang
VLM
55
20
0
10 Mar 2024
Breeze-7B Technical Report
Breeze-7B Technical Report
Chan-Jan Hsu
Chang-Le Liu
Feng-Ting Liao
Po-Chun Hsu
Yi-Chang Chen
Da-shan Shiu
21
2
0
05 Mar 2024
Accelerating Greedy Coordinate Gradient via Probe Sampling
Accelerating Greedy Coordinate Gradient via Probe Sampling
Yiran Zhao
Wenyue Zheng
Tianle Cai
Xuan Long Do
Kenji Kawaguchi
Anirudh Goyal
Michael Shieh
36
11
0
02 Mar 2024
MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT
MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT
Omkar Thawakar
Ashmal Vayani
Salman Khan
Hisham Cholakal
Rao M. Anwer
M. Felsberg
Timothy Baldwin
Eric P. Xing
Fahad Shahbaz Khan
46
31
0
26 Feb 2024
RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for
  Short-form Open-Domain Question Answering
RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering
Zihan Zhang
Meng Fang
Ling-Hao Chen
RALM
54
12
0
26 Feb 2024
An Integrated Data Processing Framework for Pretraining Foundation
  Models
An Integrated Data Processing Framework for Pretraining Foundation Models
Yiding Sun
Feng Wang
Yutao Zhu
Wayne Xin Zhao
Jiaxin Mao
46
4
0
26 Feb 2024
CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
Zicheng Lin
Zhibin Gou
Tian Liang
Ruilin Luo
Haowei Liu
Yujiu Yang
LRM
40
43
0
22 Feb 2024
Previous
1234567
Next