ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2404.14219
  4. Cited By
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your
  Phone
v1v2v3 (latest)

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

22 April 2024
Marah Abdin
Sam Ade Jacobs
A. A. Awan
J. Aneja
Ahmed Hassan Awadallah
Hany Awadalla
Nguyen Bach
Amit Bahree
Arash Bakhtiari
Jianmin Bao
Harkirat Singh Behl
Alon Benhaim
Misha Bilenko
Johan Bjorck
Sébastien Bubeck
Qin Cai
Martin Cai
C. C. T. Mendes
Weizhu Chen
Vishrav Chaudhary
Dong Chen
DongDong Chen
Yen-Chun Chen
Yi-Ling Chen
Parul Chopra
Xiyang Dai
Allison Del Giorno
Gustavo de Rosa
Matthew Dixon
Ronen Eldan
Victor Fragoso
Dan Iter
Mei Gao
Min Gao
Jianfeng Gao
Amit Garg
Abhishek Goswami
Suriya Gunasekar
Emman Haider
Junheng Hao
Russell J. Hewett
Jamie Huynh
Mojan Javaheripi
Xin Jin
Piero Kauffmann
Nikos Karampatziakis
Dongwoo Kim
Mahoud Khademi
Lev Kurilenko
James R. Lee
Yin Tat Lee
Yuanzhi Li
Yunsheng Li
Chen Liang
Lars Liden
Ce Liu
Mengchen Liu
Weishung Liu
Eric Lin
Zeqi Lin
Chong Luo
Piyush Madan
Matt Mazzola
Arindam Mitra
Hardik Modi
Anh Nguyen
Brandon Norick
Barun Patra
Daniel Perez-Becker
Thomas Portet
Reid Pryzant
Heyang Qin
Marko Radmilac
Corby Rosset
Sambudha Roy
Olatunji Ruwase
Olli Saarikivi
Amin Saied
Adil Salim
Michael Santacroce
Shital Shah
Ning Shang
Hiteshi Sharma
Swadheen Shukla
Xianmin Song
Masahiro Tanaka
Andrea Tupini
Xin Eric Wang
Lijuan Wang
Chunyu Wang
Yu Wang
Rachel A. Ward
Guanhua Wang
Philipp A. Witte
Haiping Wu
Michael Wyatt
Bin Xiao
Can Xu
Jiahang Xu
Weijian Xu
Sonali Yadav
Fan Yang
Jianwei Yang
Ziyi Yang
Yifan Yang
Donghan Yu
Lu Yuan
Cheng-Yuan Zhang
Cyril Zhang
Jianwen Zhang
Li Zhang
Yi Zhang
Yue Zhang
Yunan Zhang
Xiren Zhou
    LRMALM
ArXiv (abs)PDFHTMLHuggingFace (257 upvotes)

Papers citing "Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone"

50 / 965 papers shown
OpenSep: Leveraging Large Language Models with Textual Inversion for
  Open World Audio Separation
OpenSep: Leveraging Large Language Models with Textual Inversion for Open World Audio SeparationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Tanvir Mahmud
Diana Marculescu
VLM
206
3
0
28 Sep 2024
SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from
  Documents guided by Multi-Aspect Feedback Refinement
SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback RefinementConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Ishani Mondal
Zongxia Li
Yufang Hou
Anandhavelu Natarajan
Aparna Garimella
Jordan Boyd-Graber
229
11
0
28 Sep 2024
Data Analysis in the Era of Generative AI
Data Analysis in the Era of Generative AI
J. Inala
Chenglong Wang
Steven Drucker
Gonzalo Ramos
Victor C. Dibia
N. Riche
Dave Brown
Dan Marshall
Jianfeng Gao
234
15
0
27 Sep 2024
FoodMLLM-JP: Leveraging Multimodal Large Language Models for Japanese Recipe Generation
FoodMLLM-JP: Leveraging Multimodal Large Language Models for Japanese Recipe GenerationConference on Multimedia Modeling (MMM), 2024
Yuki Imajuku
Yoko Yamakata
Kiyoharu Aizawa
238
3
0
27 Sep 2024
E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding
E.T. Bench: Towards Open-Ended Event-Level Video-Language UnderstandingNeural Information Processing Systems (NeurIPS), 2024
Ye Liu
Zongyang Ma
Chen Ma
Yang Wu
Ying Shan
Chang Wen Chen
267
52
0
26 Sep 2024
Search and Detect: Training-Free Long Tail Object Detection via
  Web-Image Retrieval
Search and Detect: Training-Free Long Tail Object Detection via Web-Image RetrievalComputer Vision and Pattern Recognition (CVPR), 2024
Mankeerat Sidhu
Hetarth Chopra
Ansel Blume
Jeonghwan Kim
Revanth Gangi Reddy
Heng Ji
ObjDVLM
183
2
0
26 Sep 2024
Discovering the Gems in Early Layers: Accelerating Long-Context LLMs
  with 1000x Input Token Reduction
Discovering the Gems in Early Layers: Accelerating Long-Context LLMs with 1000x Input Token Reduction
Zhenmei Shi
Yifei Ming
Xuan-Phi Nguyen
Yingyu Liang
Shafiq Joty
254
42
0
25 Sep 2024
PACE: Marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularization
PACE: Marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularizationNeural Information Processing Systems (NeurIPS), 2024
Yao Ni
Shan Zhang
Piotr Koniusz
1.1K
14
0
25 Sep 2024
SynChart: Synthesizing Charts from Language Models
SynChart: Synthesizing Charts from Language Models
Mengchen Liu
Qixiu Li
Dongdong Chen
Dong Chen
Jianmin Bao
Yunsheng Li
MLLM
121
0
0
25 Sep 2024
A Comprehensive Evaluation of Large Language Models on Mental Illnesses
A Comprehensive Evaluation of Large Language Models on Mental Illnesses
Abdelrahman Hanafi
Mohammed Saad
Noureldin Zahran
Radwa J. Hanafy
Mohammed E. Fouda
AI4MHELMLM&MA
291
9
0
24 Sep 2024
Archon: An Architecture Search Framework for Inference-Time Techniques
Archon: An Architecture Search Framework for Inference-Time Techniques
Jon Saad-Falcon
Adrian Gamarra Lafuente
Shlok Natarajan
Nahum Maru
Hristo Todorov
...
E. Kelly Buchanan
Mayee Chen
Neel Guha
Christopher Ré
Azalia Mirhoseini
AI4CE
451
38
0
23 Sep 2024
Domino: Eliminating Communication in LLM Training via Generic Tensor
  Slicing and Overlapping
Domino: Eliminating Communication in LLM Training via Generic Tensor Slicing and Overlapping
Guanhua Wang
Chengming Zhang
Sihan Chen
Ang Li
Olatunji Ruwase
173
12
0
23 Sep 2024
Phantom of Latent for Large Language and Vision Models
Phantom of Latent for Large Language and Vision Models
Byung-Kwan Lee
Sangyun Chung
Chae Won Kim
Beomchan Park
Yong Man Ro
VLMLRM
271
12
0
23 Sep 2024
Instruction Tuning Vs. In-Context Learning: Revisiting Large Language
  Models in Few-Shot Computational Social Science
Instruction Tuning Vs. In-Context Learning: Revisiting Large Language Models in Few-Shot Computational Social Science
Taihang Wang
Xiaoman Xu
Yimin Wang
Ye Jiang
197
4
0
23 Sep 2024
MobileViews: A Large-Scale Mobile GUI Dataset
MobileViews: A Large-Scale Mobile GUI Dataset
Longxi Gao
Li Zhang
Shihe Wang
Shangguang Wang
Yuanchun Li
Mengwei Xu
207
13
0
22 Sep 2024
Normalized Narrow Jump To Conclusions: Normalized Narrow Shortcuts for
  Parameter Efficient Early Exit Transformer Prediction
Normalized Narrow Jump To Conclusions: Normalized Narrow Shortcuts for Parameter Efficient Early Exit Transformer PredictionConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Amrit Diggavi Seshadri
160
1
0
21 Sep 2024
Enhancing Logical Reasoning in Large Language Models through Graph-based
  Synthetic Data
Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic Data
Jiaming Zhou
Abbas Ghaddar
Ge Zhang
Liheng Ma
Yaochen Hu
Soumyasundar Pal
Mark Coates
Bin Wang
Yingxue Zhang
Jianye Hao
ReLMLRM
309
9
0
19 Sep 2024
CLAIR-A: Leveraging Large Language Models to Judge Audio Captions
CLAIR-A: Leveraging Large Language Models to Judge Audio Captions
Tsung-Han Wu
Joseph E. Gonzalez
Trevor Darrell
David M. Chan
222
3
0
19 Sep 2024
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoningInternational Conference on Learning Representations (ICLR), 2024
Zayne Sprague
Fangcong Yin
Juan Diego Rodriguez
Dongwei Jiang
Manya Wadhwa
Prasann Singhal
Xinyu Zhao
Xi Ye
Kyle Mahowald
Greg Durrett
ReLMLRM
632
232
0
18 Sep 2024
Ideal-LLM: Integrating Dual Encoders and Language-Adapted LLM for
  Multilingual Speech-to-Text
Ideal-LLM: Integrating Dual Encoders and Language-Adapted LLM for Multilingual Speech-to-Text
Hongfei Xue
Wei Ren
Xuelong Geng
Kun Wei
Longhao Li
Qijie Shao
Linju Yang
Kai Diao
Lei Xie
AuLLM
184
11
0
17 Sep 2024
CAST: Cross-modal Alignment Similarity Test for Vision Language Models
CAST: Cross-modal Alignment Similarity Test for Vision Language ModelsInternational Conference on Computational Linguistics (COLING), 2024
Gautier Dagan
Olga Loginova
Anil Batra
CoGe
236
1
0
17 Sep 2024
AI Suggestions Homogenize Writing Toward Western Styles and Diminish Cultural Nuances
AI Suggestions Homogenize Writing Toward Western Styles and Diminish Cultural NuancesInternational Conference on Human Factors in Computing Systems (CHI), 2024
Dhruv Agarwal
Mor Naaman
Aditya Vashistha
416
54
0
17 Sep 2024
Leveraging Open-Source Large Language Models for Native Language Identification
Leveraging Open-Source Large Language Models for Native Language Identification
Yee Man Ng
Ilia Markov
283
3
0
15 Sep 2024
Experimenting with Legal AI Solutions: The Case of Question-Answering
  for Access to Justice
Experimenting with Legal AI Solutions: The Case of Question-Answering for Access to Justice
Jonathan Li
R. Bhambhoria
Samuel Dahan
Xiaodan Zhu
ELMAILaw
216
5
0
12 Sep 2024
What is the Role of Small Models in the LLM Era: A Survey
What is the Role of Small Models in the LLM Era: A Survey
Lihu Chen
Gaël Varoquaux
ALM
777
55
0
10 Sep 2024
MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders
MoWE-Audio: Multitask AudioLLMs with Mixture of Weak EncodersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Feiyu Xiong
Shuo Sun
Sijin Yu
Xunlong Zou
Zhuohan Liu
Yingxu He
Geyu Lin
Nancy F. Chen
Ai Ti Aw
AuLLM
511
3
0
10 Sep 2024
MarS: a Financial Market Simulation Engine Powered by Generative Foundation Model
MarS: a Financial Market Simulation Engine Powered by Generative Foundation ModelInternational Conference on Learning Representations (ICLR), 2024
Junjie Li
Yang Liu
Yuante Li
Shikai Fang
Lewen Wang
Chang Xu
Jiang Bian
VGen
285
15
0
04 Sep 2024
The Role of Large Language Models in Musicology: Are We Ready to Trust
  the Machines?
The Role of Large Language Models in Musicology: Are We Ready to Trust the Machines?
Pedro Ramoneda
Emilia Parada-Cabaleiro
Benno Weck
Xavier Serra
94
3
0
03 Sep 2024
EvoChart: A Benchmark and a Self-Training Approach Towards Real-World Chart Understanding
EvoChart: A Benchmark and a Self-Training Approach Towards Real-World Chart UnderstandingAAAI Conference on Artificial Intelligence (AAAI), 2024
Muye Huang
Han Lai
Xinyu Zhang
Wenjun Wu
Jie Ma
Lingling Zhang
Jun Liu
256
22
0
03 Sep 2024
Evaluating the Performance of Large Language Models in Competitive
  Programming: A Multi-Year, Multi-Grade Analysis
Evaluating the Performance of Large Language Models in Competitive Programming: A Multi-Year, Multi-Grade AnalysisInternational Symposium on INnovations in Intelligent SysTems and Applications (INISTA), 2024
Adrian Marius Dumitran
Adrian Catalin Badea
Stefan-Gabriel Muscalu
ELMLRM
215
5
0
31 Aug 2024
An Investigation of Warning Erroneous Chat Translations in Cross-lingual
  Communication
An Investigation of Warning Erroneous Chat Translations in Cross-lingual CommunicationInternational Joint Conference on Natural Language Processing (IJCNLP), 2024
Yunmeng Li
Jun Suzuki
Makoto Morishita
Kaori Abe
Kentaro Inui
245
26
0
28 Aug 2024
LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs
LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Chansung Park
Juyong Jiang
Fan Wang
Sayak Paul
Jing Tang
369
6
0
24 Aug 2024
A Law of Next-Token Prediction in Large Language Models
A Law of Next-Token Prediction in Large Language ModelsPhysical Review E (Phys. Rev. E), 2024
Hangfeng He
Weijie J. Su
386
7
0
24 Aug 2024
CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution
CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and ExecutionAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Ruiyang Xu
Jialun Cao
Yaojie Lu
Ming Wen
Hongyu Lin
Xianpei Han
Ben He
Shing-Chi Cheung
Le Sun
LRMELM
519
15
0
23 Aug 2024
Video-to-Text Pedestrian Monitoring (VTPM): Leveraging Computer Vision
  and Large Language Models for Privacy-Preserve Pedestrian Activity Monitoring
  at Intersections
Video-to-Text Pedestrian Monitoring (VTPM): Leveraging Computer Vision and Large Language Models for Privacy-Preserve Pedestrian Activity Monitoring at Intersections
Ahmed S. Abdelrahman
Mohamed Abdel-Aty
Dongdong Wang
176
6
0
21 Aug 2024
SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs
SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs
Yuanyang Yin
Yaqi Zhao
Yajie Zhang
Yuanxing Zhang
Ke Lin
Jiahao Wang
Pengfei Wan
Di Zhang
Baoqun Yin
Wentao Zhang
LRM
332
11
0
21 Aug 2024
Reefknot: A Comprehensive Benchmark for Relation Hallucination Evaluation, Analysis and Mitigation in Multimodal Large Language Models
Reefknot: A Comprehensive Benchmark for Relation Hallucination Evaluation, Analysis and Mitigation in Multimodal Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Kening Zheng
Junkai Chen
Yibo Yan
Xin Zou
Xuming Hu
690
17
0
18 Aug 2024
CyberPal.AI: Empowering LLMs with Expert-Driven Cybersecurity
  Instructions
CyberPal.AI: Empowering LLMs with Expert-Driven Cybersecurity InstructionsAAAI Conference on Artificial Intelligence (AAAI), 2024
Matan Levi
Yair Alluouche
Daniel Ohayon
Anton Puzanov
218
12
0
17 Aug 2024
See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering
  LLM Weaknesses
See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses
Yulong Chen
Yang Liu
Jianhao Yan
X. Bai
Ming Zhong
Yinghao Yang
Ziyi Yang
Chenguang Zhu
Yue Zhang
ALMELM
202
18
0
16 Aug 2024
PsychoLex: Unveiling the Psychological Mind of Large Language Models
PsychoLex: Unveiling the Psychological Mind of Large Language Models
Mohammad Amin Abbasi
Farnaz Sadat Mirnezami
Hassan Naderi
LM&MA
199
2
0
16 Aug 2024
SelectLLM: Query-Aware Efficient Selection Algorithm for Large Language Models
SelectLLM: Query-Aware Efficient Selection Algorithm for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Kaushal Kumar Maurya
KV Aditya Srivatsa
Ekaterina Kochmar
323
6
0
16 Aug 2024
Fast Training Dataset Attribution via In-Context Learning
Fast Training Dataset Attribution via In-Context Learning
Milad Fotouhi
M. T. Bahadori
Oluwaseyi Feyisetan
P. Arabshahi
David Heckerman
312
1
0
14 Aug 2024
The advantages of context specific language models: the case of the Erasmian Language Model
The advantages of context specific language models: the case of the Erasmian Language Model
João Gonçalves
Nick Jelicic
Michele Murgia
Evert Stamhuis
202
1
0
13 Aug 2024
Compromesso! Italian Many-Shot Jailbreaks Undermine the Safety of Large
  Language Models
Compromesso! Italian Many-Shot Jailbreaks Undermine the Safety of Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Fabio Pernisi
Dirk Hovy
Paul Röttger
160
3
0
08 Aug 2024
WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language
  Models
WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Prannaya Gupta
Le Qi Yau
Hao Han Low
I-Shiang Lee
Hugo Maximus Lim
...
Jia Hng Koh
Dar Win Liew
Rishabh Bhardwaj
Rajat Bhardwaj
Soujanya Poria
ELMLM&MA
233
15
0
07 Aug 2024
Openstory++: A Large-scale Dataset and Benchmark for Instance-aware
  Open-domain Visual Storytelling
Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling
Zilyu Ye
Yu Lei
Ruotian Peng
Jinjin Cao
Zhiyang Chen
...
Mingyuan Zhou
Xiaoqian Shen
Mohamed Elhoseiny
Nan Zhuang
Guo-Jun Qi
VGenVLM
195
2
0
07 Aug 2024
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented
  Generation
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation
Daniel Fleischer
Moshe Berchansky
Moshe Wasserblat
Peter Izsak
3DV
293
8
0
05 Aug 2024
Large Language Model Aided QoS Prediction for Service Recommendation
Large Language Model Aided QoS Prediction for Service Recommendation
Huiying Liu
Zekun Zhang
Honghao Li
Qilin Wu
Yiwen Zhang
212
4
0
05 Aug 2024
RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework
RAGEval: Scenario Specific RAG Evaluation Dataset Generation FrameworkAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Kunlun Zhu
Yifan Luo
Dingling Xu
Ruobing Wang
Shi Yu
...
Yishan Li
Zhiyuan Liu
Xu Han
Zhiyuan Liu
Maosong Sun
666
38
0
02 Aug 2024
AI-Assisted Generation of Difficult Math Questions
AI-Assisted Generation of Difficult Math Questions
Vedant Shah
Dingli Yu
Kaifeng Lyu
Simon Park
Nan Rosemary Ke
...
Yoshua Bengio
Sanjeev Arora
Anirudh Goyal
Sanjeev Arora
Anirudh Goyal
396
31
0
30 Jul 2024
Previous
123...17181920
Next