ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2404.14219
  4. Cited By
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your
  Phone
v1v2v3 (latest)

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

22 April 2024
Marah Abdin
Sam Ade Jacobs
A. A. Awan
J. Aneja
Ahmed Hassan Awadallah
Hany Awadalla
Nguyen Bach
Amit Bahree
Arash Bakhtiari
Jianmin Bao
Harkirat Singh Behl
Alon Benhaim
Misha Bilenko
Johan Bjorck
Sébastien Bubeck
Qin Cai
Martin Cai
C. C. T. Mendes
Weizhu Chen
Vishrav Chaudhary
Dong Chen
DongDong Chen
Yen-Chun Chen
Yi-Ling Chen
Parul Chopra
Xiyang Dai
Allison Del Giorno
Gustavo de Rosa
Matthew Dixon
Ronen Eldan
Victor Fragoso
Dan Iter
Mei Gao
Min Gao
Jianfeng Gao
Amit Garg
Abhishek Goswami
Suriya Gunasekar
Emman Haider
Junheng Hao
Russell J. Hewett
Jamie Huynh
Mojan Javaheripi
Xin Jin
Piero Kauffmann
Nikos Karampatziakis
Dongwoo Kim
Mahoud Khademi
Lev Kurilenko
James R. Lee
Yin Tat Lee
Yuanzhi Li
Yunsheng Li
Chen Liang
Lars Liden
Ce Liu
Mengchen Liu
Weishung Liu
Eric Lin
Zeqi Lin
Chong Luo
Piyush Madan
Matt Mazzola
Arindam Mitra
Hardik Modi
Anh Nguyen
Brandon Norick
Barun Patra
Daniel Perez-Becker
Thomas Portet
Reid Pryzant
Heyang Qin
Marko Radmilac
Corby Rosset
Sambudha Roy
Olatunji Ruwase
Olli Saarikivi
Amin Saied
Adil Salim
Michael Santacroce
Shital Shah
Ning Shang
Hiteshi Sharma
Swadheen Shukla
Xianmin Song
Masahiro Tanaka
Andrea Tupini
Xin Eric Wang
Lijuan Wang
Chunyu Wang
Yu Wang
Rachel A. Ward
Guanhua Wang
Philipp A. Witte
Haiping Wu
Michael Wyatt
Bin Xiao
Can Xu
Jiahang Xu
Weijian Xu
Sonali Yadav
Fan Yang
Jianwei Yang
Ziyi Yang
Yifan Yang
Donghan Yu
Lu Yuan
Cheng-Yuan Zhang
Cyril Zhang
Jianwen Zhang
Li Zhang
Yi Zhang
Yue Zhang
Yunan Zhang
Xiren Zhou
    LRMALM
ArXiv (abs)PDFHTMLHuggingFace (257 upvotes)

Papers citing "Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone"

50 / 963 papers shown
Title
A Comparative Study of Task Adaptation Techniques of Large Language Models for Identifying Sustainable Development Goals
A Comparative Study of Task Adaptation Techniques of Large Language Models for Identifying Sustainable Development GoalsIEEE Access (IEEE Access), 2025
Andrea Cadeddu
Alessandro Chessa
Vincenzo De Leo
Gianni Fenu
Enrico Motta
Francesco Osborne
Diego Reforgiato Recupero
Angelo Salatino
Luca Secchi
176
0
0
18 Jun 2025
GenRecal: Generation after Recalibration from Large to Small Vision-Language Models
GenRecal: Generation after Recalibration from Large to Small Vision-Language Models
Byung-Kwan Lee
Ryo Hachiuma
Yong Man Ro
Yu-Chun Wang
Yueh-Hua Wu
VLM
307
3
0
18 Jun 2025
SciVer: Evaluating Foundation Models for Multimodal Scientific Claim Verification
SciVer: Evaluating Foundation Models for Multimodal Scientific Claim VerificationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Chengye Wang
Yifei Shen
Zexi Kuang
Arman Cohan
Yilun Zhao
188
1
0
18 Jun 2025
Demystifying the Visual Quality Paradox in Multimodal Large Language Models
Demystifying the Visual Quality Paradox in Multimodal Large Language Models
Shuo Xing
Lanqing guo
Hongyuan Hua
Seoyoung Lee
Peiran Li
Yufei Wang
Zinan Lin
Zhengzhong Tu
VLM
256
1
0
18 Jun 2025
ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM
ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM
Yujun Wang
Aniri
Jinhe Bi
Soeren Pirk
Yunpu Ma
MLLM
345
11
0
17 Jun 2025
PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning
PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning
Yizhen Zhang
Yang Ding
Shuoshuo Zhang
Xinchen Zhang
Haoling Li
...
Jie Wu
Lei Ji
Haoran Pan
Y. Yang
Yeyun Gong
OffRLVLMLRM
193
5
0
17 Jun 2025
Optimal Embedding Learning Rate in LLMs: The Effect of Vocabulary Size
Optimal Embedding Learning Rate in LLMs: The Effect of Vocabulary Size
Soufiane Hayou
Liyuan Liu
137
2
0
17 Jun 2025
Enhancing Goal-oriented Proactive Dialogue Systems via Consistency Reflection and Correction
Enhancing Goal-oriented Proactive Dialogue Systems via Consistency Reflection and CorrectionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Didi Zhang
Yaxin Fan
Peifeng Li
Qiaoming Zhu
180
0
0
16 Jun 2025
Rethinking Explainability in the Era of Multimodal AI
Rethinking Explainability in the Era of Multimodal AI
Chirag Agarwal
211
1
0
16 Jun 2025
MotiveBench: How Far Are We From Human-Like Motivational Reasoning in Large Language Models?
MotiveBench: How Far Are We From Human-Like Motivational Reasoning in Large Language Models?Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Xixian Yong
Jianxun Lian
Xiaoyuan Yi
Xiao Zhou
Xing Xie
LRM
203
0
0
16 Jun 2025
Align-then-Unlearn: Embedding Alignment for LLM Unlearning
Align-then-Unlearn: Embedding Alignment for LLM Unlearning
Philipp Spohn
Leander Girrbach
Jessica Bader
Zeynep Akata
MU
205
0
0
16 Jun 2025
SPOT: Bridging Natural Language and Geospatial Search for Investigative Journalists
SPOT: Bridging Natural Language and Geospatial Search for Investigative Journalists
Lynn Khellaf
Ipek Baris Schlicht
Tilman Mirass
Julia Bayer
Tilman Wagner
Ruben Bouwmeester
90
1
0
16 Jun 2025
Assessing the Limits of In-Context Learning beyond Functions using Partially Ordered Relation
Assessing the Limits of In-Context Learning beyond Functions using Partially Ordered Relation
Debanjan Dutta
Faizanuddin Ansari
Swagatam Das
114
0
0
16 Jun 2025
PRISM2: Unlocking Multi-Modal General Pathology AI with Clinical Dialogue
PRISM2: Unlocking Multi-Modal General Pathology AI with Clinical Dialogue
Eugene Vorontsov
Eugene Vorontsov
Adam Casson
Julian Viret
Eric Zimmermann
...
Razik Yousfi
Nicolò Fusi
Thomas J. Fuchs
Kristen Severson
Siqi Liu
MedImLM&MA
173
5
0
16 Jun 2025
ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies
ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies
Chenglin Wang
Yucheng Zhou
Qianning Wang
Zhe Wang
Kai Zhang
CoGe
154
10
0
15 Jun 2025
Jailbreak Transferability Emerges from Shared Representations
Jailbreak Transferability Emerges from Shared Representations
Rico Angell
Jannik Brinkmann
He He
296
1
0
15 Jun 2025
ConsistencyChecker: Tree-based Evaluation of LLM Generalization Capabilities
ConsistencyChecker: Tree-based Evaluation of LLM Generalization CapabilitiesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Zhaochen Hong
Haofei Yu
Jiaxuan You
178
1
0
14 Jun 2025
OpenUnlearning: Accelerating LLM Unlearning via Unified Benchmarking of Methods and Metrics
OpenUnlearning: Accelerating LLM Unlearning via Unified Benchmarking of Methods and Metrics
Pratyush Maini
Anmol Mekala
Wenlong Zhao
Andrew McCallum
Zachary Chase Lipton
J. Zico Kolter
Pratyush Maini
MUELM
319
11
0
14 Jun 2025
MTabVQA: Evaluating Multi-Tabular Reasoning of Language Models in Visual Space
MTabVQA: Evaluating Multi-Tabular Reasoning of Language Models in Visual Space
Anshul Singh
Chris Biemann
Jan Strich
LMTDLRM
167
3
0
13 Jun 2025
Curriculum-Guided Layer Scaling for Language Model Pretraining
Curriculum-Guided Layer Scaling for Language Model Pretraining
Karanpartap Singh
Neil Band
Ehsan Adeli
ALMLRM
223
0
0
13 Jun 2025
Burn After Reading: Do Multimodal Large Language Models Truly Capture Order of Events in Image Sequences?
Burn After Reading: Do Multimodal Large Language Models Truly Capture Order of Events in Image Sequences?Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yingjin Song
Yupei Du
Denis Paperno
Albert Gatt
MLLM
253
1
0
12 Jun 2025
VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos
VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos
Jiashuo Yu
Y. Wu
Meng Chu
Zhifei Ren
Z. Huang
...
Conghui He
Yu Qiao
Yali Wang
Yi Wang
L. Wang
LRM
431
4
0
12 Jun 2025
Discovering Hierarchical Latent Capabilities of Language Models via Causal Representation Learning
Discovering Hierarchical Latent Capabilities of Language Models via Causal Representation Learning
Jikai Jin
Vasilis Syrgkanis
Sham Kakade
Hanlin Zhang
ELM
315
0
0
12 Jun 2025
Dataset of News Articles with Provenance Metadata for Media Relevance Assessment
Dataset of News Articles with Provenance Metadata for Media Relevance Assessment
Tomas Peterka
Matyas Bohacek
165
0
0
11 Jun 2025
Query-Level Uncertainty in Large Language Models
Query-Level Uncertainty in Large Language Models
Lihu Chen
Gerard de Melo
Fabian M. Suchanek
Gaël Varoquaux
367
1
0
11 Jun 2025
Scaling Laws for Uncertainty in Deep Learning
Mattia Rosso
Simone Rossi
Giulio Franzese
Markus Heinonen
Maurizio Filippone
BDLUQCV
222
0
0
11 Jun 2025
Do LLMs Give Psychometrically Plausible Responses in Educational Assessments?
Do LLMs Give Psychometrically Plausible Responses in Educational Assessments?Workshop on Innovative Use of NLP for Building Educational Applications (UNBEA), 2025
Andreas Säuberli
Diego Frassinelli
Barbara Plank
AI4Ed
239
2
0
11 Jun 2025
Enhancing Reasoning Capabilities of Small Language Models with Blueprints and Prompt Template Search
Dongge Han
Menglin Xia
Daniel Madrigal Diaz
Samuel Kessler
Ankur Mallick
Xuchao Zhang
Mirian Hipolito Garcia
Jin Xu
Victor Rühle
Saravan Rajmohan
LRM
195
0
0
10 Jun 2025
Evaluating LLMs Across Multi-Cognitive Levels: From Medical Knowledge Mastery to Scenario-Based Problem Solving
Evaluating LLMs Across Multi-Cognitive Levels: From Medical Knowledge Mastery to Scenario-Based Problem Solving
Yuxuan Zhou
Xien Liu
Chenwei Yan
Chen Ning
X. Zhang
...
Xiangling Fu
Shijin Wang
Guoping Hu
Yu Wang
Ji Wu
ELM
205
1
0
10 Jun 2025
Beyond Bias Scores: Unmasking Vacuous Neutrality in Small Language Models
Sumanth Manduru
Carlotta Domeniconi
ALM
224
0
0
10 Jun 2025
Can AI Validate Science? Benchmarking LLMs for Accurate Scientific Claim →\rightarrow→ Evidence Reasoning
Shashidhar Reddy Javaji
Yun Feng
Haohang Li
Yangyang Yu
Nikhil Muralidhar
Zining Zhu
ELM
159
1
0
09 Jun 2025
Instruction-Tuned Video-Audio Models Elucidate Functional Specialization in the Brain
R. Mamidi
Khushbu Pahwa
Prachi Jindal
Satya Sai Srinath Namburi
Maneesh Singh
Tanmoy Chakraborty
Bapi S. Raju
Subba Reddy Oota
141
0
0
09 Jun 2025
Synthetic Visual Genome
Synthetic Visual GenomeComputer Vision and Pattern Recognition (CVPR), 2025
J. S. Park
Zixian Ma
Linjie Li
Chenhao Zheng
Cheng-Yu Hsieh
...
Quan Kong
Norimasa Kobori
Ali Farhadi
Yejin Choi
Ranjay Krishna
188
1
0
09 Jun 2025
Snap, Segment, Deploy: A Visual Data and Detection Pipeline for Wearable Industrial Assistants
Snap, Segment, Deploy: A Visual Data and Detection Pipeline for Wearable Industrial Assistants
Di Wen
Junwei Zheng
R. Liu
Yi Xu
Kunyu Peng
Rainer Stiefelhagen
159
1
0
09 Jun 2025
WebUIBench: A Comprehensive Benchmark for Evaluating Multimodal Large Language Models in WebUI-to-Code
WebUIBench: A Comprehensive Benchmark for Evaluating Multimodal Large Language Models in WebUI-to-CodeAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Zhiyu Lin
Zhengda Zhou
Zhiyuan Zhao
Tianrui Wan
Yilun Ma
Junyu Gao
Xuelong Li
ELM
193
4
0
09 Jun 2025
A Neurosymbolic Agent System for Compositional Visual Reasoning
A Neurosymbolic Agent System for Compositional Visual Reasoning
Yichang Xu
Gaowen Liu
Ramana Rao Kompella
Sihao Hu
Tiansheng Huang
Fatih Ilhan
Selim Furkan Tekin
Zachary Yahn
LRMVLM
217
0
0
09 Jun 2025
EgoM2P: Egocentric Multimodal Multitask Pretraining
EgoM2P: Egocentric Multimodal Multitask Pretraining
Gen Li
Yutong Chen
Yiqian Wu
Kaifeng Zhao
Marc Pollefeys
Siyu Tang
EgoVVLM
395
4
0
09 Jun 2025
Chain of Methodologies: Scaling Test Time Computation without Training
Chain of Methodologies: Scaling Test Time Computation without TrainingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Cong Liu
Jie Wu
Weigang Wu
Xu Chen
Guanbin Li
Wei-Shi Zheng
LLMAGLRMAI4CE
193
1
0
08 Jun 2025
Theorem-of-Thought: A Multi-Agent Framework for Abductive, Deductive, and Inductive Reasoning in Language Models
Theorem-of-Thought: A Multi-Agent Framework for Abductive, Deductive, and Inductive Reasoning in Language Models
Samir Abdaljalil
Hasan Kurban
K. Qaraqe
E. Serpedin
LM&RoLRM
163
2
0
08 Jun 2025
Dual-Priv Pruning : Efficient Differential Private Fine-Tuning in Multimodal Large Language Models
Dual-Priv Pruning : Efficient Differential Private Fine-Tuning in Multimodal Large Language Models
Qianshan Wei
Jiaqi Li
Zihan You
Yi Zhan
Kecen Li
...
Yi Yu
Bin Cao
Yiwen Xu
Wenshu Fan
Guilin Qi
AAMLVLM
151
1
0
08 Jun 2025
Vision-EKIPL: External Knowledge-Infused Policy Learning for Visual Reasoning
Vision-EKIPL: External Knowledge-Infused Policy Learning for Visual Reasoning
Chaoyang Wang
Zeyu Zhang
Meng Meng
Xu Zhou
Haiyun Jiang
OffRLLRM
210
1
0
07 Jun 2025
Quantile Regression with Large Language Models for Price Prediction
Quantile Regression with Large Language Models for Price PredictionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Nikhita Vedula
Dushyanta Dhyani
Laleh Jalali
Boris Oreshkin
Mohsen Bayati
S. Malmasi
131
3
0
07 Jun 2025
VisioMath: Benchmarking Figure-based Mathematical Reasoning in LMMs
VisioMath: Benchmarking Figure-based Mathematical Reasoning in LMMs
Can Li
Ting Zhang
Ting Zhang
Mei Wang
Hua Huang
LRM
190
4
0
07 Jun 2025
The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity
The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem ComplexityRobotics (RAS), 2025
Parshin Shojaee
Iman Mirzadeh
Keivan Alizadeh
Maxwell Horton
Samy Bengio
Mehrdad Farajtabar
LRM
266
215
0
07 Jun 2025
Movie Facts and Fibs (MF$^2$): A Benchmark for Long Movie Understanding
Movie Facts and Fibs (MF2^22): A Benchmark for Long Movie Understanding
Emmanouil Zaranis
António Farinhas
Saul Santos
Beatriz Canaverde
Miguel Moura Ramos
...
Raffaella Bernardi
Raquel Fernández
Sandro Pezzelle
Vlad Niculae
Andre F. T. Martins
227
3
0
06 Jun 2025
Token Signature: Predicting Chain-of-Thought Gains with Token Decoding Feature in Large Language Models
Token Signature: Predicting Chain-of-Thought Gains with Token Decoding Feature in Large Language Models
Peijie Liu
Fengli Xu
Yong Li
LRM
263
1
0
06 Jun 2025
A MISMATCHED Benchmark for Scientific Natural Language InferenceAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Firoz Shaik
Mobashir Sadat
Nikita Gautam
Doina Caragea
Cornelia Caragea
182
0
0
05 Jun 2025
Structured Labeling Enables Faster Vision-Language Models for End-to-End Autonomous Driving
Structured Labeling Enables Faster Vision-Language Models for End-to-End Autonomous Driving
Hao Jiang
Chuan Hu
Yukang Shi
Yuan He
Ke Wang
X. Zhang
Zhipeng Zhang
3DVVLM
172
1
0
05 Jun 2025
A Statistical Physics of Language Model Reasoning
Jack David Carson
Amir Reisizadeh
LRMAI4CE
174
1
0
04 Jun 2025
AuthGuard: Generalizable Deepfake Detection via Language Guidance
Guangyu Shen
Zhihua Li
Xiang Xu
Tianchen Zhao
Zheng Zhang
Dongsheng An
Zhuowen Tu
Yifan Xing
Qin Zhang
188
1
0
04 Jun 2025
Previous
123456...181920
Next