Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2404.14219
Cited By

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your
Phone

v1v2v3 (latest)

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

22 April 2024

Ahmed Hassan Awadallah

Arash Bakhtiari

Jianmin Bao

Harkirat Singh Behl

Sébastien Bubeck

C. C. T. Mendes

Vishrav Chaudhary

Allison Del Giorno

Gustavo de Rosa

Abhishek Goswami

Suriya Gunasekar

Russell J. Hewett

Mojan Javaheripi

Xin Jin

Piero Kauffmann

Nikos Karampatziakis

Yunsheng Li

Daniel Perez-Becker

Olatunji Ruwase

Michael Santacroce

Swadheen Shukla

Masahiro Tanaka

Philipp A. Witte

Fan Yang

Jianwei Yang

Lu Yuan

Cheng-Yuan Zhang

Yue Zhang

ArXiv (abs)PDF HTML HuggingFace (257 upvotes)

Papers citing "Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone"

50 / 965 papers shown

Understanding the Effects of Domain Finetuning on LLMs

Understanding the Effects of Domain Finetuning on LLMs

William Yang Wang

Tanmoy Chakraborty

128

0

0

10 Oct 2025

PhysToolBench: Benchmarking Physical Tool Understanding for MLLMs

PhysToolBench: Benchmarking Physical Tool Understanding for MLLMs

91

3

0

10 Oct 2025

ProxRouter: Proximity-Weighted LLM Query Routing for Improved Robustness to Outliers

ProxRouter: Proximity-Weighted LLM Query Routing for Improved Robustness to Outliers

128

0

0

10 Oct 2025

Zero-shot image privacy classification with Vision-Language Models

Zero-shot image privacy classification with Vision-Language Models

Alina Elena Baia

Alessio Xompero

Andrea Cavallaro

88

0

0

10 Oct 2025

On the Representations of Entities in Auto-regressive Large Language Models

On the Representations of Entities in Auto-regressive Large Language Models

Benjamin Piwowarski

117

0

0

10 Oct 2025

To Sink or Not to Sink: Visual Information Pathways in Large Vision-Language Models

To Sink or Not to Sink: Visual Information Pathways in Large Vision-Language Models

Purang Abolmaesumi

140

0

0

09 Oct 2025

RetouchLLM: Training-free Code-based Image Retouching with Vision Language Models

RetouchLLM: Training-free Code-based Image Retouching with Vision Language Models

128

0

0

09 Oct 2025

Stress-Testing Model Specs Reveals Character Differences among Language Models

Stress-Testing Model Specs Reveals Character Differences among Language Models

181

0

0

09 Oct 2025

Efficient Discriminative Joint Encoders for Large Scale Vision-Language Reranking

Efficient Discriminative Joint Encoders for Large Scale Vision-Language Reranking

Mitchell Keren Taraday

110

1

0

08 Oct 2025

Deploying Tiny LVLM Judges for Real-World Evaluation of Chart Models: Lessons Learned and Best Practices

Deploying Tiny LVLM Judges for Real-World Evaluation of Chart Models: Lessons Learned and Best Practices

Md Tahmid Rahman Laskar

Mohammed Saidul Islam

Mir Tafseer Nayeem

171

0

0

08 Oct 2025

Compressed Convolutional Attention: Efficient Attention in a Compressed Latent Space

Compressed Convolutional Attention: Efficient Attention in a Compressed Latent Space

Tomás Figliolia

Nicholas Alonso

Quentin Anthony

120

1

0

06 Oct 2025

A Set of Quebec-French Corpus of Regional Expressions and Terms

A Set of Quebec-French Corpus of Regional Expressions and Terms

David Beauchemin

Mohamed Amine Youssef

132

2

0

06 Oct 2025

Person-Centric Annotations of LAION-400M: Auditing Bias and Its Transfer to Models

Person-Centric Annotations of LAION-400M: Auditing Bias and Its Transfer to Models

Leander Girrbach

Genevieve Smith

201

3

0

04 Oct 2025

H-DDx: A Hierarchical Evaluation Framework for Differential Diagnosis

H-DDx: A Hierarchical Evaluation Framework for Differential Diagnosis

144

0

0

04 Oct 2025

MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information

MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information

133

0

0

04 Oct 2025

Towards Sampling Data Structures for Tensor Products in Turnstile Streams

Towards Sampling Data Structures for Tensor Products in Turnstile Streams

140

0

0

04 Oct 2025

Beyond CNNs: Efficient Fine-Tuning of Multi-Modal LLMs for Object Detection on Low-Data Regimes

Beyond CNNs: Efficient Fine-Tuning of Multi-Modal LLMs for Object Detection on Low-Data Regimes

Rouzbeh Davoudi

157

0

0

03 Oct 2025

Dirichlet-Prior Shaping: Guiding Expert Specialization in Upcycled MoEs

Dirichlet-Prior Shaping: Guiding Expert Specialization in Upcycled MoEs

Leyla Mirvakhabova

Paul N. Whatmough

88

0

0

01 Oct 2025

ModernVBERT: Towards Smaller Visual Document Retrievers

ModernVBERT: Towards Smaller Visual Document Retrievers

António Loison

284

2

0

01 Oct 2025

Generalized Correctness Models: Learning Calibrated and Model-Agnostic Correctness Predictors from Historical Patterns

Generalized Correctness Models: Learning Calibrated and Model-Agnostic Correctness Predictors from Historical Patterns

Elias Stengel-Eskin

168

1

0

29 Sep 2025

Predicting Training Re-evaluation Curves Enables Effective Data Curriculums for LLMs

Predicting Training Re-evaluation Curves Enables Effective Data Curriculums for LLMs

162

0

0

29 Sep 2025

Towards Trustworthy Lexical Simplification: Exploring Safety and Efficiency with Small LLMs

Towards Trustworthy Lexical Simplification: Exploring Safety and Efficiency with Small LLMs

Horacio Saggion

56

0

0

29 Sep 2025

AstroMMBench: A Benchmark for Evaluating Multimodal Large Language Models Capabilities in Astronomy

AstroMMBench: A Benchmark for Evaluating Multimodal Large Language Models Capabilities in Astronomy

193

0

0

29 Sep 2025

Analyzing and Evaluating Unbiased Language Model Watermark

Analyzing and Evaluating Unbiased Language Model Watermark

164

1

0

28 Sep 2025

LUQ: Layerwise Ultra-Low Bit Quantization for Multimodal Large Language Models

LUQ: Layerwise Ultra-Low Bit Quantization for Multimodal Large Language Models

Shubhang Bhatnagar

194

0

0

28 Sep 2025

Evaluating Program Semantics Reasoning with Type Inference in System F

Evaluating Program Semantics Reasoning with Type Inference in System F

Christopher Castro Gaw Gonzalo

551

1

0

28 Sep 2025

LLMSQL: Upgrading WikiSQL for the LLM Era of Text-to-SQL

LLMSQL: Upgrading WikiSQL for the LLM Era of Text-to-SQL

Dzmitry Pihulski

Viktoria Novogrodskaia

286

1

0

27 Sep 2025

Customizing Visual Emotion Evaluation for MLLMs: An Open-vocabulary, Multifaceted, and Scalable Approach

Customizing Visual Emotion Evaluation for MLLMs: An Open-vocabulary, Multifaceted, and Scalable Approach

152

1

0

26 Sep 2025

Quantifying the Impact of Structured Output Format on Large Language Models through Causal Inference

Quantifying the Impact of Structured Output Format on Large Language Models through Causal Inference

164

0

0

26 Sep 2025

Learning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs

Learning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs

...

Chris Callison-Burch

153

0

0

26 Sep 2025

Query-Centric Graph Retrieval Augmented Generation

Query-Centric Graph Retrieval Augmented Generation

108

0

0

25 Sep 2025

Accelerate Creation of Product Claims Using Generative AI

Accelerate Creation of Product Claims Using Generative AI

86

0

0

25 Sep 2025

GEP: A GCG-Based method for extracting personally identifiable information from chatbots built on small language models

GEP: A GCG-Based method for extracting personally identifiable information from chatbots built on small language models

Vi Ngoc-Nha Tran

220

0

0

25 Sep 2025

Seeing Through Words, Speaking Through Pixels: Deep Representational Alignment Between Vision and Language Models

Seeing Through Words, Speaking Through Pixels: Deep Representational Alignment Between Vision and Language Models

Meenakshi Khosla

124

1

0

25 Sep 2025

Tokenization and Representation Biases in Multilingual Models on Dialectal NLP Tasks

Tokenization and Representation Biases in Multilingual Models on Dialectal NLP Tasks

Vani Kanjirangat

Tanja Samardžić

Ljiljana Dolamic

84

1

0

24 Sep 2025

Polarity Detection of Sustainable Development Goals in News Text

Polarity Detection of Sustainable Development Goals in News Text

Alessandro Chessa

Vincenzo De Leo

Francesco Osborne

Diego Reforgiato Recupero

Angelo Salatino

196

0

0

24 Sep 2025

OmniBridge: Unified Multimodal Understanding, Generation, and Retrieval via Latent Space Alignment

OmniBridge: Unified Multimodal Understanding, Generation, and Retrieval via Latent Space Alignment

178

1

0

23 Sep 2025

Rule Encoding and Compliance in Large Language Models: An Information-Theoretic Analysis

Rule Encoding and Compliance in Large Language Models: An Information-Theoretic Analysis

Joachim Diederich

204

0

0

23 Sep 2025

Are VLMs Ready for Lane Topology Awareness in Autonomous Driving?

Are VLMs Ready for Lane Topology Awareness in Autonomous Driving?

169

0

0

20 Sep 2025

ORIC: Benchmarking Object Recognition under Contextual Incongruity in Large Vision-Language Models

ORIC: Benchmarking Object Recognition under Contextual Incongruity in Large Vision-Language Models

211

0

0

19 Sep 2025

Language-Instructed Reasoning for Group Activity Detection via Multimodal Large Language Model

Language-Instructed Reasoning for Group Activity Detection via Multimodal Large Language Model

103

0

0

19 Sep 2025

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

...

Zhengdong Zhang

204

4

0

19 Sep 2025

CLEAR: A Comprehensive Linguistic Evaluation of Argument Rewriting by Large Language Models

CLEAR: A Comprehensive Linguistic Evaluation of Argument Rewriting by Large Language Models

Christina Niklaus

113

0

0

18 Sep 2025

Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation

Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation

249

2

0

18 Sep 2025

AToken: A Unified Tokenizer for Vision

AToken: A Unified Tokenizer for Vision

236

7

0

17 Sep 2025

Estimating Semantic Alphabet Size for LLM Uncertainty Quantification

Estimating Semantic Alphabet Size for LLM Uncertainty Quantification

Lucas H. McCabe

Thomas Hartvigsen

120

0

0

17 Sep 2025

Positional Encoding via Token-Aware Phase Attention

Positional Encoding via Token-Aware Phase Attention

182

0

0

16 Sep 2025

CLAIRE: A Dual Encoder Network with RIFT Loss and Phi-3 Small Language Model Based Interpretability for Cross-Modality Synthetic Aperture Radar and Optical Land Cover Segmentation

CLAIRE: A Dual Encoder Network with RIFT Loss and Phi-3 Small Language Model Based Interpretability for Cross-Modality Synthetic Aperture Radar and Optical Land Cover Segmentation

Debopom Sutradhar

Arefin Ittesafun Abian

Reem E. Mohamed

Sheikh Izzal Azid

120

0

0

15 Sep 2025

A Dynamic Knowledge Update-Driven Model with Large Language Models for Fake News Detection

A Dynamic Knowledge Update-Driven Model with Large Language Models for Fake News DetectionInternational Joint Conference on Artificial Intelligence (IJCAI), 2025

98

1

0

15 Sep 2025

Pluralistic Off-policy Evaluation and Alignment

Pluralistic Off-policy Evaluation and Alignment

172

1

0

15 Sep 2025

1 2 3 4 5...18 19 20