ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2404.14219
  4. Cited By
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your
  Phone
v1v2v3 (latest)

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

22 April 2024
Marah Abdin
Sam Ade Jacobs
A. A. Awan
J. Aneja
Ahmed Hassan Awadallah
Hany Awadalla
Nguyen Bach
Amit Bahree
Arash Bakhtiari
Jianmin Bao
Harkirat Singh Behl
Alon Benhaim
Misha Bilenko
Johan Bjorck
Sébastien Bubeck
Qin Cai
Martin Cai
C. C. T. Mendes
Weizhu Chen
Vishrav Chaudhary
Dong Chen
DongDong Chen
Yen-Chun Chen
Yi-Ling Chen
Parul Chopra
Xiyang Dai
Allison Del Giorno
Gustavo de Rosa
Matthew Dixon
Ronen Eldan
Victor Fragoso
Dan Iter
Mei Gao
Min Gao
Jianfeng Gao
Amit Garg
Abhishek Goswami
Suriya Gunasekar
Emman Haider
Junheng Hao
Russell J. Hewett
Jamie Huynh
Mojan Javaheripi
Xin Jin
Piero Kauffmann
Nikos Karampatziakis
Dongwoo Kim
Mahoud Khademi
Lev Kurilenko
James R. Lee
Yin Tat Lee
Yuanzhi Li
Yunsheng Li
Chen Liang
Lars Liden
Ce Liu
Mengchen Liu
Weishung Liu
Eric Lin
Zeqi Lin
Chong Luo
Piyush Madan
Matt Mazzola
Arindam Mitra
Hardik Modi
Anh Nguyen
Brandon Norick
Barun Patra
Daniel Perez-Becker
Thomas Portet
Reid Pryzant
Heyang Qin
Marko Radmilac
Corby Rosset
Sambudha Roy
Olatunji Ruwase
Olli Saarikivi
Amin Saied
Adil Salim
Michael Santacroce
Shital Shah
Ning Shang
Hiteshi Sharma
Swadheen Shukla
Xianmin Song
Masahiro Tanaka
Andrea Tupini
Xin Eric Wang
Lijuan Wang
Chunyu Wang
Yu Wang
Rachel A. Ward
Guanhua Wang
Philipp A. Witte
Haiping Wu
Michael Wyatt
Bin Xiao
Can Xu
Jiahang Xu
Weijian Xu
Sonali Yadav
Fan Yang
Jianwei Yang
Ziyi Yang
Yifan Yang
Donghan Yu
Lu Yuan
Cheng-Yuan Zhang
Cyril Zhang
Jianwen Zhang
Li Zhang
Yi Zhang
Yue Zhang
Yunan Zhang
Xiren Zhou
    LRMALM
ArXiv (abs)PDFHTMLHuggingFace (257 upvotes)

Papers citing "Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone"

50 / 966 papers shown
LOGO -- Long cOntext aliGnment via efficient preference Optimization
LOGO -- Long cOntext aliGnment via efficient preference Optimization
Zecheng Tang
Zechen Sun
Juntao Li
Qiaoming Zhu
Min Zhang
206
6
0
24 Oct 2024
High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling Laws
High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling LawsInternational Conference on Learning Representations (ICLR), 2024
M. E. Ildiz
Halil Alperen Gozeten
Ege Onur Taga
Marco Mondelli
Samet Oymak
501
13
0
24 Oct 2024
ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and
  Low-frequency Character Bigrams
ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and Low-frequency Character Bigrams
Srija Anand
Praveen Srinivasa Varadhan
Mehak Singal
Mitesh M. Khapra
179
2
0
23 Oct 2024
CLR-Bench: Evaluating Large Language Models in College-level Reasoning
CLR-Bench: Evaluating Large Language Models in College-level Reasoning
Hao-Heng Chen
Zijin Hong
Yuanchen Bei
Feiran Huang
Xinrun Wang
Yi-Ju Chang
ELMLRM
175
4
0
23 Oct 2024
Captions Speak Louder than Images: Generalizing Foundation Models for E-commerce from High-quality Multimodal Instruction Data
Captions Speak Louder than Images: Generalizing Foundation Models for E-commerce from High-quality Multimodal Instruction Data
Xinyi Ling
Bo Peng
Hanwen Du
Zhihui Zhu
Xia Ning
336
2
0
22 Oct 2024
JMMMU: A Japanese Massive Multi-discipline Multimodal Understanding Benchmark for Culture-aware Evaluation
JMMMU: A Japanese Massive Multi-discipline Multimodal Understanding Benchmark for Culture-aware EvaluationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Shota Onohara
Atsuyuki Miyai
Yuki Imajuku
Kazuki Egashira
Jeonghun Baek
Xiang Yue
Graham Neubig
Kiyoharu Aizawa
OSLM
699
14
0
22 Oct 2024
ETHIC: Evaluating Large Language Models on Long-Context Tasks with High Information Coverage
ETHIC: Evaluating Large Language Models on Long-Context Tasks with High Information CoverageNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Taewhoo Lee
Chanwoong Yoon
Kyochul Jang
Donghyeon Lee
Minju Song
Hyunjae Kim
Jaewoo Kang
ELM
349
11
0
22 Oct 2024
Teach Multimodal LLMs to Comprehend Electrocardiographic Images
Teach Multimodal LLMs to Comprehend Electrocardiographic Images
Ruoqi Liu
Yuelin Bai
Xiang Yue
Ping Zhang
135
15
0
21 Oct 2024
Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5%
  Parameters and 90% Performance
Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance
Zhangwei Gao
Zhe Chen
Erfei Cui
Yiming Ren
Weiyun Wang
...
Lewei Lu
Tong Lu
Yu Qiao
Jifeng Dai
Wenhai Wang
VLM
403
88
0
21 Oct 2024
Augmenting Legal Decision Support Systems with LLM-based NLI for
  Analyzing Social Media Evidence
Augmenting Legal Decision Support Systems with LLM-based NLI for Analyzing Social Media Evidence
Ram Mohan Rao Kadiyala
Siddartha Pullakhandam
Kanwal Mehreen
Subhasya Tippareddy
Ashay Srivastava
AILaw
136
1
0
21 Oct 2024
Do Large Language Models Have an English Accent? Evaluating and Improving the Naturalness of Multilingual LLMs
Do Large Language Models Have an English Accent? Evaluating and Improving the Naturalness of Multilingual LLMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Yanzhu Guo
Simone Conia
Zelin Zhou
Min Li
Saloni Potdar
Henry Xiao
336
16
0
21 Oct 2024
BRIEF: Bridging Retrieval and Inference for Multi-hop Reasoning via Compression
BRIEF: Bridging Retrieval and Inference for Multi-hop Reasoning via CompressionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Yuankai Li
Jia-Chen Gu
Di Wu
Kai-Wei Chang
Nanyun Peng
RALMMQ
318
1
0
20 Oct 2024
A Comprehensive Evaluation of Cognitive Biases in LLMs
A Comprehensive Evaluation of Cognitive Biases in LLMs
Simon Malberg
Roman Poletukhin
Carolin M. Schuster
Georg Groh
ELM
332
19
0
20 Oct 2024
Understanding Forgetting in LLM Supervised Fine-Tuning and Preference Learning - A Convex Optimization Perspective
Understanding Forgetting in LLM Supervised Fine-Tuning and Preference Learning - A Convex Optimization Perspective
H. Fernando
Han Shen
Parikshit Ram
Yi Zhou
Horst Samulowitz
Nathalie Baracaldo
Tianyi Chen
CLL
480
10
0
20 Oct 2024
Large Language Models Are Overparameterized Text Encoders
Large Language Models Are Overparameterized Text EncodersWorkshop on Representation Learning for NLP (RepL4NLP), 2024
Thennal D K
Tim Fischer
Chris Biemann
218
4
0
18 Oct 2024
Tell me what I need to know: Exploring LLM-based (Personalized)
  Abstractive Multi-Source Meeting Summarization
Tell me what I need to know: Exploring LLM-based (Personalized) Abstractive Multi-Source Meeting SummarizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Frederic Kirstein
Terry Ruas
Robert Kratel
Bela Gipp
136
8
0
18 Oct 2024
TimeSeriesExam: A time series understanding exam
TimeSeriesExam: A time series understanding exam
Yifu Cai
Arjun Choudhry
Mononito Goswami
Artur Dubrawski
ELMAI4TS
204
30
0
18 Oct 2024
LLM The Genius Paradox: A Linguistic and Math Expert's Struggle with Simple Word-based Counting Problems
LLM The Genius Paradox: A Linguistic and Math Expert's Struggle with Simple Word-based Counting Problems
Nan Xu
Xuezhe Ma
LRM
394
5
0
18 Oct 2024
Do LLMs estimate uncertainty well in instruction-following?
Do LLMs estimate uncertainty well in instruction-following?International Conference on Learning Representations (ICLR), 2024
Juyeon Heo
Miao Xiong
Christina Heinze-Deml
Jaya Narain
ELM
386
13
0
18 Oct 2024
NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples
NaturalBench: Evaluating Vision-Language Models on Natural Adversarial SamplesNeural Information Processing Systems (NeurIPS), 2024
Baiqi Li
Zhiqiu Lin
Wenxuan Peng
Jean de Dieu Nyandwi
Daniel Jiang
Zixian Ma
Simran Khanuja
Ranjay Krishna
Graham Neubig
Deva Ramanan
AAMLCoGeVLM
658
62
0
18 Oct 2024
EvoPress: Accurate Dynamic Model Compression via Evolutionary Search
EvoPress: Accurate Dynamic Model Compression via Evolutionary Search
Oliver Sieberling
Denis Kuznedelev
Eldar Kurtic
Dan Alistarh
MQ
416
5
0
18 Oct 2024
Do LLMs "know" internally when they follow instructions?
Do LLMs "know" internally when they follow instructions?International Conference on Learning Representations (ICLR), 2024
Juyeon Heo
Christina Heinze-Deml
Oussama Elachqar
Shirley Ren
Udhay Nallasamy
Andy Miller
Kwan Ho Ryan Chan
Jaya Narain
408
22
0
18 Oct 2024
$γ-$MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large
  Language Models
γ−γ-γ−MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models
Yaxin Luo
Gen Luo
Jinfa Huang
Weihao Ye
Xiaoshuai Sun
Zhiqiang Shen
Rongrong Ji
VLMMoE
279
9
0
17 Oct 2024
BenTo: Benchmark Task Reduction with In-Context Transferability
BenTo: Benchmark Task Reduction with In-Context Transferability
Hongyu Zhao
Ming Li
Lichao Sun
Tianyi Zhou
298
2
0
17 Oct 2024
Large Language Models as Narrative-Driven Recommenders
Large Language Models as Narrative-Driven RecommendersThe Web Conference (WWW), 2024
Lukas Eberhard
Thorsten Ruprechter
Denis Helic
LRM
261
1
0
17 Oct 2024
Breaking Chains: Unraveling the Links in Multi-Hop Knowledge Unlearning
Breaking Chains: Unraveling the Links in Multi-Hop Knowledge Unlearning
Minseok Choi
C. Park
Dohyun Lee
Jaegul Choo
KELMMU
164
4
0
17 Oct 2024
Trust but Verify: Programmatic VLM Evaluation in the Wild
Trust but Verify: Programmatic VLM Evaluation in the Wild
Viraj Prabhu
Senthil Purushwalkam
An Yan
Caiming Xiong
Ran Xu
MLLM
169
2
0
17 Oct 2024
BQA: Body Language Question Answering Dataset for Video Large Language Models
BQA: Body Language Question Answering Dataset for Video Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Shintaro Ozaki
Kazuki Hayashi
Miyu Oba
Yusuke Sakai
Hidetaka Kamigaito
Taro Watanabe
429
3
0
17 Oct 2024
MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation Systems
MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation SystemsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Nandan Thakur
Suleman Kazi
Ge Luo
Jimmy J. Lin
Amin Ahmad
VLMRALM
465
14
0
17 Oct 2024
Understanding the Role of LLMs in Multimodal Evaluation Benchmarks
Understanding the Role of LLMs in Multimodal Evaluation Benchmarks
Botian Jiang
Lei Li
Xiaonan Li
Zhaowei Li
Xiachong Feng
Dianbo Sui
Qiang Liu
Xipeng Qiu
211
5
0
16 Oct 2024
Table-LLM-Specialist: Language Model Specialists for Tables using
  Iterative Generator-Validator Fine-tuning
Table-LLM-Specialist: Language Model Specialists for Tables using Iterative Generator-Validator Fine-tuning
Junjie Xing
Yeye He
Mengyu Zhou
Haoyu Dong
Shi Han
Dongmei Zhang
S. Chaudhuri
LMTD
194
6
0
16 Oct 2024
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global CuisinesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024
Genta Indra Winata
Frederikus Hudi
Patrick Amadeus Irawan
David Anugraha
Rifki Afina Putri
...
Alham Fikri Aji
Taro Watanabe
Derry Wijaya
Alice Oh
Chong-Wah Ngo
CoGe
500
33
0
16 Oct 2024
Enabling Data-Driven and Empathetic Interactions: A Context-Aware 3D
  Virtual Agent in Mixed Reality for Enhanced Financial Customer Experience
Enabling Data-Driven and Empathetic Interactions: A Context-Aware 3D Virtual Agent in Mixed Reality for Enhanced Financial Customer Experience
Cindy Xu
Mengyu Chen
Pranav Deshpande
Elvir Azanli
Runqing Yang
Joseph Ligman
111
3
0
15 Oct 2024
Scaling Laws for Multilingual Language Models
Scaling Laws for Multilingual Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Yifei He
Alon Benhaim
Barun Patra
Praneetha Vaddamanu
Sanchit Ahuja
Parul Chopra
Vishrav Chaudhary
Han Zhao
Xia Song
230
13
0
15 Oct 2024
DISP-LLM: Dimension-Independent Structural Pruning for Large Language
  Models
DISP-LLM: Dimension-Independent Structural Pruning for Large Language ModelsNeural Information Processing Systems (NeurIPS), 2024
Shangqian Gao
Chi-Heng Lin
Ting Hua
Tang Zheng
Yilin Shen
Hongxia Jin
Yen-Chang Hsu
245
19
0
15 Oct 2024
BSM: Small but Powerful Biological Sequence Model for Genes and Proteins
BSM: Small but Powerful Biological Sequence Model for Genes and Proteins
Weixi Xiang
Xueting Han
Xiujuan Chai
Jing Bai
118
1
0
15 Oct 2024
Towards More Effective Table-to-Text Generation: Assessing In-Context
  Learning and Self-Evaluation with Open-Source Models
Towards More Effective Table-to-Text Generation: Assessing In-Context Learning and Self-Evaluation with Open-Source Models
Sahar Iravani
Tim . O . F Conrad
LMTD
275
0
0
15 Oct 2024
Survey and Evaluation of Converging Architecture in LLMs based on
  Footsteps of Operations
Survey and Evaluation of Converging Architecture in LLMs based on Footsteps of OperationsIEEE Open Journal of the Computer Society (JOCS), 2024
Seongho Kim
Jihyun Moon
Juntaek Oh
Insu Choi
Joon-Sung Yang
162
0
0
15 Oct 2024
Latent Action Pretraining from Videos
Latent Action Pretraining from VideosInternational Conference on Learning Representations (ICLR), 2024
Seonghyeon Ye
Joel Jang
Byeongguk Jeon
Sejune Joo
Jianwei Yang
...
Kimin Lee
J. Gao
Luke Zettlemoyer
Dieter Fox
Minjoon Seo
441
145
0
15 Oct 2024
SHAKTI: A 2.5 Billion Parameter Small Language Model Optimized for Edge AI and Low-Resource Environments
SHAKTI: A 2.5 Billion Parameter Small Language Model Optimized for Edge AI and Low-Resource EnvironmentsArtificial Intelligence Applications and Innovations (AIAI), 2024
Syed Abdul Gaffar Shakhadri
Kruthika KR
Rakshit Aralimatti
VLM
191
4
0
15 Oct 2024
PAVLM: Advancing Point Cloud based Affordance Understanding Via Vision-Language Model
PAVLM: Advancing Point Cloud based Affordance Understanding Via Vision-Language Model
Shang-Ching Liu
Van-Nhiem Tran
Wenkai Chen
Wei-Lun Cheng
Yen-Lin Huang
I-Bin Liao
Yung-Hui Li
Jianwei Zhang
298
2
0
15 Oct 2024
SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding
SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image UnderstandingComputer Vision and Pattern Recognition (CVPR), 2024
Ying Chen
Guoan Wang
Yuanfeng Ji
Yanjun Li
Jin Ye
Tianbin Li
Bin Zhang
Nana Pei
Rongshan Yu
Yu Qiao
VLMLM&MA
371
26
0
15 Oct 2024
Measuring Spiritual Values and Bias of Large Language Models
Measuring Spiritual Values and Bias of Large Language Models
Songyuan Liu
Ziyang Zhang
Runze Yan
Wei Wu
Carl Yang
Jiaying Lu
151
1
0
15 Oct 2024
Liger Kernel: Efficient Triton Kernels for LLM Training
Liger Kernel: Efficient Triton Kernels for LLM Training
Pin-Lun Hsu
Ata Fatahibaarzi
Vignesh Kothapalli
Qingquan Song
Shao Tang
Sirou Zhu
Steven Shimizu
Shivam Sahni
Haowen Ning
Yanning Chen
492
97
0
14 Oct 2024
When Does Perceptual Alignment Benefit Vision Representations?
When Does Perceptual Alignment Benefit Vision Representations?Neural Information Processing Systems (NeurIPS), 2024
Shobhita Sundaram
Stephanie Fu
Lukas Muttenthaler
Netanel Y. Tamir
Lucy Chai
Simon Kornblith
Trevor Darrell
Phillip Isola
284
43
1
14 Oct 2024
HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
HART: Efficient Visual Generation with Hybrid Autoregressive TransformerInternational Conference on Learning Representations (ICLR), 2024
Haotian Tang
Yecheng Wu
Shang Yang
Enze Xie
Junsong Chen
Junyu Chen
Zhuoyang Zhang
Han Cai
Yaojie Lu
Song Han
404
105
0
14 Oct 2024
HSR-Enhanced Sparse Attention Acceleration
HSR-Enhanced Sparse Attention Acceleration
Bo Chen
Yingyu Liang
Zhizhou Sha
Zhenmei Shi
Zhao Song
818
23
0
14 Oct 2024
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family ExpertsInternational Conference on Learning Representations (ICLR), 2024
Guorui Zheng
Xidong Wang
Juhao Liang
Nuo Chen
Yuping Zheng
Benyou Wang
MoE
312
11
0
14 Oct 2024
Assessing Dialect Fairness and Robustness of Large Language Models in Reasoning Tasks
Assessing Dialect Fairness and Robustness of Large Language Models in Reasoning TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Fangru Lin
Shaoguang Mao
Emanuele La Malfa
Valentin Hofmann
Adrian de Wynter
Jing Yao
Si-Qing Chen
Michael Wooldridge
J. Pierrehumbert
Furu Wei
521
3
0
14 Oct 2024
3DArticCyclists: Generating Synthetic Articulated 8D Pose-Controllable Cyclist Data for Computer Vision Applications
3DArticCyclists: Generating Synthetic Articulated 8D Pose-Controllable Cyclist Data for Computer Vision Applications
Eduardo R. Corral-Soto
Yang Liu
Tongtong Cao
Y. Ren
Liu Bingbing
458
11
0
14 Oct 2024
Previous
123...151617181920
Next
Page 16 of 20
Pageof 20