ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2404.14219
  4. Cited By
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your
  Phone
v1v2v3 (latest)

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

22 April 2024
Marah Abdin
Sam Ade Jacobs
A. A. Awan
J. Aneja
Ahmed Hassan Awadallah
Hany Awadalla
Nguyen Bach
Amit Bahree
Arash Bakhtiari
Jianmin Bao
Harkirat Singh Behl
Alon Benhaim
Misha Bilenko
Johan Bjorck
Sébastien Bubeck
Qin Cai
Martin Cai
C. C. T. Mendes
Weizhu Chen
Vishrav Chaudhary
Dong Chen
DongDong Chen
Yen-Chun Chen
Yi-Ling Chen
Parul Chopra
Xiyang Dai
Allison Del Giorno
Gustavo de Rosa
Matthew Dixon
Ronen Eldan
Victor Fragoso
Dan Iter
Mei Gao
Min Gao
Jianfeng Gao
Amit Garg
Abhishek Goswami
Suriya Gunasekar
Emman Haider
Junheng Hao
Russell J. Hewett
Jamie Huynh
Mojan Javaheripi
Xin Jin
Piero Kauffmann
Nikos Karampatziakis
Dongwoo Kim
Mahoud Khademi
Lev Kurilenko
James R. Lee
Yin Tat Lee
Yuanzhi Li
Yunsheng Li
Chen Liang
Lars Liden
Ce Liu
Mengchen Liu
Weishung Liu
Eric Lin
Zeqi Lin
Chong Luo
Piyush Madan
Matt Mazzola
Arindam Mitra
Hardik Modi
Anh Nguyen
Brandon Norick
Barun Patra
Daniel Perez-Becker
Thomas Portet
Reid Pryzant
Heyang Qin
Marko Radmilac
Corby Rosset
Sambudha Roy
Olatunji Ruwase
Olli Saarikivi
Amin Saied
Adil Salim
Michael Santacroce
Shital Shah
Ning Shang
Hiteshi Sharma
Swadheen Shukla
Xianmin Song
Masahiro Tanaka
Andrea Tupini
Xin Eric Wang
Lijuan Wang
Chunyu Wang
Yu Wang
Rachel A. Ward
Guanhua Wang
Philipp A. Witte
Haiping Wu
Michael Wyatt
Bin Xiao
Can Xu
Jiahang Xu
Weijian Xu
Sonali Yadav
Fan Yang
Jianwei Yang
Ziyi Yang
Yifan Yang
Donghan Yu
Lu Yuan
Cheng-Yuan Zhang
Cyril Zhang
Jianwen Zhang
Li Zhang
Yi Zhang
Yue Zhang
Yunan Zhang
Xiren Zhou
    LRMALM
ArXiv (abs)PDFHTMLHuggingFace (257 upvotes)

Papers citing "Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone"

50 / 966 papers shown
Cool-Fusion: Fuse Large Language Models without Training
Cool-Fusion: Fuse Large Language Models without TrainingAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Cong Liu
Xiaojun Quan
Yan Pan
Liangzhi Li
Weigang Wu
Xu Chen
MoMeVLM
385
10
0
29 Jul 2024
Urban Safety Perception Assessments via Integrating Multimodal Large Language Models with Street View Images
Urban Safety Perception Assessments via Integrating Multimodal Large Language Models with Street View ImagesCities (Cities), 2024
Jiaxin Zhanga
Yunqin Lia
Tomohiro Fukudab
Bowen Wang
263
1
0
29 Jul 2024
Stretching Each Dollar: Diffusion Training from Scratch on a
  Micro-Budget
Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget
Vikash Sehwag
Xianghao Kong
Jingtao Li
Michael Spranger
Lingjuan Lyu
DiffM
244
25
0
22 Jul 2024
MINI-SEQUENCE TRANSFORMER: Optimizing Intermediate Memory for Long
  Sequences Training
MINI-SEQUENCE TRANSFORMER: Optimizing Intermediate Memory for Long Sequences Training
Cheng Luo
Jiawei Zhao
Zhuoming Chen
Beidi Chen
A. Anandkumar
266
5
0
22 Jul 2024
Compact Language Models via Pruning and Knowledge Distillation
Compact Language Models via Pruning and Knowledge Distillation
Saurav Muralidharan
Sharath Turuvekere Sreenivas
Raviraj Joshi
Marcin Chochowski
M. Patwary
Mohammad Shoeybi
Bryan Catanzaro
Jan Kautz
Pavlo Molchanov
SyDaMQ
357
117
0
19 Jul 2024
Data-Centric Human Preference with Rationales for Direct Preference Alignment
Data-Centric Human Preference with Rationales for Direct Preference Alignment
H. Just
Ming Jin
Anit Kumar Sahu
Huy Phan
Ruoxi Jia
527
3
0
19 Jul 2024
What's Wrong? Refining Meeting Summaries with LLM Feedback
What's Wrong? Refining Meeting Summaries with LLM Feedback
Frederic Kirstein
Terry Ruas
Bela Gipp
301
9
0
16 Jul 2024
Does Refusal Training in LLMs Generalize to the Past Tense?
Does Refusal Training in LLMs Generalize to the Past Tense?
Maksym Andriushchenko
Nicolas Flammarion
572
66
0
16 Jul 2024
VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models
VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models
Haodong Duan
Xinyu Fang
Junming Yang
Xiangyu Zhao
Lin Chen
...
Yuhang Zang
Pan Zhang
Jiaqi Wang
Dahua Lin
Kai Chen
LM&MAVLM
734
359
0
16 Jul 2024
DOCBENCH: A Benchmark for Evaluating LLM-based Document Reading Systems
DOCBENCH: A Benchmark for Evaluating LLM-based Document Reading Systems
Anni Zou
Wenhao Yu
Hongming Zhang
Kaixin Ma
Deng Cai
Zhuosheng Zhang
Hai Zhao
Dong Yu
197
28
0
15 Jul 2024
Uncovering Semantics and Topics Utilized by Threat Actors to Deliver
  Malicious Attachments and URLs
Uncovering Semantics and Topics Utilized by Threat Actors to Deliver Malicious Attachments and URLs
Andrey Yakymovych
Abhishek Singh
99
0
0
11 Jul 2024
Is Your Model Really A Good Math Reasoner? Evaluating Mathematical
  Reasoning with Checklist
Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist
Zihao Zhou
Shudong Liu
Maizhen Ning
Wei Liu
Jindong Wang
Yang Li
Xiaowei Huang
Qiufeng Wang
Kaizhu Huang
ELMLRM
257
45
0
11 Jul 2024
Converging Paradigms: The Synergy of Symbolic and Connectionist AI in
  LLM-Empowered Autonomous Agents
Converging Paradigms: The Synergy of Symbolic and Connectionist AI in LLM-Empowered Autonomous Agents
Haoyi Xiong
Zhiyuan Wang
Xuhong Li
Jiang Bian
Bo Han
Shahid Mumtaz
Laura E. Barnes
LLMAG
586
14
0
11 Jul 2024
Are Large Language Models Really Bias-Free? Jailbreak Prompts for Assessing Adversarial Robustness to Bias Elicitation
Are Large Language Models Really Bias-Free? Jailbreak Prompts for Assessing Adversarial Robustness to Bias Elicitation
Riccardo Cantini
Giada Cosenza
A. Orsino
Domenico Talia
AAML
365
14
0
11 Jul 2024
Teaching Transformers Causal Reasoning through Axiomatic Training
Teaching Transformers Causal Reasoning through Axiomatic Training
Aniket Vashishtha
Abhinav Kumar
Atharva Pandey
Abbavaram Gowtham Reddy
Amit Sharma
Vineeth N. Balasubramanian
Amit Sharma
425
8
0
10 Jul 2024
Internet of Agents: Weaving a Web of Heterogeneous Agents for
  Collaborative Intelligence
Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence
Weize Chen
Ziming You
Ran Li
Yitong Guan
Chen Qian
Chenyang Zhao
Cheng Yang
Ruobing Xie
Zhiyuan Liu
Maosong Sun
LLMAG
332
67
0
09 Jul 2024
What's Wrong with Your Code Generated by Large Language Models? An Extensive Study
What's Wrong with Your Code Generated by Large Language Models? An Extensive Study
Jiajun Sun
Haoxiang Jia
Shenxi Wu
Huiyuan Zheng
Muling Wu
...
Ming-bo Wen
Yuhao Zhou
Y. Wu
Rui Zheng
Ming-bo Wen
285
74
0
08 Jul 2024
Evaluating Language Models for Generating and Judging Programming
  Feedback
Evaluating Language Models for Generating and Judging Programming Feedback
Charles Koutcheme
Nicola Dainese
Arto Hellas
Sami Sarsa
Juho Leinonen
Syed Ashraf
Paul Denny
ELM
188
23
0
05 Jul 2024
Rethinking Visual Prompting for Multimodal Large Language Models with
  External Knowledge
Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge
Yuanze Lin
Yunsheng Li
Dongdong Chen
Weijian Xu
Ronald Clark
Juil Sock
Lu Yuan
LRMVLM
228
11
0
05 Jul 2024
Stephanie: Step-by-Step Dialogues for Mimicking Human Interactions in
  Social Conversations
Stephanie: Step-by-Step Dialogues for Mimicking Human Interactions in Social Conversations
Hao Yang
Hongyuan Lu
Xinhua Zeng
Yang Liu
Xiang Zhang
Haoran Yang
Yumeng Zhang
Shan Huang
Yiran Wei
Wai Lam
232
5
0
04 Jul 2024
IncogniText: Privacy-enhancing Conditional Text Anonymization via LLM-based Private Attribute Randomization
IncogniText: Privacy-enhancing Conditional Text Anonymization via LLM-based Private Attribute Randomization
Ahmed Frikha
Nassim Walha
Krishna Kanth Nakka
Ricardo Mendes
Xue Jiang
Xuebing Zhou
263
14
0
03 Jul 2024
UnSeenTimeQA: Time-Sensitive Question-Answering Beyond LLMs' Memorization
UnSeenTimeQA: Time-Sensitive Question-Answering Beyond LLMs' Memorization
Md Nayem Uddin
Amir Saeidi
Divij Handa
Agastya Seth
Tran Cao Son
Eduardo Blanco
Steven Corman
Chitta Baral
504
14
0
03 Jul 2024
Can Small Language Models Learn, Unlearn, and Retain Noise Patterns?
Can Small Language Models Learn, Unlearn, and Retain Noise Patterns?
Nicy Scaria
Silvester John Joseph Kennedy
Deepak N. Subramani
MU
374
2
0
01 Jul 2024
MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs
MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs
Yusu Qian
Hanrong Ye
J. Fauconnier
Peter Grasch
Yinfei Yang
Zhe Gan
672
41
0
01 Jul 2024
Too Late to Train, Too Early To Use? A Study on Necessity and Viability
  of Low-Resource Bengali LLMs
Too Late to Train, Too Early To Use? A Study on Necessity and Viability of Low-Resource Bengali LLMs
Tamzeed Mahfuz
Satak Kumar Dey
Ruwad Naswan
Hasnaen Adil
Khondker Salman Sayeed
Haz Sameen Shahgir
233
5
0
29 Jun 2024
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Tao Ge
Xin Chan
Dian Yu
Haitao Mi
Dong Yu
Dong Yu
SyDa
577
277
0
28 Jun 2024
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and
  Understanding
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding
Tao Zhang
Xiangtai Li
Hao Fei
Haobo Yuan
Shengqiong Wu
Shunping Ji
Chen Change Loy
Shuicheng Yan
LRMMLLMVLM
329
123
0
27 Jun 2024
Learning to Correct for QA Reasoning with Black-box LLMs
Learning to Correct for QA Reasoning with Black-box LLMs
Jaehyung Kim
Dongyoung Kim
Yiming Yang
LRM
250
6
0
26 Jun 2024
MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning
MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning
Xiangyu Zhao
Xiangtai Li
Haodong Duan
Haian Huang
Yining Li
Kai Chen
Hua Yang
VLMMLLM
335
21
0
25 Jun 2024
VarBench: Robust Language Model Benchmarking Through Dynamic Variable
  Perturbation
VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation
Kun Qian
Shunji Wan
Claudia Tang
Youzhi Wang
Xuanming Zhang
Maximillian Chen
Zhou Yu
AAML
341
21
0
25 Jun 2024
DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning
  Graph
DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph
Zhehao Zhang
Jiaao Chen
Diyi Yang
LRM
228
24
0
25 Jun 2024
Task Oriented In-Domain Data Augmentation
Task Oriented In-Domain Data Augmentation
Xiao Liang
Xinyu Hu
Simiao Zuo
Yeyun Gong
Qiang Lou
Yi Liu
Shao-Lun Huang
Jian Jiao
194
8
0
24 Jun 2024
Evaluation of Language Models in the Medical Context Under
  Resource-Constrained Settings
Evaluation of Language Models in the Medical Context Under Resource-Constrained Settings
Andrea Posada
Daniel Rueckert
Felix Meissen
Philip Muller
LM&MAELM
238
1
0
24 Jun 2024
Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?
Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?
Seungbin Yang
Yujin Baek
Taehee Kim
Jaegul Choo
469
6
0
18 Jun 2024
A Label is Worth a Thousand Images in Dataset Distillation
A Label is Worth a Thousand Images in Dataset DistillationNeural Information Processing Systems (NeurIPS), 2024
Tian Qin
Zhiwei Deng
David Alvarez-Melis
DD
457
24
0
15 Jun 2024
BABILong: Testing the Limits of LLMs with Long Context
  Reasoning-in-a-Haystack
BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-HaystackNeural Information Processing Systems (NeurIPS), 2024
Yuri Kuratov
Aydar Bulatov
Petr Anokhin
Ivan Rodkin
Dmitry Sorokin
Artyom Sorokin
Andrey Kravchenko
RALMALMLRMReLMELM
282
151
0
14 Jun 2024
First Multi-Dimensional Evaluation of Flowchart Comprehension for
  Multimodal Large Language Models
First Multi-Dimensional Evaluation of Flowchart Comprehension for Multimodal Large Language Models
Enming Zhang
Ruobing Yao
Huanyong Liu
Junhui Yu
Jiale Wang
ELMLRM
263
3
0
14 Jun 2024
VideoGPT+: Integrating Image and Video Encoders for Enhanced Video
  Understanding
VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
Muhammad Maaz
H. Rasheed
Salman Khan
Fahad A Khan
VLMMLLM
263
102
0
13 Jun 2024
LLM Reading Tea Leaves: Automatically Evaluating Topic Models with Large Language Models
LLM Reading Tea Leaves: Automatically Evaluating Topic Models with Large Language Models
Xiaohao Yang
He Zhao
Dinh Q. Phung
Wray Buntine
Lan Du
ALMELM
416
7
0
13 Jun 2024
AlignMMBench: Evaluating Chinese Multimodal Alignment in Large Vision-Language Models
AlignMMBench: Evaluating Chinese Multimodal Alignment in Large Vision-Language Models
Yuhang Wu
Wenmeng Yu
Yean Cheng
Yan Wang
Xiaohan Zhang
Jiazheng Xu
Ming Ding
Yuxiao Dong
358
7
0
13 Jun 2024
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs
  with Nothing
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing
Zhangchen Xu
Fengqing Jiang
Luyao Niu
Yuntian Deng
Radha Poovendran
Yejin Choi
Bill Yuchen Lin
SyDa
355
259
0
12 Jun 2024
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
Liliang Ren
Yang Liu
Yadong Lu
Haoran Pan
Chen Liang
Weizhu Chen
Mamba
385
115
0
11 Jun 2024
Hints-In-Browser: Benchmarking Language Models for Programming Feedback Generation
Hints-In-Browser: Benchmarking Language Models for Programming Feedback GenerationNeural Information Processing Systems (NeurIPS), 2024
Nachiket Kotalwar
Alkis Gotovos
Adish Singla
ALM
299
12
0
07 Jun 2024
A Survey on Large Language Models for Code Generation
A Survey on Large Language Models for Code Generation
Juyong Jiang
Fan Wang
Jiasi Shen
Sungju Kim
Sunghun Kim
545
526
0
01 Jun 2024
OR-Bench: An Over-Refusal Benchmark for Large Language Models
OR-Bench: An Over-Refusal Benchmark for Large Language Models
Justin Cui
Wei-Lin Chiang
Ion Stoica
Cho-Jui Hsieh
ALM
737
97
0
31 May 2024
ReMoDetect: Reward Models Recognize Aligned LLM's Generations
ReMoDetect: Reward Models Recognize Aligned LLM's Generations
Hyunseok Lee
Jihoon Tack
Jinwoo Shin
DeLMO
293
6
0
27 May 2024
How many samples are needed to train a deep neural network?
How many samples are needed to train a deep neural network?
Pegah Golestaneh
Mahsa Taheri
Johannes Lederer
245
8
0
26 May 2024
Small Language Models for Application Interactions: A Case Study
Small Language Models for Application Interactions: A Case Study
Beibin Li
Yi Zhang
Sébastien Bubeck
Jeevan Pathuri
Ishai Menache
258
7
0
23 May 2024
Super Tiny Language Models
Super Tiny Language Models
Dylan Hillier
Leon Guertler
Cheston Tan
Palaash Agrawal
Ruirui Chen
Bobby Cheng
294
10
0
23 May 2024
Curriculum Direct Preference Optimization for Diffusion and Consistency Models
Curriculum Direct Preference Optimization for Diffusion and Consistency Models
Florinel-Alin Croitoru
Vlad Hondru
Radu Tudor Ionescu
Andrii Zadaianchuk
Mubarak Shah
EGVM
631
20
0
22 May 2024
Previous
123...181920
Next
Page 19 of 20
Pageof 20