Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2404.14219
Cited By
v1
v2
v3 (latest)
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
22 April 2024
Marah Abdin
Sam Ade Jacobs
A. A. Awan
J. Aneja
Ahmed Hassan Awadallah
Hany Awadalla
Nguyen Bach
Amit Bahree
Arash Bakhtiari
Jianmin Bao
Harkirat Singh Behl
Alon Benhaim
Misha Bilenko
Johan Bjorck
Sébastien Bubeck
Qin Cai
Martin Cai
C. C. T. Mendes
Weizhu Chen
Vishrav Chaudhary
Dong Chen
DongDong Chen
Yen-Chun Chen
Yi-Ling Chen
Parul Chopra
Xiyang Dai
Allison Del Giorno
Gustavo de Rosa
Matthew Dixon
Ronen Eldan
Victor Fragoso
Dan Iter
Mei Gao
Min Gao
Jianfeng Gao
Amit Garg
Abhishek Goswami
Suriya Gunasekar
Emman Haider
Junheng Hao
Russell J. Hewett
Jamie Huynh
Mojan Javaheripi
Xin Jin
Piero Kauffmann
Nikos Karampatziakis
Dongwoo Kim
Mahoud Khademi
Lev Kurilenko
James R. Lee
Yin Tat Lee
Yuanzhi Li
Yunsheng Li
Chen Liang
Lars Liden
Ce Liu
Mengchen Liu
Weishung Liu
Eric Lin
Zeqi Lin
Chong Luo
Piyush Madan
Matt Mazzola
Arindam Mitra
Hardik Modi
Anh Nguyen
Brandon Norick
Barun Patra
Daniel Perez-Becker
Thomas Portet
Reid Pryzant
Heyang Qin
Marko Radmilac
Corby Rosset
Sambudha Roy
Olatunji Ruwase
Olli Saarikivi
Amin Saied
Adil Salim
Michael Santacroce
Shital Shah
Ning Shang
Hiteshi Sharma
Swadheen Shukla
Xianmin Song
Masahiro Tanaka
Andrea Tupini
Xin Eric Wang
Lijuan Wang
Chunyu Wang
Yu Wang
Rachel A. Ward
Guanhua Wang
Philipp A. Witte
Haiping Wu
Michael Wyatt
Bin Xiao
Can Xu
Jiahang Xu
Weijian Xu
Sonali Yadav
Fan Yang
Jianwei Yang
Ziyi Yang
Yifan Yang
Donghan Yu
Lu Yuan
Cheng-Yuan Zhang
Cyril Zhang
Jianwen Zhang
Li Zhang
Yi Zhang
Yue Zhang
Yunan Zhang
Xiren Zhou
LRM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (257 upvotes)
Papers citing
"Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone"
50 / 966 papers shown
LoRALib: A Standardized Benchmark for Evaluating LoRA-MoE Methods
Shaoheng Wang
Yao Lu
Yuqi Li
Yaxin Gao
Jiaqi Nie
Shanqing Yu
Yingli Tian
Qi Xuan
MoE
MoMe
154
0
0
14 Sep 2025
LLMAP: LLM-Assisted Multi-Objective Route Planning with User Preferences
Liangqi Yuan
Dong-Jun Han
Christopher G. Brinton
Sabine Brunswicker
166
2
0
14 Sep 2025
Continually Adding New Languages to Multilingual Language Models
A. Owodunni
Sachin Kumar
CLL
KELM
MoMe
206
2
0
14 Sep 2025
Enhancing Generalization in Vision-Language-Action Models by Preserving Pretrained Representations
Shresth Grover
Akshay Gopalkrishnan
Bo Ai
Henrik I. Christensen
H. Su
Xuanlin Li
VLM
225
4
0
14 Sep 2025
TrueSkin: Towards Fair and Accurate Skin Tone Recognition and Generation
Haoming Lu
SyDa
127
1
0
13 Sep 2025
RefactorCoderQA: Benchmarking LLMs for Multi-Domain Coding Question Solutions in Cloud and Edge Deployment
Shadikur Rahman
Aroosa Hameed
Gautam Srivastava
Syed Muhammad Danish
191
1
0
12 Sep 2025
GrACE: A Generative Approach to Better Confidence Elicitation in Large Language Models
Zhaohan Zhang
Ziquan Liu
Ioannis Patras
162
2
0
11 Sep 2025
Towards Better Dental AI: A Multimodal Benchmark and Instruction Dataset for Panoramic X-ray Analysis
Jing Hao
Yuxuan Fan
Yanpeng Sun
Kaixin Guo
Lizhuo Lin
Jinrong Yang
Qi Yong H. Ai
Lun M. Wong
Hao Tang
Kuo Feng Hung
LM&MA
190
5
0
11 Sep 2025
Competitive Audio-Language Models with Data-Efficient Single-Stage Training on Public Data
Gokul Karthik Kumar
Rishabh Saraf
Ludovick Lepauloux
Abdul Muneer
Billel Mokeddem
Hakim Hacid
AuLLM
153
1
0
09 Sep 2025
Spatial Reasoning with Vision-Language Models in Ego-Centric Multi-View Scenes
Mohsen Gholami
A. Rezaei
Zhou Weimin
Sitong Mao
Shunbo Zhou
Yong Zhang
Mohammad Akbari
LRM
208
17
0
08 Sep 2025
HealthSLM-Bench: Benchmarking Small Language Models for Mobile and Wearable Healthcare Monitoring
Xin Wang
Ting Dang
X. Zhang
V. Kostakos
Michael J. Witbrock
Hong Jia
LM&MA
AI4MH
385
1
0
08 Sep 2025
MoGU V2: Toward a Higher Pareto Frontier Between Model Usability and Security
Yanrui Du
Fenglei Fan
Sendong Zhao
Jiawei Cao
Ting Liu
Bing Qin
121
0
0
08 Sep 2025
MedBench-IT: A Comprehensive Benchmark for Evaluating Large Language Models on Italian Medical Entrance Examinations
Ruggero Marino Lazzaroni
Alessandro Angioi
Michelangelo Puliga
Davide Sanna
Roberto Marras
LM&MA
ELM
149
1
0
08 Sep 2025
Self-Aligned Reward: Towards Effective and Efficient Reasoners
Peixuan Han
Adit Krishnan
Gerald Friedland
Jiaxuan You
Chris Kong
LRM
162
1
0
05 Sep 2025
WildScore: Benchmarking MLLMs in-the-Wild Symbolic Music Reasoning
Gagan Mundada
Yash Vishe
Amit Namburi
Xin Xu
Zachary Novack
Julian McAuley
Junda Wu
LRM
140
4
0
05 Sep 2025
Strefer: Empowering Video LLMs with Space-Time Referring and Reasoning via Synthetic Instruction Data
Honglu Zhou
Xiangyu Peng
Shrikant B. Kendre
Michael S Ryoo
Silvio Savarese
Caiming Xiong
Juan Carlos Niebles
130
1
0
03 Sep 2025
Implicit Reasoning in Large Language Models: A Comprehensive Survey
Jindong Li
Yali Fu
Li Fan
Jiahong Liu
Yao Shu
Chengwei Qin
Menglin Yang
Irwin King
Rex Ying
OffRL
LRM
AI4CE
234
14
0
02 Sep 2025
Top-H Decoding: Adapting the Creativity and Coherence with Bounded Entropy in Text Generation
Erfan Baghaei Potraghloo
Seyedarmin Azizi
Souvik Kundu
Massoud Pedram
94
4
0
02 Sep 2025
Unlearning That Lasts: Utility-Preserving, Robust, and Almost Irreversible Forgetting in LLMs
Naman D. Singh
Maximilian Müller
Francesco Croce
Matthias Hein
MU
KELM
CLL
201
4
0
02 Sep 2025
DaMoC: Efficiently Selecting the Optimal Large Language Model for Fine-tuning Domain Tasks Based on Data and Model Compression
Wei Huang
Huang Wei
Yinggui Wang
224
0
0
01 Sep 2025
Kwai Keye-VL 1.5 Technical Report
Biao Yang
Bin Wen
Boyang Ding
Changyi Liu
Chenglong Chu
...
S. Wang
X. Luo
Yan Li
Yuhang Hu
Zixing Zhang
VLM
333
17
0
01 Sep 2025
Improving Large Vision and Language Models by Learning from a Panel of Peers
J. Hernandez
Jing Shi
Simon Jenni
Vicente Ordonez
Kushal Kafle
139
1
0
01 Sep 2025
VideoRewardBench: Comprehensive Evaluation of Multimodal Reward Models for Video Understanding
Zhihong Zhang
Xiaojian Huang
Jin Xu
Zhuodong Luo
Xinzhi Wang
Jiansheng Wei
Xuejin Chen
VLM
125
1
0
30 Aug 2025
DriveQA: Passing the Driving Knowledge Test
Maolin Wei
Wanzhou Liu
Eshed Ohn-Bar
ELM
135
1
0
29 Aug 2025
Med-RewardBench: Benchmarking Reward Models and Judges for Medical Multimodal Large Language Models
Meidan Ding
Jipeng Zhang
Wenxuan Wang
Cheng-Yi Li
Wei-Chieh Fang
Hsin-Yu Wu
Haiqin Zhong
Wenting Chen
LinLin Shen
99
0
0
29 Aug 2025
Leveraging Large Language Models for Generating Research Topic Ontologies: A Multi-Disciplinary Study
Tanay Aggarwal
Angelo Salatino
Francesco Osborne
Enrico Motta
97
0
0
28 Aug 2025
MindGuard: Intrinsic Decision Inspection for Securing LLM Agents Against Metadata Poisoning
Zhiqiang Wang
Junyang Zhang
Guanquan Shi
Haoran Cheng
Yunhao Yao
Kaiwen Guo
Haohua Du
Xiang-Yang Li
165
0
0
28 Aug 2025
NLKI: A lightweight Natural Language Knowledge Integration Framework for Improving Small VLMs in Commonsense VQA Tasks
Aritra Dutta
Swapnanil Mukherjee
Deepanway Ghosal
Somak Aditya
VLM
105
0
0
27 Aug 2025
Ensemble Debates with Local Large Language Models for AI Alignment
Ephraiem Sarabamoun
ELM
360
0
0
27 Aug 2025
KRETA: A Benchmark for Korean Reading and Reasoning in Text-Rich VQA Attuned to Diverse Visual Contexts
Taebaek Hwang
Minseo Kim
Gisang Lee
Seonuk Kim
Hyunjun Eun
VLM
161
1
0
27 Aug 2025
Scalable Object Detection in the Car Interior With Vision Foundation Models
Bálint Mészáros
Ahmet Firintepe
Sebastian Schmidt
Stephan Günnemann
97
0
0
27 Aug 2025
Knowing or Guessing? Robust Medical Visual Question Answering via Joint Consistency and Contrastive Learning
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025
Songtao Jiang
Yihao Chen
Sibo Song
Yan Zhang
Yeying Jin
Yang Feng
Jian Wu
Zuozhu Liu
131
0
0
26 Aug 2025
Hidden Tail: Adversarial Image Causing Stealthy Resource Consumption in Vision-Language Models
Rui Zhang
Z. Wang
Tianli Yang
Hongwei Li
Wenbo Jiang
Qingchuan Zhao
Wenshu Fan
Guowen Xu
AAML
VLM
84
1
0
26 Aug 2025
PKG-DPO: Optimizing Domain-Specific AI systems with Physics Knowledge Graphs and Direct Preference Optimization
Nitin Nagesh Kulkarni
Bryson Wilcox
Max Sawa
Jason Thom
AI4CE
56
0
0
25 Aug 2025
From Global to Local: Social Bias Transfer in CLIP
Ryan Ramos
Yusuke Hirota
Yuta Nakashima
Noa Garcia
122
0
0
25 Aug 2025
MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models
Krishna Teja Chitty-Venkata
Sylvia Howland
Golara Azar
Daria Soboleva
Natalia Vassilieva
Siddhisanket Raskar
M. Emani
V. Vishwanath
MoE
121
1
0
24 Aug 2025
TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling
Yuancheng Wang
Dekun Chen
Xueyao Zhang
Junan Zhang
Jiaqi Li
Zhizheng Wu
228
4
0
22 Aug 2025
WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation
Rabiul Awal
Mahsa Massoud
Aarash Feizi
Zichao Li
Suyuchen Wang
...
Siva Reddy
Juan A. Rodriguez
Perouz Taslakian
Spandana Gella
Sai Rajeswar
LRM
128
10
0
22 Aug 2025
Assess and Prompt: A Generative RL Framework for Improving Engagement in Online Mental Health Communities
Bhagesh Gaur
Karan Gupta
Aseem Srivastava
Manish Gupta
Md. Shad Akhtar
90
0
0
22 Aug 2025
Dynamic Sparse Attention on Mobile SoCs
Wangsong Yin
Daliang Xu
Mengwei Xu
Gang Huang
Xuanzhe Liu
MQ
177
3
0
22 Aug 2025
RoboBuddy in the Classroom: Exploring LLM-Powered Social Robots for Storytelling in Learning and Integration Activities
Daniel Tozadore
Nur Ertug
Yasmine Chaker
Mortadha Abderrahim
57
0
0
22 Aug 2025
Unveiling Trust in Multimodal Large Language Models: Evaluation, Analysis, and Mitigation
Yichi Zhang
Yao Huang
Yifan Wang
Yitong Sun
Chang-rui Liu
...
Xiao Yang
Xingxing Wei
Hang Su
Yinpeng Dong
Jun Zhu
164
1
0
21 Aug 2025
Nemotron-CC-Math: A 133 Billion-Token-Scale High Quality Math Pretraining Dataset
Rabeeh Karimi Mahabadi
S. Satheesh
Shrimai Prabhumoye
M. Patwary
Mohammad Shoeybi
Bryan Catanzaro
156
9
0
20 Aug 2025
Evaluating Open-Source Vision Language Models for Facial Emotion Recognition against Traditional Deep Learning Models
Vamsi Krishna Mulukutla
Sai Supriya Pavarala
Srinivasa Raju Rudraraju
Sridevi Bonthu
VLM
82
0
0
19 Aug 2025
The Hidden Cost of Readability: How Code Formatting Silently Consumes Your LLM Budget
Dangfeng Pan
Zhensu Sun
Cenyuan Zhang
David Lo
Xiaoning Du
114
5
0
19 Aug 2025
Prompt Orchestration Markup Language
Yuge Zhang
Nan Chen
Jiahang Xu
Yuqing Yang
VLM
131
2
0
19 Aug 2025
Beyond Ethical Alignment: Evaluating LLMs as Artificial Moral Assistants
Alessio Galatolo
Luca Alberto Rappuoli
Katie Winkle
Meriem Beloucif
ELM
149
2
0
18 Aug 2025
Is GPT-OSS Good? A Comprehensive Evaluation of OpenAI's Latest Open Source Models
Ziqian Bi
Keyu Chen
Chiung-Yi Tseng
Danyang Zhang
Pohsun Feng
...
Junming Huang
Jibin Guan
Junfeng Hao
Junhao Song
Junhao Song
ELM
230
5
0
17 Aug 2025
Rethinking Safety in LLM Fine-tuning: An Optimization Perspective
Minseon Kim
Jin Myung Kwak
Lama Alssum
Bernard Ghanem
Juil Sock
David M. Krueger
Fazl Barez
Adel Bibi
147
4
0
17 Aug 2025
VimoRAG: Video-based Retrieval-augmented 3D Motion Generation for Motion Language Models
Haidong Xu
Guangwei Xu
Zhedong Zheng
Xiatian Zhu
Wei Ji
Xiangtai Li
Ruijie Guo
Meishan Zhang
M. Zhang
Hao Fei
184
1
0
16 Aug 2025
Previous
1
2
3
4
5
6
...
18
19
20
Next
Page 3 of 20
Page
of 20
Go