Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2404.14219
Cited By
v1
v2
v3 (latest)
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
22 April 2024
Marah Abdin
Sam Ade Jacobs
A. A. Awan
J. Aneja
Ahmed Hassan Awadallah
Hany Awadalla
Nguyen Bach
Amit Bahree
Arash Bakhtiari
Jianmin Bao
Harkirat Singh Behl
Alon Benhaim
Misha Bilenko
Johan Bjorck
Sébastien Bubeck
Qin Cai
Martin Cai
C. C. T. Mendes
Weizhu Chen
Vishrav Chaudhary
Dong Chen
DongDong Chen
Yen-Chun Chen
Yi-Ling Chen
Parul Chopra
Xiyang Dai
Allison Del Giorno
Gustavo de Rosa
Matthew Dixon
Ronen Eldan
Victor Fragoso
Dan Iter
Mei Gao
Min Gao
Jianfeng Gao
Amit Garg
Abhishek Goswami
Suriya Gunasekar
Emman Haider
Junheng Hao
Russell J. Hewett
Jamie Huynh
Mojan Javaheripi
Xin Jin
Piero Kauffmann
Nikos Karampatziakis
Dongwoo Kim
Mahoud Khademi
Lev Kurilenko
James R. Lee
Yin Tat Lee
Yuanzhi Li
Yunsheng Li
Chen Liang
Lars Liden
Ce Liu
Mengchen Liu
Weishung Liu
Eric Lin
Zeqi Lin
Chong Luo
Piyush Madan
Matt Mazzola
Arindam Mitra
Hardik Modi
Anh Nguyen
Brandon Norick
Barun Patra
Daniel Perez-Becker
Thomas Portet
Reid Pryzant
Heyang Qin
Marko Radmilac
Corby Rosset
Sambudha Roy
Olatunji Ruwase
Olli Saarikivi
Amin Saied
Adil Salim
Michael Santacroce
Shital Shah
Ning Shang
Hiteshi Sharma
Swadheen Shukla
Xianmin Song
Masahiro Tanaka
Andrea Tupini
Xin Eric Wang
Lijuan Wang
Chunyu Wang
Yu Wang
Rachel A. Ward
Guanhua Wang
Philipp A. Witte
Haiping Wu
Michael Wyatt
Bin Xiao
Can Xu
Jiahang Xu
Weijian Xu
Sonali Yadav
Fan Yang
Jianwei Yang
Ziyi Yang
Yifan Yang
Donghan Yu
Lu Yuan
Cheng-Yuan Zhang
Cyril Zhang
Jianwen Zhang
Li Zhang
Yi Zhang
Yue Zhang
Yunan Zhang
Xiren Zhou
LRM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (257 upvotes)
Papers citing
"Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone"
50 / 965 papers shown
Understanding the Effects of Domain Finetuning on LLMs
Eshaan Tanwar
Deepak Nathani
William Yang Wang
Tanmoy Chakraborty
128
0
0
10 Oct 2025
PhysToolBench: Benchmarking Physical Tool Understanding for MLLMs
Zixin Zhang
Kanghao Chen
Xingwang Lin
Lutao Jiang
Xu Zheng
Yuanhuiyi Lyu
Litao Guo
Yinchuan Li
Ying-Cong Chen
91
3
0
10 Oct 2025
ProxRouter: Proximity-Weighted LLM Query Routing for Improved Robustness to Outliers
Shivam Patel
Neharika Jali
Ankur Mallick
Gauri Joshi
128
0
0
10 Oct 2025
Zero-shot image privacy classification with Vision-Language Models
Alina Elena Baia
Alessio Xompero
Andrea Cavallaro
VLM
88
0
0
10 Oct 2025
On the Representations of Entities in Auto-regressive Large Language Models
Victor Morand
Josiane Mothe
Benjamin Piwowarski
117
0
0
10 Oct 2025
To Sink or Not to Sink: Visual Information Pathways in Large Vision-Language Models
Jiayun Luo
Wan-Cyuan Fan
Lyuyang Wang
Xiangteng He
Tanzila Rahman
Purang Abolmaesumi
Leonid Sigal
LRM
140
0
0
09 Oct 2025
RetouchLLM: Training-free Code-based Image Retouching with Vision Language Models
Moon Ye-Bin
Roy Miles
Tae-Hyun Oh
Ismail Elezi
Jiankang Deng
OffRL
VLM
128
0
0
09 Oct 2025
Stress-Testing Model Specs Reveals Character Differences among Language Models
Jifan Zhang
Henry Sleight
Andi Peng
John Schulman
Esin Durmus
181
0
0
09 Oct 2025
Efficient Discriminative Joint Encoders for Large Scale Vision-Language Reranking
Mitchell Keren Taraday
Shahaf Wagner
Chaim Baskin
VLM
110
1
0
08 Oct 2025
Deploying Tiny LVLM Judges for Real-World Evaluation of Chart Models: Lessons Learned and Best Practices
Md Tahmid Rahman Laskar
Mohammed Saidul Islam
Ridwan Mahbub
Mizanur Rahman
Amran Bhuiyan
Israt Jahan
Mir Tafseer Nayeem
Shafiq Joty
E. Hoque
J. Huang
ELM
ALM
171
0
0
08 Oct 2025
Compressed Convolutional Attention: Efficient Attention in a Compressed Latent Space
Tomás Figliolia
Nicholas Alonso
Rishi Iyer
Quentin Anthony
Beren Millidge
MQ
120
1
0
06 Oct 2025
A Set of Quebec-French Corpus of Regional Expressions and Terms
David Beauchemin
Yan Tremblay
Mohamed Amine Youssef
Richard Khoury
132
2
0
06 Oct 2025
Person-Centric Annotations of LAION-400M: Auditing Bias and Its Transfer to Models
Leander Girrbach
Stephan Alaniz
Genevieve Smith
Trevor Darrell
Zeynep Akata
201
3
0
04 Oct 2025
H-DDx: A Hierarchical Evaluation Framework for Differential Diagnosis
Seungseop Lim
Gibaeg Kim
Hyunkyung Lee
Wooseok Han
Jean Seo
Jaehyo Yoo
Eunho Yang
LM&MA
ELM
144
0
0
04 Oct 2025
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information
Jiaxi Li
Yucheng Shi
Jin Lu
Ninghao Liu
LRM
133
0
0
04 Oct 2025
Towards Sampling Data Structures for Tensor Products in Turnstile Streams
Zhao Song
Shenghao Xie
Samson Zhou
140
0
0
04 Oct 2025
Beyond CNNs: Efficient Fine-Tuning of Multi-Modal LLMs for Object Detection on Low-Data Regimes
Nirmal Elamon
Rouzbeh Davoudi
ObjD
157
0
0
03 Oct 2025
Dirichlet-Prior Shaping: Guiding Expert Specialization in Upcycled MoEs
Leyla Mirvakhabova
B. Bejnordi
Gaurav Kumar
Hanxue Liang
Wanru Zhao
Paul N. Whatmough
MoE
88
0
0
01 Oct 2025
ModernVBERT: Towards Smaller Visual Document Retrievers
Paul Teiletche
Quentin Macé
Max Conti
António Loison
Gautier Viaud
Pierre Colombo
Manuel Faysse
VLM
284
2
0
01 Oct 2025
Generalized Correctness Models: Learning Calibrated and Model-Agnostic Correctness Predictors from Historical Patterns
Hanqi Xiao
Vaidehi Patil
Hyunji Lee
Elias Stengel-Eskin
Mohit Bansal
168
1
0
29 Sep 2025
Predicting Training Re-evaluation Curves Enables Effective Data Curriculums for LLMs
Shane Bergsma
Nolan Dey
Joel Hestness
162
0
0
29 Sep 2025
Towards Trustworthy Lexical Simplification: Exploring Safety and Efficiency with Small LLMs
Akio Hayakawa
Stefan Bott
Horacio Saggion
56
0
0
29 Sep 2025
AstroMMBench: A Benchmark for Evaluating Multimodal Large Language Models Capabilities in Astronomy
Jinghang Shi
Xiao Yu Tang
Yang Hunag
Yuyang Li
Xiaokong
Yanxia Zhang
Caizhan Yue
193
0
0
29 Sep 2025
Analyzing and Evaluating Unbiased Language Model Watermark
Yihan Wu
Xuehao Cui
Ruibo Chen
Tianyi Zhou
WaLM
164
1
0
28 Sep 2025
LUQ: Layerwise Ultra-Low Bit Quantization for Multimodal Large Language Models
Shubhang Bhatnagar
Andy Xu
Kar-Han Tan
Narendra Ahuja
MQ
194
0
0
28 Sep 2025
Evaluating Program Semantics Reasoning with Type Inference in System F
Yifeng He
Luning Yang
Christopher Castro Gaw Gonzalo
Hao Chen
ReLM
LRM
551
1
0
28 Sep 2025
LLMSQL: Upgrading WikiSQL for the LLM Era of Text-to-SQL
Dzmitry Pihulski
Karol Charchut
Viktoria Novogrodskaia
Jan Kocoń
KELM
286
1
0
27 Sep 2025
Customizing Visual Emotion Evaluation for MLLMs: An Open-vocabulary, Multifaceted, and Scalable Approach
Daiqing Wu
Dongbao Yang
Sicheng Zhao
Can Ma
Can Ma
MLLM
152
1
0
26 Sep 2025
Quantifying the Impact of Structured Output Format on Large Language Models through Causal Inference
Han Yuan
Yue Zhao
Li Zhang
Wuqiong Luo
Zheng Ma
164
0
0
26 Sep 2025
Learning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs
Xingyu Fu
Siyi Liu
Yinuo Xu
Pan Lu
Guangqiuse Hu
...
Chung Un Lee
Yejin Choi
James Zou
Dan Roth
Chris Callison-Burch
153
0
0
26 Sep 2025
Query-Centric Graph Retrieval Augmented Generation
Yaxiong Wu
Jianyuan Bo
Yongyue Zhang
Sheng Liang
Yong Liu
108
0
0
25 Sep 2025
Accelerate Creation of Product Claims Using Generative AI
Po-Yu Liang
Yong Zhang
Tatiana Hwa
Aaron Byers
86
0
0
25 Sep 2025
GEP: A GCG-Based method for extracting personally identifiable information from chatbots built on small language models
Jieli Zhu
Vi Ngoc-Nha Tran
220
0
0
25 Sep 2025
Seeing Through Words, Speaking Through Pixels: Deep Representational Alignment Between Vision and Language Models
Zoe Wanying He
Sean Trott
Meenakshi Khosla
VLM
124
1
0
25 Sep 2025
Tokenization and Representation Biases in Multilingual Models on Dialectal NLP Tasks
Vani Kanjirangat
Tanja Samardžić
Ljiljana Dolamic
Fabio Rinaldi
84
1
0
24 Sep 2025
Polarity Detection of Sustainable Development Goals in News Text
Andrea Cadeddu
Alessandro Chessa
Vincenzo De Leo
Gianni Fenu
Francesco Osborne
Diego Reforgiato Recupero
Angelo Salatino
Luca Secchi
196
0
0
24 Sep 2025
OmniBridge: Unified Multimodal Understanding, Generation, and Retrieval via Latent Space Alignment
Teng Xiao
Zuchao Li
Lefei Zhang
178
1
0
23 Sep 2025
Rule Encoding and Compliance in Large Language Models: An Information-Theoretic Analysis
Joachim Diederich
204
0
0
23 Sep 2025
Are VLMs Ready for Lane Topology Awareness in Autonomous Driving?
Xin Chen
Jia He
Maozheng Li
Dongliang Xu
Tianyu Wang
Yixiao Chen
Zhixin Lin
Yue Yao
169
0
0
20 Sep 2025
ORIC: Benchmarking Object Recognition under Contextual Incongruity in Large Vision-Language Models
Zhaoyang Li
Z. Ling
Yuchen Zhou
Litian Gong
Erdem Bıyık
H. Su
211
0
0
19 Sep 2025
Language-Instructed Reasoning for Group Activity Detection via Multimodal Large Language Model
Jihua Peng
Qianxiong Xu
Yichen Liu
Chenxi Liu
Cheng Long
Rui Zhao
Ziyue Li
LRM
103
0
0
19 Sep 2025
MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer
Yanghao Li
Rui Qian
Bowen Pan
Haotian Zhang
Haoshuo Huang
...
Zhengdong Zhang
Chen Chen
Yang Zhao
Ruoming Pang
Zhifeng Chen
MLLM
204
4
0
19 Sep 2025
CLEAR: A Comprehensive Linguistic Evaluation of Argument Rewriting by Large Language Models
Thomas Huber
Christina Niklaus
113
0
0
18 Sep 2025
Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation
Xiaoyu Yue
Zidong Wang
Yuqing Wang
Wenlong Zhang
Xihui Liu
Wanli Ouyang
Wenlong Zhang
Luping Zhou
GAN
249
2
0
18 Sep 2025
AToken: A Unified Tokenizer for Vision
Jiasen Lu
Liangchen Song
Mingze Xu
Byeongjoo Ahn
Yanjun Wang
Chen Chen
Afshin Dehghan
Yinfei Yang
ViT
236
7
0
17 Sep 2025
Estimating Semantic Alphabet Size for LLM Uncertainty Quantification
Lucas H. McCabe
Rimon Melamed
Thomas Hartvigsen
H. H. Huang
120
0
0
17 Sep 2025
Positional Encoding via Token-Aware Phase Attention
Wang
Sheng Shen
Rémi Munos
Hongyuan Zhan
Yuandong Tian
182
0
0
16 Sep 2025
CLAIRE: A Dual Encoder Network with RIFT Loss and Phi-3 Small Language Model Based Interpretability for Cross-Modality Synthetic Aperture Radar and Optical Land Cover Segmentation
Debopom Sutradhar
Arefin Ittesafun Abian
M. R
Reem E. Mohamed
Sheikh Izzal Azid
Sami Azam
120
0
0
15 Sep 2025
A Dynamic Knowledge Update-Driven Model with Large Language Models for Fake News Detection
International Joint Conference on Artificial Intelligence (IJCAI), 2025
Di Jin
Jun Yang
Xiaobao Wang
Junwei Zhang
Shuqi Li
Dongxiao He
KELM
98
1
0
15 Sep 2025
Pluralistic Off-policy Evaluation and Alignment
Chengkai Huang
Junda Wu
Zhouhang Xie
Yu Xia
Rui Wang
Tong Yu
Subrata Mitra
Julian McAuley
L. Yao
OffRL
172
1
0
15 Sep 2025
Previous
1
2
3
4
5
...
18
19
20
Next