Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2404.14219
Cited By
v1
v2
v3 (latest)
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
22 April 2024
Marah Abdin
Sam Ade Jacobs
A. A. Awan
J. Aneja
Ahmed Hassan Awadallah
Hany Awadalla
Nguyen Bach
Amit Bahree
Arash Bakhtiari
Jianmin Bao
Harkirat Singh Behl
Alon Benhaim
Misha Bilenko
Johan Bjorck
Sébastien Bubeck
Qin Cai
Martin Cai
C. C. T. Mendes
Weizhu Chen
Vishrav Chaudhary
Dong Chen
DongDong Chen
Yen-Chun Chen
Yi-Ling Chen
Parul Chopra
Xiyang Dai
Allison Del Giorno
Gustavo de Rosa
Matthew Dixon
Ronen Eldan
Victor Fragoso
Dan Iter
Mei Gao
Min Gao
Jianfeng Gao
Amit Garg
Abhishek Goswami
Suriya Gunasekar
Emman Haider
Junheng Hao
Russell J. Hewett
Jamie Huynh
Mojan Javaheripi
Xin Jin
Piero Kauffmann
Nikos Karampatziakis
Dongwoo Kim
Mahoud Khademi
Lev Kurilenko
James R. Lee
Yin Tat Lee
Yuanzhi Li
Yunsheng Li
Chen Liang
Lars Liden
Ce Liu
Mengchen Liu
Weishung Liu
Eric Lin
Zeqi Lin
Chong Luo
Piyush Madan
Matt Mazzola
Arindam Mitra
Hardik Modi
Anh Nguyen
Brandon Norick
Barun Patra
Daniel Perez-Becker
Thomas Portet
Reid Pryzant
Heyang Qin
Marko Radmilac
Corby Rosset
Sambudha Roy
Olatunji Ruwase
Olli Saarikivi
Amin Saied
Adil Salim
Michael Santacroce
Shital Shah
Ning Shang
Hiteshi Sharma
Swadheen Shukla
Xianmin Song
Masahiro Tanaka
Andrea Tupini
Xin Eric Wang
Lijuan Wang
Chunyu Wang
Yu Wang
Rachel A. Ward
Guanhua Wang
Philipp A. Witte
Haiping Wu
Michael Wyatt
Bin Xiao
Can Xu
Jiahang Xu
Weijian Xu
Sonali Yadav
Fan Yang
Jianwei Yang
Ziyi Yang
Yifan Yang
Donghan Yu
Lu Yuan
Cheng-Yuan Zhang
Cyril Zhang
Jianwen Zhang
Li Zhang
Yi Zhang
Yue Zhang
Yunan Zhang
Xiren Zhou
LRM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (257 upvotes)
Papers citing
"Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone"
50 / 958 papers shown
Title
RetouchLLM: Training-free Code-based Image Retouching with Vision Language Models
Moon Ye-Bin
Roy Miles
Tae-Hyun Oh
Ismail Elezi
Jiankang Deng
OffRL
VLM
111
0
0
09 Oct 2025
To Sink or Not to Sink: Visual Information Pathways in Large Vision-Language Models
Jiayun Luo
Wan-Cyuan Fan
Lyuyang Wang
Xiangteng He
Tanzila Rahman
Purang Abolmaesumi
Leonid Sigal
LRM
128
0
0
09 Oct 2025
Stress-Testing Model Specs Reveals Character Differences among Language Models
Jifan Zhang
Henry Sleight
Andi Peng
John Schulman
Esin Durmus
141
0
0
09 Oct 2025
Deploying Tiny LVLM Judges for Real-World Evaluation of Chart Models: Lessons Learned and Best Practices
Md Tahmid Rahman Laskar
Mohammed Saidul Islam
Ridwan Mahbub
Mizanur Rahman
Amran Bhuiyan
Israt Jahan
Mir Tafseer Nayeem
Shafiq Joty
E. Hoque
J. Huang
ELM
ALM
159
0
0
08 Oct 2025
Efficient Discriminative Joint Encoders for Large Scale Vision-Language Reranking
Mitchell Keren Taraday
Shahaf Wagner
Chaim Baskin
VLM
105
1
0
08 Oct 2025
Compressed Convolutional Attention: Efficient Attention in a Compressed Latent Space
Tomás Figliolia
Nicholas Alonso
Rishi Iyer
Quentin Anthony
Beren Millidge
MQ
112
1
0
06 Oct 2025
A Set of Quebec-French Corpus of Regional Expressions and Terms
David Beauchemin
Yan Tremblay
Mohamed Amine Youssef
Richard Khoury
104
2
0
06 Oct 2025
Towards Sampling Data Structures for Tensor Products in Turnstile Streams
Zhao Song
Shenghao Xie
Samson Zhou
128
0
0
04 Oct 2025
H-DDx: A Hierarchical Evaluation Framework for Differential Diagnosis
Seungseop Lim
Gibaeg Kim
Hyunkyung Lee
Wooseok Han
Jean Seo
Jaehyo Yoo
Eunho Yang
LM&MA
ELM
111
0
0
04 Oct 2025
Person-Centric Annotations of LAION-400M: Auditing Bias and Its Transfer to Models
Leander Girrbach
Stephan Alaniz
Genevieve Smith
Trevor Darrell
Zeynep Akata
197
1
0
04 Oct 2025
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information
Jiaxi Li
Yucheng Shi
Jin Lu
Ninghao Liu
LRM
120
0
0
04 Oct 2025
Beyond CNNs: Efficient Fine-Tuning of Multi-Modal LLMs for Object Detection on Low-Data Regimes
Nirmal Elamon
Rouzbeh Davoudi
ObjD
149
0
0
03 Oct 2025
Dirichlet-Prior Shaping: Guiding Expert Specialization in Upcycled MoEs
Leyla Mirvakhabova
B. Bejnordi
Gaurav Kumar
Hanxue Liang
Wanru Zhao
Paul N. Whatmough
MoE
84
0
0
01 Oct 2025
ModernVBERT: Towards Smaller Visual Document Retrievers
Paul Teiletche
Quentin Macé
Max Conti
António Loison
Gautier Viaud
Pierre Colombo
Manuel Faysse
VLM
214
2
0
01 Oct 2025
Generalized Correctness Models: Learning Calibrated and Model-Agnostic Correctness Predictors from Historical Patterns
Hanqi Xiao
Vaidehi Patil
Hyunji Lee
Elias Stengel-Eskin
Mohit Bansal
164
1
0
29 Sep 2025
AstroMMBench: A Benchmark for Evaluating Multimodal Large Language Models Capabilities in Astronomy
Jinghang Shi
Xiao Yu Tang
Yang Hunag
Yuyang Li
Xiaokong
Yanxia Zhang
Caizhan Yue
169
0
0
29 Sep 2025
Predicting Training Re-evaluation Curves Enables Effective Data Curriculums for LLMs
Shane Bergsma
Nolan Dey
Joel Hestness
150
0
0
29 Sep 2025
Towards Trustworthy Lexical Simplification: Exploring Safety and Efficiency with Small LLMs
Akio Hayakawa
Stefan Bott
Horacio Saggion
44
0
0
29 Sep 2025
Analyzing and Evaluating Unbiased Language Model Watermark
Yihan Wu
Xuehao Cui
Ruibo Chen
Tianyi Zhou
WaLM
160
1
0
28 Sep 2025
Evaluating Program Semantics Reasoning with Type Inference in System F
Yifeng He
Luning Yang
Christopher Castro Gaw Gonzalo
Hao Chen
ReLM
LRM
538
1
0
28 Sep 2025
LUQ: Layerwise Ultra-Low Bit Quantization for Multimodal Large Language Models
Shubhang Bhatnagar
Andy Xu
Kar-Han Tan
Narendra Ahuja
MQ
182
0
0
28 Sep 2025
LLMSQL: Upgrading WikiSQL for the LLM Era of Text-to-SQL
Dzmitry Pihulski
Karol Charchut
Viktoria Novogrodskaia
Jan Kocoń
KELM
236
1
0
27 Sep 2025
Customizing Visual Emotion Evaluation for MLLMs: An Open-vocabulary, Multifaceted, and Scalable Approach
Daiqing Wu
Dongbao Yang
Sicheng Zhao
Can Ma
Can Ma
MLLM
144
1
0
26 Sep 2025
Quantifying the Impact of Structured Output Format on Large Language Models through Causal Inference
Han Yuan
Yue Zhao
Li Zhang
Wuqiong Luo
Zheng Ma
148
0
0
26 Sep 2025
Learning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs
Xingyu Fu
Siyi Liu
Yinuo Xu
Pan Lu
Guangqiuse Hu
...
Chung Un Lee
Yejin Choi
James Zou
Dan Roth
Chris Callison-Burch
121
0
0
26 Sep 2025
Query-Centric Graph Retrieval Augmented Generation
Yaxiong Wu
Jianyuan Bo
Yongyue Zhang
Sheng Liang
Yong Liu
85
0
0
25 Sep 2025
Accelerate Creation of Product Claims Using Generative AI
Po-Yu Liang
Yong Zhang
Tatiana Hwa
Aaron Byers
70
0
0
25 Sep 2025
Seeing Through Words, Speaking Through Pixels: Deep Representational Alignment Between Vision and Language Models
Zoe Wanying He
Sean Trott
Meenakshi Khosla
VLM
108
1
0
25 Sep 2025
GEP: A GCG-Based method for extracting personally identifiable information from chatbots built on small language models
Jieli Zhu
Vi Ngoc-Nha Tran
208
0
0
25 Sep 2025
Polarity Detection of Sustainable Detection Goals in News Text
Andrea Cadeddu
Alessandro Chessa
Vincenzo De Leo
Gianni Fenu
Francesco Osborne
Diego Reforgiato Recupero
Angelo Salatino
Luca Secchi
148
0
0
24 Sep 2025
Tokenization and Representation Biases in Multilingual Models on Dialectal NLP Tasks
Vani Kanjirangat
Tanja Samardžić
Ljiljana Dolamic
Fabio Rinaldi
80
0
0
24 Sep 2025
Rule Encoding and Compliance in Large Language Models: An Information-Theoretic Analysis
Joachim Diederich
160
0
0
23 Sep 2025
OmniBridge: Unified Multimodal Understanding, Generation, and Retrieval via Latent Space Alignment
Teng Xiao
Zuchao Li
Lefei Zhang
165
0
0
23 Sep 2025
Are VLMs Ready for Lane Topology Awareness in Autonomous Driving?
Xin Chen
Jia He
Maozheng Li
Dongliang Xu
Tianyu Wang
Yixiao Chen
Zhixin Lin
Yue Yao
157
0
0
20 Sep 2025
Language-Instructed Reasoning for Group Activity Detection via Multimodal Large Language Model
Jihua Peng
Qianxiong Xu
Yichen Liu
Chenxi Liu
Cheng Long
Rui Zhao
Ziyue Li
LRM
68
0
0
19 Sep 2025
MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer
Yanghao Li
Rui Qian
Bowen Pan
Haotian Zhang
Haoshuo Huang
...
Zhengdong Zhang
Chen Chen
Yang Zhao
Ruoming Pang
Zhifeng Chen
MLLM
200
4
0
19 Sep 2025
ORIC: Benchmarking Object Recognition under Contextual Incongruity in Large Vision-Language Models
Zhaoyang Li
Z. Ling
Yuchen Zhou
Litian Gong
Erdem Bıyık
H. Su
195
0
0
19 Sep 2025
CLEAR: A Comprehensive Linguistic Evaluation of Argument Rewriting by Large Language Models
Thomas Huber
Christina Niklaus
109
0
0
18 Sep 2025
Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation
Xiaoyu Yue
Zidong Wang
Yuqing Wang
Wenlong Zhang
Xihui Liu
Wanli Ouyang
Wenlong Zhang
Luping Zhou
GAN
241
2
0
18 Sep 2025
Estimating Semantic Alphabet Size for LLM Uncertainty Quantification
Lucas H. McCabe
Rimon Melamed
Thomas Hartvigsen
H. H. Huang
106
0
0
17 Sep 2025
AToken: A Unified Tokenizer for Vision
Jiasen Lu
Liangchen Song
Mingze Xu
Byeongjoo Ahn
Yanjun Wang
Chen Chen
Afshin Dehghan
Yinfei Yang
ViT
212
7
0
17 Sep 2025
Positional Encoding via Token-Aware Phase Attention
Wang
Sheng Shen
Rémi Munos
Hongyuan Zhan
Yuandong Tian
174
0
0
16 Sep 2025
A Dynamic Knowledge Update-Driven Model with Large Language Models for Fake News Detection
International Joint Conference on Artificial Intelligence (IJCAI), 2025
Di Jin
Jun Yang
Xiaobao Wang
Junwei Zhang
Shuqi Li
Dongxiao He
KELM
89
1
0
15 Sep 2025
CLAIRE: A Dual Encoder Network with RIFT Loss and Phi-3 Small Language Model Based Interpretability for Cross-Modality Synthetic Aperture Radar and Optical Land Cover Segmentation
Debopom Sutradhar
Arefin Ittesafun Abian
M. R
Reem E. Mohamed
Sheikh Izzal Azid
Sami Azam
96
0
0
15 Sep 2025
Pluralistic Off-policy Evaluation and Alignment
Chengkai Huang
Junda Wu
Zhouhang Xie
Yu Xia
Rui Wang
Tong Yu
Subrata Mitra
Julian McAuley
L. Yao
OffRL
160
1
0
15 Sep 2025
LoRALib: A Standardized Benchmark for Evaluating LoRA-MoE Methods
Shaoheng Wang
Yao Lu
Yuqi Li
Yaxin Gao
Jiaqi Nie
Shanqing Yu
Yingli Tian
Qi Xuan
MoE
MoMe
136
0
0
14 Sep 2025
Enhancing Generalization in Vision-Language-Action Models by Preserving Pretrained Representations
Shresth Grover
Akshay Gopalkrishnan
Bo Ai
Henrik I. Christensen
H. Su
Xuanlin Li
VLM
197
4
0
14 Sep 2025
Continually Adding New Languages to Multilingual Language Models
A. Owodunni
Sachin Kumar
CLL
KELM
MoMe
177
2
0
14 Sep 2025
LLMAP: LLM-Assisted Multi-Objective Route Planning with User Preferences
Liangqi Yuan
Dong-Jun Han
Christopher G. Brinton
Sabine Brunswicker
134
2
0
14 Sep 2025
TrueSkin: Towards Fair and Accurate Skin Tone Recognition and Generation
Haoming Lu
SyDa
94
1
0
13 Sep 2025
Previous
1
2
3
4
5
...
18
19
20
Next