Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2404.14219
Cited By
v1
v2
v3 (latest)
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
22 April 2024
Marah Abdin
Sam Ade Jacobs
A. A. Awan
J. Aneja
Ahmed Hassan Awadallah
Hany Awadalla
Nguyen Bach
Amit Bahree
Arash Bakhtiari
Jianmin Bao
Harkirat Singh Behl
Alon Benhaim
Misha Bilenko
Johan Bjorck
Sébastien Bubeck
Qin Cai
Martin Cai
C. C. T. Mendes
Weizhu Chen
Vishrav Chaudhary
Dong Chen
DongDong Chen
Yen-Chun Chen
Yi-Ling Chen
Parul Chopra
Xiyang Dai
Allison Del Giorno
Gustavo de Rosa
Matthew Dixon
Ronen Eldan
Victor Fragoso
Dan Iter
Mei Gao
Min Gao
Jianfeng Gao
Amit Garg
Abhishek Goswami
Suriya Gunasekar
Emman Haider
Junheng Hao
Russell J. Hewett
Jamie Huynh
Mojan Javaheripi
Xin Jin
Piero Kauffmann
Nikos Karampatziakis
Dongwoo Kim
Mahoud Khademi
Lev Kurilenko
James R. Lee
Yin Tat Lee
Yuanzhi Li
Yunsheng Li
Chen Liang
Lars Liden
Ce Liu
Mengchen Liu
Weishung Liu
Eric Lin
Zeqi Lin
Chong Luo
Piyush Madan
Matt Mazzola
Arindam Mitra
Hardik Modi
Anh Nguyen
Brandon Norick
Barun Patra
Daniel Perez-Becker
Thomas Portet
Reid Pryzant
Heyang Qin
Marko Radmilac
Corby Rosset
Sambudha Roy
Olatunji Ruwase
Olli Saarikivi
Amin Saied
Adil Salim
Michael Santacroce
Shital Shah
Ning Shang
Hiteshi Sharma
Swadheen Shukla
Xianmin Song
Masahiro Tanaka
Andrea Tupini
Xin Eric Wang
Lijuan Wang
Chunyu Wang
Yu Wang
Rachel A. Ward
Guanhua Wang
Philipp A. Witte
Haiping Wu
Michael Wyatt
Bin Xiao
Can Xu
Jiahang Xu
Weijian Xu
Sonali Yadav
Fan Yang
Jianwei Yang
Ziyi Yang
Yifan Yang
Donghan Yu
Lu Yuan
Cheng-Yuan Zhang
Cyril Zhang
Jianwen Zhang
Li Zhang
Yi Zhang
Yue Zhang
Yunan Zhang
Xiren Zhou
LRM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (257 upvotes)
Papers citing
"Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone"
50 / 963 papers shown
Title
A Comparative Study of Task Adaptation Techniques of Large Language Models for Identifying Sustainable Development Goals
IEEE Access (IEEE Access), 2025
Andrea Cadeddu
Alessandro Chessa
Vincenzo De Leo
Gianni Fenu
Enrico Motta
Francesco Osborne
Diego Reforgiato Recupero
Angelo Salatino
Luca Secchi
176
0
0
18 Jun 2025
GenRecal: Generation after Recalibration from Large to Small Vision-Language Models
Byung-Kwan Lee
Ryo Hachiuma
Yong Man Ro
Yu-Chun Wang
Yueh-Hua Wu
VLM
307
3
0
18 Jun 2025
SciVer: Evaluating Foundation Models for Multimodal Scientific Claim Verification
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Chengye Wang
Yifei Shen
Zexi Kuang
Arman Cohan
Yilun Zhao
188
1
0
18 Jun 2025
Demystifying the Visual Quality Paradox in Multimodal Large Language Models
Shuo Xing
Lanqing guo
Hongyuan Hua
Seoyoung Lee
Peiran Li
Yufei Wang
Zinan Lin
Zhengzhong Tu
VLM
256
1
0
18 Jun 2025
ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM
Yujun Wang
Aniri
Jinhe Bi
Soeren Pirk
Yunpu Ma
MLLM
345
11
0
17 Jun 2025
PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning
Yizhen Zhang
Yang Ding
Shuoshuo Zhang
Xinchen Zhang
Haoling Li
...
Jie Wu
Lei Ji
Haoran Pan
Y. Yang
Yeyun Gong
OffRL
VLM
LRM
193
5
0
17 Jun 2025
Optimal Embedding Learning Rate in LLMs: The Effect of Vocabulary Size
Soufiane Hayou
Liyuan Liu
137
2
0
17 Jun 2025
Enhancing Goal-oriented Proactive Dialogue Systems via Consistency Reflection and Correction
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Didi Zhang
Yaxin Fan
Peifeng Li
Qiaoming Zhu
180
0
0
16 Jun 2025
Rethinking Explainability in the Era of Multimodal AI
Chirag Agarwal
211
1
0
16 Jun 2025
MotiveBench: How Far Are We From Human-Like Motivational Reasoning in Large Language Models?
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Xixian Yong
Jianxun Lian
Xiaoyuan Yi
Xiao Zhou
Xing Xie
LRM
203
0
0
16 Jun 2025
Align-then-Unlearn: Embedding Alignment for LLM Unlearning
Philipp Spohn
Leander Girrbach
Jessica Bader
Zeynep Akata
MU
205
0
0
16 Jun 2025
SPOT: Bridging Natural Language and Geospatial Search for Investigative Journalists
Lynn Khellaf
Ipek Baris Schlicht
Tilman Mirass
Julia Bayer
Tilman Wagner
Ruben Bouwmeester
90
1
0
16 Jun 2025
Assessing the Limits of In-Context Learning beyond Functions using Partially Ordered Relation
Debanjan Dutta
Faizanuddin Ansari
Swagatam Das
114
0
0
16 Jun 2025
PRISM2: Unlocking Multi-Modal General Pathology AI with Clinical Dialogue
Eugene Vorontsov
Eugene Vorontsov
Adam Casson
Julian Viret
Eric Zimmermann
...
Razik Yousfi
Nicolò Fusi
Thomas J. Fuchs
Kristen Severson
Siqi Liu
MedIm
LM&MA
173
5
0
16 Jun 2025
ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies
Chenglin Wang
Yucheng Zhou
Qianning Wang
Zhe Wang
Kai Zhang
CoGe
154
10
0
15 Jun 2025
Jailbreak Transferability Emerges from Shared Representations
Rico Angell
Jannik Brinkmann
He He
296
1
0
15 Jun 2025
ConsistencyChecker: Tree-based Evaluation of LLM Generalization Capabilities
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Zhaochen Hong
Haofei Yu
Jiaxuan You
178
1
0
14 Jun 2025
OpenUnlearning: Accelerating LLM Unlearning via Unified Benchmarking of Methods and Metrics
Pratyush Maini
Anmol Mekala
Wenlong Zhao
Andrew McCallum
Zachary Chase Lipton
J. Zico Kolter
Pratyush Maini
MU
ELM
319
11
0
14 Jun 2025
MTabVQA: Evaluating Multi-Tabular Reasoning of Language Models in Visual Space
Anshul Singh
Chris Biemann
Jan Strich
LMTD
LRM
167
3
0
13 Jun 2025
Curriculum-Guided Layer Scaling for Language Model Pretraining
Karanpartap Singh
Neil Band
Ehsan Adeli
ALM
LRM
223
0
0
13 Jun 2025
Burn After Reading: Do Multimodal Large Language Models Truly Capture Order of Events in Image Sequences?
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yingjin Song
Yupei Du
Denis Paperno
Albert Gatt
MLLM
253
1
0
12 Jun 2025
VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos
Jiashuo Yu
Y. Wu
Meng Chu
Zhifei Ren
Z. Huang
...
Conghui He
Yu Qiao
Yali Wang
Yi Wang
L. Wang
LRM
431
4
0
12 Jun 2025
Discovering Hierarchical Latent Capabilities of Language Models via Causal Representation Learning
Jikai Jin
Vasilis Syrgkanis
Sham Kakade
Hanlin Zhang
ELM
315
0
0
12 Jun 2025
Dataset of News Articles with Provenance Metadata for Media Relevance Assessment
Tomas Peterka
Matyas Bohacek
165
0
0
11 Jun 2025
Query-Level Uncertainty in Large Language Models
Lihu Chen
Gerard de Melo
Fabian M. Suchanek
Gaël Varoquaux
367
1
0
11 Jun 2025
Scaling Laws for Uncertainty in Deep Learning
Mattia Rosso
Simone Rossi
Giulio Franzese
Markus Heinonen
Maurizio Filippone
BDL
UQCV
222
0
0
11 Jun 2025
Do LLMs Give Psychometrically Plausible Responses in Educational Assessments?
Workshop on Innovative Use of NLP for Building Educational Applications (UNBEA), 2025
Andreas Säuberli
Diego Frassinelli
Barbara Plank
AI4Ed
239
2
0
11 Jun 2025
Enhancing Reasoning Capabilities of Small Language Models with Blueprints and Prompt Template Search
Dongge Han
Menglin Xia
Daniel Madrigal Diaz
Samuel Kessler
Ankur Mallick
Xuchao Zhang
Mirian Hipolito Garcia
Jin Xu
Victor Rühle
Saravan Rajmohan
LRM
195
0
0
10 Jun 2025
Evaluating LLMs Across Multi-Cognitive Levels: From Medical Knowledge Mastery to Scenario-Based Problem Solving
Yuxuan Zhou
Xien Liu
Chenwei Yan
Chen Ning
X. Zhang
...
Xiangling Fu
Shijin Wang
Guoping Hu
Yu Wang
Ji Wu
ELM
205
1
0
10 Jun 2025
Beyond Bias Scores: Unmasking Vacuous Neutrality in Small Language Models
Sumanth Manduru
Carlotta Domeniconi
ALM
224
0
0
10 Jun 2025
Can AI Validate Science? Benchmarking LLMs for Accurate Scientific Claim
→
\rightarrow
→
Evidence Reasoning
Shashidhar Reddy Javaji
Yun Feng
Haohang Li
Yangyang Yu
Nikhil Muralidhar
Zining Zhu
ELM
159
1
0
09 Jun 2025
Instruction-Tuned Video-Audio Models Elucidate Functional Specialization in the Brain
R. Mamidi
Khushbu Pahwa
Prachi Jindal
Satya Sai Srinath Namburi
Maneesh Singh
Tanmoy Chakraborty
Bapi S. Raju
Subba Reddy Oota
141
0
0
09 Jun 2025
Synthetic Visual Genome
Computer Vision and Pattern Recognition (CVPR), 2025
J. S. Park
Zixian Ma
Linjie Li
Chenhao Zheng
Cheng-Yu Hsieh
...
Quan Kong
Norimasa Kobori
Ali Farhadi
Yejin Choi
Ranjay Krishna
188
1
0
09 Jun 2025
Snap, Segment, Deploy: A Visual Data and Detection Pipeline for Wearable Industrial Assistants
Di Wen
Junwei Zheng
R. Liu
Yi Xu
Kunyu Peng
Rainer Stiefelhagen
159
1
0
09 Jun 2025
WebUIBench: A Comprehensive Benchmark for Evaluating Multimodal Large Language Models in WebUI-to-Code
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Zhiyu Lin
Zhengda Zhou
Zhiyuan Zhao
Tianrui Wan
Yilun Ma
Junyu Gao
Xuelong Li
ELM
193
4
0
09 Jun 2025
A Neurosymbolic Agent System for Compositional Visual Reasoning
Yichang Xu
Gaowen Liu
Ramana Rao Kompella
Sihao Hu
Tiansheng Huang
Fatih Ilhan
Selim Furkan Tekin
Zachary Yahn
LRM
VLM
217
0
0
09 Jun 2025
EgoM2P: Egocentric Multimodal Multitask Pretraining
Gen Li
Yutong Chen
Yiqian Wu
Kaifeng Zhao
Marc Pollefeys
Siyu Tang
EgoV
VLM
395
4
0
09 Jun 2025
Chain of Methodologies: Scaling Test Time Computation without Training
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Cong Liu
Jie Wu
Weigang Wu
Xu Chen
Guanbin Li
Wei-Shi Zheng
LLMAG
LRM
AI4CE
193
1
0
08 Jun 2025
Theorem-of-Thought: A Multi-Agent Framework for Abductive, Deductive, and Inductive Reasoning in Language Models
Samir Abdaljalil
Hasan Kurban
K. Qaraqe
E. Serpedin
LM&Ro
LRM
163
2
0
08 Jun 2025
Dual-Priv Pruning : Efficient Differential Private Fine-Tuning in Multimodal Large Language Models
Qianshan Wei
Jiaqi Li
Zihan You
Yi Zhan
Kecen Li
...
Yi Yu
Bin Cao
Yiwen Xu
Wenshu Fan
Guilin Qi
AAML
VLM
151
1
0
08 Jun 2025
Vision-EKIPL: External Knowledge-Infused Policy Learning for Visual Reasoning
Chaoyang Wang
Zeyu Zhang
Meng Meng
Xu Zhou
Haiyun Jiang
OffRL
LRM
210
1
0
07 Jun 2025
Quantile Regression with Large Language Models for Price Prediction
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Nikhita Vedula
Dushyanta Dhyani
Laleh Jalali
Boris Oreshkin
Mohsen Bayati
S. Malmasi
131
3
0
07 Jun 2025
VisioMath: Benchmarking Figure-based Mathematical Reasoning in LMMs
Can Li
Ting Zhang
Ting Zhang
Mei Wang
Hua Huang
LRM
190
4
0
07 Jun 2025
The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity
Robotics (RAS), 2025
Parshin Shojaee
Iman Mirzadeh
Keivan Alizadeh
Maxwell Horton
Samy Bengio
Mehrdad Farajtabar
LRM
266
215
0
07 Jun 2025
Movie Facts and Fibs (MF
2
^2
2
): A Benchmark for Long Movie Understanding
Emmanouil Zaranis
António Farinhas
Saul Santos
Beatriz Canaverde
Miguel Moura Ramos
...
Raffaella Bernardi
Raquel Fernández
Sandro Pezzelle
Vlad Niculae
Andre F. T. Martins
227
3
0
06 Jun 2025
Token Signature: Predicting Chain-of-Thought Gains with Token Decoding Feature in Large Language Models
Peijie Liu
Fengli Xu
Yong Li
LRM
263
1
0
06 Jun 2025
A MISMATCHED Benchmark for Scientific Natural Language Inference
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Firoz Shaik
Mobashir Sadat
Nikita Gautam
Doina Caragea
Cornelia Caragea
182
0
0
05 Jun 2025
Structured Labeling Enables Faster Vision-Language Models for End-to-End Autonomous Driving
Hao Jiang
Chuan Hu
Yukang Shi
Yuan He
Ke Wang
X. Zhang
Zhipeng Zhang
3DV
VLM
172
1
0
05 Jun 2025
A Statistical Physics of Language Model Reasoning
Jack David Carson
Amir Reisizadeh
LRM
AI4CE
174
1
0
04 Jun 2025
AuthGuard: Generalizable Deepfake Detection via Language Guidance
Guangyu Shen
Zhihua Li
Xiang Xu
Tianchen Zhao
Zheng Zhang
Dongsheng An
Zhuowen Tu
Yifan Xing
Qin Zhang
188
1
0
04 Jun 2025
Previous
1
2
3
4
5
6
...
18
19
20
Next