Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2404.14219
Cited By
v1
v2
v3 (latest)
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
22 April 2024
Marah Abdin
Sam Ade Jacobs
A. A. Awan
J. Aneja
Ahmed Hassan Awadallah
Hany Awadalla
Nguyen Bach
Amit Bahree
Arash Bakhtiari
Jianmin Bao
Harkirat Singh Behl
Alon Benhaim
Misha Bilenko
Johan Bjorck
Sébastien Bubeck
Qin Cai
Martin Cai
C. C. T. Mendes
Weizhu Chen
Vishrav Chaudhary
Dong Chen
DongDong Chen
Yen-Chun Chen
Yi-Ling Chen
Parul Chopra
Xiyang Dai
Allison Del Giorno
Gustavo de Rosa
Matthew Dixon
Ronen Eldan
Victor Fragoso
Dan Iter
Mei Gao
Min Gao
Jianfeng Gao
Amit Garg
Abhishek Goswami
Suriya Gunasekar
Emman Haider
Junheng Hao
Russell J. Hewett
Jamie Huynh
Mojan Javaheripi
Xin Jin
Piero Kauffmann
Nikos Karampatziakis
Dongwoo Kim
Mahoud Khademi
Lev Kurilenko
James R. Lee
Yin Tat Lee
Yuanzhi Li
Yunsheng Li
Chen Liang
Lars Liden
Ce Liu
Mengchen Liu
Weishung Liu
Eric Lin
Zeqi Lin
Chong Luo
Piyush Madan
Matt Mazzola
Arindam Mitra
Hardik Modi
Anh Nguyen
Brandon Norick
Barun Patra
Daniel Perez-Becker
Thomas Portet
Reid Pryzant
Heyang Qin
Marko Radmilac
Corby Rosset
Sambudha Roy
Olatunji Ruwase
Olli Saarikivi
Amin Saied
Adil Salim
Michael Santacroce
Shital Shah
Ning Shang
Hiteshi Sharma
Swadheen Shukla
Xianmin Song
Masahiro Tanaka
Andrea Tupini
Xin Eric Wang
Lijuan Wang
Chunyu Wang
Yu Wang
Rachel A. Ward
Guanhua Wang
Philipp A. Witte
Haiping Wu
Michael Wyatt
Bin Xiao
Can Xu
Jiahang Xu
Weijian Xu
Sonali Yadav
Fan Yang
Jianwei Yang
Ziyi Yang
Yifan Yang
Donghan Yu
Lu Yuan
Cheng-Yuan Zhang
Cyril Zhang
Jianwen Zhang
Li Zhang
Yi Zhang
Yue Zhang
Yunan Zhang
Xiren Zhou
LRM
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (257 upvotes)
Papers citing
"Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone"
50 / 965 papers shown
MMCOMPOSITION: Revisiting the Compositionality of Pre-trained Vision-Language Models
Hang Hua
Yunlong Tang
Ziyun Zeng
Liangliang Cao
Zhengyuan Yang
Hangfeng He
Chenliang Xu
Jiebo Luo
VLM
CoGe
235
22
0
13 Oct 2024
ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple Domains
International Conference on Learning Representations (ICLR), 2024
Yein Park
Chanwoong Yoon
Jungwoo Park
Donghyeon Lee
Minbyul Jeong
Jaewoo Kang
KELM
478
3
0
13 Oct 2024
VERITAS-NLI : Validation and Extraction of Reliable Information Through Automated Scraping and Natural Language Inference
Arjun Shah
Hetansh Shah
Vedica Bafna
Charmi Khandor
Sindhu Nair
147
1
0
12 Oct 2024
CAMPHOR: Collaborative Agents for Multi-input Planning and High-Order Reasoning On Device
Yicheng Fu
R. Anantha
Jianpeng Cheng
LRM
LLMAG
216
6
0
12 Oct 2024
Fine-grained Attention I/O Complexity: Comprehensive Analysis for Backward Passes
Xiaoyu Li
Yingyu Liang
Zhenmei Shi
Zhao Song
Yufa Zhou
250
19
0
12 Oct 2024
FB-Bench: A Fine-Grained Multi-Task Benchmark for Evaluating LLMs' Responsiveness to Human Feedback
Yongbin Li
Miao Zheng
Fan Yang
Bin Cui
Tengjiao Wang
Xin Wu
Guosheng Dong
Wentao Zhang
ALM
337
10
0
12 Oct 2024
MedMobile: A mobile-sized language model with clinical capabilities
Krithik Vishwanath
Jaden Stryker
Anton Alaykin
Daniel Alexander Alber
E. Oermann
LM&MA
MedIm
LRM
463
2
0
11 Oct 2024
Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks
International Conference on Learning Representations (ICLR), 2024
Rushang Karia
Daniel Bramblett
D. Dobhal
Siddharth Srivastava
ELM
LRM
324
2
0
11 Oct 2024
KV Prediction for Improved Time to First Token
Maxwell Horton
Qingqing Cao
Chenfan Sun
Yanzi Jin
Sachin Mehta
Mohammad Rastegari
Moin Nabi
AI4TS
240
7
0
10 Oct 2024
News Reporter: A Multi-lingual LLM Framework for Broadcast T.V News
Tarun Jain
Yufei Gao
Sridhar Vanga
Karan Singla
185
1
0
10 Oct 2024
COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act
Philipp Guldimann
Alexander Spiridonov
Robin Staab
Nikola Jovanović
Mark Vero
...
Mislav Balunović
Nikola Konstantinov
Pavol Bielik
Petar Tsankov
Martin Vechev
ELM
353
20
0
10 Oct 2024
SEAL: Safety-enhanced Aligned LLM Fine-tuning via Bilevel Data Selection
International Conference on Learning Representations (ICLR), 2024
Han Shen
Pin-Yu Chen
Payel Das
Tianyi Chen
ALM
287
50
0
09 Oct 2024
TextLap: Customizing Language Models for Text-to-Layout Planning
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Jian Chen
Ruiyi Zhang
Jiuxiang Gu
Jennifer Healey
J. Gu
Zhiqiang Xu
Chong Chen
VLM
271
5
0
09 Oct 2024
Exploring the Readiness of Prominent Small Language Models for the Democratization of Financial Literacy
Tagore Rao Kosireddy
Jeffrey D. Wall
Evan Lucas
220
3
0
09 Oct 2024
Unleashing Multi-Hop Reasoning Potential in Large Language Models through Repetition of Misordered Context
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Sangwon Yu
Ik-hwan Kim
Jongyoon Song
Saehyung Lee
Junsung Park
Sungroh Yoon
LRM
352
4
0
09 Oct 2024
TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training
International Conference on Learning Representations (ICLR), 2024
Wanchao Liang
Tianyu Liu
Less Wright
Will Constable
Andrew Gu
...
Howard Huang
Junjie Wang
Sanket Purandare
Gokul Nadathur
Stratos Idreos
OffRL
335
52
0
09 Oct 2024
Context-Aware Command Understanding for Tabletop Scenarios
Paul Gajewski
Antonio Galiza Cerdeira Gonzalez
B. Indurkhya
LM&Ro
71
0
0
08 Oct 2024
QERA: an Analytical Framework for Quantization Error Reconstruction
Cheng Zhang
Jeffrey T. H. Wong
Can Xiao
George A. Constantinides
Yiren Zhao
MQ
198
8
0
08 Oct 2024
DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing with Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Ranchi Zhao
Zhen Leng Thai
Yifan Zhang
Shengding Hu
Yunqi Ba
Jie Zhou
Jie Cai
Zhiyuan Liu
Maosong Sun
237
4
0
08 Oct 2024
R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?
Chunyi Li
Junxuan Zhang
Zicheng Zhang
H. Wu
Yuan Tian
...
Guo Lu
Xiaohong Liu
Xiongkuo Min
Weisi Lin
Guangtao Zhai
AAML
181
14
0
07 Oct 2024
Precise Model Benchmarking with Only a Few Observations
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Riccardo Fogliato
Pratik Patil
Nil-Jana Akpinar
Mathew Monfort
208
1
0
07 Oct 2024
ACDC: Autoregressive Coherent Multimodal Generation using Diffusion Correction
Hyungjin Chung
Dohun Lee
Jong Chul Ye
VGen
DiffM
195
2
0
07 Oct 2024
GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
International Conference on Learning Representations (ICLR), 2024
Iman Mirzadeh
Keivan Alizadeh
Hooman Shahrokhi
Oncel Tuzel
Samy Bengio
Mehrdad Farajtabar
AIMat
LRM
493
410
0
07 Oct 2024
ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection
Yibo Yan
Shen Wang
Jiahao Huo
Hang Li
Yangqiu Song
...
Kun Wang
Hui Xiong
Philip S. Yu
Xuming Hu
Qingsong Wen
LRM
291
33
0
06 Oct 2024
CiMaTe: Citation Count Prediction Effectively Leveraging the Main Text
Jun Hirako
Ryohei Sasano
Koichi Takeda
334
4
0
06 Oct 2024
DiDOTS: Knowledge Distillation from Large-Language-Models for Dementia Obfuscation in Transcribed Speech
Proceedings on Privacy Enhancing Technologies (PoPETs), 2024
Dominika Woszczyk
Soteris Demetriou
298
3
0
05 Oct 2024
Improving LLM Reasoning through Scaling Inference Computation with Collaborative Verification
Zhenwen Liang
Ye Liu
Tong Niu
Xiangliang Zhang
Yingbo Zhou
Semih Yavuz
LRM
247
35
0
05 Oct 2024
Gamified crowd-sourcing of high-quality data for visual fine-tuning
Shashank Yadav
Rohan Tomar
Garvit Jain
Chirag Ahooja
Shubham Chaudhary
Charles Elkan
299
1
0
05 Oct 2024
ASPIRER: Bypassing System Prompts With Permutation-based Backdoors in LLMs
Lu Yan
Siyuan Cheng
Xuan Chen
Kaiyuan Zhang
Guangyu Shen
Zhuo Zhang
Xiangyu Zhang
AAML
SILM
245
1
0
05 Oct 2024
Towards a Benchmark for Large Language Models for Business Process Management Tasks
Proceedings of the Annual Hawaii International Conference on System Sciences (HICSS), 2024
Kiran Busch
Henrik Leopold
221
4
0
04 Oct 2024
Scaling Parameter-Constrained Language Models with Quality Data
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Ernie Chang
Matteo Paltenghi
Yang Li
Pin-Jie Lin
Changsheng Zhao
Patrick Huber
Zechun Liu
Rastislav Rabatin
Yangyang Shi
Vikas Chandra
234
10
0
04 Oct 2024
L-CiteEval: Do Long-Context Models Truly Leverage Context for Responding?
Zecheng Tang
Keyan Zhou
Juntao Li
Baibei Ji
Jianye Hou
Min Zhang
266
7
0
03 Oct 2024
Jailbreak Antidote: Runtime Safety-Utility Balance via Sparse Representation Adjustment in Large Language Models
International Conference on Learning Representations (ICLR), 2024
Guobin Shen
Dongcheng Zhao
Yiting Dong
Xiang He
Yi Zeng
AAML
336
11
0
03 Oct 2024
How to Train Long-Context Language Models (Effectively)
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Tianyu Gao
Alexander Wettig
Howard Yen
Danqi Chen
RALM
664
90
0
03 Oct 2024
Training Language Models on Synthetic Edit Sequences Improves Code Synthesis
International Conference on Learning Representations (ICLR), 2024
Ulyana Piterbarg
Lerrel Pinto
Rob Fergus
SyDa
444
7
0
03 Oct 2024
LLaVA-Critic: Learning to Evaluate Multimodal Models
Computer Vision and Pattern Recognition (CVPR), 2024
Tianyi Xiong
Xinze Wang
Dong Guo
Qinghao Ye
Haoqi Fan
Quanquan Gu
Heng Huang
Chunyuan Li
MLLM
VLM
LRM
350
94
0
03 Oct 2024
Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads on Consumer-Grade Devices
Yuxiang Huang
Binhang Yuan
Xu Han
Chaojun Xiao
Zhiyuan Liu
RALM
469
11
0
02 Oct 2024
FactAlign: Long-form Factuality Alignment of Large Language Models
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Chao-Wei Huang
Yun-Nung Chen
HILM
146
11
0
02 Oct 2024
InfiniPot: Infinite Context Processing on Memory-Constrained LLMs
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Minsoo Kim
Kyuhong Shim
Jungwook Choi
Simyung Chang
313
16
0
02 Oct 2024
House of Cards: Massive Weights in LLMs
Jaehoon Oh
Seungjun Shin
Dokwan Oh
348
3
0
02 Oct 2024
Reasoning Elicitation in Language Models via Counterfactual Feedback
International Conference on Learning Representations (ICLR), 2024
Alihan Hüyük
Xinnuo Xu
Jacqueline R. M. A. Maasch
Aditya V. Nori
Javier González
ReLM
LRM
901
7
0
02 Oct 2024
Disentangling Latent Shifts of In-Context Learning with Weak Supervision
Josip Jukić
Jan Snajder
297
1
0
02 Oct 2024
Mixing It Up: The Cocktail Effect of Multi-Task Fine-Tuning on LLM Performance -- A Case Study in Finance
Meni Brief
Oded Ovadia
Gil Shenderovitz
Noga Ben Yoash
Rachel Lemberg
Eitam Sheetrit
253
12
0
01 Oct 2024
PyRIT: A Framework for Security Risk Identification and Red Teaming in Generative AI System
Gary D. Lopez Munoz
Amanda Minnich
Roman Lutz
Richard Lundeen
Raja Sekhar Rao Dheekonda
...
Tori Westerhoff
Chang Kawaguchi
Christian Seifert
Ram Shankar Siva Kumar
Yonatan Zunger
SILM
220
14
0
01 Oct 2024
On the Implications of Verbose LLM Outputs: A Case Study in Translation Evaluation
Eleftheria Briakou
Zhongtao Liu
Colin Cherry
Markus Freitag
122
17
0
01 Oct 2024
VLMGuard: Defending VLMs against Malicious Prompts via Unlabeled Data
Xuefeng Du
Reshmi Ghosh
Robert Sim
Ahmed Salem
Vitor Carvalho
Emily Lawton
Yixuan Li
Jack W. Stokes
VLM
AAML
222
16
0
01 Oct 2024
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning
Haotian Zhang
Mingfei Gao
Zhe Gan
Philipp Dufter
Nina Wenzel
...
Haoxuan You
Zirui Wang
Afshin Dehghan
Peter Grasch
Yinfei Yang
VLM
MLLM
303
66
1
30 Sep 2024
ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer
International Conference on Learning Representations (ICLR), 2024
Zhen Han
Zeyinzi Jiang
Yulin Pan
Jingfeng Zhang
Chaojie Mao
Chenwei Xie
Yu Liu
Jingren Zhou
DiffM
358
42
0
30 Sep 2024
FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"
International Conference on Learning Representations (ICLR), 2024
Yifei Ming
Senthil Purushwalkam
Shrey Pandit
Zixuan Ke
Xuan-Phi Nguyen
Caiming Xiong
Shafiq Joty
HILM
629
44
0
30 Sep 2024
One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
Neural Information Processing Systems (NeurIPS), 2024
Zechen Bai
Tong He
Haiyang Mei
Pichao Wang
Ziteng Gao
Joya Chen
Lei Liu
Zheng Zhang
Mike Zheng Shou
VLM
VOS
MLLM
251
74
0
29 Sep 2024
Previous
1
2
3
...
16
17
18
19
20
Next