v1v2v3 (latest)

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

22 April 2024

Ahmed Hassan Awadallah

Jianmin Bao

Xin Jin

Yunsheng Li

Fan Yang

Jianwei Yang

Lu Yuan

Yue Zhang

ArXiv (abs)PDF HTML HuggingFace (257 upvotes)

Papers citing "Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone"

50 / 965 papers shown

MMCOMPOSITION: Revisiting the Compositionality of Pre-trained Vision-Language Models

Chenliang Xu

235

13 Oct 2024

ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple DomainsInternational Conference on Learning Representations (ICLR), 2024

478

13 Oct 2024

VERITAS-NLI : Validation and Extraction of Reliable Information Through Automated Scraping and Natural Language Inference

147

12 Oct 2024

CAMPHOR: Collaborative Agents for Multi-input Planning and High-Order Reasoning On Device

216

12 Oct 2024

Fine-grained Attention I/O Complexity: Comprehensive Analysis for Backward Passes

250

12 Oct 2024

FB-Bench: A Fine-Grained Multi-Task Benchmark for Evaluating LLMs' Responsiveness to Human Feedback

337

12 Oct 2024

MedMobile: A mobile-sized language model with clinical capabilities

Krithik Vishwanath

Jaden Stryker

Anton Alaykin

Daniel Alexander Alber

E. Oermann

LM&MA MedIm LRM

463

11 Oct 2024

Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning TasksInternational Conference on Learning Representations (ICLR), 2024

324

11 Oct 2024

KV Prediction for Improved Time to First Token

240

10 Oct 2024

News Reporter: A Multi-lingual LLM Framework for Broadcast T.V News

185

10 Oct 2024

COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act

...

Martin Vechev

353

10 Oct 2024

SEAL: Safety-enhanced Aligned LLM Fine-tuning via Bilevel Data SelectionInternational Conference on Learning Representations (ICLR), 2024

Pin-Yu Chen

287

09 Oct 2024

TextLap: Customizing Language Models for Text-to-Layout PlanningConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Ruiyi Zhang

271

09 Oct 2024

Exploring the Readiness of Prominent Small Language Models for the Democratization of Financial Literacy

Tagore Rao Kosireddy

Jeffrey D. Wall

Evan Lucas

220

09 Oct 2024

Unleashing Multi-Hop Reasoning Potential in Large Language Models through Repetition of Misordered ContextNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

352

09 Oct 2024

TorchTitan: One-stop PyTorch native solution for production ready LLM pre-trainingInternational Conference on Learning Representations (ICLR), 2024

...

335

09 Oct 2024

Context-Aware Command Understanding for Tabletop Scenarios

Paul Gajewski

Antonio Galiza Cerdeira Gonzalez

B. Indurkhya

LM&Ro

08 Oct 2024

QERA: an Analytical Framework for Quantization Error Reconstruction

Cheng Zhang

Jeffrey T. H. Wong

Can Xiao

George A. Constantinides

Yiren Zhao

198

08 Oct 2024

DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing with Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Yifan Zhang

Zhiyuan Liu

Maosong Sun

237

08 Oct 2024

R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?

Chunyi Li

Junxuan Zhang

Zicheng Zhang

H. Wu

Yuan Tian

...

Guo Lu

Xiaohong Liu

Xiongkuo Min

Weisi Lin

Guangtao Zhai

AAML

181

07 Oct 2024

Precise Model Benchmarking with Only a Few ObservationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Riccardo Fogliato

Pratik Patil

Nil-Jana Akpinar

Mathew Monfort

208

07 Oct 2024

ACDC: Autoregressive Coherent Multimodal Generation using Diffusion Correction

Hyungjin Chung

Dohun Lee

Jong Chul Ye

VGen DiffM

195

07 Oct 2024

GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024

493

410

07 Oct 2024

ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection

Hang Li

...

Kun Wang

Hui Xiong

Philip S. Yu

Xuming Hu

Qingsong Wen

LRM

291

06 Oct 2024

CiMaTe: Citation Count Prediction Effectively Leveraging the Main Text

Jun Hirako

Ryohei Sasano

Koichi Takeda

334

06 Oct 2024

DiDOTS: Knowledge Distillation from Large-Language-Models for Dementia Obfuscation in Transcribed SpeechProceedings on Privacy Enhancing Technologies (PoPETs), 2024

Dominika Woszczyk

Soteris Demetriou

298

05 Oct 2024

Improving LLM Reasoning through Scaling Inference Computation with Collaborative Verification

Ye Liu

Yingbo Zhou

247

05 Oct 2024

Gamified crowd-sourcing of high-quality data for visual fine-tuning

299

05 Oct 2024

ASPIRER: Bypassing System Prompts With Permutation-based Backdoors in LLMs

Xuan Chen

Xiangyu Zhang

245

05 Oct 2024

Towards a Benchmark for Large Language Models for Business Process Management TasksProceedings of the Annual Hawaii International Conference on System Sciences (HICSS), 2024

Kiran Busch

Henrik Leopold

221

04 Oct 2024

Scaling Parameter-Constrained Language Models with Quality DataConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Ernie Chang

234

04 Oct 2024

L-CiteEval: Do Long-Context Models Truly Leverage Context for Responding?

Juntao Li

Min Zhang

266

03 Oct 2024

Jailbreak Antidote: Runtime Safety-Utility Balance via Sparse Representation Adjustment in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024

Yi Zeng

336

03 Oct 2024

How to Train Long-Context Language Models (Effectively)Annual Meeting of the Association for Computational Linguistics (ACL), 2024

664

03 Oct 2024

Training Language Models on Synthetic Edit Sequences Improves Code SynthesisInternational Conference on Learning Representations (ICLR), 2024

Ulyana Piterbarg

Lerrel Pinto

Rob Fergus

SyDa

444

03 Oct 2024

LLaVA-Critic: Learning to Evaluate Multimodal ModelsComputer Vision and Pattern Recognition (CVPR), 2024

Dong Guo

Heng Huang

Chunyuan Li

MLLM VLM LRM

350

03 Oct 2024

Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads on Consumer-Grade Devices

Yuxiang Huang

Binhang Yuan

Xu Han

Chaojun Xiao

Zhiyuan Liu

RALM

469

02 Oct 2024

FactAlign: Long-form Factuality Alignment of Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Chao-Wei Huang

Yun-Nung Chen

HILM

146

02 Oct 2024

InfiniPot: Infinite Context Processing on Memory-Constrained LLMsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

313

02 Oct 2024

House of Cards: Massive Weights in LLMs

Jaehoon Oh

Seungjun Shin

Dokwan Oh

348

02 Oct 2024

Reasoning Elicitation in Language Models via Counterfactual FeedbackInternational Conference on Learning Representations (ICLR), 2024

Alihan Hüyük

Xinnuo Xu

Jacqueline R. M. A. Maasch

Aditya V. Nori

Javier González

ReLM LRM

901

02 Oct 2024

Disentangling Latent Shifts of In-Context Learning with Weak Supervision

Josip Jukić

Jan Snajder

297

02 Oct 2024

Mixing It Up: The Cocktail Effect of Multi-Task Fine-Tuning on LLM Performance -- A Case Study in Finance

253

01 Oct 2024

PyRIT: A Framework for Security Risk Identification and Red Teaming in Generative AI System

Raja Sekhar Rao Dheekonda

...

Tori Westerhoff

Chang Kawaguchi

Christian Seifert

Ram Shankar Siva Kumar

Yonatan Zunger

SILM

220

01 Oct 2024

On the Implications of Verbose LLM Outputs: A Case Study in Translation Evaluation

122

01 Oct 2024

VLMGuard: Defending VLMs against Malicious Prompts via Unlabeled Data

Ahmed Salem

Yixuan Li

222

01 Oct 2024

MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning

Haotian Zhang

Mingfei Gao

...

Zirui Wang

Yinfei Yang

303

30 Sep 2024

ACE: All-round Creator and Editor Following Instructions via Diffusion TransformerInternational Conference on Learning Representations (ICLR), 2024

Zhen Han

Zeyinzi Jiang

Yulin Pan

Jingfeng Zhang

Chaojie Mao

Chenwei Xie

Yu Liu

Jingren Zhou

DiffM

358

30 Sep 2024

FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"International Conference on Learning Representations (ICLR), 2024

629

30 Sep 2024

One Token to Seg Them All: Language Instructed Reasoning Segmentation in VideosNeural Information Processing Systems (NeurIPS), 2024

Tong He

Joya Chen

Zheng Zhang

Mike Zheng Shou

VLM VOS MLLM

251

29 Sep 2024