v1v2v3 (latest)

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

22 April 2024

Ahmed Hassan Awadallah

Jianmin Bao

Xin Jin

Yunsheng Li

Fan Yang

Jianwei Yang

Lu Yuan

Yue Zhang

ArXiv (abs)PDF HTML HuggingFace (257 upvotes)

Papers citing "Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone"

50 / 966 papers shown

LOGO -- Long cOntext aliGnment via efficient preference Optimization

Zecheng Tang

Zechen Sun

Juntao Li

Qiaoming Zhu

Min Zhang

206

24 Oct 2024

High-dimensional Analysis of Knowledge Distillation: Weak-to-Strong Generalization and Scaling LawsInternational Conference on Learning Representations (ICLR), 2024

M. E. Ildiz

Halil Alperen Gozeten

Ege Onur Taga

Marco Mondelli

Samet Oymak

501

24 Oct 2024

ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and Low-frequency Character Bigrams

Srija Anand

Praveen Srinivasa Varadhan

Mehak Singal

Mitesh M. Khapra

179

23 Oct 2024

CLR-Bench: Evaluating Large Language Models in College-level Reasoning

175

23 Oct 2024

Captions Speak Louder than Images: Generalizing Foundation Models for E-commerce from High-quality Multimodal Instruction Data

336

22 Oct 2024

JMMMU: A Japanese Massive Multi-discipline Multimodal Understanding Benchmark for Culture-aware EvaluationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

Graham Neubig

699

22 Oct 2024

ETHIC: Evaluating Large Language Models on Long-Context Tasks with High Information CoverageNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

349

22 Oct 2024

Teach Multimodal LLMs to Comprehend Electrocardiographic Images

Ruoqi Liu

Yuelin Bai

Xiang Yue

Ping Zhang

135

21 Oct 2024

Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance

Zhe Chen

...

403

21 Oct 2024

Augmenting Legal Decision Support Systems with LLM-based NLI for Analyzing Social Media Evidence

Ram Mohan Rao Kadiyala

Siddartha Pullakhandam

136

21 Oct 2024

Do Large Language Models Have an English Accent? Evaluating and Improving the Naturalness of Multilingual LLMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

336

21 Oct 2024

BRIEF: Bridging Retrieval and Inference for Multi-hop Reasoning via CompressionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

318

20 Oct 2024

A Comprehensive Evaluation of Cognitive Biases in LLMs

332

20 Oct 2024

Understanding Forgetting in LLM Supervised Fine-Tuning and Preference Learning - A Convex Optimization Perspective

480

20 Oct 2024

Large Language Models Are Overparameterized Text EncodersWorkshop on Representation Learning for NLP (RepL4NLP), 2024

Thennal D K

Tim Fischer

Chris Biemann

218

18 Oct 2024

Tell me what I need to know: Exploring LLM-based (Personalized) Abstractive Multi-Source Meeting SummarizationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

136

18 Oct 2024

TimeSeriesExam: A time series understanding exam

204

18 Oct 2024

LLM The Genius Paradox: A Linguistic and Math Expert's Struggle with Simple Word-based Counting Problems

Nan Xu

Xuezhe Ma

LRM

394

18 Oct 2024

Do LLMs estimate uncertainty well in instruction-following?International Conference on Learning Representations (ICLR), 2024

Juyeon Heo

Miao Xiong

Christina Heinze-Deml

Jaya Narain

ELM

386

18 Oct 2024

NaturalBench: Evaluating Vision-Language Models on Natural Adversarial SamplesNeural Information Processing Systems (NeurIPS), 2024

658

18 Oct 2024

EvoPress: Accurate Dynamic Model Compression via Evolutionary Search

416

18 Oct 2024

Do LLMs "know" internally when they follow instructions?International Conference on Learning Representations (ICLR), 2024

Juyeon Heo

Christina Heinze-Deml

Jaya Narain

408

18 Oct 2024

γ-

MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models

279

17 Oct 2024

BenTo: Benchmark Task Reduction with In-Context Transferability

Hongyu Zhao

Ming Li

Lichao Sun

Tianyi Zhou

298

17 Oct 2024

Large Language Models as Narrative-Driven RecommendersThe Web Conference (WWW), 2024

261

17 Oct 2024

Breaking Chains: Unraveling the Links in Multi-Hop Knowledge Unlearning

164

17 Oct 2024

Trust but Verify: Programmatic VLM Evaluation in the Wild

169

17 Oct 2024

BQA: Body Language Question Answering Dataset for Video Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

429

17 Oct 2024

MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation SystemsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

Nandan Thakur

Suleman Kazi

Ge Luo

Jimmy J. Lin

Amin Ahmad

VLM RALM

465

17 Oct 2024

Understanding the Role of LLMs in Multimodal Evaluation Benchmarks

Zhaowei Li

211

16 Oct 2024

Table-LLM-Specialist: Language Model Specialists for Tables using Iterative Generator-Validator Fine-tuning

194

16 Oct 2024

WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global CuisinesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

Genta Indra Winata

Frederikus Hudi

Patrick Amadeus Irawan

...

500

16 Oct 2024

Enabling Data-Driven and Empathetic Interactions: A Context-Aware 3D Virtual Agent in Mixed Reality for Enhanced Financial Customer Experience

111

15 Oct 2024

Scaling Laws for Multilingual Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

230

15 Oct 2024

DISP-LLM: Dimension-Independent Structural Pruning for Large Language ModelsNeural Information Processing Systems (NeurIPS), 2024

245

15 Oct 2024

BSM: Small but Powerful Biological Sequence Model for Genes and Proteins

118

15 Oct 2024

Towards More Effective Table-to-Text Generation: Assessing In-Context Learning and Self-Evaluation with Open-Source Models

Sahar Iravani

Tim . O . F Conrad

LMTD

275

15 Oct 2024

Survey and Evaluation of Converging Architecture in LLMs based on Footsteps of OperationsIEEE Open Journal of the Computer Society (JOCS), 2024

162

15 Oct 2024

Latent Action Pretraining from VideosInternational Conference on Learning Representations (ICLR), 2024

...

441

145

15 Oct 2024

SHAKTI: A 2.5 Billion Parameter Small Language Model Optimized for Edge AI and Low-Resource EnvironmentsArtificial Intelligence Applications and Innovations (AIAI), 2024

Syed Abdul Gaffar Shakhadri

Kruthika KR

Rakshit Aralimatti

VLM

191

15 Oct 2024

PAVLM: Advancing Point Cloud based Affordance Understanding Via Vision-Language Model

298

15 Oct 2024

SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image UnderstandingComputer Vision and Pattern Recognition (CVPR), 2024

371

15 Oct 2024

Measuring Spiritual Values and Bias of Large Language Models

151

15 Oct 2024

Liger Kernel: Efficient Triton Kernels for LLM Training

492

14 Oct 2024

When Does Perceptual Alignment Benefit Vision Representations?Neural Information Processing Systems (NeurIPS), 2024

284

14 Oct 2024

HART: Efficient Visual Generation with Hybrid Autoregressive TransformerInternational Conference on Learning Representations (ICLR), 2024

Enze Xie

Han Cai

404

105

14 Oct 2024

HSR-Enhanced Sparse Attention Acceleration

818

14 Oct 2024

Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family ExpertsInternational Conference on Learning Representations (ICLR), 2024

Xidong Wang

312

14 Oct 2024

Assessing Dialect Fairness and Robustness of Large Language Models in Reasoning TasksAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

521

14 Oct 2024

3DArticCyclists: Generating Synthetic Articulated 8D Pose-Controllable Cyclist Data for Computer Vision Applications

Eduardo R. Corral-Soto

458

14 Oct 2024