v1v2v3 (latest)

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

22 April 2024

Ahmed Hassan Awadallah

Jianmin Bao

Xin Jin

Yunsheng Li

Fan Yang

Jianwei Yang

Lu Yuan

Yue Zhang

ArXiv (abs)PDF HTML HuggingFace (257 upvotes)

Papers citing "Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone"

50 / 965 papers shown

OpenSep: Leveraging Large Language Models with Textual Inversion for Open World Audio SeparationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Tanvir Mahmud

Diana Marculescu

VLM

206

28 Sep 2024

SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback RefinementConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Ishani Mondal

Zongxia Li

Yufang Hou

Anandhavelu Natarajan

Aparna Garimella

Jordan Boyd-Graber

229

28 Sep 2024

Data Analysis in the Era of Generative AI

Chenglong Wang

234

27 Sep 2024

FoodMLLM-JP: Leveraging Multimodal Large Language Models for Japanese Recipe GenerationConference on Multimedia Modeling (MMM), 2024

Yuki Imajuku

Yoko Yamakata

Kiyoharu Aizawa

238

27 Sep 2024

E.T. Bench: Towards Open-Ended Event-Level Video-Language UnderstandingNeural Information Processing Systems (NeurIPS), 2024

Ye Liu

Zongyang Ma

Chen Ma

Yang Wu

Ying Shan

Chang Wen Chen

267

26 Sep 2024

Search and Detect: Training-Free Long Tail Object Detection via Web-Image RetrievalComputer Vision and Pattern Recognition (CVPR), 2024

Revanth Gangi Reddy

Heng Ji

ObjD VLM

183

26 Sep 2024

Discovering the Gems in Early Layers: Accelerating Long-Context LLMs with 1000x Input Token Reduction

Zhenmei Shi

Yifei Ming

Xuan-Phi Nguyen

Yingyu Liang

Shafiq Joty

254

25 Sep 2024

PACE: Marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularizationNeural Information Processing Systems (NeurIPS), 2024

Yao Ni

Shan Zhang

Piotr Koniusz

1.1K

25 Sep 2024

SynChart: Synthesizing Charts from Language Models

Mengchen Liu

Qixiu Li

Dongdong Chen

Dong Chen

Jianmin Bao

Yunsheng Li

MLLM

121

25 Sep 2024

A Comprehensive Evaluation of Large Language Models on Mental Illnesses

291

24 Sep 2024

Archon: An Architecture Search Framework for Inference-Time Techniques

Jon Saad-Falcon

Adrian Gamarra Lafuente

...

Mayee Chen

451

23 Sep 2024

Domino: Eliminating Communication in LLM Training via Generic Tensor Slicing and Overlapping

Ang Li

173

23 Sep 2024

Phantom of Latent for Large Language and Vision Models

Yong Man Ro

271

23 Sep 2024

Instruction Tuning Vs. In-Context Learning: Revisiting Large Language Models in Few-Shot Computational Social Science

197

23 Sep 2024

MobileViews: A Large-Scale Mobile GUI Dataset

Longxi Gao

Li Zhang

Shihe Wang

Shangguang Wang

Yuanchun Li

Mengwei Xu

207

22 Sep 2024

Normalized Narrow Jump To Conclusions: Normalized Narrow Shortcuts for Parameter Efficient Early Exit Transformer PredictionConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Amrit Diggavi Seshadri

160

21 Sep 2024

Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic Data

Jiaming Zhou

Liheng Ma

Soumyasundar Pal

Bin Wang

Yingxue Zhang

Jianye Hao

ReLM LRM

309

19 Sep 2024

CLAIR-A: Leveraging Large Language Models to Judge Audio Captions

222

19 Sep 2024

To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoningInternational Conference on Learning Representations (ICLR), 2024

632

232

18 Sep 2024

Ideal-LLM: Integrating Dual Encoders and Language-Adapted LLM for Multilingual Speech-to-Text

Hongfei Xue

Kun Wei

Qijie Shao

Lei Xie

184

17 Sep 2024

CAST: Cross-modal Alignment Similarity Test for Vision Language ModelsInternational Conference on Computational Linguistics (COLING), 2024

236

17 Sep 2024

AI Suggestions Homogenize Writing Toward Western Styles and Diminish Cultural NuancesInternational Conference on Human Factors in Computing Systems (CHI), 2024

Dhruv Agarwal

Mor Naaman

Aditya Vashistha

416

17 Sep 2024

Leveraging Open-Source Large Language Models for Native Language Identification

Yee Man Ng

Ilia Markov

283

15 Sep 2024

Experimenting with Legal AI Solutions: The Case of Question-Answering for Access to Justice

Xiaodan Zhu

216

12 Sep 2024

What is the Role of Small Models in the LLM Era: A Survey

Lihu Chen

Gaël Varoquaux

ALM

777

10 Sep 2024

MoWE-Audio: Multitask AudioLLMs with Mixture of Weak EncodersIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024

511

10 Sep 2024

MarS: a Financial Market Simulation Engine Powered by Generative Foundation ModelInternational Conference on Learning Representations (ICLR), 2024

Junjie Li

Chang Xu

285

04 Sep 2024

The Role of Large Language Models in Musicology: Are We Ready to Trust the Machines?

Pedro Ramoneda

Emilia Parada-Cabaleiro

Benno Weck

Xavier Serra

03 Sep 2024

EvoChart: A Benchmark and a Self-Training Approach Towards Real-World Chart UnderstandingAAAI Conference on Artificial Intelligence (AAAI), 2024

Muye Huang

Han Lai

Xinyu Zhang

Wenjun Wu

Jie Ma

Lingling Zhang

Jun Liu

256

03 Sep 2024

Evaluating the Performance of Large Language Models in Competitive Programming: A Multi-Year, Multi-Grade AnalysisInternational Symposium on INnovations in Intelligent SysTems and Applications (INISTA), 2024

Adrian Marius Dumitran

Adrian Catalin Badea

Stefan-Gabriel Muscalu

ELM LRM

215

31 Aug 2024

An Investigation of Warning Erroneous Chat Translations in Cross-lingual CommunicationInternational Joint Conference on Natural Language Processing (IJCNLP), 2024

Yunmeng Li

Jun Suzuki

Makoto Morishita

Kaori Abe

Kentaro Inui

245

28 Aug 2024

LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

369

24 Aug 2024

A Law of Next-Token Prediction in Large Language ModelsPhysical Review E (Phys. Rev. E), 2024

Hangfeng He

Weijie J. Su

386

24 Aug 2024

CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and ExecutionAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

519

23 Aug 2024

Video-to-Text Pedestrian Monitoring (VTPM): Leveraging Computer Vision and Large Language Models for Privacy-Preserve Pedestrian Activity Monitoring at Intersections

Ahmed S. Abdelrahman

Mohamed Abdel-Aty

Dongdong Wang

176

21 Aug 2024

SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs

332

21 Aug 2024

Reefknot: A Comprehensive Benchmark for Relation Hallucination Evaluation, Analysis and Mitigation in Multimodal Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

690

18 Aug 2024

CyberPal.AI: Empowering LLMs with Expert-Driven Cybersecurity InstructionsAAAI Conference on Artificial Intelligence (AAAI), 2024

218

17 Aug 2024

See What LLMs Cannot Answer: A Self-Challenge Framework for Uncovering LLM Weaknesses

Yulong Chen

Yang Liu

Jianhao Yan

X. Bai

Ming Zhong

Yinghao Yang

Ziyi Yang

Chenguang Zhu

Yue Zhang

ALM ELM

202

16 Aug 2024

PsychoLex: Unveiling the Psychological Mind of Large Language Models

Mohammad Amin Abbasi

Farnaz Sadat Mirnezami

Hassan Naderi

LM&MA

199

16 Aug 2024

SelectLLM: Query-Aware Efficient Selection Algorithm for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Kaushal Kumar Maurya

KV Aditya Srivatsa

Ekaterina Kochmar

323

16 Aug 2024

Fast Training Dataset Attribution via In-Context Learning

312

14 Aug 2024

The advantages of context specific language models: the case of the Erasmian Language Model

João Gonçalves

Nick Jelicic

Michele Murgia

Evert Stamhuis

202

13 Aug 2024

Compromesso! Italian Many-Shot Jailbreaks Undermine the Safety of Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Fabio Pernisi

Dirk Hovy

Paul Röttger

160

08 Aug 2024

WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

...

Soujanya Poria

233

07 Aug 2024

Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling

Zhiyang Chen

...

Mingyuan Zhou

195

07 Aug 2024

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

293

05 Aug 2024

Large Language Model Aided QoS Prediction for Service Recommendation

212

05 Aug 2024

RAGEval: Scenario Specific RAG Evaluation Dataset Generation FrameworkAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

...

Yishan Li

Zhiyuan Liu

Xu Han

Zhiyuan Liu

Maosong Sun

666

02 Aug 2024

AI-Assisted Generation of Difficult Math Questions

Dingli Yu

...

Sanjeev Arora

Anirudh Goyal

396

30 Jul 2024