v1v2v3v4 (latest)

OPT: Open Pre-trained Transformer Language Models

2 May 2022

Xian Li

Luke Zettlemoyer

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)

Papers citing "OPT: Open Pre-trained Transformer Language Models"

50 / 2,926 papers shown

Position: Beyond Euclidean -- Foundation Models Should Embrace Non-Euclidean Geometries

290

11 Apr 2025

Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions

405

11 Apr 2025

Knowledge Graph-extended Retrieval Augmented Generation for Question Answering

Jasper Linders

Jakub M. Tomczak

RALM

218

11 Apr 2025

On The Landscape of Spoken Language Models: A Comprehensive Survey

391

11 Apr 2025

Apt-Serve: Adaptive Request Scheduling on Hybrid Cache for Scalable LLM Inference Serving

268

10 Apr 2025

Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora

...

643

177

10 Apr 2025

Exploring the Effectiveness and Interpretability of Texts in LLM-based Time Series Models

169

09 Apr 2025

Classifying the Unknown: In-Context Learning for Open-Vocabulary Text and Symbol RecognitionIEEE International Conference on Document Analysis and Recognition (ICDAR), 2025

212

09 Apr 2025

GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography

416

09 Apr 2025

Data Augmentation for Fake Reviews Detection in Multiple Languages and Multiple Domains

Ming Liu

Massimo Poesio

286

09 Apr 2025

AccLLM: Accelerating Long-Context LLM Inference Via Algorithm-Hardware Co-Design

258

07 Apr 2025

URECA: Unique Region Caption Anything

315

07 Apr 2025

Thanos: A Block-wise Pruning Algorithm for Efficient Large Language Model Compression

Ivan Ilin

Peter Richtárik

184

06 Apr 2025

Hessian of Perplexity for Large Language Models by PyTorch autograd (Open Source)

Ivan Ilin

258

06 Apr 2025

Domain Generalization for Face Anti-spoofing via Content-aware Composite Prompt EngineeringIEEE transactions on multimedia (TMM), 2025

342

06 Apr 2025

Your Image Generator Is Your New Private DatasetImage and Vision Computing (IVC), 2025

359

06 Apr 2025

SLOs-Serve: Optimized Serving of Multi-SLO LLMs

253

05 Apr 2025

A Perplexity and Menger Curvature-Based Approach for Similarity Evaluation of Large Language Models

Yuantao Zhang

Zhankui Yang

AAML

319

05 Apr 2025

Scaling Analysis of Interleaved Speech-Text Language Models

461

03 Apr 2025

When Reasoning Meets Compression: Understanding the Effects of LLMs Compression on Large Reasoning Models

568

02 Apr 2025

HERA: Hybrid Edge-cloud Resource Allocation for Cost-Efficient AI Agents

381

01 Apr 2025

Short-PHD: Detecting Short LLM-generated Text with Topological Data Analysis After Off-topic Content Insertion

293

01 Apr 2025

WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization

320

31 Mar 2025

Model Hemorrhage and the Robustness Limits of Large Language Models

319

31 Mar 2025

PIM-LLM: A High-Throughput Hybrid PIM Architecture for 1-bit LLMs

187

31 Mar 2025

Whisper-LM: Improving ASR Models with Language Models for Low-Resource Languages

328

30 Mar 2025

Leaking LoRa: An Evaluation of Password Leaks and Knowledge Storage in Large Language Models

Ryan Marinelli

Magnus Eckhoff

PILM

190

29 Mar 2025

Monte Carlo Sampling for Analyzing In-Context Examples

S. Schoch

Yangfeng Ji

220

27 Mar 2025

TempTest: Local Normalization Distortion and the Detection of Machine-generated TextInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025

279

26 Mar 2025

Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face DetectorComputer Vision and Pattern Recognition (CVPR), 2025

438

26 Mar 2025

CubeRobot: Grounding Language in Rubik's Cube Manipulation via Vision-Language ModelThe Web Conference (WWW), 2025

261

25 Mar 2025

Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache QuantizationInternational Symposium on Computer Architecture (ISCA), 2025

309

24 Mar 2025

CEFW: A Comprehensive Evaluation Framework for Watermark in Large Language Models

292

24 Mar 2025

Maximum Redundancy Pruning: A Principle-Driven Layerwise Sparsity Allocation for LLMs

306

24 Mar 2025

ExpertRAG: Efficient RAG with Mixture of Experts -- Optimizing Context Retrieval for Adaptive LLM Responses

Esmail Gumaan

MoE

337

23 Mar 2025

ComfyGPT: A Self-Optimizing Multi-Agent System for Comprehensive ComfyUI Workflow Generation

311

22 Mar 2025

Large Language Model Compression via the Nested Activation-Aware Decomposition

250

21 Mar 2025

NdLinear: Preserving Multi-Dimensional Structure for Parameter-Efficient Neural Networks

457

21 Mar 2025

REVAL: A Comprehension Evaluation on Reliability and Values of Large Vision-Language Models

355

20 Mar 2025

Disentangling Fine-Tuning from Pre-Training in Visual Captioning with Hybrid Markov LogicBigData Congress [Services Society] (BSS), 2024

335

18 Mar 2025

MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning SegmentationInternational Conference on Learning Representations (ICLR), 2025

269

18 Mar 2025

ClusComp: A Simple Paradigm for Model Compression and Efficient FinetuningAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

391

17 Mar 2025

HIS-GPT: Towards 3D Human-In-Scene Multimodal Understanding

402

17 Mar 2025

ROMA: a Read-Only-Memory-based Accelerator for QLoRA-based On-Device LLM

298

17 Mar 2025

AccelGen: Heterogeneous SLO-Guaranteed High-Throughput LLM Inference Serving for Diverse Applications

Haiying Shen

Tanmoy Sen

280

17 Mar 2025

ZO2: Scalable Zeroth-Order Fine-Tuning for Extremely Large Language Models with Limited GPU Memory

381

16 Mar 2025

SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model CompressionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

277

16 Mar 2025

PIPO: Pipelined Offloading for Efficient Inference on Consumer Devices

Yangyijian Liu

Jun Yu Li

Wu-Jun Li

304

15 Mar 2025

Text Compression for Efficient Language GenerationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

David Gu

Peter Belcak

Roger Wattenhofer

251

14 Mar 2025

Towards Extreme Pruning of LLMs with Plug-and-Play Mixed Sparsity

193

14 Mar 2025