v1v2v3v4 (latest)

OPT: Open Pre-trained Transformer Language Models

2 May 2022

Xian Li

Luke Zettlemoyer

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)

Papers citing "OPT: Open Pre-trained Transformer Language Models"

50 / 2,922 papers shown

SoK: Are Watermarks in LLMs Ready for Deployment?

170

24 Dec 2025

KVNAND: Efficient On-Device Large Language Model Inference Using DRAM-Free In-Flash Computing

03 Dec 2025

TokenPowerBench: Benchmarking the Power Consumption of LLM Inference

02 Dec 2025

$Fairy2i: Training Complex LLMs from Real LLMs with All Parameters in $\{\pm 1, \pm i\}$$

Fairy2i: Training Complex LLMs from Real LLMs with All Parameters in

\{\pm 1, \pm i\}

476

02 Dec 2025

Context-Enriched Contrastive Loss: Enhancing Presentation of Inherent Sample Connections in Contrastive Learning FrameworkIEEE transactions on multimedia (TMM), 2025

Haojin Deng

Yimin Yang

01 Dec 2025

Tangram: Accelerating Serverless LLM Loading through GPU Memory Reuse and Affinity

01 Dec 2025

Comparative Analysis of 47 Context-Based Question Answer Models Across 8 Diverse Datasets

Muhammad Muneeb

David B. Ascher

Ahsan Baidar Bakht

29 Nov 2025

Serving Heterogeneous LoRA Adapters in Distributed LLM Inference Systems

...

111

28 Nov 2025

Experts are all you need: A Composable Framework for Large Language Model Inference

173

28 Nov 2025

Towards Audio Token Compression in Large Audio Language Models

310

26 Nov 2025

CDLM: Consistency Diffusion Language Models For Faster Sampling

200

24 Nov 2025

FastForward Pruning: Efficient LLM Pruning via Single-Step Reinforcement Learning

377

24 Nov 2025

Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models

Y. Fu

Xin Dong

Shizhe Diao

Matthijs Van Keirsbilck

...

164

24 Nov 2025

Adaptive Layer-Wise Transformations for Post-Training Quantization of Large Language Models

146

21 Nov 2025

Layer-Wise High-Impact Parameter Ratio Optimization in Post-Training Quantization for Large Language Models

136

21 Nov 2025

R2Q: Towards Robust 2-Bit Large Language Models via Residual Refinement Quantization

176

21 Nov 2025

Robot Confirmation Generation and Action Planning Using Long-context Q-Former Integrated with Multimodal LLM

101

21 Nov 2025

An Image Is Worth Ten Thousand Words: Verbose-Text Induction Attacks on VLMs

207

20 Nov 2025

10Cache: Heterogeneous Resource-Aware Tensor Caching and Migration for LLM Training

Sabiha Afroz

Redwan Ibne Seraj Khan

Hadeel Albahar

Jingoo Han

A. R. Butt

157

18 Nov 2025

GPS: General Per-Sample Prompter

Pawel Batorski

Paul Swoboda

18 Nov 2025

Neo: Real-Time On-Device 3D Gaussian Splatting with Reuse-and-Update Sorting Acceleration

211

17 Nov 2025

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

...

168

17 Nov 2025

MACKO: Sparse Matrix-Vector Multiplication for Low Sparsity

Vladimír Macko

Vladimír Boža

136

17 Nov 2025

BitSnap: Checkpoint Sparsification and Quantization in LLM Training

326

15 Nov 2025

Don't Think of the White Bear: Ironic Negation in Transformer Models Under Cognitive Load

15 Nov 2025

OAD-Promoter: Enhancing Zero-shot VQA using Large Language Models with Object Attribute Description

261

15 Nov 2025

Dynamic Temperature Scheduler for Knowledge Distillation

102

14 Nov 2025

iSeal: Encrypted Fingerprinting for Reliable LLM Ownership Verification

160

12 Nov 2025

LLM-GROP: Visually Grounded Robot Task and Motion Planning with Large Language ModelsThe international journal of robotics research (IJRR), 2025

245

11 Nov 2025

ProcGen3D: Learning Neural Procedural Graph Representations for Image-to-3D Reconstruction

204

10 Nov 2025

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

279

10 Nov 2025

Rethinking Parameter Sharing as Graph Coloring for Structured Compression

Boyang Zhang

Daning Cheng

Yunquan Zhang

184

10 Nov 2025

Private-RAG: Answering Multiple Queries with LLMs while Keeping Your Data Private

266

10 Nov 2025

Ghost in the Transformer: Detecting Model Reuse with Invariant Spectral Signatures

160

09 Nov 2025

Chain-of-Thought as a Lens: Evaluating Structured Reasoning Alignment between Human Preferences and Large Language Models

113

09 Nov 2025

HatePrototypes: Interpretable and Transferable Representations for Implicit and Explicit Hate Speech Detection

Irina Proskurina

Marc-Antoine Carpentier

Julien Velcin

VLM

129

09 Nov 2025

DRAGON: Guard LLM Unlearning in Context via Negative Detection and ReasoningConference on Empirical Methods in Natural Language Processing (EMNLP), 2025

355

08 Nov 2025

The Future of Fully Homomorphic Encryption System: from a Storage I/O Perspective

...

07 Nov 2025

DartQuant: Efficient Rotational Distribution Calibration for LLM Quantization

365

06 Nov 2025

From Prompts to Power: Measuring the Energy Footprint of LLM Inference

Francisco Caravaca

Ángel Cuevas

R. Cuevas

116

05 Nov 2025

FP8-Flow-MoE: A Casting-Free FP8 Recipe without Double Quantization Error

127

04 Nov 2025

ConMeZO: Adaptive Descent-Direction Sampling for Gradient-Free Finetuning of Large Language Models

141

04 Nov 2025

Analyzing the Power of Chain of Thought through Memorization Capabilities

216

03 Nov 2025

A CPU-Centric Perspective on Agentic AI

Ritik Raj

Hong Wang

Tushar Krishna

313

01 Nov 2025

Encoder-Decoder or Decoder-Only? Revisiting Encoder-Decoder Large Language Model

148

30 Oct 2025

MMEdge: Accelerating On-device Multimodal Inference via Pipelined Sensing and Encoding

281

29 Oct 2025

Layer of Truth: Probing Belief Shifts under Continual Pre-Training Poisoning

369

29 Oct 2025

MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling

148

28 Oct 2025

Language Model Behavioral Phases are Consistent Across Architecture, Training Data, and Scale

133

28 Oct 2025

DualCap: Enhancing Lightweight Image Captioning via Dual Retrieval with Similar Scenes Visual Prompts

337

28 Oct 2025