v1v2v3v4 (latest)

OPT: Open Pre-trained Transformer Language Models

2 May 2022

Xian Li

Luke Zettlemoyer

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)

Papers citing "OPT: Open Pre-trained Transformer Language Models"

50 / 2,924 papers shown

Challenging GPU Dominance: When CPUs Outperform for On-Device LLM Inference

Haolin Zhang

Jeff Huang

234

09 May 2025

Diffusion Model Quantization: A Review

378

08 May 2025

Reliably Bounding False Positives: A Zero-Shot Machine-Generated Text Detection Framework via Multiscaled Conformal PredictionAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

531

08 May 2025

Revealing Weaknesses in Text Watermarking Through Self-Information Rewrite Attacks

603

08 May 2025

X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP

485

08 May 2025

SPAP: Structured Pruning via Alternating Optimization and Penalty Methods

Hanyu Hu

Xiaoming Yuan

222

06 May 2025

Automatic Calibration for Membership Inference Attack on Large Language Models

Mohammad Amin Roshani

Prashant Khanduri

Dongxiao Zhu

270

06 May 2025

Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression TechniquesAnnual International Computer Software and Applications Conference (COMPSAC), 2025

Sanjay Surendranath Girija

392

05 May 2025

Radio: Rate-Distortion Optimization for Large Language Model Compression

Sean I. Young

322

05 May 2025

An End-to-End Model For Logits Based Large Language Models Watermarking

381

05 May 2025

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

...

1.2K

05 May 2025

Demystifying optimized prompts in language models

Rimon Melamed

Lucas H. McCabe

H. H. Huang

263

04 May 2025

LLM Watermarking Using Mixtures and Statistical-to-Computational Gaps

Pedro Abdalla

Roman Vershynin

WaLM

404

02 May 2025

MateICL: Mitigating Attention Dispersion in Large-Scale In-Context Learning

Murtadha Ahmed

Wenbo

Liu yunfeng

237

02 May 2025

Don't be lazy: CompleteP enables compute-efficient deep transformers

514

02 May 2025

CSE-SFP: Enabling Unsupervised Sentence Representation Learning via a Single Forward PassAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025

Bowen Zhang

Zixin Song

Chunping Li

229

01 May 2025

Fast and Low-Cost Genomic Foundation Models via Outlier Removal

457

01 May 2025

An Evaluation of a Visual Question Answering Strategy for Zero-shot Facial Expression Recognition in Still Images

Modesto Castrillón-Santana

Oliverio J. Santana

David Freire-Obregón

Daniel Hernández-Sosa

J. Lorenzo-Navarro

330

30 Apr 2025

From Precision to Perception: User-Centred Evaluation of Keyword Extraction Algorithms for Internet-Scale Contextual Advertising

Jingwen Cai

Sara Leckner

Johanna Björklund

227

30 Apr 2025

Detecting and Mitigating Hateful Content in Multimodal Memes with Vision-Language Models

Minh-Hao Van

Xintao Wu

VLM

366

30 Apr 2025

Combatting Dimensional Collapse in LLM Pre-Training Data via Diversified File Selection

272

29 Apr 2025

A Domain-Agnostic Scalable AI Safety Ensuring Framework

634

29 Apr 2025

R-Sparse: Rank-Aware Activation Sparsity for Efficient LLM InferenceInternational Conference on Learning Representations (ICLR), 2025

267

28 Apr 2025

Perturbation-efficient Zeroth-order Optimization for Hardware-friendly On-device Training

...

409

28 Apr 2025

AndroidGen: Building an Android Language Agent under Data ScarcityAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

321

27 Apr 2025

Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks

...

561

26 Apr 2025

Revisiting Transformers through the Lens of Low Entropy and Dynamic Sparsity

Ruifeng Ren

Yong Liu

970

26 Apr 2025

The Big Send-off: High Performance Collectives on GPU-based Supercomputers

Siddharth Singh

Mahua Singh

A. Bhatele

250

25 Apr 2025

Comparing Uncertainty Measurement and Mitigation Methods for Large Language Models: A Systematic Review

628

25 Apr 2025

Leveraging Decoder Architectures for Learned Sparse Retrieval

303

25 Apr 2025

Evaluating Evaluation Metrics -- The Mirage of Hallucination Detection

Atharva Kulkarni

Yuan-kang Zhang

Joel Ruben Antony Moniz

385

25 Apr 2025

HMI: Hierarchical Knowledge Management for Efficient Multi-Tenant Inference in Pretrained Language ModelsThe VLDB journal (VLDB J.), 2025

192

24 Apr 2025

CoheMark: A Novel Sentence-Level Watermark for Enhanced Text Quality

313

24 Apr 2025

A Survey of Foundation Model-Powered Recommender Systems: From Feature-Based, Generative to Agentic Paradigms

321

23 Apr 2025

Context-Enhanced Contrastive Search for Improved LLM Text Generation

Jaydip Sen

Rohit Pandey

Hetvi Waghela

344

22 Apr 2025

ResNetVLLM -- Multi-modal Vision LLM for the Video Understanding Task

327

20 Apr 2025

Analysing the Robustness of Vision-Language-Models to Common Corruptions

345

18 Apr 2025

EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting

...

525

17 Apr 2025

One Model to Rig Them All: Diverse Skeleton Rigging with UniRigACM Transactions on Graphics (TOG), 2025

259

16 Apr 2025

Shared Disk KV Cache Management for Efficient Multi-Instance Inference in RAG-Powered LLMs

282

16 Apr 2025

Entropy-Guided Watermarking for LLMs: A Test-Time Framework for Robust and Traceable Text Generation

286

16 Apr 2025

Efficient Distributed Retrieval-Augmented Generation for Enhancing Language Model Performance

330

15 Apr 2025

CSPLADE: Learned Sparse Retrieval with Causal Language Models

445

15 Apr 2025

Dynamic Compressing Prompts for Efficient Inference of Large Language Models

350

15 Apr 2025

Transferable text data distillation by trajectory matching

338

14 Apr 2025

HalluShift: Measuring Distribution Shifts towards Hallucination Detection in LLMs

277

13 Apr 2025

Efficient LLM Serving on Hybrid Real-time and Best-effort Requests

304

13 Apr 2025

Quantization Error Propagation: Revisiting Layer-Wise Post-Training Quantization

Yamato Arai

Yuma Ichikawa

427

13 Apr 2025

AeroLite: Tag-Guided Lightweight Generation of Aerial Image Captions

161

13 Apr 2025

Knowledge Graph-extended Retrieval Augmented Generation for Question Answering

Jasper Linders

Jakub M. Tomczak

RALM

210

11 Apr 2025