Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2205.01068
Cited By

OPT: Open Pre-trained Transformer Language Models

v1v2v3v4 (latest)

OPT: Open Pre-trained Transformer Language Models

2 May 2022

Christopher Dewan

Xian Li

Punit Singh Koura

Luke Zettlemoyer

ArXiv (abs)PDF HTML HuggingFace (2 upvotes)

Papers citing "OPT: Open Pre-trained Transformer Language Models"

50 / 2,924 papers shown

Towards Sampling Data Structures for Tensor Products in Turnstile Streams

Towards Sampling Data Structures for Tensor Products in Turnstile Streams

147

0

0

04 Oct 2025

On the Empirical Power of Goodness-of-Fit Tests in Watermark Detection

On the Empirical Power of Goodness-of-Fit Tests in Watermark Detection

260

0

0

04 Oct 2025

Brain-Language Model Alignment: Insights into the Platonic Hypothesis and Intermediate-Layer Advantage

Brain-Language Model Alignment: Insights into the Platonic Hypothesis and Intermediate-Layer Advantage

Angela Lopez-Cardona

Sebastian Idesis

Mireia Masias Bruns

Ioannis Arapakis

120

0

0

03 Oct 2025

AgenticRAG: Tool-Augmented Foundation Models for Zero-Shot Explainable Recommender Systems

AgenticRAG: Tool-Augmented Foundation Models for Zero-Shot Explainable Recommender Systems

129

0

0

03 Oct 2025

Don't Just Chase "Highlighted Tokens" in MLLMs: Revisiting Visual Holistic Context Retention

Don't Just Chase "Highlighted Tokens" in MLLMs: Revisiting Visual Holistic Context Retention

292

7

0

03 Oct 2025

Neural Correlates of Language Models Are Specific to Human Language

Neural Correlates of Language Models Are Specific to Human Language

144

0

0

03 Oct 2025

Detecting Post-generation Edits to Watermarked LLM Outputs via Combinatorial Watermarking

Detecting Post-generation Edits to Watermarked LLM Outputs via Combinatorial Watermarking

Muhammad Siddeek

Andrea J. Goldsmith

139

1

0

02 Oct 2025

The Unseen Frontier: Pushing the Limits of LLM Sparsity with Surrogate-Free ADMM

The Unseen Frontier: Pushing the Limits of LLM Sparsity with Surrogate-Free ADMM

101

1

0

02 Oct 2025

Bridging Collaborative Filtering and Large Language Models with Dynamic Alignment, Multimodal Fusion and Evidence-grounded Explanations

Bridging Collaborative Filtering and Large Language Models with Dynamic Alignment, Multimodal Fusion and Evidence-grounded Explanations

65

0

0

02 Oct 2025

Learning a Zeroth-Order Optimizer for Fine-Tuning LLMs

Learning a Zeroth-Order Optimizer for Fine-Tuning LLMs

182

0

0

01 Oct 2025

CAST: Continuous and Differentiable Semi-Structured Sparsity-Aware Training for Large Language Models

CAST: Continuous and Differentiable Semi-Structured Sparsity-Aware Training for Large Language Models

112

0

0

30 Sep 2025

Revealing the Power of Post-Training for Small Language Models via Knowledge Distillation

Revealing the Power of Post-Training for Small Language Models via Knowledge Distillation

165

2

0

30 Sep 2025

Understanding the Mixture-of-Experts with Nadaraya-Watson Kernel

Understanding the Mixture-of-Experts with Nadaraya-Watson Kernel

Chuanyang Zheng

...

Anderson Schneider

Yuriy Nevmyvaka

231

2

0

30 Sep 2025

Scaling Spoken Language Models with Syllabic Speech Tokenization

Scaling Spoken Language Models with Syllabic Speech Tokenization

Gopala K. Anumanchipalli

132

1

0

30 Sep 2025

UniPruning: Unifying Local Metric and Global Feedback for Scalable Sparse LLMs

UniPruning: Unifying Local Metric and Global Feedback for Scalable Sparse LLMs

182

0

0

29 Sep 2025

Negative Pre-activations Differentiate Syntax

Negative Pre-activations Differentiate Syntax

127

0

0

29 Sep 2025

OIG-Bench: A Multi-Agent Annotated Benchmark for Multimodal One-Image Guides Understanding

OIG-Bench: A Multi-Agent Annotated Benchmark for Multimodal One-Image Guides Understanding

90

0

0

29 Sep 2025

Tequila: Trapping-free Ternary Quantization for Large Language Models

Tequila: Trapping-free Ternary Quantization for Large Language Models

271

3

0

28 Sep 2025

Knowledge distillation through geometry-aware representational alignment

Knowledge distillation through geometry-aware representational alignment

Prajjwal Bhattarai

177

0

0

27 Sep 2025

PT$^2$-LLM: Post-Training Ternarization for Large Language Models

^2

-LLM: Post-Training Ternarization for Large Language Models

228

0

0

27 Sep 2025

GeoBS: Information-Theoretic Quantification of Geographic Bias in AI Models

GeoBS: Information-Theoretic Quantification of Geographic Bias in AI Models

Zhongliang Zhou

...

133

1

0

27 Sep 2025

SDQ-LLM: Sigma-Delta Quantization for 1-bit LLMs of any size

SDQ-LLM: Sigma-Delta Quantization for 1-bit LLMs of any size

110

0

0

27 Sep 2025

LLM Watermark Evasion via Bias Inversion

LLM Watermark Evasion via Bias Inversion

Jeongyeon Hwang

345

0

0

27 Sep 2025

Black-Box Hallucination Detection via Consistency Under the Uncertain Expression

Black-Box Hallucination Detection via Consistency Under the Uncertain Expression

118

2

0

26 Sep 2025

SuperOffload: Unleashing the Power of Large-Scale LLM Training on Superchips

SuperOffload: Unleashing the Power of Large-Scale LLM Training on Superchips

Masahiro Tanaka

Olatunji Ruwase

122

2

0

25 Sep 2025

SCRA-VQA: Summarized Caption-Rerank for Augmented Large Language Models in Visual Question Answering

SCRA-VQA: Summarized Caption-Rerank for Augmented Large Language Models in Visual Question Answering

112

0

0

25 Sep 2025

PMark: Towards Robust and Distortion-free Semantic-level Watermarking with Channel Constraints

PMark: Towards Robust and Distortion-free Semantic-level Watermarking with Channel Constraints

192

4

0

25 Sep 2025

GEP: A GCG-Based method for extracting personally identifiable information from chatbots built on small language models

GEP: A GCG-Based method for extracting personally identifiable information from chatbots built on small language models

Vi Ngoc-Nha Tran

230

0

0

25 Sep 2025

Detoxifying Large Language Models via Autoregressive Reward Guided Representation Editing

Detoxifying Large Language Models via Autoregressive Reward Guided Representation Editing

154

2

0

24 Sep 2025

Confidence-Aware Routing for Large Language Model Reliability Enhancement: A Multi-Signal Approach to Pre-Generation Hallucination Mitigation

Confidence-Aware Routing for Large Language Model Reliability Enhancement: A Multi-Signal Approach to Pre-Generation Hallucination Mitigation

129

0

0

23 Sep 2025

When Long Helps Short: How Context Length in Supervised Fine-tuning Affects Behavior of Large Language Models

When Long Helps Short: How Context Length in Supervised Fine-tuning Affects Behavior of Large Language Models

255

0

0

23 Sep 2025

Are We Scaling the Right Thing? A System Perspective on Test-Time Scaling

Are We Scaling the Right Thing? A System Perspective on Test-Time Scaling

Christopher Gooley

108

0

0

23 Sep 2025

On-the-Fly Adaptation to Quantization: Configuration-Aware LoRA for Efficient Fine-Tuning of Quantized LLMs

On-the-Fly Adaptation to Quantization: Configuration-Aware LoRA for Efficient Fine-Tuning of Quantized LLMs

Edith C. H. Ngai

100

0

0

22 Sep 2025

LIMI: Less is More for Agency

LIMI: Less is More for Agency

...

219

6

0

22 Sep 2025

BEFT: Bias-Efficient Fine-Tuning of Language Models

BEFT: Bias-Efficient Fine-Tuning of Language Models

Ananth Balashankar

134

0

0

19 Sep 2025

Fair-GPTQ: Bias-Aware Quantization for Large Language Models

Fair-GPTQ: Bias-Aware Quantization for Large Language Models

Irina Proskurina

Guillaume Metzler

146

0

0

18 Sep 2025

Do LLMs Align Human Values Regarding Social Biases? Judging and Explaining Social Biases with LLMs

Do LLMs Align Human Values Regarding Social Biases? Judging and Explaining Social Biases with LLMs

167

0

0

17 Sep 2025

Prompt Stability in Code LLMs: Measuring Sensitivity across Emotion- and Personality-Driven Variations

Prompt Stability in Code LLMs: Measuring Sensitivity across Emotion- and Personality-Driven Variations

152

0

0

17 Sep 2025

A Framework for Generating Artificial Datasets to Validate Absolute and Relative Position Concepts

A Framework for Generating Artificial Datasets to Validate Absolute and Relative Position Concepts

George Correa de Araujo

144

0

0

17 Sep 2025

EvoEmpirBench: Dynamic Spatial Reasoning with Agent-ExpVer

EvoEmpirBench: Dynamic Spatial Reasoning with Agent-ExpVer

209

0

0

16 Sep 2025

Character-Level Perturbations Disrupt LLM Watermarks

Character-Level Perturbations Disrupt LLM Watermarks

411

1

0

11 Sep 2025

Collaborate, Deliberate, Evaluate: How LLM Alignment Affects Coordinated Multi-Agent Outcomes

Collaborate, Deliberate, Evaluate: How LLM Alignment Affects Coordinated Multi-Agent Outcomes

Nikhil Krishnaswamy

185

3

0

07 Sep 2025

SMooGPT: Stylized Motion Generation using Large Language Models

SMooGPT: Stylized Motion Generation using Large Language Models

115

1

0

04 Sep 2025

RecBase: Generative Foundation Model Pretraining for Zero-Shot Recommendation

RecBase: Generative Foundation Model Pretraining for Zero-Shot Recommendation

133

4

0

03 Sep 2025

Behavioral Fingerprinting of Large Language Models

Behavioral Fingerprinting of Large Language Models

90

2

0

02 Sep 2025

Evaluating Recabilities of Foundation Models: A Multi-Domain, Multi-Dataset Benchmark

Evaluating Recabilities of Foundation Models: A Multi-Domain, Multi-Dataset Benchmark

113

2

0

29 Aug 2025

MM-SeR: Multimodal Self-Refinement for Lightweight Image Captioning

MM-SeR: Multimodal Self-Refinement for Lightweight Image Captioning

226

0

0

29 Aug 2025

VeriLoRA: Fine-Tuning Large Language Models with Verifiable Security via Zero-Knowledge Proofs

VeriLoRA: Fine-Tuning Large Language Models with Verifiable Security via Zero-Knowledge Proofs

236

0

0

29 Aug 2025

PDTrim: Targeted Pruning for Prefill-Decode Disaggregation in Inference

PDTrim: Targeted Pruning for Prefill-Decode Disaggregation in Inference

484

1

0

29 Aug 2025

GUARD: Glocal Uncertainty-Aware Robust Decoding for Effective and Efficient Open-Ended Text Generation

GUARD: Glocal Uncertainty-Aware Robust Decoding for Effective and Efficient Open-Ended Text Generation

Esteban Garces Arias

Julian Rodemann

Matthias Aßenmacher

Chongsheng Zhang

166

3

0

28 Aug 2025

1 2 3 4 5 6...57 58 59

Page 3 of 59

Pageof 59