ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.01068
  4. Cited By
OPT: Open Pre-trained Transformer Language Models
v1v2v3v4 (latest)

OPT: Open Pre-trained Transformer Language Models

2 May 2022
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
Shuohui Chen
Christopher Dewan
Mona T. Diab
Xian Li
Xi Lin
Todor Mihaylov
Myle Ott
Sam Shleifer
Kurt Shuster
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
    VLMOSLMAI4CE
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "OPT: Open Pre-trained Transformer Language Models"

50 / 2,922 papers shown
APT-LLM: Exploiting Arbitrary-Precision Tensor Core Computing for LLM Acceleration
APT-LLM: Exploiting Arbitrary-Precision Tensor Core Computing for LLM AccelerationIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2025
Shaobo Ma
Chao Fang
Haikuo Shao
Zhongfeng Wang
124
1
0
26 Aug 2025
Task-Stratified Knowledge Scaling Laws for Post-Training Quantized Large Language Models
Task-Stratified Knowledge Scaling Laws for Post-Training Quantized Large Language Models
Chenxi Zhou
Pengfei Cao
Jiang Li
Jun Zhao
Kang Liu
Jun Zhao
Kang Liu
MQ
196
0
0
26 Aug 2025
Better Language Model-Based Judging Reward Modeling through Scaling Comprehension Boundaries
Better Language Model-Based Judging Reward Modeling through Scaling Comprehension Boundaries
Meiling Ning
Zhongbao Zhang
Junda Ye
Jiabao Guo
Qingyuan Guan
LRM
132
0
0
25 Aug 2025
Dynamic Sparse Attention on Mobile SoCs
Dynamic Sparse Attention on Mobile SoCs
Wangsong Yin
Daliang Xu
Mengwei Xu
Gang Huang
Xuanzhe Liu
MQ
162
3
0
22 Aug 2025
Interpreting the Effects of Quantization on LLMs
Interpreting the Effects of Quantization on LLMs
Manpreet Singh
Hassan Sajjad
MQMILM
382
3
0
22 Aug 2025
Subjective Behaviors and Preferences in LLM: Language of Browsing
Subjective Behaviors and Preferences in LLM: Language of Browsing
Sai Sundaresan
Harshita Chopra
Atanu R. Sinha
Koustava Goswami
Nagasai Saketh Naidu
Raghav Karan
N Anushka
246
0
0
21 Aug 2025
Discrete Optimization of Min-Max Violation and its Applications Across Computational Sciences
Discrete Optimization of Min-Max Violation and its Applications Across Computational Sciences
Cheikh Ahmed
Mahdi Mostajabdaveh
Samin Aref
Zirui Zhou
135
1
0
19 Aug 2025
Two Birds with One Stone: Multi-Task Detection and Attribution of LLM-Generated Text
Two Birds with One Stone: Multi-Task Detection and Attribution of LLM-Generated Text
Zixin Rao
Youssef Mohamed
Shang Liu
Zeyan Liu
DeLMO
176
0
0
19 Aug 2025
GLASS: Test-Time Acceleration for LLMs via Global-Local Neural Importance Aggregation
GLASS: Test-Time Acceleration for LLMs via Global-Local Neural Importance Aggregation
Amirmohsen Sattarifard
Sepehr Lavasani
Ehsan Imani
Kunlin Zhang
Hanlin Xu
Fengyu Sun
Negar Hassanpour
Chao Gao
VLM
107
1
0
19 Aug 2025
Z-Pruner: Post-Training Pruning of Large Language Models for Efficiency without Retraining
Z-Pruner: Post-Training Pruning of Large Language Models for Efficiency without Retraining
Samiul Basir Bhuiyan
Md. Sazzad Hossain Adib
Mohammed Aman Bhuiyan
Muhammad Rafsan Kabir
Moshiur Farazi
Shafin Rahman
Nabeel Mohammed
180
1
0
18 Aug 2025
The Cultural Gene of Large Language Models: A Study on the Impact of Cross-Corpus Training on Model Values and Biases
The Cultural Gene of Large Language Models: A Study on the Impact of Cross-Corpus Training on Model Values and Biases
Emanuel Z. Fenech-Borg
Tilen P. Meznaric-Kos
Milica D. Lekovic-Bojovic
Arni J. Hentze-Djurhuus
253
0
0
17 Aug 2025
STEM: Efficient Relative Capability Evaluation of LLMs through Structured Transition Samples
STEM: Efficient Relative Capability Evaluation of LLMs through Structured Transition Samples
Haiquan Hu
Jiazhi Jiang
Shiyou Xu
Ruhan Zeng
Tian Wang
124
0
0
16 Aug 2025
A Survey on Diffusion Language Models
A Survey on Diffusion Language Models
Tianyi Li
Mingda Chen
Bowei Guo
Zhiqiang Shen
319
32
0
14 Aug 2025
Puppeteer: Rig and Animate Your 3D Models
Puppeteer: Rig and Animate Your 3D Models
Chaoyue Song
Xiu Li
Fan Yang
Zhongcong Xu
Jiacheng Wei
Fayao Liu
Jiashi Feng
Guosheng Lin
Jianfeng Zhang
110
12
0
14 Aug 2025
A Study of Commonsense Reasoning over Visual Object Properties
A Study of Commonsense Reasoning over Visual Object Properties
Abhishek Kolari
Mohammadhossein Khojasteh
Yifan Jiang
Floris den Hengst
Filip Ilievski
OCL
233
0
0
14 Aug 2025
Unpacking the Implicit Norm Dynamics of Sharpness-Aware Minimization in Tensorized Models
Unpacking the Implicit Norm Dynamics of Sharpness-Aware Minimization in Tensorized Models
Tianxiao Cao
Kyohei Atarashi
H. Kashima
230
0
0
14 Aug 2025
Shadow in the Cache: Unveiling and Mitigating Privacy Risks of KV-cache in LLM Inference
Shadow in the Cache: Unveiling and Mitigating Privacy Risks of KV-cache in LLM Inference
Zhifan Luo
Shuo Shao
Su Zhang
Lijing Zhou
Yuke Hu
Chenxu Zhao
Zhihao Liu
Zhan Qin
240
4
0
13 Aug 2025
VertexRegen: Mesh Generation with Continuous Level of Detail
VertexRegen: Mesh Generation with Continuous Level of Detail
Xiang Zhang
Yawar Siddiqui
A. Avetisyan
Chris Xie
Jakob Julian Engel
Henry Howard-Jenkins
3DH
72
4
0
12 Aug 2025
SinLlama -- A Large Language Model for Sinhala
SinLlama -- A Large Language Model for SinhalaMoratuwa Engineering Research Conference (MERCon), 2025
H.W.K.Aravinda
Rashad Sirajudeen
Samith Karunathilake
Nisansa de Silva
Surangika Ranathunga
Rishemjit Kaur
LRM
290
1
0
12 Aug 2025
Semantic-Enhanced Time-Series Forecasting via Large Language Models
Semantic-Enhanced Time-Series Forecasting via Large Language Models
Hao Liu
Chun Yang
Zhang xiaoxing
Xiaobin Zhu
AI4TSAIFin
246
1
0
11 Aug 2025
Efficient Edge LLMs Deployment via HessianAware Quantization and CPU GPU Collaborative
Efficient Edge LLMs Deployment via HessianAware Quantization and CPU GPU Collaborative
Tuo Zhang
Ning Li
Xin Yuan
Wenchao Xu
Quan Chen
Song Guo
Haijun Zhang
MQ
130
0
0
10 Aug 2025
Rethinking 1-bit Optimization Leveraging Pre-trained Large Language Models
Rethinking 1-bit Optimization Leveraging Pre-trained Large Language Models
Zhijun Tu
Hanting Chen
Siqi Liu
Chuanjian Liu
Jian Li
Jie Hu
Yunhe Wang
MQ
127
0
0
09 Aug 2025
Fed MobiLLM: Efficient Federated LLM Fine-Tuning over Heterogeneous Mobile Devices via Server Assisted Side-Tuning
Fed MobiLLM: Efficient Federated LLM Fine-Tuning over Heterogeneous Mobile Devices via Server Assisted Side-Tuning
Xingke Yang
Liang Li
Sicong Li
Liwei Guan
Hao Wang
Xiaoqi Qi
Jiang-Dong Liu
Xin Fu
Miao Pan
121
1
0
09 Aug 2025
Decision-Making with Deliberation: Meta-reviewing as a Document-grounded Dialogue
Decision-Making with Deliberation: Meta-reviewing as a Document-grounded Dialogue
Sukannya Purkayastha
Nils Dycke
Anne Lauscher
Iryna Gurevych
104
1
0
07 Aug 2025
A Survey on Video Temporal Grounding with Multimodal Large Language Model
A Survey on Video Temporal Grounding with Multimodal Large Language ModelIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Yue Yu
Wei Liu
Y. Liu
Meng-yang Liu
Liqiang Nie
Zhouchen Lin
C. Chen
AI4TSVLMLRM
145
7
0
07 Aug 2025
FlexQ: Efficient Post-training INT6 Quantization for LLM Serving via Algorithm-System Co-Design
FlexQ: Efficient Post-training INT6 Quantization for LLM Serving via Algorithm-System Co-Design
Hao Zhang
Aining Jia
Weifeng Bu
Y. Cai
Kai Sheng
Hao Chen
Xin He
MQ
131
0
0
06 Aug 2025
Share Your Attention: Transformer Weight Sharing via Matrix-based Dictionary Learning
Share Your Attention: Transformer Weight Sharing via Matrix-based Dictionary Learning
Magauiya Zhussip
Dmitriy Shopkhoev
Ammar Ali
Stamatios Lefkimmiatis
112
2
0
06 Aug 2025
CTR-Sink: Attention Sink for Language Models in Click-Through Rate Prediction
CTR-Sink: Attention Sink for Language Models in Click-Through Rate Prediction
Zixuan Li
Binzong Geng
Jing Xiong
Yong He
Yuxuan Hu
...
Liang Zhang
Linjian Mo
Chengming Li
Chuan Yuan
Zhenan Sun
136
0
0
05 Aug 2025
MegaWika 2: A More Comprehensive Multilingual Collection of Articles and their Sources
MegaWika 2: A More Comprehensive Multilingual Collection of Articles and their Sources
Samuel Barham
Chandler May
Benjamin Van Durme
SyDa
186
3
0
05 Aug 2025
When Truth Is Overridden: Uncovering the Internal Origins of Sycophancy in Large Language Models
When Truth Is Overridden: Uncovering the Internal Origins of Sycophancy in Large Language Models
Keyu Wang
Jin Li
Shu Yang
Zhuoran Zhang
Haiyan Zhao
452
6
0
04 Aug 2025
Context-Adaptive Multi-Prompt Embedding with Large Language Models for Vision-Language Alignment
Context-Adaptive Multi-Prompt Embedding with Large Language Models for Vision-Language Alignment
Dahun Kim
A. Angelova
VLM
209
1
0
03 Aug 2025
Mitigating Information Loss under High Pruning Rates for Efficient Large Vision Language Models
Mitigating Information Loss under High Pruning Rates for Efficient Large Vision Language Models
Mingyu Fu
Wei Suo
Ji Ma
Lin Yuanbo Wu
Peng Wang
Yanning Zhang
VLM
168
1
0
02 Aug 2025
FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models
FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models
Zishan Shao
Yixiao Wang
Qinsi Wang
Ting Jiang
Zhixu Du
Hancheng Ye
Danyang Zhuo
Yiran Chen
Xue Yang
119
3
0
02 Aug 2025
A Bayesian Hybrid Parameter-Efficient Fine-Tuning Method for Large Language Models
A Bayesian Hybrid Parameter-Efficient Fine-Tuning Method for Large Language Models
Yidong Chai
Yang Liu
Yonghang Zhou
Jiaheng Xie
Daniel Zeng
95
0
0
31 Jul 2025
Shapley Uncertainty in Natural Language Generation
Shapley Uncertainty in Natural Language Generation
Meilin Zhu
Gaojie Jin
Xiaowei Huang
Lijun Zhang
151
0
0
29 Jul 2025
When Truthful Representations Flip Under Deceptive Instructions?
When Truthful Representations Flip Under Deceptive Instructions?
Xianxuan Long
Y. Fu
Runchao Li
Mu Sheng
Haotian Yu
Xiaotian Han
Pan Li
LLMSV
373
4
0
29 Jul 2025
Adversarial Defence without Adversarial Defence: Enhancing Language Model Robustness via Instance-level Principal Component Removal
Adversarial Defence without Adversarial Defence: Enhancing Language Model Robustness via Instance-level Principal Component Removal
Yang Wang
Chenghao Xiao
Yi Zhou
Stuart E. Middleton
Noura Al Moubayed
C. D. Lin
AAML
304
1
0
29 Jul 2025
FMimic: Foundation Models are Fine-grained Action Learners from Human Videos
FMimic: Foundation Models are Fine-grained Action Learners from Human VideosThe international journal of robotics research (IJRR), 2025
Guangyan Chen
Meiling Wang
Te Cui
Yao Mu
Haoyang Lu
...
Mengxiao Hu
Tianxing Zhou
M. Fu
Yi Yang
Yufeng Yue
LM&RoVLM
158
5
0
28 Jul 2025
Do Large Language Models Understand Morality Across Cultures?
Do Large Language Models Understand Morality Across Cultures?
Hadi Mohammadi
Yasmeen F.S.S. Meijer
Efthymia Papadopoulou
Ayoub Bagheri
214
2
0
28 Jul 2025
Flora: Effortless Context Construction to Arbitrary Length and Scale
Flora: Effortless Context Construction to Arbitrary Length and Scale
Tianxiang Chen
Zhentao Tan
Xiaofan Bo
Yue Wu
Tao Gong
Qi Chu
Jieping Ye
Nenghai Yu
CLLLRM
253
1
0
26 Jul 2025
HCAttention: Extreme KV Cache Compression via Heterogeneous Attention Computing for LLMs
HCAttention: Extreme KV Cache Compression via Heterogeneous Attention Computing for LLMs
Dongquan Yang
Yifan Yang
Xiaotian Yu
Xianbiao Qi
Rong Xiao
MQ
173
0
0
26 Jul 2025
The Carbon Cost of Conversation, Sustainability in the Age of Language Models
The Carbon Cost of Conversation, Sustainability in the Age of Language Models
Sayed Mahbub Hasan Amiri
Prasun Goswami
Md. Mainul Islam
Mohammad Shakhawat Hossen
Sayed Majhab Hasan Amiri
Naznin Akter
SILMSyDa
260
3
0
26 Jul 2025
A Survey on Generative Model Unlearning: Fundamentals, Taxonomy, Evaluation, and Future Direction
A Survey on Generative Model Unlearning: Fundamentals, Taxonomy, Evaluation, and Future Direction
Xiaohua Feng
Jiaming Zhang
Fengyuan Yu
C. Wang
Li Zhang
Kaixiang Li
Yuyuan Li
Chaochao Chen
Jianwei Yin
MU
262
2
0
26 Jul 2025
MLLM-based Speech Recognition: When and How is Multimodality Beneficial?
MLLM-based Speech Recognition: When and How is Multimodality Beneficial?
Yiwen Guan
V. Trinh
Vivek Voleti
Jacob Whitehill
222
1
0
25 Jul 2025
Modality Agnostic Efficient Long Range Encoder
Modality Agnostic Efficient Long Range Encoder
T. Parag
Ahmed Elgammal
158
0
0
25 Jul 2025
SLoW: Select Low-frequency Words! Automatic Dictionary Selection for Translation on Large Language Models
SLoW: Select Low-frequency Words! Automatic Dictionary Selection for Translation on Large Language Models
Hongyuan Lu
Zixuan Li
Zefan Zhang
Wai Lam
127
0
0
25 Jul 2025
BucketServe: Bucket-Based Dynamic Batching for Smart and Efficient LLM Inference Serving
BucketServe: Bucket-Based Dynamic Batching for Smart and Efficient LLM Inference Serving
Wanyi Zheng
Minxian Xu
Shengye Song
Kejiang Ye
145
1
0
23 Jul 2025
FedChip: Federated LLM for Artificial Intelligence Accelerator Chip Design
FedChip: Federated LLM for Artificial Intelligence Accelerator Chip Design
Mahmoud Nazzal
Khoa Nguyen
Deepak Vungarala
Ramtin Zand
Shaahin Angizi
Hai Phan
Abdallah Khreishah
134
1
0
23 Jul 2025
Spatial 3D-LLM: Exploring Spatial Awareness in 3D Vision-Language Models
Spatial 3D-LLM: Exploring Spatial Awareness in 3D Vision-Language Models
Xiaoyan Wang
Zeju Li
Yifan Xu
Jiaxing Qi
Zhifei Yang
Ruifei Ma
Xiangde Liu
Chao Zhang
130
3
0
22 Jul 2025
Discrete Tokenization for Multimodal LLMs: A Comprehensive Survey
Discrete Tokenization for Multimodal LLMs: A Comprehensive Survey
Jindong Li
Yali Fu
Jiahong Liu
Linxiao Cao
Wei Ji
Menglin Yang
Irwin King
Ming-Hsuan Yang
OffRL
157
3
0
21 Jul 2025
Previous
12345...575859
Next
Page 4 of 59
Pageof 59