Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2205.01068
Cited By
v1
v2
v3
v4 (latest)
OPT: Open Pre-trained Transformer Language Models
2 May 2022
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
Shuohui Chen
Christopher Dewan
Mona T. Diab
Xian Li
Xi Lin
Todor Mihaylov
Myle Ott
Sam Shleifer
Kurt Shuster
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
VLM
OSLM
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (2 upvotes)
Papers citing
"OPT: Open Pre-trained Transformer Language Models"
50 / 2,926 papers shown
Position: Beyond Euclidean -- Foundation Models Should Embrace Non-Euclidean Geometries
Neil He
Jiahong Liu
Buze Zhang
N. Bui
Ali Maatouk
Menglin Yang
Irwin King
Melanie Weber
Rex Ying
290
7
0
11 Apr 2025
Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions
Tommaso Galliena
Tommaso Apicella
Stefano Rosa
Pietro Morerio
Alessio Del Bue
Lorenzo Natale
405
1
0
11 Apr 2025
Knowledge Graph-extended Retrieval Augmented Generation for Question Answering
Jasper Linders
Jakub M. Tomczak
RALM
218
11
0
11 Apr 2025
On The Landscape of Spoken Language Models: A Comprehensive Survey
Siddhant Arora
Kai-Wei Chang
Chung-Ming Chien
Yifan Peng
Haibin Wu
Yossi Adi
Emmanuel Dupoux
Hung-yi Lee
Karen Livescu
Shinji Watanabe
391
78
0
11 Apr 2025
Apt-Serve: Adaptive Request Scheduling on Hybrid Cache for Scalable LLM Inference Serving
Shihong Gao
Wei Wei
Yanyan Shen
Lei Chen
268
9
0
10 Apr 2025
Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Alex Warstadt
Aaron Mueller
Leshem Choshen
E. Wilcox
Chengxu Zhuang
...
Rafael Mosquera
Bhargavi Paranjape
Adina Williams
Tal Linzen
Robert Bamler
643
177
0
10 Apr 2025
Exploring the Effectiveness and Interpretability of Texts in LLM-based Time Series Models
Zhengke Sun
Hangwei Qian
Ivor Tsang
AI4TS
169
0
0
09 Apr 2025
Classifying the Unknown: In-Context Learning for Open-Vocabulary Text and Symbol Recognition
IEEE International Conference on Document Analysis and Recognition (ICDAR), 2025
Tom Simon
William Mocaer
Pierrick Tranouez
Clément Chatelain
Thierry Paquet
MLLM
VLM
212
0
0
09 Apr 2025
GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography
Mengchen Zhang
Tong Wu
Jing Tan
Yu Qiao
Gordon Wetzstein
Dahua Lin
VGen
416
7
0
09 Apr 2025
Data Augmentation for Fake Reviews Detection in Multiple Languages and Multiple Domains
Ming Liu
Massimo Poesio
286
3
0
09 Apr 2025
AccLLM: Accelerating Long-Context LLM Inference Via Algorithm-Hardware Co-Design
Yanbiao Liang
Huihong Shi
Haikuo Shao
Zhongfeng Wang
258
4
0
07 Apr 2025
URECA: Unique Region Caption Anything
Sangbeom Lim
J. Kim
Heeji Yoon
Jaewoo Jung
Seungryong Kim
315
2
0
07 Apr 2025
Thanos: A Block-wise Pruning Algorithm for Efficient Large Language Model Compression
Ivan Ilin
Peter Richtárik
184
5
0
06 Apr 2025
Hessian of Perplexity for Large Language Models by PyTorch autograd (Open Source)
Ivan Ilin
258
2
0
06 Apr 2025
Domain Generalization for Face Anti-spoofing via Content-aware Composite Prompt Engineering
IEEE transactions on multimedia (TMM), 2025
Jiaxin Guo
Ajian Liu
Yunfeng Diao
Jing Zhang
Hui Ma
Bo Zhao
Richang Hong
Meng Wang
342
12
0
06 Apr 2025
Your Image Generator Is Your New Private Dataset
Image and Vision Computing (IVC), 2025
Nicolo Resmini
Eugenio Lomurno
Cristian Sbrolli
Matteo Matteucci
359
1
0
06 Apr 2025
SLOs-Serve: Optimized Serving of Multi-SLO LLMs
Siyuan Chen
Zhipeng Jia
S. Khan
Arvind Krishnamurthy
Phillip B. Gibbons
253
11
0
05 Apr 2025
A Perplexity and Menger Curvature-Based Approach for Similarity Evaluation of Large Language Models
Yuantao Zhang
Zhankui Yang
AAML
319
0
0
05 Apr 2025
Scaling Analysis of Interleaved Speech-Text Language Models
Gallil Maimon
Michael Hassid
Amit Roth
Yossi Adi
AuLLM
461
6
0
03 Apr 2025
When Reasoning Meets Compression: Understanding the Effects of LLMs Compression on Large Reasoning Models
Nan Zhang
Eugene Kwek
Yusen Zhang
Ngoc-Hieu Nguyen
Prasenjit Mitra
Rui Zhang
MQ
LRM
568
8
0
02 Apr 2025
HERA: Hybrid Edge-cloud Resource Allocation for Cost-Efficient AI Agents
Shiyi Liu
Haiying Shen
Shuai Che
Mahdi Ghandi
Mingqin Li
LLMAG
381
4
0
01 Apr 2025
Short-PHD: Detecting Short LLM-generated Text with Topological Data Analysis After Off-topic Content Insertion
Dongjun Wei
Minjia Mao
Xiao Fang
Michael Chau
DeLMO
293
3
0
01 Apr 2025
WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization
I. Gevers
Victor De Marez
Luna De Bruyne
Walter Daelemans
320
2
0
31 Mar 2025
Model Hemorrhage and the Robustness Limits of Large Language Models
Ziyang Ma
Hui Yuan
Guang Dai
Gui-Song Xia
Bo Du
Liangpei Zhang
Dacheng Tao
319
1
0
31 Mar 2025
PIM-LLM: A High-Throughput Hybrid PIM Architecture for 1-bit LLMs
Jinendra Malekar
Peyton S. Chandarana
Md Hasibul Amin
Mohammed E. Elbtity
Ramtin Zand
187
2
0
31 Mar 2025
Whisper-LM: Improving ASR Models with Language Models for Low-Resource Languages
Xabier de Zuazo
Eva Navas
Ibon Saratxaga
Inma Hernáez Rioja
328
4
0
30 Mar 2025
Leaking LoRa: An Evaluation of Password Leaks and Knowledge Storage in Large Language Models
Ryan Marinelli
Magnus Eckhoff
PILM
190
1
0
29 Mar 2025
Monte Carlo Sampling for Analyzing In-Context Examples
S. Schoch
Yangfeng Ji
220
1
0
27 Mar 2025
TempTest: Local Normalization Distortion and the Detection of Machine-generated Text
International Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Tom Kempton
Stuart Burrell
Connor Cheverall
DeLMO
279
2
0
26 Mar 2025
Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face Detector
Computer Vision and Pattern Recognition (CVPR), 2025
Xiao Guo
Xiufeng Song
Yue Zhang
Xiaohong Liu
Xuyang Liu
438
30
0
26 Mar 2025
CubeRobot: Grounding Language in Rubik's Cube Manipulation via Vision-Language Model
The Web Conference (WWW), 2025
Feiyang Wang
Xiaomin Yu
Wangyu Wu
LM&Ro
261
4
0
25 Mar 2025
Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization
International Symposium on Computer Architecture (ISCA), 2025
Minsu Kim
Seongmin Hong
RyeoWook Ko
S. Choi
Hunjong Lee
Junsoo Kim
Joo-Young Kim
Jongse Park
309
9
0
24 Mar 2025
CEFW: A Comprehensive Evaluation Framework for Watermark in Large Language Models
Shuhao Zhang
B. Cheng
Jiale Han
Yuli Chen
Zhixuan Wu
Changbao Li
Pingli Gu
WaLM
292
0
0
24 Mar 2025
Maximum Redundancy Pruning: A Principle-Driven Layerwise Sparsity Allocation for LLMs
Chang Gao
Kang Zhao
Runqi Wang
Jianfei Chen
Liping Jing
306
1
0
24 Mar 2025
ExpertRAG: Efficient RAG with Mixture of Experts -- Optimizing Context Retrieval for Adaptive LLM Responses
Esmail Gumaan
MoE
337
2
0
23 Mar 2025
ComfyGPT: A Self-Optimizing Multi-Agent System for Comprehensive ComfyUI Workflow Generation
Oucheng Huang
Yuhang Ma
Zeng Zhao
Mingrui Wu
Jinfa Huang
Rongsheng Zhang
Zhibo Hu
Xiaoshuai Sun
Rongrong Ji
311
3
0
22 Mar 2025
Large Language Model Compression via the Nested Activation-Aware Decomposition
Jun Lu
Tianyi Xu
Bill Ding
David Li
Yu Kang
250
1
0
21 Mar 2025
NdLinear: Preserving Multi-Dimensional Structure for Parameter-Efficient Neural Networks
Alex Reneau
Jerry Yao-Chieh Hu
Zhongfang Zhuang
Ting-Chun Liu
Xiang He
Judah Goldfeder
Nadav Timor
Allen Roush
Ravid Shwartz-Ziv
HAI
457
0
0
21 Mar 2025
REVAL: A Comprehension Evaluation on Reliability and Values of Large Vision-Language Models
Jie M. Zhang
Zheng Yuan
Ziyi Wang
Bei Yan
Sibo Wang
Xiangkui Cao
Zonghui Guo
Shiguang Shan
Xilin Chen
ELM
355
3
0
20 Mar 2025
Disentangling Fine-Tuning from Pre-Training in Visual Captioning with Hybrid Markov Logic
BigData Congress [Services Society] (BSS), 2024
Monika Shah
Somdeb Sarkhel
Deepak Venugopal
MLLM
BDL
VLM
335
1
0
18 Mar 2025
MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segmentation
International Conference on Learning Representations (ICLR), 2025
Donggon Jang
Yucheol Cho
Suin Lee
Taehyeon Kim
Dae-Shik Kim
VLM
269
19
0
18 Mar 2025
ClusComp: A Simple Paradigm for Model Compression and Efficient Finetuning
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Baohao Liao
Christian Herold
Seyyed Hadi Hashemi
Stefan Vasilev
Shahram Khadivi
Christof Monz
MQ
391
1
0
17 Mar 2025
HIS-GPT: Towards 3D Human-In-Scene Multimodal Understanding
Jiahe Zhao
Ruibing Hou
Zejie Tian
Hong Chang
Shiguang Shan
402
0
0
17 Mar 2025
ROMA: a Read-Only-Memory-based Accelerator for QLoRA-based On-Device LLM
Wenqiang Wang
Yijia Zhang
Zikai Zhang
Guanting Huo
Hao Liang
Shijie Cao
Ningyi Xu
298
1
0
17 Mar 2025
AccelGen: Heterogeneous SLO-Guaranteed High-Throughput LLM Inference Serving for Diverse Applications
Haiying Shen
Tanmoy Sen
280
2
0
17 Mar 2025
ZO2: Scalable Zeroth-Order Fine-Tuning for Extremely Large Language Models with Limited GPU Memory
Liangyu Wang
Jie Ren
Hang Xu
Junxiao Wang
Huanyi Xie
David E. Keyes
Di Wang
381
3
0
16 Mar 2025
SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Xin Wang
Samiul Alam
Zhongwei Wan
Mengqi Li
Hao Fei
MQ
277
35
0
16 Mar 2025
PIPO: Pipelined Offloading for Efficient Inference on Consumer Devices
Yangyijian Liu
Jun Yu Li
Wu-Jun Li
304
0
0
15 Mar 2025
Text Compression for Efficient Language Generation
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
David Gu
Peter Belcak
Roger Wattenhofer
251
1
0
14 Mar 2025
Towards Extreme Pruning of LLMs with Plug-and-Play Mixed Sparsity
Chi Xu
Gefei Zhang
Yantong Zhu
Luca Benini
Guosheng Hu
Yawei Li
Zhihong Zhang
193
1
0
14 Mar 2025
Previous
1
2
3
...
7
8
9
...
57
58
59
Next
Page 8 of 59
Page
of 59
Go