Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2304.04052
Cited By
Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder
8 April 2023
Z. Fu
W. Lam
Qian Yu
Anthony Man-Cho So
Shengding Hu
Zhiyuan Liu
Nigel Collier
AuLLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Decoder-Only or Encoder-Decoder? Interpreting Language Model as a Regularized Encoder-Decoder"
33 / 33 papers shown
Title
LLMs are All You Need? Improving Fuzz Testing for MOJO with Large Language Models
Linghan Huang
Peizhou Zhao
Huaming Chen
147
0
0
11 Oct 2025
Optimal Control for Transformer Architectures: Enhancing Generalization, Robustness and Efficiency
Kelvin Kan
Xingjian Li
Benjamin J. Zhang
Tuhin Sahai
Stanley Osher
Markos A. Katsoulakis
220
0
0
16 May 2025
VNJPTranslate: A comprehensive pipeline for Vietnamese-Japanese translation
Hoang Hai Phan
Nguyen Duc Minh Vu
Nam Dang Phuong
213
0
0
01 Apr 2025
Evaluating book summaries from internal knowledge in Large Language Models: a cross-model and semantic consistency approach
Javier Coronado-Blázquez
HILM
ELM
271
0
0
27 Mar 2025
Theoretical limitations of multi-layer Transformer
Lijie Chen
Binghui Peng
Hongxun Wu
AI4CE
451
21
0
04 Dec 2024
The Future of Intelligent Healthcare: A Systematic Analysis and Discussion on the Integration and Impact of Robots Using Large Language Models for Healthcare
Souren Pashangpour
Goldie Nejat
LM&MA
221
13
0
05 Nov 2024
Strada-LLM: Graph LLM for traffic prediction
Seyed Mohamad Moghadas
Yangxintong Lyu
Alexandre Alahi
Alexandre Alahi
AI4TS
552
4
0
28 Oct 2024
CCSBench: Evaluating Compositional Controllability in LLMs for Scientific Document Summarization
Yixi Ding
Jiaying Wu
Tongyao Zhu
Yanxia Qin
Qian Liu
Min-Yen Kan
CoGe
195
0
0
16 Oct 2024
Scaling Laws of Decoder-Only Models on the Multilingual Machine Translation Task
Conference on Machine Translation (WMT), 2024
Gaëtan Caillaut
Raheel Qader
Mariam Nakhlé
Jingshu Liu
Jean-Gabriel Barthélemy
157
4
0
23 Sep 2024
On-Device Language Models: A Comprehensive Review
Jiajun Xu
Zhiyuan Li
Wei Chen
Qun Wang
Xin Gao
Qi Cai
Ziyuan Ling
492
97
0
26 Aug 2024
Summarizing long regulatory documents with a multi-step pipeline
Mika Sie
Ruby Beek
Michiel Bots
S. Brinkkemper
Albert Gatt
AILaw
ELM
172
5
0
19 Aug 2024
Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent
Shanbo Cheng
Zhichao Huang
Tom Ko
Hang Li
Ningxin Peng
Lu Xu
Qini Zhang
284
11
0
31 Jul 2024
Towards Chapter-to-Chapter Context-Aware Literary Translation via Large Language Models
Linghao Jin
Li An
Xuezhe Ma
296
0
0
12 Jul 2024
Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis
Qi Sun
Hang Zhou
Wengang Zhou
Li Li
Houqiang Li
3DPC
3DV
245
12
0
07 Jul 2024
TacoLM: GaTed Attention Equipped Codec Language Model are Efficient Zero-Shot Text to Speech Synthesizers
Yakun Song
Zhuo Chen
Xiaofei Wang
Ziyang Ma
Guanrou Yang
Xie Chen
AuLLM
116
6
0
22 Jun 2024
Investigating the translation capabilities of Large Language Models trained on parallel data only
Javier García Gilabert
Carlos Escolano
Aleix Sant Savall
Francesca de Luca Fornaciari
Audrey Mash
Xixian Liao
Maite Melero
LRM
312
2
0
13 Jun 2024
Decoder-only Streaming Transformer for Simultaneous Translation
Shoutao Guo
Shaolei Zhang
Yang Feng
295
13
0
06 Jun 2024
Efficient Encoder-Decoder Transformer Decoding for Decomposable Tasks
Bo-Ru Lu
Nikita Haduong
Chien-Yu Lin
Hao Cheng
Noah A. Smith
Mari Ostendorf
AI4CE
203
1
0
19 Mar 2024
Denoising Autoregressive Representation Learning
International Conference on Machine Learning (ICML), 2024
Yazhe Li
J. Bornschein
Ting Chen
DiffM
244
7
0
08 Mar 2024
Transformers and Language Models in Form Understanding: A Comprehensive Review of Scanned Document Analysis
Abdelrahman Abdallah
Daniel Eberharter
Zoe Pfister
Adam Jatowt
187
15
0
06 Mar 2024
A Survey of Deep Learning and Foundation Models for Time Series Forecasting
John A. Miller
Mohammed Aldosari
Farah Saeed
Nasid Habib Barna
Subas Rana
I. Arpinar
Ninghao Liu
AI4TS
AI4CE
263
48
0
25 Jan 2024
ELLA-V: Stable Neural Codec Language Modeling with Alignment-guided Sequence Reordering
AAAI Conference on Artificial Intelligence (AAAI), 2024
Ya-Zhen Song
Zhuo Chen
Xiaofei Wang
Ziyang Ma
Xie Chen
AuLLM
216
62
0
14 Jan 2024
Customizable Combination of Parameter-Efficient Modules for Multi-Task Learning
Haowen Wang
Tao Sun
Cong Fan
Jinjie Gu
MoE
200
7
0
06 Dec 2023
TPPoet: Transformer-Based Persian Poem Generation using Minimal Data and Advanced Decoding Techniques
Amir Panahandeh
Hanie Asemi
Esmail Nourani
275
2
0
04 Dec 2023
Understanding the Natural Language of DNA using Encoder-Decoder Foundation Models with Byte-level Precision
Bioinformatics Advances (BA), 2023
Aditya Malusare
Harish Kothandaraman
Dipesh Tamboli
N. Lanman
Vaneet Aggarwal
AI4CE
247
0
0
04 Nov 2023
An Introduction to Natural Language Processing Techniques and Framework for Clinical Implementation in Radiation Oncology
Reza Khanmohammadi
Mohammad Mahdi Ghassemi
Kyle Verdecchia
A. Ghanem
Luo Bing
...
H. Bagher-Ebadian
Farzan Siddiqui
Mohamed Elshaikh
B. Movsas
Kundan Thind
LM&MA
269
3
0
03 Nov 2023
ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models
Proceedings of the VLDB Endowment (PVLDB), 2023
Ben Feuer
Yurong Liu
Chinmay Hegde
Juliana Freire
AI4TS
VLM
195
24
0
27 Oct 2023
MSG-BART: Multi-granularity Scene Graph-Enhanced Encoder-Decoder Language Model for Video-grounded Dialogue Generation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Hongcheng Liu
Zhe Chen
Hui Li
Pingjie Wang
Yanfeng Wang
Yu Wang
VGen
163
4
0
26 Sep 2023
OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from Scratch
Science China Information Sciences (Sci China Inf Sci), 2023
Juntao Li
Zecheng Tang
Yuyang Ding
Pinzheng Wang
Pei Guo
...
Wenliang Chen
Guohong Fu
Qiaoming Zhu
Guodong Zhou
Hao Fei
356
8
0
19 Sep 2023
Reward Engineering for Generating Semi-structured Explanation
Findings (Findings), 2023
Jiuzhou Han
Wray Buntine
Ehsan Shareghi
LRM
150
0
0
15 Sep 2023
On decoder-only architecture for speech-to-text and large language model integration
Automatic Speech Recognition & Understanding (ASRU), 2023
Jian Wu
Yashesh Gaur
Zhuo Chen
Long Zhou
Yilun Zhu
...
Jinyu Li
Shujie Liu
Bo Ren
Linquan Liu
Yu-Huan Wu
AuLLM
507
183
0
08 Jul 2023
BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks
Nature Network Boston (NNB), 2023
Kai Zhang
Jun Yu
Eashan Adhikarla
Rong Zhou
Zhilin Yan
...
Hang Zhang
Yong Chen
Shijie Zhao
Hongfang Liu
Lichao Sun
LM&MA
MedIm
302
11
0
26 May 2023
Multi-Task Instruction Tuning of LLaMa for Specific Scenarios: A Preliminary Study on Writing Assistance
Yue Zhang
Leyang Cui
Deng Cai
Xinting Huang
Tao Fang
Wei Bi
ALM
195
47
0
22 May 2023
1