Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.12219
Cited By
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
23 August 2023
Jiasheng Ye
Zaixiang Zheng
Yu Bao
Lihua Qian
Quanquan Gu
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"
27 / 27 papers shown
Title
Target Concrete Score Matching: A Holistic Framework for Discrete Diffusion
Ruixiang Zhang
Shuangfei Zhai
Yizhe Zhang
James Thornton
Zijing Ou
Joshua M. Susskind
Navdeep Jaitly
DiffM
25
0
0
23 Apr 2025
Diffusion Models for Tabular Data: Challenges, Current Progress, and Future Directions
Zhong Li
Qi Huang
Lincen Yang
Jiayang Shi
Zhao Yang
N. V. Stein
Thomas Bäck
M. Leeuwen
DiffM
37
0
0
24 Feb 2025
Theoretical Benefit and Limitation of Diffusion Language Model
Guhao Feng
Yihan Geng
Jian-Yu Guan
Wei Yu Wu
Liwei Wang
Di He
DiffM
41
0
0
13 Feb 2025
Scaling up Masked Diffusion Models on Text
Shen Nie
Fengqi Zhu
Chao Du
Tianyu Pang
Qian Liu
Guangtao Zeng
Min-Bin Lin
Chongxuan Li
AI4CE
28
13
0
24 Oct 2024
Scaling Diffusion Language Models via Adaptation from Autoregressive Models
Shansan Gong
Shivam Agarwal
Yizhe Zhang
Jiacheng Ye
Lin Zheng
...
Peilin Zhao
W. Bi
Jiawei Han
Hao Peng
Lingpeng Kong
AI4CE
46
14
0
23 Oct 2024
Latent Diffusion Models for Controllable RNA Sequence Generation
Kaixuan Huang
Yukang Yang
Kaidi Fu
Yanyi Chu
Le Cong
Mengdi Wang
29
1
0
15 Sep 2024
Guided Discrete Diffusion for Electronic Health Record Generation
Jun Han
Zixiang Chen
Yongqian Li
Yiwen Kou
Eran Halperin
Robert E. Tillman
Quanquan Gu
MedIm
DiffM
29
6
0
18 Apr 2024
Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models
Jiacheng Ye
Shansan Gong
Liheng Chen
Lin Zheng
Jiahui Gao
...
Chuan Wu
Xin Jiang
Zhenguo Li
Wei Bi
Lingpeng Kong
DiffM
LRM
AI4CE
30
12
0
12 Feb 2024
Transfer Learning for Text Diffusion Models
Kehang Han
Kathleen Kenealy
Aditya Barua
Noah Fiedel
Noah Constant
VLM
AI4CE
62
3
0
30 Jan 2024
Fast Sampling via Discrete Non-Markov Diffusion Models
Zixiang Chen
Huizhuo Yuan
Yongqian Li
Yiwen Kou
Junkai Zhang
Quanquan Gu
DiffM
21
5
0
14 Dec 2023
AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation
Tong Wu
Zhihao Fan
Xiao Liu
Yeyun Gong
Yelong Shen
...
Juntao Li
Zhongyu Wei
Jian Guo
Nan Duan
Weizhu Chen
VLM
71
22
0
16 May 2023
Directed Acyclic Transformer Pre-training for High-quality Non-autoregressive Text Generation
Fei Huang
Pei Ke
Minlie Huang
AI4CE
27
7
0
24 Apr 2023
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Sébastien Bubeck
Varun Chandrasekaran
Ronen Eldan
J. Gehrke
Eric Horvitz
...
Scott M. Lundberg
Harsha Nori
Hamid Palangi
Marco Tulio Ribeiro
Yi Zhang
ELM
AI4MH
AI4CE
ALM
197
2,232
0
22 Mar 2023
DINOISER: Diffused Conditional Sequence Learning by Manipulating Noises
Jiasheng Ye
Zaixiang Zheng
Yu Bao
Lihua Qian
Mingxuan Wang
DiffM
16
44
0
20 Feb 2023
What Language Model to Train if You Have One Million GPU Hours?
Teven Le Scao
Thomas Wang
Daniel Hesslow
Lucile Saulnier
Stas Bekman
...
Lintang Sutawika
Jaesung Tae
Zheng-Xin Yong
Julien Launay
Iz Beltagy
MoE
AI4CE
212
103
0
27 Oct 2022
Language Models are Multilingual Chain-of-Thought Reasoners
Freda Shi
Mirac Suzgun
Markus Freitag
Xuezhi Wang
Suraj Srivats
...
Yi Tay
Sebastian Ruder
Denny Zhou
Dipanjan Das
Jason W. Wei
ReLM
LRM
160
320
0
06 Oct 2022
GLM-130B: An Open Bilingual Pre-trained Model
Aohan Zeng
Xiao Liu
Zhengxiao Du
Zihan Wang
Hanyu Lai
...
Jidong Zhai
Wenguang Chen
Peng-Zhen Zhang
Yuxiao Dong
Jie Tang
BDL
LRM
237
840
0
05 Oct 2022
A Survey on Generative Diffusion Model
Hanqun Cao
Cheng Tan
Zhangyang Gao
Yilun Xu
Guangyong Chen
Pheng-Ann Heng
Stan Z. Li
MedIm
29
195
0
06 Sep 2022
Diffusion-LM Improves Controllable Text Generation
Xiang Lisa Li
John Thickstun
Ishaan Gulrajani
Percy Liang
Tatsunori B. Hashimoto
AI4CE
163
537
0
27 May 2022
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
291
2,712
0
24 May 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
313
8,261
0
28 Jan 2022
Multitask Prompted Training Enables Zero-Shot Task Generalization
Victor Sanh
Albert Webson
Colin Raffel
Stephen H. Bach
Lintang Sutawika
...
T. Bers
Stella Biderman
Leo Gao
Thomas Wolf
Alexander M. Rush
LRM
203
1,651
0
15 Oct 2021
Larger-Scale Transformers for Multilingual Masked Language Modeling
Naman Goyal
Jingfei Du
Myle Ott
Giridhar Anantharaman
Alexis Conneau
88
125
0
02 May 2021
Argmax Flows and Multinomial Diffusion: Learning Categorical Distributions
Emiel Hoogeboom
Didrik Nielsen
P. Jaini
Patrick Forré
Max Welling
DiffM
191
392
0
10 Feb 2021
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
220
3,054
0
23 Jan 2020
Efficient Estimation of Word Representations in Vector Space
Tomáš Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
228
29,632
0
16 Jan 2013
1