Diffusion
updated
Large Language Diffusion Models
Paper
•
2502.09992
•
Published
•
123
Block Diffusion: Interpolating Between Autoregressive and Diffusion
Language Models
Paper
•
2503.09573
•
Published
•
74
MMaDA: Multimodal Large Diffusion Language Models
Paper
•
2505.15809
•
Published
•
97
Diffusion vs. Autoregressive Language Models: A Text Embedding
Perspective
Paper
•
2505.15045
•
Published
•
54
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning
Paper
•
2505.16933
•
Published
•
34
LaViDa: A Large Diffusion Language Model for Multimodal Understanding
Paper
•
2505.16839
•
Published
•
13
Scaling Diffusion Transformers Efficiently via μP
Paper
•
2505.15270
•
Published
•
35
Paper
•
2505.14513
•
Published
•
29
D-AR: Diffusion via Autoregressive Models
Paper
•
2505.23660
•
Published
•
34
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed
Inference
Paper
•
2508.02193
•
Published
•
133
A Survey on Diffusion Language Models
Paper
•
2508.10875
•
Published
•
34
SparseD: Sparse Attention for Diffusion Language Models
Paper
•
2509.24014
•
Published
•
30
Sequential Diffusion Language Models
Paper
•
2509.24007
•
Published
•
45
Fast-dLLM v2: Efficient Block-Diffusion LLM
Paper
•
2509.26328
•
Published
•
55
Attention Sinks in Diffusion Language Models
Paper
•
2510.15731
•
Published
•
48
Diffusion Language Models are Super Data Learners
Paper
•
2511.03276
•
Published
•
128
LLaDA2.0: Scaling Up Diffusion Language Models to 100B
Paper
•
2512.15745
•
Published
•
77