Diffusion Language Modeling with LoRA-Adapted Diffusion (LAD)

Note: Paper available here, and more info and demo's here!

Overview

This project introduces a diffusion-style language model built upon LLaMA 3.1 8B, fine-tuned using LoRA adapters and a unique (de-)noising schedule. Unlike conventional autoregressive models, this approach employs diffusion to generate text. And unlike conventional diffusion, this approach enables noiseless refinement.

🔍 Key Features

Diffusion-based generation: Forward corruption and reverse denoising for generation.
Unique noising schedule: The novel noising implementation enables the model to denoise without remasking.
Scalable test time compute: Improved output quality by increasing the number of diffusion iterations.
LoRA fine-tuning: Efficient adaptation using Low-Rank Adapters enables training within hours on a single GPU.

🚀 Demo

👉 Try the Demo on Hugging Face Spaces

📊 Preliminary Benchmark Results

Benchmark	Score
ARC-Easy	88.5
MMLU	60.5
ARC-Challenge	81.0
HellaSwag	70.0

These preliminary results were obtained from a randomly selected subset of 200 samples of each benchmark

🧪 Sample Output

Prompt: What is the capital of France?

🔧 Settings

Disable Intermediate Noising: Speeds up convergence by skipping the noising step between iterations. Works best for short, factual questions.
Iterations: Number of refinement steps. More iterations means more time to refine the answer.
Pause Between Steps: Slows down the process so you can visually follow the changes.

🖍️ Visualization

Red tokens: Masked (noised) tokens that will be regenerated.
Green tokens: Newly generated tokens compared to the previous step.

🧪 Example Prompt

For noiseless diffusion, try short questions like:

What's the capital of France?

For more in-depth questions, enable intermediate noising. Increasing the number of iterations generally improves answer quality.

What do you know about Amsterdam?

See how low you can go with the number of iterations while still receiving adequate answers!

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
.github/workflows		.github/workflows
.nojekyll		.nojekyll
README.md		README.md
_config.yml		_config.yml
figure_lad_noremask.png		figure_lad_noremask.png
figure_lad_remask.png		figure_lad_remask.png
index.html		index.html
style.css		style.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diffusion Language Modeling with LoRA-Adapted Diffusion (LAD)

Overview

🔍 Key Features

🚀 Demo

📊 Preliminary Benchmark Results

🧪 Sample Output

🔧 Settings

🖍️ Visualization

🧪 Example Prompt

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Diffusion Language Modeling with LoRA-Adapted Diffusion (LAD)

Overview

🔍 Key Features

🚀 Demo

📊 Preliminary Benchmark Results

🧪 Sample Output

🔧 Settings

🖍️ Visualization

🧪 Example Prompt

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages