Home Knowledge Base Absorbing State Diffusion

Absorbing State Diffusion

Keywords: absorbing state diffusion, generative models


Absorbing State Diffusion for text is a diffusion approach where tokens gradually transition toward a special mask token (absorbing state) — providing a natural discrete diffusion process where the forward process masks tokens with increasing probability and the reverse process learns to unmask, connecting diffusion models to masked language modeling like BERT.

What Is Absorbing State Diffusion?

Why Absorbing State Diffusion?

How It Works

Forward Process (Masking):

Transition Probabilities:

P(x_t = [MASK] | x_{t-1} = token) = β_t
P(x_t = token | x_{t-1} = token) = 1 - β_t
P(x_t = token | x_{t-1} = [MASK]) = 0  (absorbing!)

Reverse Process (Unmasking):

Connection to BERT

Similarities:

Differences:

Insight: Absorbing state diffusion generalizes BERT to iterative generation.

Training

Objective:

Training Algorithm:

1. Sample clean sequence x_0 from dataset
2. Sample timestep t ~ Uniform(1, T)
3. Mask tokens according to schedule q(t)
4. Model predicts original tokens from masked sequence
5. Compute cross-entropy loss on masked positions
6. Backpropagate and update model

Masking Schedule:

Generation (Sampling)

Iterative Unmasking:

1. Start with fully masked sequence x_T = [MASK, ..., MASK]
2. For t = T down to 1:
   a. Model predicts token probabilities for each [MASK]
   b. Sample tokens from predicted distributions
   c. Unmask some positions (according to schedule)
   d. Keep other positions masked for next iteration
3. Final x_0 is generated text

Unmasking Strategy:

Temperature:

Advantages

Natural Discrete Process:

Leverages BERT Insights:

Flexible Generation:

Controllable:

Limitations

Multiple Steps Required:

Unmasking Order:

Long-Range Dependencies:

Examples & Implementations

D3PM (Discrete Denoising Diffusion Probabilistic Models):

MDLM (Masked Diffusion Language Model):

Applications

Text Infilling:

Controlled Generation:

Text Editing:

Tools & Resources

Absorbing State Diffusion is a promising approach for discrete diffusion — by using masking as the corruption process, it provides a natural, interpretable way to apply diffusion to text that connects to successful masked language modeling, offering advantages in infilling, editing, and controllable generation while remaining simpler than continuous embedding approaches.


Source: ChipFoundryServicesSearch this topicAsk CFSGPT

absorbing state diffusiongenerative models

Explore 500+ Semiconductor & AI Topics

From EUV lithography to CUDA optimization — search the full knowledge base or chat with our AI assistant.