Home Knowledge Base Semiconductor Manufacturing Process Yield Modeling: Mathematical Foundations

Semiconductor Manufacturing Process Yield Modeling: Mathematical Foundations

1. Overview

Yield modeling in semiconductor manufacturing is the mathematical framework for predicting the fraction of functional dies on a wafer. Since fabrication involves hundreds of process steps where defects can occur, accurate yield prediction is critical for:

2. Fundamental Definitions

Yield ($Y$) is defined as:

$$ Y = \frac{\text{Number of good dies}}{\text{Total dies on wafer}} $$

The mathematical challenge involves relating yield to:

3. The Poisson Model (Baseline)

The simplest model assumes defects are randomly and uniformly distributed across the wafer.

3.1 Basic Equation

$$ Y = e^{-AD} $$

Where:

3.2 Mathematical Derivation

If defects follow a Poisson distribution with mean $\lambda = AD$, the probability of zero defects (functional die) is:

$$ P(X = 0) = \frac{e^{-\lambda} \lambda^0}{0!} = e^{-AD} $$

3.3 Limitations

4. Defect Clustering Models

Real defects cluster due to:

4.1 Murphy's Model (1964)

Assumes defect density is uniformly distributed between $0$ and $2D_0$:

$$ Y = \frac{1 - e^{-2AD_0}}{2AD_0} $$

For large $AD_0$, this approximates to:

$$ Y \approx \frac{1}{2AD_0} $$

4.2 Seeds' Model

Assumes exponential distribution of defect density:

$$ Y = e^{-\sqrt{AD}} $$

4.3 Negative Binomial Model (Industry Standard)

This is the most widely used model in semiconductor manufacturing.

4.3.1 Main Equation

$$ Y = \left(1 + \frac{AD}{\alpha}\right)^{-\alpha} $$

Where $\alpha$ is the clustering parameter:

4.3.2 Mathematical Origin

The negative binomial arises from a compound Poisson process:

1. Let $X \sim \text{Poisson}(\lambda)$ be the defect count 2. Let $\lambda \sim \text{Gamma}(\alpha, \beta)$ be the varying rate 3. Marginalizing over $\lambda$ gives $X \sim \text{Negative Binomial}$

The probability mass function is:

$$ P(X = k) = \binom{k + \alpha - 1}{k} \left(\frac{\beta}{\beta + 1}\right)^\alpha \left(\frac{1}{\beta + 1}\right)^k $$

The yield (probability of zero defects) becomes:

$$ Y = P(X = 0) = \left(\frac{\beta}{\beta + 1}\right)^\alpha = \left(1 + \frac{AD}{\alpha}\right)^{-\alpha} $$

4.4 Model Comparison

At $AD = 1$:

ModelYield
Poisson36.8%
Murphy43.2%
Negative Binomial ($\alpha = 2$)57.7%
Negative Binomial ($\alpha = 1$)50.0%
Seeds36.8%

5. Critical Area Analysis

Not all die area is equally sensitive to defects. Critical area ($A_c$) is the region where a defect of given size causes failure.

5.1 Definition

For a defect of radius $r$:

5.2 Stapper's Critical Area Model

For parallel lines of width $w$, spacing $s$, and length $l$:

$$ A_c(r) = \begin{cases} 0 & \text{if } r < \frac{s}{2} \\[8pt] 2l\left(r - \frac{s}{2}\right) & \text{if } \frac{s}{2} \leq r < \frac{w+s}{2} \\[8pt] lw & \text{if } r \geq \frac{w+s}{2} \end{cases} $$

5.3 Integration Over Defect Size Distribution

The total critical area integrates over the defect size distribution $f(r)$:

$$ A_c = \int_0^\infty A_c(r) \cdot f(r) \, dr $$

Common distributions for $f(r)$:

5.4 Yield with Critical Area

$$ Y = \exp\left(-\int_0^\infty A_c(r) \cdot D(r) \, dr\right) $$

6. Yield Decomposition

Total yield is typically factored into independent components:

$$ Y_{\text{total}} = Y_{\text{gross}} \times Y_{\text{random}} \times Y_{\text{parametric}} $$

6.1 Component Definitions

ComponentDescriptionTypical Range
$Y_{\text{gross}}$Catastrophic defects, edge loss, handling damage95–99%
$Y_{\text{random}}$Random particle defects (main focus of yield modeling)70–95%
$Y_{\text{parametric}}$Process variation causing spec failures90–99%

6.2 Extended Decomposition

For more detailed analysis:

$$ Y_{\text{total}} = Y_{\text{gross}} \times \prod_{i=1}^{N_{\text{layers}}} Y_{\text{random},i} \times \prod_{j=1}^{M_{\text{params}}} Y_{\text{param},j} $$

7. Parametric Yield Modeling

Dies may function but fail to meet performance specifications due to process variation.

7.1 Single Parameter Model

If parameter $X \sim \mathcal{N}(\mu, \sigma^2)$ with specification limits $[L, U]$:

$$ Y_p = \Phi\left(\frac{U - \mu}{\sigma}\right) - \Phi\left(\frac{L - \mu}{\sigma}\right) $$

Where $\Phi(\cdot)$ is the standard normal cumulative distribution function:

$$ \Phi(z) = \frac{1}{\sqrt{2\pi}} \int_{-\infty}^{z} e^{-t^2/2} \, dt $$

7.2 Process Capability Indices

7.2.1 Cp (Process Capability)

$$ C_p = \frac{USL - LSL}{6\sigma} $$

7.2.2 Cpk (Process Capability Index)

$$ C_{pk} = \min\left(\frac{USL - \mu}{3\sigma}, \frac{\mu - LSL}{3\sigma}\right) $$

7.3 Cpk to Yield Conversion

$C_{pk}$Sigma LevelYieldDPMO
0.3368.27%317,300
0.6795.45%45,500
1.0099.73%2,700
1.3399.9937%63
1.6799.999943%0.57
2.0099.9999998%0.002

7.4 Multiple Correlated Parameters

For $n$ parameters with mean vector $\boldsymbol{\mu}$ and covariance matrix $\boldsymbol{\Sigma}$:

$$ Y_p = \int \int \cdots \int_{\mathcal{R}} \frac{1}{(2\pi)^{n/2}|\boldsymbol{\Sigma}|^{1/2}} \exp\left(-\frac{1}{2}(\mathbf{x}-\boldsymbol{\mu})^T \boldsymbol{\Sigma}^{-1}(\mathbf{x}-\boldsymbol{\mu})\right) d\mathbf{x} $$

Where $\mathcal{R}$ is the specification region.

Computational Methods:

8. Spatial Yield Models

Modern fabs analyze spatial patterns using wafer maps to identify systematic issues.

8.1 Radial Defect Density Model

Accounts for edge effects:

$$ D(r) = D_0 + D_1 r^2 $$

Where:

8.2 General Spatial Model

$$ D(x, y) = D_0 + \sum_{i} \beta_i \phi_i(x, y) $$

Where $\phi_i(x, y)$ are spatial basis functions (e.g., Zernike polynomials).

8.3 Spatial Autocorrelation (Moran's I)

$$ I = \frac{n \sum_i \sum_j w_{ij}(Z_i - \bar{Z})(Z_j - \bar{Z})}{W \sum_i (Z_i - \bar{Z})^2} $$

Where:

Interpretation:

8.4 Variogram Analysis

The semi-variogram $\gamma(h)$ measures spatial dependence:

$$ \gamma(h) = \frac{1}{2|N(h)|} \sum_{(i,j) \in N(h)} (Z_i - Z_j)^2 $$

Where $N(h)$ is the set of die pairs separated by distance $h$.

9. Multi-Layer Yield

Modern ICs have many process layers, each contributing to yield loss.

9.1 Independent Layers

$$ Y_{\text{total}} = \prod_{i=1}^{N} Y_i = \prod_{i=1}^{N} \left(1 + \frac{A_i D_i}{\alpha_i}\right)^{-\alpha_i} $$

9.2 Simplified Model

If defects are independent across layers with similar clustering:

$$ Y = \left(1 + \frac{A \cdot D_{\text{total}}}{\alpha}\right)^{-\alpha} $$

Where:

$$ D_{\text{total}} = \sum_{i=1}^{N} D_i $$

9.3 Layer-Specific Critical Areas

$$ Y = \prod_{i=1}^{N} \exp\left(-A_{c,i} \cdot D_i\right) $$

For Poisson model, or:

$$ Y = \prod_{i=1}^{N} \left(1 + \frac{A_{c,i} D_i}{\alpha_i}\right)^{-\alpha_i} $$

For negative binomial.

10. Yield Learning Curves

Yield improves over time as processes mature and defect sources are eliminated.

10.1 Exponential Learning Model

$$ D(t) = D_\infty + (D_0 - D_\infty)e^{-t/\tau} $$

Where:

10.2 Power Law (Wright's Learning Curve)

$$ D(n) = D_1 \cdot n^{-b} $$

Where:

10.3 Yield vs. Time

Combining with yield model:

$$ Y(t) = \left(1 + \frac{A \cdot D(t)}{\alpha}\right)^{-\alpha} $$

11. Yield-Redundancy Models (Memory)

Memory arrays use redundant rows/columns for defect tolerance through laser repair or electrical fusing.

11.1 Poisson Model with Redundancy

If a memory has $R$ spare elements and defects follow Poisson:

$$ Y_{\text{repaired}} = \sum_{k=0}^{R} \frac{(AD)^k e^{-AD}}{k!} $$

This is the CDF of the Poisson distribution:

$$ Y_{\text{repaired}} = \frac{\Gamma(R+1, AD)}{\Gamma(R+1)} = \frac{\gamma(R+1, AD)}{R!} $$

Where $\gamma(\cdot, \cdot)$ is the lower incomplete gamma function.

11.2 Negative Binomial Model with Redundancy

$$ Y_{\text{repaired}} = \sum_{k=0}^{R} \binom{k+\alpha-1}{k} \left(\frac{\alpha}{\alpha + AD}\right)^\alpha \left(\frac{AD}{\alpha + AD}\right)^k $$

11.3 Repair Coverage Factor

$$ Y_{\text{repaired}} = Y_{\text{base}} + (1 - Y_{\text{base}}) \cdot RC $$

Where $RC$ is the repair coverage (fraction of defective dies that can be repaired).

12. Statistical Estimation

12.1 Maximum Likelihood Estimation for Negative Binomial

Given wafer data with $n_i$ dies and $k_i$ failures per wafer $i$:

Likelihood function:

$$ \mathcal{L}(D, \alpha) = \prod_{i=1}^{W} \binom{n_i}{k_i} (1-Y)^{k_i} Y^{n_i - k_i} $$

Log-likelihood:

$$ \ell(D, \alpha) = \sum_{i=1}^{W} \left[ \ln\binom{n_i}{k_i} + k_i \ln(1-Y) + (n_i - k_i) \ln Y \right] $$

Estimation: Requires iterative numerical methods:

12.2 Bayesian Estimation

With prior distributions $P(D)$ and $P(\alpha)$:

$$ P(D, \alpha \mid \text{data}) \propto P(\text{data} \mid D, \alpha) \cdot P(D) \cdot P(\alpha) $$

Common priors:

12.3 Model Selection

Use information criteria to compare models:

Akaike Information Criterion (AIC):

$$ AIC = -2\ln(\mathcal{L}) + 2k $$

Bayesian Information Criterion (BIC):

$$ BIC = -2\ln(\mathcal{L}) + k\ln(n) $$

Where $k$ = number of parameters, $n$ = sample size.

13. Economic Model

13.1 Die Cost

$$ \text{Cost}_{\text{die}} = \frac{\text{Cost}_{\text{wafer}}}{N_{\text{dies}} \times Y} $$

13.2 Dies Per Wafer

Accounting for edge exclusion (dies must fit entirely within usable area):

$$ N \approx \frac{\pi D_w^2}{4A} - \frac{\pi D_w}{\sqrt{2A}} $$

Where:

More accurate formula:

$$ N = \frac{\pi (D_w/2 - E)^2}{A} \cdot \eta $$

Where:

13.3 Cost Sensitivity Analysis

Marginal cost impact of yield change:

$$ \frac{\partial \text{Cost}_{\text{die}}}{\partial Y} = -\frac{\text{Cost}_{\text{wafer}}}{N \cdot Y^2} $$

13.4 Break-Even Analysis

Minimum yield for profitability:

$$ Y_{\text{min}} = \frac{\text{Cost}_{\text{wafer}}}{N \cdot \text{Price}_{\text{die}}} $$

14. Key Models

14.1 Yield Models Comparison

ModelFormulaBest Application
Poisson$Y = e^{-AD}$Lower bound estimate, theoretical baseline
Murphy$Y = \frac{1-e^{-2AD}}{2AD}$Moderate clustering
Seeds$Y = e^{-\sqrt{AD}}$Exponential clustering
Negative Binomial$Y = \left(1 + \frac{AD}{\alpha}\right)^{-\alpha}$Industry standard, tunable clustering
Critical Area$Y = e^{-\int A_c(r)D(r)dr}$Layout-aware prediction

14.2 Key Parameters

ParameterSymbolTypical RangeDescription
Defect Density$D$0.01–1 /cm²Defects per unit area
Die Area$A$10–800 mm²Size of single chip
Clustering Parameter$\alpha$0.5–5Degree of defect clustering
Learning Rate$b$0.2–0.4Yield improvement rate

14.3 Quick Reference Equations

Basic yield: $$Y = e^{-AD}$$

Industry standard: $$Y = \left(1 + \frac{AD}{\alpha}\right)^{-\alpha}$$

Total yield: $$Y_{\text{total}} = Y_{\text{gross}} \times Y_{\text{random}} \times Y_{\text{parametric}}$$

Die cost: $$\text{Cost}_{\text{die}} = \frac{\text{Cost}_{\text{wafer}}}{N \times Y}$$

Practical Implementation Workflow

1. Data Collection

2. Parameter Estimation

3. Spatial Analysis

4. Parametric Analysis

5. Model Integration

6. Trend Monitoring

7. Cost Optimization

yield modelingproduction yielddefect densitydie yieldwafer yieldyield management

Explore 500+ Semiconductor & AI Topics

From EUV lithography to CUDA optimization — search the full knowledge base or chat with our AI assistant.