Skip to main content

Answer ZH

必備知識

  1. 共變異數的定義:對於平均數為 μ\mu 的隨機變數 xx,其共變異數為 E[(xμ)(xμ)T]=E[xxT]μμT\mathbb{E}[(x - \mu)(x - \mu)^T] = \mathbb{E}[xx^T] - \mu\mu^T
  2. 平移定理 (Steiner's Translation Theorem) 或等價性質cov(x)=E[xxT]E[x]E[x]T\text{cov}(x) = \mathbb{E}[x x^T] - \mathbb{E}[x]\mathbb{E}[x]^T
  3. 核函數的性質
    • k~(u)du=1\int \tilde{k}(u) du = 1
    • uk~(u)du=0\int u \tilde{k}(u) du = 0 (零平均數)。
    • uuTk~(u)du=H\int u u^T \tilde{k}(u) du = H (來自公式 5.7,因為平均數為 0)。

逐步證明

μ^\hat{\mu}p^(x)\hat{p}(x) 的平均數(已在 (a) 部分推導出)。共變異數定義為:

Σ^=Ep^[(xμ^)(xμ^)T]=(xμ^)(xμ^)Tp^(x)dx\hat{\Sigma} = \mathbb{E}_{\hat{p}}[(x - \hat{\mu})(x - \hat{\mu})^T] = \int (x - \hat{\mu})(x - \hat{\mu})^T \hat{p}(x) dx

或者使用性質 cov(x)=E[xxT]E[x]E[x]T\text{cov}(x) = \mathbb{E}[xx^T] - \mathbb{E}[x]\mathbb{E}[x]^T

Σ^=xxTp^(x)dxμ^μ^T\hat{\Sigma} = \int x x^T \hat{p}(x) dx - \hat{\mu}\hat{\mu}^T

我們先計算二階動差項 xxTp^(x)dx\int x x^T \hat{p}(x) dx

  1. 代入 p^(x)\hat{p}(x)

    xxT(1ni=1nk~(xxi))dx=1ni=1nxxTk~(xxi)dx\int x x^T \left( \frac{1}{n} \sum_{i=1}^n \tilde{k}(x - x_i) \right) dx = \frac{1}{n} \sum_{i=1}^n \int x x^T \tilde{k}(x - x_i) dx
  2. 變數變換:u=xxiu = x - x_i,則 x=u+xix = u + x_i

    (u+xi)(u+xi)Tk~(u)du\int (u + x_i)(u + x_i)^T \tilde{k}(u) du

    展開 (u+xi)(u+xi)T=uuT+uxiT+xiuT+xixiT(u + x_i)(u + x_i)^T = uu^T + ux_i^T + x_i u^T + x_i x_i^T

  3. 逐項計算積分:

    (uuT+uxiT+xiuT+xixiT)k~(u)du\int (uu^T + ux_i^T + x_i u^T + x_i x_i^T) \tilde{k}(u) du
    • uuTk~(u)du=H\int uu^T \tilde{k}(u) du = H (根據零平均數的核函數共變異數定義)。
    • uxiTk~(u)du=(uk~(u)du)xiT=0xiT=0\int u x_i^T \tilde{k}(u) du = (\int u \tilde{k}(u) du) x_i^T = 0 \cdot x_i^T = 0
    • xiuTk~(u)du=xi(uTk~(u)du)=xi0=0\int x_i u^T \tilde{k}(u) du = x_i (\int u^T \tilde{k}(u) du) = x_i \cdot 0 = 0
    • xixiTk~(u)du=xixiTk~(u)du=xixiT1=xixiT\int x_i x_i^T \tilde{k}(u) du = x_i x_i^T \int \tilde{k}(u) du = x_i x_i^T \cdot 1 = x_i x_i^T

    因此,xxTk~(xxi)dx=H+xixiT\int x x^T \tilde{k}(x - x_i) dx = H + x_i x_i^T

  4. 加總結果:

    Ep^[xxT]=1ni=1n(H+xixiT)=H+1ni=1nxixiT\mathbb{E}_{\hat{p}}[x x^T] = \frac{1}{n} \sum_{i=1}^n (H + x_i x_i^T) = H + \frac{1}{n} \sum_{i=1}^n x_i x_i^T
  5. 計算共變異數:

    Σ^=Ep^[xxT]μ^μ^T\hat{\Sigma} = \mathbb{E}_{\hat{p}}[x x^T] - \hat{\mu}\hat{\mu}^T Σ^=H+1ni=1nxixiTμ^μ^T\hat{\Sigma} = H + \frac{1}{n} \sum_{i=1}^n x_i x_i^T - \hat{\mu}\hat{\mu}^T
  6. 整理為樣本共變異數形式: 回想樣本共變異數為 S=1n(xiμ^)(xiμ^)T=(1nxixiT)μ^μ^TS = \frac{1}{n}\sum (x_i - \hat{\mu})(x_i - \hat{\mu})^T = (\frac{1}{n}\sum x_i x_i^T) - \hat{\mu}\hat{\mu}^T。 因此:

    Σ^=H+(1ni=1nxixiTμ^μ^T)\hat{\Sigma} = H + \left( \frac{1}{n} \sum_{i=1}^n x_i x_i^T - \hat{\mu}\hat{\mu}^T \right) Σ^=H+1ni=1n(xiμ^)(xiμ^)T\hat{\Sigma} = H + \frac{1}{n} \sum_{i=1}^n (x_i - \hat{\mu})(x_i - \hat{\mu})^T \quad \blacksquare