Skip to main content

Answer

Prerequisites

  • Linearity of Expectation
  • Properties of Integrals
  • Probability Density Functions

Step-by-Step Derivation

  1. Start with the definition of the mean for the estimated distribution p^(x)\hat{p}(x): μ^=E_p^[x]=p^(x)xdx\hat{\mu} = \mathbb{E}\_{\hat{p}}[x] = \int \hat{p}(x) x dx

  2. Substitute the definition of p^(x)\hat{p}(x) from Equation (5.5): μ^=(1n_i=1nk~(xxi))xdx\hat{\mu} = \int \left( \frac{1}{n} \sum\_{i=1}^n \tilde{k}(x - x_i) \right) x dx

  3. Use the linearity of the integral to pull out the sum and the constant 1n\frac{1}{n}: μ^=1n_i=1nk~(xxi)xdx\hat{\mu} = \frac{1}{n} \sum\_{i=1}^n \int \tilde{k}(x - x_i) x dx

  4. Perform a change of variables in the integral for each term in the sum. Let u=xxiu = x - x_i, then du=dxdu = dx and x=u+xix = u + x_i: k~(xxi)xdx=k~(u)(u+xi)du\int \tilde{k}(x - x_i) x dx = \int \tilde{k}(u) (u + x_i) du

  5. Expand the expression and separate the integral into two parts: k~(u)udu+k~(u)xidu\int \tilde{k}(u) u du + \int \tilde{k}(u) x_i du

  6. Evaluate the first integral. According to Equation (5.6), the kernel has zero mean: k~(u)udu=0\int \tilde{k}(u) u du = 0

  7. Evaluate the second integral. Since k~(u)\tilde{k}(u) is a valid probability density function, it must integrate to 1: k~(u)xidu=xik~(u)du=xi1=xi\int \tilde{k}(u) x_i du = x_i \int \tilde{k}(u) du = x_i \cdot 1 = x_i

  8. Substitute these results back into the summation from step 3: μ^=1ni=1n(0+xi)=1ni=1nxi\hat{\mu} = \frac{1}{n} \sum*{i=1}^n (0 + x_i) = \frac{1}{n} \sum*{i=1}^n x_i

This completes the proof that the mean of the distribution p^(x)\hat{p}(x) is the sample mean of XX.