Pre-required Knowledge
- Definition of Expectation: The expected value of a random variable X with density p(x) is E[x]=∫xp(x)dx.
- Kernel Density Estimation (KDE): Formula (5.5).
- Properties of Integrals: Linearity (∫(af(x)+bg(x))dx=a∫f(x)dx+b∫g(x)dx).
- Change of Variables: Integration by substitution.
- Kernel Properties: The kernel k~(x) integrates to 1 (∫k~(x)dx=1) because it is a probability density, and it has zero mean (Eq 5.6).
Step-by-Step Proof
We want to calculate the expected value of the distribution p^(x).
-
Write down the definition:
Ep^[x]=∫xp^(x)dx
-
Substitute the definition of p^(x) (Eq 5.5):
Ep^[x]=∫x(n1i=1∑nk~(x−xi))dx
-
Interchange integration and summation (Linearity):
Ep^[x]=n1i=1∑n∫xk~(x−xi)dx
-
Perform change of variables:
Let u=x−xi, so x=u+xi and dx=du. As x goes from −∞ to ∞, so does u.
∫xk~(x−xi)dx=∫(u+xi)k~(u)du
-
Expand and separate the integral:
=∫uk~(u)du+∫xik~(u)du
=Ek~[u]∫uk~(u)du+xi1∫k~(u)du
-
Use Kernel Properties:
- From Eq (5.6), ∫uk~(u)du=0 (zero mean).
- Since k~ is a valid probability density function, ∫k~(u)du=1.
So,
∫xk~(x−xi)dx=0+xi⋅1=xi
-
Summate the results:
Substitute this back into the sum from step 3:
Ep^[x]=n1i=1∑nxi
This is exactly the sample mean of X.
μ^=n1i=1∑nxi■