-
Express the KDE:
The KDE is defined as p^(x)=n1∑i=1nhd1k(hx−xi).
-
Apply Variance to the KDE:
We want to compute the variance of p^(x) with respect to the random sample X={x1,…,xn}.
varX(p^(x))=varX(nhd1i=1∑nk(hx−xi))
-
Use Variance Properties for Independent Samples:
Since the samples x1,…,xn are drawn independently and identically distributed (i.i.d.), the variance of their sum is the sum of their variances: var(∑Zi)=∑var(Zi).
Also, extracting a constant c gives var(cZ)=c2var(Z).
varX(p^(x))=n2h2d1i=1∑nvarxi(k(hx−xi))
Since each xi∼p identically, all terms in the sum are equal. Let μ∼p be a dummy variable:
varX(p^(x))=n2h2dnvarμ∼p(k(hx−μ))=nh2d1varμ(k(hx−μ))
-
Apply the Variance Upper Bound (Eq 5.3):
Using the hint var(Z)≤E[Z2]:
varμ(k(hx−μ))≤Eμ[(k(hx−μ))2]
So we get:
varX(p^(x))≤nh2d1Eμ[(k(hx−μ))2]
-
Apply the Kernel Maximum Bound (Eq 5.4):
We can split the squared kernel into two terms and bound one of them using the maximum value of the kernel function. Let maxzk(z) represent the maximum output of the kernel (equivalent to maxx(k(x)) in the problem notation).
(k(hx−μ))2=k(hx−μ)⋅k(hx−μ)≤zmax(k(z))⋅k(hx−μ)
Substitute this inequality into the expectation:
Eμ[(k(hx−μ))2]≤Eμ[zmax(k(z))⋅k(hx−μ)]=zmax(k(z))⋅Eμ[k(hx−μ)]
Plugging this back into the variance inequality:
varX(p^(x))≤nh2d1zmax(k(z))Eμ[k(hx−μ)]
-
Relate Back to Expected Density E[p^(x)]:
Rearrange the right side to recover the form of E[p^(x)] derived in Part (a). We know k~(z)=hd1k(hz), so:
varX(p^(x))≤nhd1zmax(k(z))⋅Eμ[hd1k(hx−μ)]
We proved in (a) that Eμ[hd1k(hx−μ)]=EX[p^(x)]. Replacing this chunk:
varX(p^(x))≤nhd1zmax(k(z))EX[p^(x)]
Using the problem's notation maxx(k(x)) for the kernel maximum, we obtain:
varX(p^(x))≤nhd1xmax(k(x))E[p^(x)]