Answer
Pre-required Knowledge
- Linear Transformation of Gaussian: If , then follows .
- Sum of Independent Gaussians: If and are independent, then .
- Marginalization: .
Step-by-Step Answer
Part 1: Distribution of
-
Define : The latent function value is defined as a linear transformation of the parameters:
-
Apply Linear Transformation Property: We know the posterior of is . Using the linear transformation property (where is a row vector):
- Mean:
- Variance:
-
Result:
where and match the equations (3.51) and (3.52).
Part 2: Distribution of
-
Model Relationship: The observed output is the function value plus noise:
-
Sum of Independent Random Variables: We have the distribution of (from Part 1) and the distribution of (noise assumption). Since the new noise is independent of the past data (and thus ), the variable is the sum of two independent Gaussian variables.
-
Compute Moments:
- Mean:
- Variance:
-
Result:
This matches equation (3.53).