Problem 3.8(a) Answer
Pre-required Knowledge
-
Bernoulli Distribution: A discrete probability distribution of a random variable which takes the value 1 with probability and the value 0 with probability . The probability mass function is given by: for .
-
Independent and Identically Distributed (i.i.d.): We assume the samples in the dataset are drawn independently from the same distribution. If events and are independent, then . Generally, for independent samples , the joint probability is the product of individual probabilities:
-
Exponent Rules:
Step-by-Step Proof
-
Write down the likelihood of the dataset : Assuming the samples are i.i.d., the probability of observing the dataset given the parameter is the product of the probabilities of each individual sample.
-
Substitute the Bernoulli PDF: Substitute Eq. (3.30) () into the product.
-
Group the terms: Using the properties of exponents, we can separate the terms and the terms.
-
Apply product rule for exponents: Recall that .
-
Simplify the exponents: Let . This is the sum of the samples (number of successes/heads). The exponent for the second term is:
Here is the total number of samples.
-
Final Result: Substitute and back into the equation.
This matches Eq. (3.31).