Problem 3.8(e) Explanation
Why and ?
These are linear functions.
- is a line going from to . The area under the triange is . So it is a valid PDF. It puts most weight on .
- is a line going from to . Area is also 1. It puts most weight on .
Mapping to Pseudocounts
The crucial insight is writing the polynomial in the form .
- For , exponent of is 1. Since , . Exponent of is 0. Since , .
- Usually, we say counts success and counts failures.
- However, for MAP estimates, the "neutral" point is not (which is improper) or (Uniform).
- Wait, let's look at the MAP formula again:
-
If (Prior ): Result: . This looks like we added 1 success to the numerator, and 1 trial to the denominator. So: 1 Virtual Success, 0 Virtual Failures.
-
If (Prior ): Result: . This looks like we added 0 successes to the numerator, and 1 trial to the denominator. So: 0 Virtual Successes, 1 Virtual Failure.
-
Summary of Bias
- encodes a belief that "I have seen one success already".
- encodes a belief that "I have seen one failure already".
- Uniform (from previous parts) encodes "I have seen nothing? Or 1 of each?"
- MAP for Uniform (): . (0 added samples).
- So, relative to the Uniform MAP, adds a success, adds a failure.