Explain
Intuition
In part (b), we found the equation for the border between two classes: . While mathematically correct, it's a bit abstract. Part (c) asks us to rewrite this equation into a form that is much easier to visualize: .
Think of it like describing a wall. You can describe a wall by a complex mathematical formula, or you can simply say: "The wall passes exactly through this specific point (), and it faces this specific direction ()."
Let's break down what these two pieces mean:
1. The Direction ()
The vector is the "normal vector". Imagine standing on the border; is an arrow pointing straight out from the border, directly towards class . It tells us how the border is tilted. It's calculated by looking at the difference between the two class centers () and adjusting for the shape of the data ().
2. The Anchor Point ()
This is the really interesting part. is a specific point that sits exactly on the border. Let's look at its formula:
- The Midpoint: The first part is . This is exactly halfway between the center of class and the center of class . If both classes are equally common, the border passes right through this halfway point.
- The Shift: The second part involves . This is a "tug-of-war" based on which class is more popular (the priors).
- If class and class are equally likely, , so there is no shift. The anchor point stays in the middle.
- If class is much more common than class , the log term is positive. Because of the minus sign in the formula, the anchor point is pushed away from class and towards class .
Why does it shift towards the less common class? Imagine you are a doctor diagnosing a rare disease (class ) versus a common cold (class ). If a patient's symptoms are exactly borderline, you are more likely to guess the common cold, simply because it's much more common. By moving the decision border closer to the rare disease's center, you are effectively making the "territory" for the common cold larger. A data point has to be very close to the rare disease's center to be classified as such. The priors shift the border to give the more probable class a larger decision region.