Explain
Intuition
Imagine a map with two cities (class and class ). You want to draw a border line between them so that if you are on one side, you are closer to city , and on the other side, you are closer to city .
The decision boundary is exactly this border. It's the place where the "score" for belonging to class is exactly equal to the "score" for belonging to class . You are perfectly undecided.
Because our scoring system from part (a) is just a simple linear equation (like the equation for a flat surface), when we set the two scores equal to each other, the result is another linear equation. In geometry, a linear equation describes a flat surface called a hyperplane (a line in 2D, a plane in 3D, etc.).
Let's break down the two parts of this border equation, :
-
The Direction (): The vector determines the tilt or orientation of the border. It is calculated as .
- is an arrow pointing straight from the center of class to the center of class .
- Multiplying by adjusts this arrow based on how the data is spread out. If the data is stretched in a certain direction, the border will tilt to account for that stretch.
-
The Position (): The scalar determines where the border is placed along that direction. It has two components:
- The Midpoint: The term essentially places the border exactly halfway between the two class centers (adjusted for the data's shape).
- The Popularity Shift: The term shifts the border based on which class is more common. If class is much more common than class (), this term is positive, which pushes the border away from class and closer to class . This makes sense: if class is more likely overall, you want to claim more territory for it!