Output & Results
To apply the EM algorithm derived in part (a), we represent the data as an array , where each element refers to the number of hits for one city square. The number of samples for London is . For Antwerp, .
(Note: For computational viability and exact expectations, the "5 and over" bin is typically approximated as exactly 5, though an exact model might consider taking a truncated Poisson likelihood.)
Running the EM algorithm for and computing the log-likelihood (and the Bayesian Information Criterion, BIC, to prevent overfitting) gives the following typical results (exact outputs may slightly vary based on random initialization but will converge to these optima):
London
| Log-Likelihood () | BIC | Parameters () | |
|---|---|---|---|
| 1 | -728.71 | 1463 | |
| 2 | -728.71 | 1476 | |
| 3 | -728.70 | 1489 |
For London, increasing does not yield any meaningful improvement in the log-likelihood (the components collapse or merely replicate the global mean). Consequently, the BIC severely penalizes higher .
Antwerp
| Log-Likelihood () | BIC | Parameters () | |
|---|---|---|---|
| 1 | -830.70 | 1667 | |
| 2 | -748.02 | 1515 | |
| 3 | -747.78 | 1527 |
For Antwerp, stepping from to produces a massive increase in log-likelihood (). The BIC score reaches its global minimum at . Adding further components () yields diminishing returns and higher BIC scores.
Conclusion & Interpretation
London: Indiscriminate Bombing
For London, the optimal number of components is . The mixture model essentially collapses back into a single Poisson distribution with . This aligns with the mean equalling the variance () of the raw data. There is no evidence of specific targeting. The distribution of flying bombs across London squares perfectly mimics a purely random point pattern. The bombs fell indiscriminately.
Antwerp: Targeted Bombing
For Antwerp, the data exhibits severe overdispersion—the variance () greatly exceeds the mean (). The optimal mixture model strongly favors . The trained model reveals two distinct underlying distributions:
- Low-Hit Component (): ~66% of the city experienced a very low bomb hit rate. These were likely untargeted residential or suburban areas.
- High-Hit Component (): ~34% of the city experienced an aggressive hit rate, receiving nearly 10 times the bomb density of the low-hit regions.
There is heavy evidence of targeted bombing in Antwerp. The Germans likely aimed at specific infrastructural or strategic points (e.g., ports, railways)—leading to a dense concentration of hits in ~1/3 of the squares, leaving the remainder mostly unharmed.