Skip to main content

Answer

Prerequisites

  • Lagrange Multipliers
  • Derivatives: Specifically, product rule and derivative of xlogxx \log x.
    • ddx(xlogx)=1logx+x1x=logx+1\frac{d}{dx} (x \log x) = 1 \cdot \log x + x \cdot \frac{1}{x} = \log x + 1.

Solution

We maximize the objective function: f({πj})=j=1Kπj(Njlogπj)=j=1K(πjNjπjlogπj)f(\{\pi_j\}) = \sum_{j=1}^K \pi_j (N_j - \log \pi_j) = \sum_{j=1}^K ( \pi_j N_j - \pi_j \log \pi_j )

Subject to: j=1Kπj=1\sum_{j=1}^K \pi_j = 1

Step 1: Form the Lagrangian

L({πj},λ)=j=1K(πjNjπjlogπj)+λ(j=1Kπj1)L(\{\pi_j\}, \lambda) = \sum_{j=1}^K ( \pi_j N_j - \pi_j \log \pi_j ) + \lambda \left( \sum_{j=1}^K \pi_j - 1 \right)

Step 2: Take derivatives and set to zero

Take the partial derivative with respect to a specific πj\pi_j: Lπj=πj(πjNj)πj(πjlogπj)+πj(λπj)\frac{\partial L}{\partial \pi_j} = \frac{\partial}{\partial \pi_j} (\pi_j N_j) - \frac{\partial}{\partial \pi_j} (\pi_j \log \pi_j) + \frac{\partial}{\partial \pi_j} (\lambda \pi_j)

Using the product rule for πjlogπj\pi_j \log \pi_j: πj(πjlogπj)=1logπj+πj1πj=logπj+1\frac{\partial}{\partial \pi_j} (\pi_j \log \pi_j) = 1 \cdot \log \pi_j + \pi_j \cdot \frac{1}{\pi_j} = \log \pi_j + 1

Substituting back: Lπj=Nj(logπj+1)+λ\frac{\partial L}{\partial \pi_j} = N_j - (\log \pi_j + 1) + \lambda

Set to zero: Njlogπj1+λ=0N_j - \log \pi_j - 1 + \lambda = 0

Step 3: Solve for πj\pi_j in terms of λ\lambda

Rearrange the equation to isolate logπj\log \pi_j: logπj=Nj1+λ\log \pi_j = N_j - 1 + \lambda

Exponentiate both sides: πj=exp(Nj1+λ)\pi_j = \exp(N_j - 1 + \lambda) πj=exp(Nj)exp(λ1)\pi_j = \exp(N_j) \cdot \exp(\lambda - 1)

Step 4: Solve for the Lagrange multiplier term

Use the constraint πj=1\sum \pi_j = 1: j=1K[exp(Nj)exp(λ1)]=1\sum_{j=1}^K \left[ \exp(N_j) \cdot \exp(\lambda - 1) \right] = 1

Since exp(λ1)\exp(\lambda - 1) does not depend on jj, we can factor it out: exp(λ1)j=1Kexp(Nj)=1\exp(\lambda - 1) \sum_{j=1}^K \exp(N_j) = 1

Thus: exp(λ1)=1k=1Kexp(Nk)\exp(\lambda - 1) = \frac{1}{\sum_{k=1}^K \exp(N_k)}

Step 5: Substitute back to find πj\pi_j

Multiply exp(Nj)\exp(N_j) by the factor we just found: πj=exp(Nj)1k=1Kexp(Nk)\pi_j = \exp(N_j) \cdot \frac{1}{\sum_{k=1}^K \exp(N_k)} πj=expNjk=1KexpNk\pi_j = \frac{\exp N_j}{\sum_{k=1}^K \exp N_k}