Setup

We have a set of observations $(x_{i}, y_{i})$ , and we want to find the parameters $a, b, σ, μ$ such that $Y = a X + b + ϵ$ , where $ϵ \approx N (0, σ)$ is Normally distributed.

Solution

We will apply Maximum Likelihood Estimation. Applied to this problem, we want to find the $a r g ma x$ of the expression below:

\prod P [y_{i} ∣ x_{i}] = lo g (\prod P [y_{i} ∣ x_{i}]) = \sum lo g (P [y_{i} ∣ x_{i}]) = \sum lo g (\frac{1}{σ 2 π} e^{- \frac{1}{2} (\frac{a x _{i} + b - y _{i}}{σ})^{2}}) = - N lo g (σ) - \frac{N}{2} lo g (2 π) - \frac{1}{2} \sum (\frac{a x _{i} + b - y _{i}}{σ})^{2} = - N lo g (σ) - \frac{1}{2} \sum (\frac{a x _{i} + b - y _{i}}{σ})^{2} As lo g (\prod z_{i}) = \sum lo g (z_{i}) As P [y_{i} ∣ x_{i}] = \frac{1}{σ 2 π} e^{- \frac{1}{2} (\frac{a x _{i} + b - y _{i}}{σ})^{2}} As lo g (\prod z_{i}) = \sum lo g (z_{i}) As - \frac{N}{2} lo g (2 π) is a constant As - \frac{N}{2} lo g (2 π) is a constant

In order to find the $a r g ma x$ we equal all derivatives to zero:

0 a \sum x_{i} + N b = \frac{\partial}{\partial b} \prod P [y_{i} ∣ x_{i}] = \frac{\partial}{\partial b} (- N lo g (σ) - \frac{1}{2} \sum (\frac{a x _{i} + b - y _{i}}{σ})^{2}) = \frac{\partial}{\partial b} (\sum (\frac{a x _{i} + b - y _{i}}{σ})^{2}) = \sum \frac{\partial}{\partial b} ((a x_{i} + b - y_{i})^{2}) = \sum 2 (a x_{i} + b - y_{i}) = \sum a x_{i} + b - y_{i} ⇓ = \sum y_{i} Removing terms without b and constants Dividing both sides by 2

Doing the same for $a$ :

0 a \sum x_{i}^{2} + b \sum x_{i} = \frac{\partial}{\partial a} \prod P [y_{i} ∣ x_{i}] = \frac{\partial}{\partial a} (- N lo g (σ) - \frac{1}{2} \sum (\frac{a x _{i} + b - y _{i}}{σ})^{2}) = \frac{\partial}{\partial a} (\sum (\frac{a x _{i} + b - y _{i}}{σ})^{2}) = \sum \frac{\partial}{\partial a} ((a x_{i} + b - y_{i})^{2}) = \sum 2 x_{i} (a x_{i} + b - y_{i}) = \sum a x_{i}^{2} + x_{i} b - x_{i} y_{i} ⇓ = \sum x_{i} y_{i} Removing terms without a and constants Dividing both sides by 2

Writing the 2 equations above in matrix form:

(\sum x_{i} \sum x_{i}^{2} N \sum x_{i}) (a b) (a b) = (\sum y_{i} \sum x_{i} y_{i}) ⇓ = (\sum x_{i} \sum x_{i}^{2} N \sum x_{i})^{- 1} (\sum y_{i} \sum x_{i} y_{i}) = ((\sum x_{i})^{2} - N \sum x_{I}^{2})^{- 1} (\sum x_{i} - \sum x_{i}^{2} - N \sum x_{i}) (\sum y_{i} \sum x_{i} y_{i})

Applying the formula above, we get the solution for the Linear regression as a function of our observations. We can also calculate $σ$ if we want to know the estimated error of our predictions, but it is not needed to build the predictor itself

Miguel Torres Costa

Explorer

MLE with normally distributed model

Setup

Solution

Table of Contents

Backlinks

Graph View