. The formula corresponds to finding the best parameters that map an input to an output by minimizing the L2 loss function. Variations of this might consider other Metric spaces in which to minimize Distance. For example,

Polynomial Regression: Ridge Regression: LASSO Regression: Logistic Regression: where