Supervised Learning
Models are trained on labeled data, where each input has a known correct output.
- We have a dataset with one or more "features" (or independent variables, or just variables) (X) and one or more results (y).
- We would like to create a function that given a new set of values for X will predict the value(s) of y.
It is called "supervised" learning because for the given data-set we know whay is the true result (aka. labels).
We can divide the supervised learning problems into two types:
-
Regression (continuous values)
-
Classification (discrete labels)
-
Input variables: Independent variables, aka. features
-
Output variables: Dependent variables, target, labels
The model is supposed to predict the output from the input.