MACHINE LEARNING IN COMPUTER VISION
There are problems in computer vision scenarios that can not be dealt with classical computer vision approach for example image classification:
In order to address this situation machine learning techniques are deployed.
THE PROBLEM OF IMAGE CLASSIFICATION
The problem of image classification relies on a classification algorithm that can deal with the huge variety of the input data, it’s impossible to handcraft such an algorithm so in order to address this problem machine learning is involved
MACHINE LEARNING FOR IMAGE CLASSIFICATION
Machine learning techniques deployed relies on a training phase in which the model is learned by a training set of images with provided labels and a test phase where model performance are tested
flowchart LR subgraph testing direction TB C[test set] D[test learnt model] C --> D end subgraph training direction TB A[training set] B[learn classification model] A --> B end training --> testing
The training and testing datasets can be defined as follows
Where are the given input feature (images) and are the true labels for the corresponding input feature
MODELING THE “LEARNING” CONCEPT
In machine learning the training phase can be seen as an optimization problem that aims to optimize an objective function which measures how good the prediction on the training set are
Where is called Loss function and measures how bad the prediction on the training set are so the lower the better is common practice to implement the Loss function as the average of the single images
UNDERFITTING AND OVERFITTING PROBLEM
When varying model complexity training and test error follow this curve
So with complex models the training error increases, this is called overfitting
REGULARIZATION
Regularization aims to reduce the test error without modifying the training error
The basic idea under this concept is that models with lower parameter tend to overfit less.
In order to implement such solution a regularize parameter is introduced to express a preference for smaller parameter values
where is an hyperparameter that determines the contribution of the regulator, popular choices of the regulator can be:
DATA AUGMENTATION
In order to artificially increase the size of datasets operation on the input image are performed without altering the label such as image rotation,crop,cutout