MACHINE LEARNING IN COMPUTER VISION

There are problems in computer vision scenarios that can not be dealt with classical computer vision approach for example image classification:

In order to address this situation machine learning techniques are deployed.

THE PROBLEM OF IMAGE CLASSIFICATION

The problem of image classification relies on a classification algorithm that can deal with the huge variety of the input data, it’s impossible to handcraft such an algorithm so in order to address this problem machine learning is involved

MACHINE LEARNING FOR IMAGE CLASSIFICATION

Machine learning techniques deployed relies on a training phase in which the model is learned by a training set of images with provided labels and a test phase where model performance are tested

flowchart LR
subgraph testing
direction TB
C[test set]
D[test learnt model]
C --> D
end
subgraph training
direction TB
A[training set]
B[learn classification model]
A --> B
end
training --> testing

The training and testing datasets can be defined as follows

D^{t r ain} = {(x^{i}, y^{i}) ∣ i = 1... N}

D^{t es t} = {(x^{i}, y^{i}) ∣ i = 1... M}

Where $x^{i}$ are the given input feature (images) and $y^{i}$ are the true labels for the corresponding input feature

MODELING THE “LEARNING” CONCEPT

In machine learning the training phase can be seen as an optimization problem that aims to optimize an objective function which measures how good the prediction on the training set $D^{t r ain}$ are

θ^{*} = a r g mi n_{θ \in Θ} (L (θ, D^{t r ain}))

Where $L (θ, D^{t r ain})$ is called Loss function and measures how bad the prediction on the training set are so the lower the better is common practice to implement the Loss function as the average of the single images

L (θ, D^{t r ain}) = \frac{1}{N} i = 0 \sum N L (θ, (x^{i}, y^{i}))

UNDERFITTING AND OVERFITTING PROBLEM

When varying model complexity training and test error follow this curve

So with complex models the training error increases, this is called overfitting

REGULARIZATION

Regularization aims to reduce the test error without modifying the training error

The basic idea under this concept is that models with lower parameter tend to overfit less.

In order to implement such solution a regularize parameter is introduced to express a preference for smaller parameter values

L (θ, D^{t r ain}) + λ L^{re g} (θ)

where $λ$ is an hyperparameter that determines the contribution of the regulator, popular choices of the regulator can be:

L^{re g} = L_{1} (θ) = ∣ θ ∣ = i \sum ∣ θ_{i} ∣

L^{re g} = L_{2} (θ) = ∥ θ ∥^{2} = i \sum θ_{i}^{2}

DATA AUGMENTATION

In order to artificially increase the size of datasets operation on the input image are performed without altering the label such as image rotation,crop,cutout

PREVIOUS NEXT

Explorer