DATA PREPROCESSING

Operations done on the data to make them suitable for machine learning models

AGGREGATION

the process of combining 2 ore more attributes into a single one, this can be done to obtain differet goals:

  • data reduction
  • change of scale
  • more stable data

SCALING

SAMPLING

the process of reducing the dataset dimensions making samples

DIMENSIONALITY REDUCTION

the process of reducing the number of features

FEATURE SUBSET SELECTION

the process of selecting a subset of feature\

FEATURE CREATION

the process of creating new features from existing data

NEXT