TYPE CONVERSIONS
procedure to convert data types of attributes, there are many purposes such as:
- algorithms that require numeric features
- classification targets for numerical values
- boolean feature for association rules discovery
Transforming in a numerical quantity the features that represent categories.
there are 2 types of encoders
NOMINAL TO NUMERIC CONVERSION
ONE HOT ENCODER
It transforms a feature with unique values in boolean feature each, if an object as the value for the feature than the corresponding new feature is set to
here a usage example of the scikit-learn implementation:
ORDINAL TO NUMERIC CONVERSION
ORDINAL ENCODER
It transforms a ordinal feature in a numeric one preserving the order by translate the values into consecutive integers, the user can specify the sequence (lexicographic order is default)
here a usage example of the scikit-learn implementation:
NUMERIC TO BINARY CONVERSION
BINARIZER WITH THRESHOLD
it transform numeric values to binary ones by using a threshold
DISCRETIZATION
transformation that turns continuous domains into discrete ones, there are many algorithms for the purpose, some are domain knowledge based other use thresholds
NUMERIC TO K VALUES CONVERSION
Number are discretized in a series of values from to