RULES GENERATION
The goal of this phase is, given an item-set to find all non empty subsets that the association rule have a where is a parameter defined from the designer
It’important to understand how to generate rules with an high from a given item-set
so from this formula we can say that for rules generated from the same item-sets:
this means that rules confidence is anti-monotone in relation of the number of elements that creates the antecedent
this characteristic can be used to prune the generation tree when there is a rule with a
INTERESTING RULE GENERATION METRICS
Confidence metrics can be insufficient in the rule generation process, other interesting metrics are also used
so given a rule in the form of we can compute the contingency table as follow
C | |||
---|---|---|---|
A | |||
from this table we can derive some interesting measures like
LIFT
lift is defined as
it indicates the ratio between rule true cases and independence, it tend to when the item-sets are independent
LEVERAGE
leverage is defined as
leverage indicates the number of additional cases, it tends to when the item-sets are independent
CONVICTION
conviction is defined as
it’s the ratio that occurs without if and where independent, higher value means that the rules is violated less often (in the assumption that the and are independent)