FREQUENT ITEM-SET GENERATION
This step aims to generate all possible with a .
BRUTE-FORCE APPROACH
For each possible item-set (called candidate) compute it’s by scanning the database.
This, approach which have a complexity of where:
- → total number of the transactions
- → average number of items within a transaction
- → number of frequent item-set candidate.
it’s extremely computational expensive.
There are other strategies that aims to reduce the computational cost of this operation such as:
- reducing the number of candidates by pruning (apriori algorithm)
- reducing the number of comparisons
BRUTE-FORCE APPROACH
The brute-force approach generates each item-set in the graph above. Then, it computes the sup and conf indexes values for every association rule generated by every item-set.
- Complexity (EXPENSIVE): O(NWM), where
Frequent item-set Generation Strategies
- Reduce the number of candidates M (Apriori Algorithm)
- Complete Search:
- Use pruning techniques to reduce M
- Reduce the number of comparisons NM
- Use efficient data structures to store the candidates or transactions
- No need to match every candidate against every transaction