FREQUENT ITEM-SET GENERATION

This step aims to generate all possible with a .

BRUTE-FORCE APPROACH

For each possible item-set (called candidate) compute it’s by scanning the database.

This, approach which have a complexity of where:

  • total number of the transactions
  • average number of items within a transaction
  • number of frequent item-set candidate.

it’s extremely computational expensive.

There are other strategies that aims to reduce the computational cost of this operation such as:

  • reducing the number of candidates by pruning (apriori algorithm)
  • reducing the number of comparisons

BRUTE-FORCE APPROACH

The brute-force approach generates each item-set in the graph above. Then, it computes the sup and conf indexes values for every association rule generated by every item-set.

  • Complexity (EXPENSIVE): O(NWM), where

Frequent item-set Generation Strategies

  • Reduce the number of candidates M (Apriori Algorithm)
    • Complete Search:
    • Use pruning techniques to reduce M
  • Reduce the number of comparisons NM
    • Use efficient data structures to store the candidates or transactions
    • No need to match every candidate against every transaction

PREVIOUS NEXT