frequent itemset generation

This step aims to generate all possible with a $sup \gt threshold$.

brute-force approach

For each possible item-set (called candidate) compute it’s $sup$ by scanning the database.

This, approach which have a complexity of $\mathcal{o}(NMW)$ where:

it’s extremely computational expensive.

There are other strategies that aims to reduce the computational cost of this operation such as:

brute-force approach

The brute-force approach generates each item-set in the graph above. Then, it computes the sup and conf indexes values for every association rule generated by every item-set.

frequent item-set generation strategies

Link map