12
Data Mining: Association Rule By: Thanh Truong

Data Mining: Association Rule By: Thanh Truong. Association Rules In Association Rules, we look at the associations between different items to draw conclusions

Embed Size (px)

Citation preview

Page 1: Data Mining: Association Rule By: Thanh Truong. Association Rules In Association Rules, we look at the associations between different items to draw conclusions

Data Mining: Association Rule

By: Thanh Truong

Page 2: Data Mining: Association Rule By: Thanh Truong. Association Rules In Association Rules, we look at the associations between different items to draw conclusions

Association Rules

In Association Rules, we look at the associations between different items to draw conclusions from.In sales, we look at purchases:Example in book Someone who buys bread is most likely buy

milk Someone who buys the book Database

System Concepts is quite likely also to buy the book Operating System Concepts.

Page 3: Data Mining: Association Rule By: Thanh Truong. Association Rules In Association Rules, we look at the associations between different items to draw conclusions

Association Uses

When a customer buys a book on-line shop may suggest associated books.

Page 4: Data Mining: Association Rule By: Thanh Truong. Association Rules In Association Rules, we look at the associations between different items to draw conclusions
Page 5: Data Mining: Association Rule By: Thanh Truong. Association Rules In Association Rules, we look at the associations between different items to draw conclusions

(cont…)

In grocery stores they can place associated items next to each other.ORThey can place at opposite ends of the aisle, with other associated items in between.The store can sell an item at a discounted price, but not the other.

Page 6: Data Mining: Association Rule By: Thanh Truong. Association Rules In Association Rules, we look at the associations between different items to draw conclusions

Association Notation

Association Rules are statement of the form {X1, X2,…, Xn} => Y

Means: If we find things in X, then we will most likely find Y

Page 7: Data Mining: Association Rule By: Thanh Truong. Association Rules In Association Rules, we look at the associations between different items to draw conclusions

Population & Instance

An association rule must have an associated population.The population consists of a set of instances.In the grocery example, Population may be all grocery-store

purchases Instances are the purchases itself

Page 8: Data Mining: Association Rule By: Thanh Truong. Association Rules In Association Rules, we look at the associations between different items to draw conclusions

Support

Support is the measure of what fraction of the population satisfies both the antecedent and the consequent of the rule.For example, if only .0001 percent of purchases include milk and screwdrivers, then, the support is low for milk => screwdrivers

If 50% of purchases of diapers include beer, then we would say the support is high.

Page 9: Data Mining: Association Rule By: Thanh Truong. Association Rules In Association Rules, we look at the associations between different items to draw conclusions

Confidence

Confidence is a measure of how often the consequent is true when the antecedent is true.bread => milk has a confidence of 80% if 80% of the purchases that include bread also includes milk.A rule with a low confidence is not meaningful.

Page 10: Data Mining: Association Rule By: Thanh Truong. Association Rules In Association Rules, we look at the associations between different items to draw conclusions

Other Types of Association

In statistical terms, we can look for correlations between items. So, even if purchases of bread is not correlated with cereal, it would not be reported, even if there was a strong association between the two.Assocation Rule: {bread, butter} => jamCorrelation: Someone who buys tea will not buy coffee

Page 11: Data Mining: Association Rule By: Thanh Truong. Association Rules In Association Rules, we look at the associations between different items to draw conclusions

(cont…)

Sequence associations: Time-series data, such as stock prices on a sequence of daysExample is the following rule “Whenever bond rates go up, the stock prices go down within 2 days”Using this will help make investment decisions.

Page 12: Data Mining: Association Rule By: Thanh Truong. Association Rules In Association Rules, we look at the associations between different items to draw conclusions

References

Database System Concepts, Fifth Edition, Silberschatz