21
www.decideo.fr/bruley Product Product Affinity Affinity Extract from various presentations: CRS, BUS 782, Aster Data … [email protected] January 2013 January 2013

Big Data and Product Affinity

  • View
    639

  • Download
    2

Embed Size (px)

DESCRIPTION

Analytics can offer a key possibility identifying which products are sold together, because that information can be used to influence targeted communication efforts, store layouts, and in-store promotions.

Citation preview

Page 1: Big Data and Product Affinity

www.decideo.fr/bruley

Product Product AffinityAffinity

Extract from various presentations: CRS, BUS 782, Aster Data …

[email protected]

January 2013January 2013

Page 2: Big Data and Product Affinity

www.decideo.fr/bruley

Assortment analysis & Management

Customer Analysis & Marketing

Promotion evaluation & management

Vendor & Supply Chain Management

Store Operations

No. Of Baskets Traffic Builder ID Traffic analysis

Frequency of visits Consumer Penetration

Traffic builder Opp. Consumer Penetration (with Cust.ID)

Traffic Builder ID

Traffic Analysis (manpower planning)

Av. Basket Metrics

Item Contribution Trx Builder ID Price Point

Contribution

Value, av. Purchase Discount behaviour Customer Modelling

Promotional Evaluation

Item Performance Store Performance evaluation

Av. Dist. # of Items & Depts

Trx Builder ID Item contribution “Variety driver”

Purchase Variety Behaviour

Customer Modelling

Promotion Evaluation (w/Cherry Picker)

AffinityAnalysis

Item Deletion Cross Sell Lost Sales Prevention Potential overstock

Co-marketing opportunities by customer

Promotion Evaluation Promo Item Selection Event Strategy Cross sell opportunity

Vendor Participation Co Merchandising Opportunities (visual merchandising)

Cherry Picker Item deletion Item contribution

Cherry Picking Behaviour

Consumer Profitability

Promotion Evaluation Promo item selection

Margin Protection Vendor participation

Price Point Price Point Identification

Price Elasticity

Promotional Pricing Price integrity Fraud detection Local pricing

Transaction analysis

Product quality (returns)

Fraud detection Fraud Detection Cashier productivity

Time of Day In-Store activities In-Store Activities Event Strategy (“Early Bird” opportunities)

Manpower planning In-store activities

Payment Type Payment influence Payment type relevance

Trx. Profiling Local Store Assortment

Pricing by segment

Customer Profiling Customer Modelling Propensity to Buy

Promotional Evaluation (behaviour change)

Item Performance Consumer/retailer

relevance of item

Store layout & visual merchandising

Product affinity analysis is one of the Product affinity analysis is one of the basket analysis techniquesbasket analysis techniques

Page 3: Big Data and Product Affinity

www.decideo.fr/bruley

Product Affinity Definition Product Affinity Definition

Identify which products are sold together and use that information to influence targeted marketing efforts, store layouts, and in-store promotions

Product Affinity enables an organization to detect product/service purchase patterns, linkages, and cross-sell opportunities in order to increase revenues. Results from this application will enable the organization to identify, with a high degree of accuracy, those customers most interested in specific products, services and product/service groupings

Page 4: Big Data and Product Affinity

www.decideo.fr/bruley

Affinity AnalysisAffinity Analysis

Affinity Analysis is a modeling technique based upon the theory that if you buy a certain group of items, you are more (or less) likely to buy another group of items.

The set of items a customer buys is referred to as an item set, and market basket analysis seeks to find relationships between purchases.

Typically the relationship will be in the form of a rule: Example:

– IF {beer, no bar meal} THEN {chips}

Page 5: Big Data and Product Affinity

www.decideo.fr/bruley

Product Affinity and Cross- SellingProduct Affinity and Cross- Selling

For instance, customers are very likely to purchase shampoo and conditioner together, so a retailer would not put both items on promotion at the same time. The promotion of one would likely drive sales of the other

A widely used example of cross selling on the internet with market basket analysis is Amazon.com's use of suggestions of the type:

– "Customers who bought book A also bought book B", e.g.

Page 6: Big Data and Product Affinity

www.decideo.fr/bruley

Product Affinity Analysis ProcessProduct Affinity Analysis Process

Historic market basket data and analyzes are used to build more effective marketing programs:

– Past customer purchase data is used to identify which products/services are acquired by which customer groups

– Predictive analytics is applied to this data to discover profiles of customers most likely to buy the products in each group

– These profiles are used to target those customers most likely to respond favorably to specific cross-sell campaign

– Pair-wise product associations are also determined to enable the constructed of offers featuring the purchase of these pair products

– Customer product dislikes are also identified so that company does not promote unwanted products

Benefits that can be realized from utilizing this solution:– Improve customer knowledge allowing company to better understand what their

customers are likely to buy and not buy.

– Increase revenue and decrease costs by identifying those customers most likely to respond to cross-sell campaigns

Page 7: Big Data and Product Affinity

www.decideo.fr/bruley

Behavior PredictionBehavior Prediction

This uses past consumer behavior to foresee the future behavior of their customers.

This analysis includes several variations.1. Propensity-to-buy analysis- understanding what a

particular customer might buy.2. Next Sequential Purchase- predicting the customers next

buy.3. Product Affinity Analysis- Understanding which

products will be bought with others. 4. Price elasticity modeling and dynamic pricing- determine

the best price for a given product.

Page 8: Big Data and Product Affinity

www.decideo.fr/bruley

Product Affinity = Link AnalysisProduct Affinity = Link Analysis

Aims to establish links (associations) between records, or sets of records, in a database

There are three specializations– Associations discovery– Sequential pattern discovery– Similar time sequence discovery

Page 9: Big Data and Product Affinity

www.decideo.fr/bruley

Link Analysis: Associations Discovery

Finds items that imply the presence of other items in the same event

Affinities between items are represented by association rules

– e.g. ‘When customer rents property for more than 2 years and is more than 25 years old, in 40% of cases, customer will buy a property. Association happens in 35% of all customers who rent properties’.

Page 10: Big Data and Product Affinity

www.decideo.fr/bruley

Link Analysis: Sequential Pattern Link Analysis: Sequential Pattern DiscoveryDiscovery

Finds patterns between events such that the presence of one set of items is followed by another set of items in a database of events over a period of time.

– e.g. Used to understand long-term customer buying behaviour

Page 11: Big Data and Product Affinity

www.decideo.fr/bruley

Link Analysis: Similar Time Sequence Link Analysis: Similar Time Sequence Discovery Discovery

Finds links between two sets of data that are time-dependent, and is based on the degree of similarity between the patterns that both time series demonstrate

– e.g. Within three months of buying property, new home owners will purchase goods such as cookers, freezers, and washing machines

Page 12: Big Data and Product Affinity

www.decideo.fr/bruley

Teradata SQL

SQL is better for:

•Standard transformations across every element in a table

•Standard aggregations using GROUP BY on tables

• sum(), max(), stddev()

•Dimensional Joins

•Set Filtering • Lookups, data pruning to limit a

table to a subset.

•Presentation formatting • For example, “get me top K counts

only”

SQL-MapReduce is better for:

•Custom Transformations • e.g. unstructured data, log extraction,

conditional manipulation

•Custom Aggregations

•Inter-row Analysis, like time-series

•Layered queries• Nested queries, sub-queries, recursive

queries

•Analysis that requires reorganization of data into new data structures

• Graph analysis, decision trees, etc.

Aster SQL-MapReduce

For Analytics: SQL or SQL-For Analytics: SQL or SQL-MapReduceMapReduce

Page 13: Big Data and Product Affinity

www.decideo.fr/bruley

Time Series AnalysisTime Series Analysis discover patterns in rows of sequential datadiscover patterns in rows of sequential data

Aster Data SQL/MR Approach• Single-pass of data

• Linked list sequential analysis• Gap recognition

Traditional SQL Approach• Full Table Scans

• Self-Joins for sequencing• Limited operators for ordered data

Purchase 1 Purchase 2 Purchase 3 Purchase 4

{user, product, time}Sales Transactions

Page 14: Big Data and Product Affinity

www.decideo.fr/bruley

Teradata Aster solution

Challenge• Identify correlations between

purchases made over time

With Aster Data• SQL-MapReduce for market

basket analysis indicates correlations between products

Impact• Move beyond “people who

bought this also bought” to time-based recommendations

Identify common product baskets of interestIdentify common product baskets of interest

Product

Catalog

userID EAN Author Store time

15682817 823201 JK Rowling 100 12:00 PM

16816193 123101 Shakespeare 105 1:45 PM

19825996 182191 Rick Riordan 201 3:00 PM

15528047 823201 Walter Isaacson 100 4:20 PM

In-Store Transactions

IPAddress EAN Author time

192.168.20.14 823201 John Grisham 12:00 PM

172.16.254.1 123101 Dostoevsky 1:45 PM

216.27.61.137 182191 Obama 3:00 PM

194.66.82.11 823201 Stephen King 4:20 PM

Online Transactions

item_no type EAN

12334 book 823201

13345 music --

21456 periodical --

82673 toy --

Cross-Channel Transactions

43M Customers Online Alone!

Page 15: Big Data and Product Affinity

www.decideo.fr/bruley

Basket Affinity: Retail Business Basket Affinity: Retail Business NeedNeed

Overview:For most retailers, Market basket affinity is a well known tool for cross-promotions and marketing.However, there is very little affinity known “outside” the basket. For example, there are many cases where the consumer will return to the store to get the additional item(s) they did not purchase initially.

Examples:Electronics retailer (Best Buy, Radio Shack, Fry’s):

– A Blue-Ray player is purchased online on a given date. The same customer visits the store next week to buy HDMI cables and a B-R disc.

Fashion Retailer (Target, Macy’s, J Crew):

– A customer purchases a dress and hand bag one week. Returns within a month to buy matching shoes.

With this sequential affinity analysis, the retailer can send very specific and timely email marketing, to drive traffic and increase revenue.

Page 16: Big Data and Product Affinity

www.decideo.fr/bruley

Challenge• Difficult to do in a relational DB due to the

sheer size of the combinatorial permutations of the various purchasing sequences.

• Requires good customer recognition via a credit card database or a customer loyalty card program.

With Teradata Aster• Use nPath/Sessionization to identify “super”

baskets within a time window. Tighter time window implies higher affinity.

• Run Basket Generator to identify the most frequent affinity items & subcategories.

Impact• Enables more accurate targeting of customer

needs; reduce direct marketing spend, increase revenue yield.

Overview of Cross-Basket AffinityOverview of Cross-Basket Affinity

TransID UserId Date/Time Item UPC

874143 10001 11/12/24 83321

543422 20001 11/12/28 73910

632735 30002 11/12/24 39503

452834 10001 11/12/30 49019

Transactional DB

Cross-Channel Transactions

X Customers X Marketing Campaigns

Retail EDW

UserId Address Phone

10001 10 Main St 555-3421

20001 24 Elm st 232-5451

30002 534 Rich 232-5465

Customer Loyalty

Item UPC Category Dept

83321 Heels Shoes-Womens

73910 Handbags Accessories

39503 Dresses Apparel-Womens

49019 Perfumes Cosmetics

Product/Item Hierachy

Date CampaignID UserId

11/12/24 3241 10001

11/12/28 2352 20001

11/12/24 3241 30002

11/12/30 2352 10001

Marketing/Promotions

Page 17: Big Data and Product Affinity

www.decideo.fr/bruley

Prepares multi-structured data•Stitches rows together by customer in a time-ordered view

Scans all records to produce complete set of sequences•No need to define patterns in advance•Fully parallelized for scalable performance using MapReduce where not feasible with SQL/SAS

Summarize sequential affinity output for business exploration•Rank order the most popular sequential purchase paths.

Cross-Basket Affinity ExampleCross-Basket Affinity Example

Step 2: run Basket Generator to identify frequent affinity items.

TransID UserId Date/Time Item UPC SuperSession

SeqNum

874143 10001 11/12/24 83321 101 1

452834 10001 11/12/30 49019 101 2

Step 1: nPath/ Sessionization to identify “super” baskets.

Aster MapReduce Platform

ProductUpcA

ProductUpcB

Support Confidence

Time Window

SequentialOrder

83321 49019 0.10 0.30 14 days true

73910 83321 0.11 0.25 7 days false

UserId Address Phone

10001 10 Main St 555-3421

20001 24 Elm st 232-5451

30002 534 Rich 232-5465

TransID UserId Date/Time Item UPC

874143 10001 11/12/24 83321

543422 20001 11/12/28 73910

632735 30002 11/12/24 39503

452834 10001 11/12/30 49019

Page 18: Big Data and Product Affinity

www.decideo.fr/bruley

Identifies the Cross-Basket Affinity Identifies the Cross-Basket Affinity ProductsProducts

The frequent sequence of purchased items identifies products B & C which are likely to be sold when a customer buys a certain product A.

– Leverage this Cross-Affinity analysis to run more targeted marketing campaigns; increase affinity purchases

– Personalized email offers yields higher customer retention and loyalty, and reduces churn.

Aster SQL-MR functions nPath/Sessionization and Basket Generators are key algorithmic differentiators; this process cannot be done in a scalable manner in a relational DB and/or SAS

Page 19: Big Data and Product Affinity

www.decideo.fr/bruley

Analyzing item price movements and its impact on:– Basket size over a long duration (6-10yrs) will

provide key insights into halo impact and affinity contribution for items

– Basket composition over a long duration (6-10yrs) will provide key insights into price bands for items

Analyzing Affinity of items over a long duration (6-10 yrs) will provide key insights into running better promotions, planogram and price planning of around affinity items

Affinity Use Case Affinity Use Case 1/31/3

Page 20: Big Data and Product Affinity

www.decideo.fr/bruley

Affinity Analysis •Analyzing Affinity of items over a long duration (6-10yrs) will provide key insights into running better promotions, planogram and price planning using items affinity •Time Frame: 8 Years, 1 Banner - Data Set: Transaction Data, Product hierarchyConsumer Migration •Analyzing declines in consumer segments over large timeframes.•Time Frame: 3 Years - Data Set: Transaction Data, Segment Data, Competitor Data, Pricing DataPricing Affinity•Analyzing item price movement and its impact on basket size and affinity of items over a long duration (6 years)•Data Set: Transaction Data, Price data - Time Frame: 6 YearsCompetitor Impact•Data Set: Transaction/Consumer/Competitor/Pricing Data, Unit_Inf - Time Frame: 8 YearsSocial Media •Integrating consumer online data (Social Media - Facebook) with existing transaction data and understand impact on consumer loyalty.•Data Set: Should be collected by vendor

Affinity Use Case Affinity Use Case 2/32/3

Page 21: Big Data and Product Affinity

www.decideo.fr/bruley

Data– ~ 8 years of transaction data (2004 up to Sep-2011)– 15 Billion baskets (or transactions)– 225 Stores– 367K Unique UPCs– 12 Categories: Alcohol, Cereal, Frozen – Ice Cream, Laundry

Detergent, Cheese (Shredded/Sliced/Chunk/Other), Paper Towels, Pizza & Shelf Stable Juice

Solution

– Aster SQL-MapReduce: Collaborative Filter

– Query Runtime: 48 minutes (4 Workers using Columnar)

Affinity Use Case Affinity Use Case 3/33/3