58
Which One is Different Data Mining and Forensic Analytics Bill Douglas

Which one is different data mining and forensic analytics

Embed Size (px)

DESCRIPTION

 

Citation preview

Page 1: Which one is different data mining and forensic analytics

Which One is DifferentData Mining and Forensic AnalyticsBill Douglas

Page 2: Which one is different data mining and forensic analytics

Cost Advisors’ Background

Founded in 1999

Mission: Improve our client’s business and the lives of our employees

Focus on Accounting Investigation and Forensics

Logo symbolizes partnership with our clients

© 2008 Cost Advisors, Inc. All rights reserved.

2

Page 3: Which one is different data mining and forensic analytics

Bill Douglas’ BackgroundPresident at Cost Advisors, Inc.

33 years experience

Management positions in Accounting, Sales, Marketing

CFO, IPO, 'Big 4' public accounting, business processes, recovery auditing, internal controls, fraud, internal auditing, Sarbanes-Oxley (SOX)

Financial project management at both large and small public companies

Volunteer Washington County Sheriff’s Dept. – Fraud Team

Frequent speaker and writer about Internal Controls, Fraud

© 2012 Cost Advisors, Inc. All rights reserved.

3

Page 4: Which one is different data mining and forensic analytics

Bill Douglas’ Background

Credentials and memberships:OR, CA, WA Certified Public Accountant (CPA)

Certified Internal Auditor (CIA)

Certified Fraud Examiner (CFE)

Certified in Financial Forensics (CFF)

Certified IT Professional (CITP)

OR Licensed Private Investigator (PI)

© 2012 Cost Advisors, Inc. All rights reserved.

4

Northwest Fraud Investigators Association

MULTNOMAH BAR

ASSOCIATION (Affiliate Member)

Page 5: Which one is different data mining and forensic analytics

Agenda

1. Data Mining Examples

2. Data mining you can do in Excel

© 2012 Cost Advisors, Inc. All rights reserved.

5

Page 6: Which one is different data mining and forensic analytics

1. Data Mining for Fraud

© 2012 Cost Advisors, Inc. All rights reserved.

6

What is CAATs?

Example #1 Accounting Queries

Example #2 Scanning Bank Statements

Example #3 Benford’s Law

Page 7: Which one is different data mining and forensic analytics

What is CAATs?

Computer Assisted Audit Tools (CAATs)

Examine 100% of transactions

Analysis available:Duplicates

Missing Records

Queries (meeting certain criteria)

Population summaries by field (pivot tables)

Population statistics

© 2012 Cost Advisors, Inc. All rights reserved.

7

Page 8: Which one is different data mining and forensic analytics

Data Sources

Import data from many sourcesExcel

Acrobat (.pdf)

Text Files (.txt, .doc)

Print files (.prn)

Hardcopy scans

© 2012 Cost Advisors, Inc. All rights reserved.

8

Page 9: Which one is different data mining and forensic analytics

.PRN File

© 2012 Cost Advisors, Inc. All rights reserved.

9

Page 10: Which one is different data mining and forensic analytics

Example #1 Accounting Queries

© 2009 Cost Advisors, Inc. All rights reserved.

10

Disbursements(Checks)

Vendor Master List

Employee Master List

Accounting System

Page 11: Which one is different data mining and forensic analytics

Data Mining Tool

Example #1 Accounting Queries

© 2009 Cost Advisors, Inc. All rights reserved.

11

Disbursements(Checks)

Vendor Master List

Employee Master List

Page 12: Which one is different data mining and forensic analytics

Example #1 Six Accounting Queries

© 2009 Cost Advisors, Inc. All rights reserved.

12

Vendors with same address as employeeVendors using SS# as EIN

Payee not on Vendor List Non-payroll, non-expense report, payments to employees

Duplicate Payments

Employees with no address

Disbursements(Checks)

Vendor Master List

Employee Master List

Page 13: Which one is different data mining and forensic analytics

Example #2One Set of Books?

© 2009 Cost Advisors, Inc. All rights reserved.

13

Victim’sAccounting System = Victim’s

Bank Statement

Page 14: Which one is different data mining and forensic analytics

Data Extraction - Review

© 2009 Cost Advisors, Inc. All rights reserved.

14

Disbursements(Checks)

Vendor Master List

Employee Master List

Accounting System

Page 15: Which one is different data mining and forensic analytics

Disbursements in Excel

© 2009 Cost Advisors, Inc. All rights reserved.

15

Disbursements(Checks)

Page 16: Which one is different data mining and forensic analytics

Example #2- Scanning Bank Statements

© 2009 Cost Advisors, Inc. All rights reserved.

16

=Electronic Comparison

Victim’sBank Statement

Disbursements(per Accounting

System)

Missing

Page 17: Which one is different data mining and forensic analytics

Example #3 -Benford’s Law

Frank Benford (1938), Simon Newcomb (1881)

Some leading digits occur more/less frequently in most data

© 2012 Cost Advisors, Inc. All rights reserved.

17

1 2 3 4 5 6 7 8 90.00%

5.00%

10.00%

15.00%

20.00%

25.00%

30.00%

35.00%

Probability

Leading Digit

Page 18: Which one is different data mining and forensic analytics

Example #3 -Benford’s Law

Compares expected amounts to actual amounts

There were 1,368 occurrences of amounts beginning with $250

© 2012 Cost Advisors, Inc. All rights reserved.

18

Page 19: Which one is different data mining and forensic analytics

Summary of CAATs

Data from any source

Every transaction can be tested (no sampling)

Many tests possible. Comparison examples:Within accounting files

Accounting records to bank statements

Actual records to expected values (Benford)

© 2012 Cost Advisors, Inc. All rights reserved.

19

Page 20: Which one is different data mining and forensic analytics

Agenda

1. Data Mining Examples

2. Data mining you can do in Excel

© 2012 Cost Advisors, Inc. All rights reserved.

20

Page 21: Which one is different data mining and forensic analytics

Goals and Assumptions

Do basic investigation yourself

1 hour spent here will save dozens (hundreds?) of hours at work

Assumptions:

Data is in Excel 2007 or 2010

Basic knowledge of Excel (info for advanced Excel too)

© 2012 Cost Advisors, Inc. All rights reserved.

21

Page 22: Which one is different data mining and forensic analytics

Data Mining in ExcelData Filters

Empty data fields

Conditional formattingDuplicates

Comparing two Excel filesPayee not on vendor list

Pivot TablesHigh-dollar vendors

Missing checks

PowerPivot

Reporting

© 2012 Cost Advisors, Inc. All rights reserved.

22

Page 23: Which one is different data mining and forensic analytics

Data Filters - Setting

© 2012 Cost Advisors, Inc. All rights reserved.

23

Page 24: Which one is different data mining and forensic analytics

Data Filters - Blanks

© 2012 Cost Advisors, Inc. All rights reserved.

24

Page 25: Which one is different data mining and forensic analytics

Data Filters - Others

© 2012 Cost Advisors, Inc. All rights reserved.

25

Page 26: Which one is different data mining and forensic analytics

Data Filters - Suggestions

© 2012 Cost Advisors, Inc. All rights reserved.

26

Blank invoice numbers

Employees or vendors with no address

Vendors using a social security instead of EIN

Odd characters at the end of the invoice number or check number (“.” “–” “a”)

Invoice numbers 100, 101, 1000 or 1001

Page 27: Which one is different data mining and forensic analytics

Data Filters - Clearing

© 2012 Cost Advisors, Inc. All rights reserved.

27

Page 28: Which one is different data mining and forensic analytics

Data Mining in ExcelData Filters

Empty data fields

Conditional formattingDuplicates

Comparing two Excel filesPayee not on vendor list

Pivot TablesHigh-dollar vendors

Missing checks

PowerPivot

Reporting

© 2012 Cost Advisors, Inc. All rights reserved.

28

Page 29: Which one is different data mining and forensic analytics

Conditional Formatting -

© 2012 Cost Advisors, Inc. All rights reserved.

29

Page 30: Which one is different data mining and forensic analytics

Conditional Formatting with Data Filter

© 2012 Cost Advisors, Inc. All rights reserved.

30

Page 31: Which one is different data mining and forensic analytics

Conditional Format & Filter - Result

© 2012 Cost Advisors, Inc. All rights reserved.

31

Page 32: Which one is different data mining and forensic analytics

Conditional Format - Suggestions

© 2012 Cost Advisors, Inc. All rights reserved.

32

Look for duplicates of:

Invoice date

Invoice number

Invoice amount

Vendor name

Page 33: Which one is different data mining and forensic analytics

Data Mining in ExcelData Filters

Empty data fields

Conditional formattingDuplicates

Comparing two Excel filesPayee not on vendor list

Pivot TablesHigh-dollar vendors

Missing checks

PowerPivot

Reporting

© 2012 Cost Advisors, Inc. All rights reserved.

33

Page 34: Which one is different data mining and forensic analytics

Comparing Excel Files – First Sheet

© 2012 Cost Advisors, Inc. All rights reserved.

34

Page 35: Which one is different data mining and forensic analytics

Comparing Excel Files – Second Sheet

© 2012 Cost Advisors, Inc. All rights reserved.

35

Page 36: Which one is different data mining and forensic analytics

Comparing Excel Files – Result

© 2012 Cost Advisors, Inc. All rights reserved.

36

These vendors are missing from the vendor master list

Page 37: Which one is different data mining and forensic analytics

Data Mining in ExcelData Filters

Empty data fields

Conditional formattingDuplicates

Comparing two Excel filesPayee not on vendor list

Pivot TablesHigh-dollar vendors

Missing checks

PowerPivot

Reporting

© 2012 Cost Advisors, Inc. All rights reserved.

37

Page 38: Which one is different data mining and forensic analytics

Pivot Table – Largest Vendors (step 1)

© 2012 Cost Advisors, Inc. All rights reserved.

38

Page 39: Which one is different data mining and forensic analytics

Pivot Table – Largest Vendors (step 2)

© 2012 Cost Advisors, Inc. All rights reserved.

39

Page 40: Which one is different data mining and forensic analytics

Pivot Table – Largest Vendors (step 3)

© 2012 Cost Advisors, Inc. All rights reserved.

40

Page 41: Which one is different data mining and forensic analytics

Pivot Table – Largest Vendors - Suggestions

© 2012 Cost Advisors, Inc. All rights reserved.

41

Look for unusual vendor names and names of employees (‘cash’, ‘petty cash’, <blanks>, ‘bank’, ‘credit card’, etc.)

Discuss vendor disbursement levels with management

Page 42: Which one is different data mining and forensic analytics

Pivot Tables – Missing Check #s

© 2012 Cost Advisors, Inc. All rights reserved.

42

Page 43: Which one is different data mining and forensic analytics

Data Mining in ExcelData Filters

Empty data fields

Conditional formattingDuplicates

Comparing two Excel filesPayee not on vendor list

Pivot TablesHigh-dollar vendors

Missing checks

PowerPivot

Reporting

© 2012 Cost Advisors, Inc. All rights reserved.

43

Page 44: Which one is different data mining and forensic analytics

What is PowerPivot

From Microsoft for Office (Excel) 2010

It’s Free

FeaturesTurns Excel into a relational database

Compresses data

Speeds recalculation

(DAX Reporting tool)

© 2012 Cost Advisors, Inc. All rights reserved.

44

Page 45: Which one is different data mining and forensic analytics

How to Get PowerPivot

64 bit Excel vs. 32 bit Excel

© 2012 Cost Advisors, Inc. All rights reserved.

45

Page 46: Which one is different data mining and forensic analytics

Menu

© 2012 Cost Advisors, Inc. All rights reserved.

46

Normal Tabs

PowerPivot Tabs

Page 47: Which one is different data mining and forensic analytics

Menu

© 2012 Cost Advisors, Inc. All rights reserved.

47

Normal Tabs

PowerPivot Tabs

Page 48: Which one is different data mining and forensic analytics

Pivot Fields from Multiple Tabs (Tables)

© 2012 Cost Advisors, Inc. All rights reserved.

48

Page 49: Which one is different data mining and forensic analytics

PowerPivot Compression, SpeedAccess Excel (native) PowerPivot

Compression 327MB 82MB 12MB

Recalculation ~ 30min < 30 seconds

Worksheet size ~ 2GB 1,048,576 rows Millions of rows~2GB

© 2012 Cost Advisors, Inc. All rights reserved.

49

Page 50: Which one is different data mining and forensic analytics

Data Mining in ExcelData Filters

Empty data fields

Conditional formattingDuplicates

Comparing two Excel filesPayee not on vendor list

Pivot TablesHigh-dollar vendors

Missing checks

PowerPivot

Reporting

© 2012 Cost Advisors, Inc. All rights reserved.

50

Page 51: Which one is different data mining and forensic analytics

Reporting – Set Print Area

© 2012 Cost Advisors, Inc. All rights reserved.

51

Page 52: Which one is different data mining and forensic analytics

Reporting – Setup (Header)

© 2012 Cost Advisors, Inc. All rights reserved.

52

Page 53: Which one is different data mining and forensic analytics

Reporting – Setup (Footer)

© 2012 Cost Advisors, Inc. All rights reserved.

53

Page 54: Which one is different data mining and forensic analytics

Reporting – Setup (Result)

© 2012 Cost Advisors, Inc. All rights reserved.

54

Page 55: Which one is different data mining and forensic analytics

Reporting - Duplicating Tabs

© 2012 Cost Advisors, Inc. All rights reserved.

55

Page 56: Which one is different data mining and forensic analytics

Reporting – Removing Meta Data

© 2012 Cost Advisors, Inc. All rights reserved.

56

Page 57: Which one is different data mining and forensic analytics

Reporting – encrypting for sending

© 2012 Cost Advisors, Inc. All rights reserved.

57

Be sure to save the workbook with a new name - append

“(encrypted)” to the filename

Page 58: Which one is different data mining and forensic analytics

For More Information

Cost Advisors, Inc.

503-704-3719

www.costadvisors.com

Download: ‘Embezzlement Response Guide’

© 2012 Cost Advisors, Inc. All rights reserved.

58