33
Copyright 2007, Information Builders. Slide 1 Large Database and What to Do Bigger is Better! Floria Hanspard-Foote

Large Database and What to Do Bigger is Better!

  • Upload
    jensen

  • View
    78

  • Download
    0

Embed Size (px)

DESCRIPTION

Large Database and What to Do Bigger is Better!. Floria Hanspard-Foote. Bigger Is Better Agenda. New Limits Estimating File Size ACCESS File Horizontal Partitioning Vertical Partitioning Intelligent USES. EMPLOYEE Lots of Employees. Bigger Is Better Employee File. EMP_ID - PowerPoint PPT Presentation

Citation preview

Page 1: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 1

Large Database and What to DoBigger is Better!

Floria Hanspard-Foote

Page 2: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 2

Bigger Is BetterAgenda

New LimitsEstimating File SizeACCESS FileHorizontal PartitioningVertical Partitioning

Intelligent USES

Page 3: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 3

EMPLOYEELots of Employees

Page 4: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 4

Bigger Is BetterEmployee File

EMP_ID LAST_NAME FIRST_NAME HIRE_DATE

EMP_ID LAST_NAME FIRST_NAME HIRE_DATE

DAT_INC PCT_INC SALARY JOBCODE

DAT_INC PCT_INC SALARY JOBCODE

PAY_DATE GROSSNET CHECK_NO

PAY_DATE GROSSNET CHECK_NO

BANK_NAME BANK_CODE BANK_ACCT EFFECT_DATE

I

I

Page 5: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 5

Bigger Is BetterLimits for FOCUS/FUSION files

Number of Pages: FUSION -- 512K pages FOCUS --

Release 7.2 512K pages 2gig Release 7.1 512K pages 2gig Release 7.0 256K pages 1gig

Number of Segments 64 Number of Indexes + text fields + segments 189 Number of Fields 3072 Segment Size (data + pointers) 3968 bytes

Page 6: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 6

Bigger is BetterEstimating File Size

Each Instance is comprised of Data and Pointers Data

An n bytes of Storage In 4 bytesDn.m 8 bytesFn.m 4 bytesPn.m (n <= 15) 8 bytesPn.m (n > 15) 16 bytesSmart Dates 4 bytesPLUS filler to pad to full word (4 bytes)

Page 7: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 7

Bigger is BetterEstimating File Size

Pointers 4 Bytes: type, page #, Word OffsetTypes:

Parent to Real ChildParent to KU ChildParent to KM ChildChild to ParentForward ChainDeleted, Free, End-of-Chain

Page 8: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 8

Bigger is BetterEstimating File Size

Index = I Internal Index – Updated as File is UpdatedPer Entry:

ValueAddress of Data Instance

Pages may be ½ full

Page 9: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 9

Bigger is BetterEstimating File Size

SEGNAME=EMPINFO, SEGTYPE=S1 FIELDNAME=EMP_ID, ALIAS=EID, FORMAT=A9, INDEX=I, $

FIELDNAME=LAST_NAME, ALIAS=LN, FORMAT=A15, $ FIELDNAME=FIRST_NAME, ALIAS=FN, FORMAT=A10, $ FIELDNAME=HIRE_DATE, ALIAS=HDT, FORMAT=YYMD , $ FIELDNAME=DEPARTMENT, ALIAS=DPT, FORMAT=A10, $ FIELDNAME=CURR_SAL, ALIAS=CSAL, FORMAT=D12.2M, $ FIELDNAME=CURR_JOBCODE, ALIAS=CJC, FORMAT=A3, $ FIELDNAME=ED_HRS, ALIAS=OJT, FORMAT=F6.2, $

Pointers: 3 – Parent – to – Child ; 1 – Chain = 16 bytes

Data: 63 Bytes + 1 “filler” = 64 bytes

Total: 80 bytes Instances/Page = 49

Page 10: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 10

Bigger is BetterEX CALCFILE

SEGMENT NUM FOCUS TOTAL TOTAL TOTAL

SEG /INDEX PER 4K BYTES POINTER DATA

NO NAME PAGE INSTANCES PAGES USED BYTES BYTES

--- ------- ---- --------- ----- ----- ------- -----

SEGMENT SPACE

==============

1 EMPINFO 49 10000 205 800000 160000 640000

2 FUNDTRAN 99 10000 102 400000 80000 320000

3 PAYINFO 141 50000 355 1400000 400000 1000000

4 SALINFO 99 1040000 10506 41600000 8320000 33280000

INDEX SPACE (ASSUMES HALF FULL PAGES)

==============

1 EMP_ID 151 10000 68 130000 40000 90000

4 CHECK_NO 151 1040000 6889 13520000 4160000 9360000

TOTAL 2160000 18125 57850000 13160000 44690000

Page 11: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 11

Bigger is BetterEX CALCFILE

SEGMENT NUM FOCUS TOTAL TOTAL TOTAL SEG /INDEX PER 4K BYTES POINTER DATA NO NAME PAGE INSTANCES PAGES USED BYTES BYTES --- ------- ---- --------- ----- ----- ------- ----- SEGMENT SPACE ============== 1 EMPINFO 49 100000 2041 8000000 1600000 6400000 2 FUNDTRAN 99 100000 1011 4000000 800000 3200000 3 PAYINFO 141 1000000 7093 28000000 8000000 20000000 4 SALINFO 99 10400000 105051 416000000 83200000 332800000 INDEX SPACE (ASSUMES HALF FULL PAGES) ============== 1 EMP_ID 151 100000 664 1300000 400000 900000 4 CHECK_NO 151 10400000 68876 135200000 41600000 93600000 TOTAL 22100000 184736 592500000 135600000 456900000

Page 12: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 12

Bigger is BetterACCESS File Points to the FOCUS Files

MASTERNAME EMPLOYEE

DATANAME 'EMPLOYEE FOCUS A'

MASTERNAME JOBFILE

DATANAME 'JOBFILE FOCUS F'

MASTERNAME EMPLOYEE

DATANAME 'EMPLOYEE FOCUS A'

MASTERNAME JOBFILE

DATANAME 'JOBFILE FOCUS F'

FILENAME=EMPLOYEE, SUFFIX=FOC,ACCESS=EMPLOYEE,$

FILENAME=JOBFILE ,SUFFIX=FOC, ACCESS = EMPLOYEE,$

Page 13: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 13

Horizontal Partitioning

Page 14: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 14

Bigger Is BetterHEmploye File

EMP_ID LAST_NAME FIRST_NAME HIRE_DATE

EMP_ID LAST_NAME FIRST_NAME HIRE_DATE

DAT_INC PCT_INC SALARY JOBCODE

DAT_INC PCT_INC SALARY JOBCODE

PAY_DATE GROSSNET CHECK_NO

PAY_DATE GROSSNET CHECK_NO

BANK_NAME BANK_CODE BANK_ACCT EFFECT_DATE

I

I

Page 15: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 15

Bigger is BetterACCESS File Points to the FOCUS Files

MASTERNAME HEMPLOYE DATANAME 'HEMPLOYE FOCUS A' LOCATION FUNDS DATANAME ‘FUNDS FOCUS B’ LOCATION SALS DATANAME ‘SALS FOCUS C’

MASTERNAME JOBFILE DATANAME 'JOBFILE FOCUS F'

Page 16: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 16

Bigger is BetterACCESS File Points to the FOCUS Files

FILENAME=HEMPLOYE, SUFFIX=FOC,ACCESS=EMPLOYEE,$ SEGNAME=EMPINFO, SEGTYPE=S1 FIELDNAME=EMP_ID, ALIAS=EID, FORMAT=A9, INDEX=I,$

. . .

SEGNAME=FUNDTRAN, SEGTYPE=U, PARENT=EMPINFO , LOCATION = FUNDS FIELDNAME=BANK_NAME, ALIAS=BN, FORMAT=A20, $

. . . SEGNAME=PAYINFO, SEGTYPE=SH1, PARENT=EMPINFO , LOCATION = HEMPLOYE FIELDNAME=DAT_INC, ALIAS=DI, FORMAT=MDYY , $

. . .

SEGNAME=SALINFO, SEGTYPE=SH1, PARENT=EMPINFO , LOCATION = SALS FIELDNAME=PAY_DATE, ALIAS=PD, FORMAT=MDYY , $

. . .

Page 17: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 17

Bigger is BetterACCESS File Points to the FOCUS Files

STATUS OF FOCUS FILE: HEMPLOYE FOCUS A ON 04/18/2001 AT 15.21.22 ACTIVE DELETED DATE OF TIME OF LAST TRANS SEGNAME COUNT COUNT LAST CHG LAST CHG NUMBER EMPINFO 12 04/18/2001 15.21.22 70 FUNDTRAN 00.00.00 PAYINFO 19 04/18/2001 15.21.22 70 SALINFO 00.00.00 *INDEXES* EMP_ID 04/18/2001 15.21.22 70 CHECK_NO 00.00.00 TOTAL SEGS 31 TOTAL CHAR 1148 TOTAL PAGES 3 LAST CHANGE 04/18/2001 15.21.22 70

Page 18: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 18

Bigger is BetterACCESS File Points to the FOCUS Files

STATUS OF FOCUS FILE: FUNDS FOCUS B1 ON 04/18/2001 AT 15.21.22 ACTIVE DELETED DATE OF TIME OF LAST TRANS SEGNAME COUNT COUNT LAST CHG LAST CHG NUMBER EMPINFO 00.00.00 FUNDTRAN 12 04/18/2001 15.21.22 70 PAYINFO 00.00.00 SALINFO 00.00.00 EMP_ID 00.00.00 CHECK_NO 00.00.00 TOTAL SEGS 12 TOTAL CHAR 384 TOTAL PAGES 1 LAST CHANGE 04/18/2001 15.21.22 70

Page 19: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 19

Bigger is BetterACCESS File Points to the FOCUS Files

STATUS OF FOCUS FILE: SALS FOCUS C1 ON 04/18/2001 AT 15.21.22 ACTIVE DELETED DATE OF TIME OF LAST TRANS SEGNAME COUNT COUNT LAST CHG LAST CHG NUMBER EMPINFO 00.00.00 FUNDTRAN 00.00.00 PAYINFO 00.00.00 SALINFO 70 04/18/2001 15.21.22 70 EMP_ID 00.00.00 *INDEXES* CHECK_NO 04/18/2001 15.21.22 70 TOTAL SEGS 70 TOTAL CHAR 2240 TOTAL PAGES 2 LAST CHANGE 04/18/2001 15.21.22 70

Page 20: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 20

VerticalPartitioning

Page 21: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 21

Bigger Is BetterUSEmploye File

EMP_ID LAST_NAME FIRST_NAME HIRE_DATE

EMP_ID LAST_NAME FIRST_NAME HIRE_DATE

DAT_INC PCT_INC SALARY JOBCODE

DAT_INC PCT_INC SALARY JOBCODE

PAY_DATE GROSSNET CHECK_NO

PAY_DATE GROSSNET CHECK_NO

BANK_NAME BANK_CODE BANK_ACCT EFFECT_DATE

I

I

USEMP FOCUS

USSALS FOCUSUSFUND FOCUS

Page 22: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 22

Bigger Is BetterCAEmploye File

EMP_ID LAST_NAME FIRST_NAME HIRE_DATE

EMP_ID LAST_NAME FIRST_NAME HIRE_DATE

DAT_INC PCT_INC SALARY JOBCODE

DAT_INC PCT_INC SALARY JOBCODE

PAY_DATE GROSSNET CHECK_NO

PAY_DATE GROSSNET CHECK_NO

BANK_NAME BANK_CODE BANK_ACCT EFFECT_DATE

I

I

CAEMP FOCUS

CASALS FOCUSCAFUND FOCUS

Page 23: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 23

Bigger Is BetterEUEmploye File

EMP_ID LAST_NAME FIRST_NAME HIRE_DATE

EMP_ID LAST_NAME FIRST_NAME HIRE_DATE

DAT_INC PCT_INC SALARY JOBCODE

DAT_INC PCT_INC SALARY JOBCODE

PAY_DATE GROSSNET CHECK_NO

PAY_DATE GROSSNET CHECK_NO

BANK_NAME BANK_CODE BANK_ACCT EFFECT_DATE

I

I

EUEMP FOCUS

EUSALS FOCUSEUFUND FOCUS

Page 24: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 24

Bigger is BetterACCESS File Points to the FOCUS Files

MASTERNAME HEMPLOYE DATANAME 'USEMP FOCUS A' LOCATION FUNDS DATANAME 'USFUND FOCUS A' LOCATION SALS DATANAME 'USSALS FOCUS A' DATANAME 'CAEMP FOCUS B' LOCATION FUNDS DATANAME 'CAFUND FOCUS B' LOCATION SALS DATANAME 'CASALS FOCUS B' DATANAME 'EUEMP FOCUS C' LOCATION FUNDS DATANAME 'EUFUND FOCUS C' LOCATION SALS DATANAME 'EUSALS FOCUS C'

Page 25: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 25

Bigger is BetterACCESS File Points to the FOCUS FilesMASTERNAME HEMPLOYE DATANAME 'USEMP FOCUS A' WHERE DEPARTMENT EQ ' PRODUCTION' OR 'MIS' ; LOCATION FUNDS DATANAME 'USFUND FOCUS A' LOCATION SALS DATANAME 'USSALS FOCUS A' DATANAME 'CAEMP FOCUS B WHERE DEPARTMENT EQ ‘CANADA’; LOCATION FUNDS DATANAME 'CAFUND FOCUS B' LOCATION SALS DATANAME 'CASALS FOCUS B' DATANAME 'EUEMP FOCUS C' WHERE DEPARTMENT EQ ‘EUROPE’; LOCATION FUNDS DATANAME 'EUFUND FOCUS C' LOCATION SALS DATANAME 'EUSALS FOCUS C'

Page 26: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 26

Bigger is BetterIntelligent Partitions

Without WHERE test in ACCESS FileEach Partition is read in turn IF/WHERE test applied to real data

WHERE test in ACCESS File IF/WHERE test applied before partition is

openedAlternative: Build USEs

Page 27: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 27

Bigger is BetterIntelligent USE – no ACCESS File

-SET &LIST =‘CANADA,EUROPE,MIS,PRODUCTION,ALL’;

-PROMPT &DEPARTMENT.(&LIST).ENTER DEPARTMENT OR ALL.

-IF &DEPARTMENT NE ‘ALL’ GOTO SMUSE;

-ALLUSE

USE

USEMP FOCUS A AS HEMPLOYE

USFUND FOCUS A AS FUNDS

USSALS FOCUS A AS SALS

CAEMP FOCUS B AS HEMPLOYE

CAFUND FOCUS B AS FUNDS

CASALS FOCUS B AS SALS

Page 28: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 28

Bigger is BetterIntelligent USE – no ACCESS File

EUEMP FOCUS C AS HEMPLOYE

EUFUND FOCUS C AS FUNDS

EUSALS FOCUS C AS SALS

END

-GOTO DOIT

--SMUSE

USE

-SET &PREF = IF &DEPARTMENT EQ ‘CANADA’ THEN ‘CA’ ELSE

- IF &DEPARTMENT EQ ‘EUROPE’ THEN ‘EU’ ELSE ‘US’;

&PREF|EMP FOCUS * AS HEMPLOYE

&PREF|FUND FOCUS * AS FUNDS

&PREF|SALS FOCUS * AS SALS

END

Page 29: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 29

Bigger is BetterIntelligent USE – no ACCESS File

-DOIT

JOIN CJC IN HEMPLOYE TO JOBCODE IN JOBFILE AS AJ

TABLE FILE HEMPLOYE

PRINT PAY_DATE BY EID BY JOB_DESC

-IF &DEPARTMENT EQ ‘ALL’ GOTO NOIF;

WHERE DEPARTMENT EQ ‘&DEPARTMENT’

-NOIF

END

Page 30: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 30

Bigger is BetterMaybe

Since each physical file can be 2 gig (512K pages) Up to 250 partitions (horizontal and vertical) Each partition can be on its own disk

MVS – span multiple packs Maximum size of a database is now 500 gigabytes of

data. Intelligent Partitioning will speed access, even if the size is not

the issue.Remember FOCSORT can now be 2 gigabytes also.

Page 31: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 31

Bigger is BetterBut there are Seven of Them

Which is better? One big database Seven smaller ones – intelligently partitioned

What’s the predominant access Data Maintenance - BIG database

Alternatives – Separate MASTERs for each smaller file; COMBINE or MAINTAIN

Reporting -- Seven smaller files

Page 32: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 32

Before You leave!

Be Sure To Visit Our Problem Isolation Debugging Tool Site

http://techsupport.informationbuilders.com/app/css_web_tool/default.htm

Page 33: Large Database and What to Do Bigger is Better!

Copyright 2007, Information Builders. Slide 33

QUESTIONS