Upload
kade-ellison
View
220
Download
0
Embed Size (px)
Citation preview
A presentation byW H Inmon
KIMBALL vs INMON
the essence of the difference between Inmon and Kimball
Inmon –there needs to be a singleversion of the truth
datawarehouse
datamart
datamart
datamart
datamart
integratedhistoricalgranular
sales
finance
marketing
mgmt
HRsingle versionof the truth
the essence of the difference between Inmon and Kimball
datawarehouse
datamart
datamart
datamart
datamart
integratedhistoricalgranular
sales
finance
marketing
mgmt
HRsingle versionof the truth
the question being answered – what is the single versionof the truth? what is corporate data?
the essence of the difference between Inmon and Kimball
Kimball – a data warehouse is the unionof all of the data marts
datamart
datamart
sales
finance
HR
data mart
a data mart is based onbusiness function – Ralph Kimball
the essence of the difference between Inmon and Kimball
datamart
datamart
sales
finance
HR
data mart
the question being answered – how quickly can Ibuild reports? how quickly can I do analysis?
over time the architectures have evolved
1990 2000 2010
Inmon –Single versionof the truth
Kimball –a union ofdata marts
Inmon –an architecturecorporate informationfactory
Kimball –conformed dimension
Inmon –DW 2.0,unstructured data
Kimball –a need forintegration
1990 2000 2010
Inmon –Single versionof the truth
Kimball –a union ofdata marts
Inmon –an architecturecorporate informationfactory
Kimball –conformed dimension
Inmon –DW 2.0,unstructured data
Kimball –a need forintegration
Kimball is today where Inmon was in 1990What has Kimball said to all of those people whofollowed his teachings in 1990?
the essence of the difference between Inmon and Kimball
1990 2000 2010 2020
Inmon –Single versionof the truth
Kimball –a union ofdata marts
Inmon –an architecturecorporateinformationfactory
Kimball –conformeddimension
Inmon –DW 2.0,unstructured data
Kimball –a need forintegration
Kimball –unstructured data belongs in a datawarehouse
prediction – in 2020 the Kimballites will “discover”that textual data belongs in a data warehouse
appl
mktg
sales
finance
mgmt
HR
Engineering
Production
datamarts appl
mktg
sales
finance
mgmt
HR
Engineering
Production
datamarts
Kimball Inmon
from an implementation perspective
appl
mktg
sales
finance
mgmt
HR
Engineering
Production
datamarts appl
mktg
sales
finance
mgmt
HR
Engineering
Production
datamarts
daily refreshment of data
each of these lines must be crossedat least once a day
appl
mktg
sales
finance
mgmt
HR
Engineering
Production
datamarts appl
mktg
sales
finance
mgmt
HR
Engineering
Production
datamarts
daily refreshment of data
mn
m n
m x n m + n
appl
mktg
sales
finance
mgmt
HR
Engineering
Production
datamarts appl
mktg
sales
finance
mgmt
HR
Engineering
Production
datamarts
daily refreshment of data
mn
m n
m x n m + nhow many programs have to bewritten? have to be maintained?
appl
mktg
sales
finance
mgmt
HR
Engineering
Production
datamarts appl
mktg
sales
finance
mgmt
HR
Engineering
Production
datamarts
daily refreshment of data
mn
m n
m x n m + n
which overnight batch processingwindow do you want?
appl
mktg
sales
finance
mgmt
HR
Engineering
Production
datamarts appl
mktg
sales
finance
mgmt
HR
Engineering
Production
datamarts
$1000
$32000
$1,009,087
$1000
$32000
$1,009,087
reconciliation
in which environment would you ratherdo reconciliation?
appl
mktg
sales
finance
mgmt
HR
Engineering
Production
datamarts appl
mktg
sales
finance
mgmt
HR
Engineering
Production
datamarts
in which environment would you rather adda new data mart?
star schema(Kimball)
relational baseddata warehouse(Inmon)
from an architectural perspective
star schema(Kimball)
relational baseddata warehouse(Inmon)
good for fast reportsnot a short term propositiongood for a system of record
as an end user I am confused…there are 17 data marts that have informationand I don’t know which one to go to. And theyall have different information
every time there is a new requirementI have to start from scratch. And thesedarn data marts are hard to maintain.I have to build a new one every timethere is a change in requirements
we have had data marts for five years now.We have 250 of them and only 10 of themare actually being used today……
I’ve got these auditors coming in and I don’thave any data that I trust that I can showthem……
with Kimball, the starschema is the architecture with Inmon, the relational
foundation is only the start ofthe architecture
Verycurrent
Interactive
Less thancurrent
Near line
Te xt to s ubj
Text id ......
Interna l, external
Textualsubjects
Capturedtext
Linkage
S im p lep oin te r
Current++
Integrated
Te xt to s ubj
Text id ......
Interna l, external
Textualsubjects
Capturedtext
Linkage
S im p lep oin te r
Transactiondata
Appl
Appl
Appl
Sum mary
Subj
Subj
Subj
Subj
Detailed Continuoussnapshotdata
Subj
Subj
Subj
Profiledata
Sum mary
Subj
Subj
Subj
Subj
Detailed Continuoussnapshotdata
Subj
Subj
Subj
Profiledata
Sum mary
Subj
Subj
Subj
Subj
Detailed Continuoussnapshotdata
Subj
Subj
Subj
Profiledata
O lder
Archival
Te xt to s ubj
Text id ......
Interna l, external
Textualsubjects
Capturedtext
Linkage
S im p lep oin te r
the Inmon approach is a FULL architectureleading to DW 2.0. And DW 2.0 is a truefull scale architecture
Verycurrent
Interactive
Less thancurrent
Near line
Te xt to s ub j
Text id ......
Interna l, external
Textualsubjects
Capturedtext
Linkage
S im p lep oin te r
Curren t++
Integrated
Te xt to s ub j
Text id ......
Interna l, external
Textualsubjects
Capturedtext
Linkage
S im p lep oin te r
Transactiondata
Appl
Appl
Appl
Sum mary
Subj
Subj
Subj
Subj
Detailed Continuoussnapshotdata
Subj
Subj
Subj
Pro filedata
Sum mary
Subj
Subj
Subj
Subj
Detailed Continuoussnapshotdata
Subj
Subj
Subj
Pro filedata
Sum mary
Subj
Subj
Subj
Subj
Detailed Continuoussnapshotdata
Subj
Subj
Subj
Pro filedata
O lder
Archival
Te xt to s ub j
Text id ......
Interna l, external
Textualsubjects
Capturedtext
Linkage
S im p lep oin te r
DW 2.0 supports some really importantarchitectural features – - the life cycle of data within the data warehouse - the accommodation for very large amounts of data - the recognition that cost is the ultimate limiting factor for a data warehouse - unstructured data as an essential component - metadata as an essential component
ask Kimball how he supports unstructured data?ask Kimball how he supports metadata?ask Kimball how he supports really large amounts of data?ask Kimball how he supports archival data?
corporate data
structured data unstructured data
the vast majority of corporate data is not structured
structured data unstructured data
Kimball
structured data unstructured data
Inmon
the Inmon architecture is complete;the Kimball architecture is not
Verycurrent
Interactive
Less thancurrent
Near line
Text to subj
Text id ......
In terna l, ex te rnal
Textualsubjects
Capturedtext
Linkage
S im p lep oin te r
Curren t++
Integ rated
Text to subj
Text id ......
In terna l, ex te rnal
Textualsubjects
Capturedtext
Linkage
S im p lep oin te r
Transactiondata
Appl
Appl
Appl
Sum mary
Subj
Subj
Subj
Subj
Detailed Continuoussnapshotdata
Subj
Subj
Subj
Pro filedata
Sum mary
Subj
Subj
Subj
Subj
Detailed Continuoussnapshotdata
Subj
Subj
Subj
Pro filedata
Sum mary
Subj
Subj
Subj
Subj
Detailed Continuoussnapshotdata
Subj
Subj
Subj
Pro filedata
O lder
Archival
Text to subj
Text id ......
In terna l, ex te rnal
Textualsubjects
Capturedtext
Linkage
S im p lep oin te r
Florida
South AmericaNYCChicagoHawaiiSao PaoloMexicoCanada
BermudaDenverCalgaryLos AngelesGold CoastFloridaMiamiSan FranciscoSeattle
Kimball
Inmon
Kimball only addresses one small partof architecture. Inmon addresses a muchmore comprehensive picture
datawarehouse
datamart
datamart
datamart
datamart
datamart
integratedhistoricalgranular
sales
financemarketing
mgmt
HR
how Inmon/Kimball fit together