23
BigData Architectures Daan Gerits Dasos

Big data architectures

Embed Size (px)

DESCRIPTION

Presentation given at @DamnData discussing different architecture types for BigData environments

Citation preview

Page 1: Big data architectures

BigData Architectures

Daan GeritsDasos

Page 2: Big data architectures

Volume

We already have that:

- NAS/SAN- High Performance Computing

IOIIOIIOIOIOIIIOIOOOOIOIOOIIOIIIIIOIIOIIOIOIOIOIOIOIIOIIOIOIOIIIOIOOOOIOIOOIIOIIIIIOIIOIIOIOIOIOIOIOIIOIIOIOIOIIIOIOOOOIOIOOIIOIIIIIOIIOIIOIOIOIOIOIOIIOIIOIOIOIIIOIOOOOIOIOOIIOIIIIIOIIOIIOIOIOIOIOIOIIOIIOIOIOIIIOIOOOOIOIOOIIOIIIIIOIIOIIOIOIOIOIOIOIIOIIOIOIOIIIOIOOOOIOIOOIIOIIIIIOIIOIIOIOIOIOIOIOIIOIIOIOIOIIIOIOOOOIOIOOIIOIIIIIOIIOIIOIOIOIOIOIOIOIIOIOIOIIIOIOOOOIOIOOIIOIIIIIOIIOIIOIOIOIOIOIOIIOIIOIOIOIIIOIOOOOIOII

Page 3: Big data architectures

Variety

We already have that:

- Meta-modeling- NAS/SAN

IOIIIOII IOII

IOIIIOIIIOII

IOIIIOIIIOII

Page 4: Big data architectures

Velocity

We already have that:

- Complex Event Processing

IOII OOIOII

OIIIOIII

Page 5: Big data architectures

But do you have all of that in 1 platform?

Page 6: Big data architectures

But How??

Page 7: Big data architectures

Architectures

(Thx Nathan Marz!)

Page 8: Big data architectures

Analytical Big Data

Analysis OrientedOptimize

Non-intrusive

Page 9: Big data architectures

Delta

DataSources

DistributedDatabase

Data Systems

AppsDashboards

IngestionEngine

Enrich

Page 10: Big data architectures

Delta

DataSources

DistributedDatabase

Data Systems

AppsDashboards

Flume, Sqoop,

Scribe, ... MR, Pig, Crunch, Mahout, ...

MR, Pig, Crunch, ...

Impala, Hive, ...

Page 11: Big data architectures

Delta

Analytical Big Data architecture for enriching mostly structured data with the goal to

optimize business processes.

Page 12: Big data architectures

Delta

DataSources

DistributedDatabase

Data Systems

AppsDashboards

IngestionEngine

Enrich

Overload!

Page 13: Big data architectures

Delta

Be write-heavy

orread-heavy

NOT both!

Page 14: Big data architectures

Operational Big Data

(Thx Nathan Marz!)

Focussed on Day-to-day business

Innovate(Non-)intrusive

Page 15: Big data architectures

Lambda

DataSources

RealtimeProcessing

FactStore

Batch View A

Batch View B

Batch View C

Realtime View A

Realtime View B

Realtime View C

Just In TimeCombiner

Apps

Dashboard

Reports

Page 16: Big data architectures

Lambda

DataSources

Storm

HDFS

ElephantDB

ElephantDB

ElephantDB

Cassandra*

Cassandra*

Cassandra*

Custom Code*

Apps

Dashboard

Reports

Page 17: Big data architectures

Lambda

Operational Big Data architecture for storing and processing

multi-structured and immutable data with the goal to

Innovate business

Page 18: Big data architectures

Technologies to use

Pick your stack!

Page 19: Big data architectures

Advice

Pilots, PoC, PoT, … do them!Be pragmatic, start skinnyIn Belgium: Variety > VolumeBe prepared to pivot on technologies

Page 20: Big data architectures

Questions?Thoughts?Ideas?Disagreements?...

[email protected]@daangerits

All images are used merely for illustrational means. In no way was it my purpose to violate any rights by using them.

Page 21: Big data architectures

BigData Architectures

Backup Slides

Page 22: Big data architectures

Volume

Variety Velocity

Page 23: Big data architectures

Lambda

Multi-structured

Un-structured

Re-structured