37
Middle Tier Scalability Current challenges and future directions DeWayne Filppi @dfilppi slideshare.net/dfilppi

Middle Tier Scalability - Present and Future

  • Upload
    dfilppi

  • View
    304

  • Download
    0

Embed Size (px)

DESCRIPTION

How the data explosion of recent years has spawned many new technologies, and role of in-memory techology currently and in light of advances in flash memory.

Citation preview

Page 1: Middle Tier Scalability - Present and Future

Middle Tier Scalability

Current challenges and future directions

DeWayne [email protected]/dfilppi

Page 2: Middle Tier Scalability - Present and Future

What are we here to discuss?

Making Sense of the Exploding Data World

The role of middleware to address scalability challenges

The role of middleware to address integration challenges

Page 3: Middle Tier Scalability - Present and Future

Making Sense of The Exploding Data World

Page 4: Middle Tier Scalability - Present and Future

GB

TB

PB

Dat

a Vo

lum

e

Yr Mo Day Hr Min Sec MS µS

Data MiningMachine Learning

Data Velocity

Data Warehouse High Throughput OLTP

Operational Intelligence

Exploratory Analytics

OLTP

Business Intelligence

Streaming

Capacity and Performance Drives New Data Management Technologies

Page 5: Middle Tier Scalability - Present and Future

Let’s Look at Tradeoffs of

Some Selected Solutions

Page 6: Middle Tier Scalability - Present and Future

SQL Queries

• Query: SQL • Semantics:

• CRUD• Aggregation• Projection• Partial update

• Performance: 100’s/Sec • Consistency: Transactional• Scaling: Mostly Scale-UP• Availability: Disk Based

Page 7: Middle Tier Scalability - Present and Future

NoSQL• Query: Proprietary but rich• Semantics:

• CRUD• Limited Aggregation

(Map/Reduce)• No Projection*• No Partial update*

• Performance: 1000s/Sec • Consistency: Eventual* • Scaling: Mostly Scale-Out• Availability: Based on replication

Page 8: Middle Tier Scalability - Present and Future

IMDG • Query: Propriety but rich• Semantics:

• CRUD• Aggregation API +

Map/Reduce• Projection (GigaSpaces)• Partial Update

(GigaSpaces)• Performance: 100k/sec• Consistency: Transactional • Scaling: Mostly Scale-Out• Availability: Replication

Page 9: Middle Tier Scalability - Present and Future

Key/Value

• Query: Key, Value• Semantics:

• Mostly Read• No Aggregation• No Projection• No Partial update

• Performance: 1M’s/sec • Consistency: Atomic*• Scaling: Mostly Scale-Out• Availability: Limited (varies quite substantially between implementations)

Page 10: Middle Tier Scalability - Present and Future

Stream Processing (Storm)

• Semantics– Event driven data processing

• Used for continuous updates– No need for a costly “SELECT

FOR UPDATE”

• Performance: 10’sM/sec updates

Spouts

Bolt

Page 11: Middle Tier Scalability - Present and Future

Common Assumption

Disk is the bottleneck

2010

Perf

orm

ance

1̂0

2000 2020

CPU Perform

ance = 100X PER DECADE

HDD Latency (Seek & Rotate) = Little Improvement

100X

10,000X

Source: GigaOM Research

Page 12: Middle Tier Scalability - Present and Future

Capacity and Performance Drives New Data Management Technologies

(Source: IDC, 2013)

Big Data (Hadoop)

NoSQL

In Memory, Stream Processing

RDBMS

Page 13: Middle Tier Scalability - Present and Future

There’s No One Size Fits All

Page 14: Middle Tier Scalability - Present and Future

A Typical App Looks Like This..

Front End Analytics

RT

Batch

STORM

The Data Flow Complexity

Page 15: Middle Tier Scalability - Present and Future

What if Disk Was no Longer the Bottleneck?

FLASH Closes the CPU to Storage Gap

Page 16: Middle Tier Scalability - Present and Future

Our Application Cloud Look Like This..

Front End

High Speed Data Store

(Using Flash/NVM)

Key/Value

SQL

Document

Graph

Transactional

Map/Reduce

Disk Becomes the new Tape

StreamBase

Common Data Store servingMultiple Semantics/API

Page 17: Middle Tier Scalability - Present and Future

We're not there yet ..

But..

Page 18: Middle Tier Scalability - Present and Future

We can use High Speed Data Bus for Integrating All of our Data Sources

Front End Analytics

RT

Batch

STORM

High Speed Data Bus(Built-In

Caching)

RT Transactional Data Access

Direct Access

RT Streaming

Hadoop Synch

MySQL Synch

Mongo Synch

Page 19: Middle Tier Scalability - Present and Future

High Speed Data Bus (Zoom In)

Page 20: Middle Tier Scalability - Present and Future

Data Grid Ideal Integration Nexus

• Transactional• HA – Self Healing• Horizontally scalable• FIFO (and partial FIFO) support• Queryable• Ultra high performance read/write

Page 21: Middle Tier Scalability - Present and Future

Designed for Transactional and Analytics Scenarios..

Homeland Security

Real Time Search

Social

eCommerce

User Tracking & Engagement

Financial Services

Page 22: Middle Tier Scalability - Present and Future

Typical NoSQL Integration

Page 23: Middle Tier Scalability - Present and Future

Storm Integration

http://ec2-54-89-152-83.compute-1.amazonaws.com:8090/web/

Page 24: Middle Tier Scalability - Present and Future

Many API’s – Same Data

Key/Value SQL Document Graph TransactionalMap/Reduce

Page 25: Middle Tier Scalability - Present and Future

Let’s take a closer look..

Page 26: Middle Tier Scalability - Present and Future

Nested Queries & Projections

Page 27: Middle Tier Scalability - Present and Future

Aggregations.

Page 28: Middle Tier Scalability - Present and Future

Fast Update …

Page 29: Middle Tier Scalability - Present and Future

Fifo/messaging support

@SpaceClass(fifoSupport=FifoSupport.OPERATION) public class Person { ... }

@EventDriven @Polling

public class SimpleListener {

@SpaceDataEvent public Data eventListener(Data event) { //process Data here }

Page 30: Middle Tier Scalability - Present and Future

Transactions support

Page 31: Middle Tier Scalability - Present and Future

So what?

• Data access not tied to store implementation.

• Middle tier grows as source of truth.• Simplifies data access as it grows• Can support strong consistency as

needed.• Provides HA platform for integration.

Page 32: Middle Tier Scalability - Present and Future

- 1KB object size and uniform distribution- 2 sockets 2.8GHz CPU with total 24 cores, CentOS 5.8, 2 FusionIO SLC PCIe cards RAID- YCSB measurements performed by SanDisk

No Read / 100% Write 100 % Read / No Write0

20

40

60

80

100

120

140

160

62

121

17

56

FDF-GigaSpaces on SSDs Stock GigaSpaces in DRAM

Assumptions: 1TB Flash = $2K; 1TB RAM = $20K

The Performance of RAM at a Cost/Capacity Closer to Disk

ZetaScale-GigaSpaces on SSDsStock GigaSpaces in DRAM

ZetaScale-GigaSpaces

Provides 2x – 3.6x Better TPS/$ 1:50 More Capacity

ZetaScale™ – XAP MemoryXtend

Capacity0

200

400

600

800

1000

1200

20

1000

XAP XAP Extend

1:50

242k Read/Sec

Page 33: Middle Tier Scalability - Present and Future

Take Aways

• Explosion of data has created an explosion of targeted technologies

• Many architected on “disk is slow”• Flash changing the equation.• In-memory tech best suited to take

advantage of flash• Continued blurring of in-memory

middleware and data storage.

Page 34: Middle Tier Scalability - Present and Future

Real World Example #1: Fraud Detection

Page 35: Middle Tier Scalability - Present and Future

Real World Example #2: Banking

Page 36: Middle Tier Scalability - Present and Future

Real World Example #3: Clinical Surveillance

Page 37: Middle Tier Scalability - Present and Future

Nati Shalom

Check out the slides on http://www.slideshare.net/dfilppi