18
Next Generation Cassandra Tom Wilkie Founder & VP Engineering, Acunu

Next Generation Cassandra

  • Upload
    acunu

  • View
    2.480

  • Download
    0

Embed Size (px)

DESCRIPTION

Talk by Tom Wilkie at Cassandra EU

Citation preview

Page 1: Next Generation Cassandra

Next Generation Cassandra

Tom WilkieFounder & VP Engineering, Acunu

Page 2: Next Generation Cassandra
Page 3: Next Generation Cassandra

Cassandra needs a team

• Installing, configuring and deploying

• Automating management operations

• Monitoring and alerting

• Testing and fixing bugs

• Tuning the garbage collector

• Tweaking the storage engine

$$

$$

$3

Page 4: Next Generation Cassandra

Acunu: your enterprise Cassandra Appliance

Commodity Hardware

Apache Cassandra

Acunu Storage Engine

Control Center

Data driven applications Web UI

Configured and tuned OS

4

Page 5: Next Generation Cassandra

Storage Engineaka Castle

Page 6: Next Generation Cassandra

Castle: Built for Big Data

• Storage engine optimized for large slow disks, many cores, Big Data workloads

• Enterprise density on commodity hardware

• Lightning disk rebuilds:10x faster than RAID

Acunu Kernel

Userspace

Doubling Arrays

arrays range

querieskey

insert

insertqueues

Bloom filters

x

user

spac

ein

terfa

ceke

rnel

spac

ein

terfa

cedo

ublin

g a

rray

map

ping

laye

rm

odlis

t btre

em

appi

ng la

yer

bloc

k m

appi

ng &

cach

eing

laye

r

"Extent" layerextent

allocatorfreespacemanager

btreerange

queries

key get

key insert

Version tree

Streaming interfacekey

insertkey get

bufferedvalue get

bufferedvalue insert

range queries

Cache

flusher

extent blockcache

prefetcher

In-kernel workloads

shared buffersasync, sharedmemory ring

Shared memory interfacekeys

values

Arrays

value arrays

btree

key get

arraysmanagement

merges

• Opensource (GPLv2, MIT for user libraries)

• http://bitbucket.org/acunu

• Loadable Kernel Module, targeting CentOS’s 2.6.18

• http://www.acunu.com/blogs/andy-twigg/why-

Castle

http://goo.gl/gzihe

6

Page 7: Next Generation Cassandra
Page 8: Next Generation Cassandra

0

1

2

3

4

5

RAID10, 8 Disks RAID5, 8 Disks RDA, 8 Disks RDA, 15 Disks

Re

bu

ild

Tim

e (

Ho

urs

)

Rebuild time

8

Page 9: Next Generation Cassandra

Control Center

9

Page 10: Next Generation Cassandra

Control Center

The new UI has been critical in helping us work out what is wrong in our code

“The new UI has been critical in helping us work out what is wrong in our code”

- Matt Miller, TellyBug

Page 11: Next Generation Cassandra

Breaking System

Page 12: Next Generation Cassandra

13,000 tests per day

1.1 million tests per release

3.5k machine days of testing

for each release

12

Page 13: Next Generation Cassandra

13

Short, unit tests

Data integrity tests

Distributed tests

Failure simulation

Tests for common, long running operations (bootstrap, decommission, backup, restore etc)

Resource usage tests (checking memory and cpu consumption)

Long running performance tests

Page 14: Next Generation Cassandra
Page 15: Next Generation Cassandra
Page 16: Next Generation Cassandra
Page 17: Next Generation Cassandra
Page 18: Next Generation Cassandra

“Turnkey Cassandra”

No need to master the internalsNo tuning required

No need for a committer on staff