MongoDB in Production - Amazon Web...

Chris Harris - charris@10gen.com

MongoDB in Production

HIGH AVAILABILITY SCENARIOS

Single Node

•Will have downtime•If node crashes human intervention might be needed

How MongoDB Replication works

Member 1

Member 2

Member 3

•Set is made up of 2 or more nodes

Member 1

Member 2

PRIMARY

Member 3

•Election establishes the PRIMARY•Data replication from PRIMARY to SECONDARY

Member 1

Member 2

Member 3

negotiate new master

•PRIMARY may fail•Automatic election of new PRIMARY if majority exists

Member 1

Member 2

Member 3

PRIMARY

•New PRIMARY elected•Replication Set re-established

Member 1

Member 2RECOVERING

Member 3

PRIMARY

•Automatic recovery

Member 1

Member 2

Member 3

PRIMARY

•Replication Set re-established

Replica Set 0•Two Node•Network failure can cause the nodes to slip which will result in the the whole system going READ_ONLY

NOT RECOMMENDED!

Replica Set 1

Arbiter

•Single datacenter•Single switch & power•Points of failure:•Power•Network•Datacenter•Two node failure

•Automatic recovery of single node crash

Replica Set 2

Arbiter

•Single datacenter•Multiple power/network zones•Points of failure:•Datacenter•Two node failure

•Automatic recovery of single node crash•w=2 not viable as losing 1 node means no writes

Replica Set 3•Single datacenter•Multiple power/network zones•Points of failure:•Datacenter•Two node failure

•Automatic recovery of single node crash•w=2 viable as 2/3 online

Replica Set 4

•Multi datacenter

•DR node for safety

•Can’t do multi data center durable write safely since only 1 node in distant DC

Replica Set 5

delayed •Three data centers•Can survive full data center loss •Can do w= { dc : 2 } to guarantee write in 2 data centers

HARDWARE SIZING

Collection 1

Index 1

Virtual Address Space 1

Collection 1

Index 1

Collection 1

Index 1 This is your virtual memory size (mapped)

Physical RAM

Collection 1

Index 1

Physical RAM

Collection 1

Index 1

This is your resident memory size

Physical RAM

DiskCollection 1

Index 1

Physical RAM

DiskCollection 1

Index 1

100 ns

10,000 ns

Sizing RAM and Disk• Working set• Document Size• Memory versus disk• Data lifecycle patterns

• Long tail• pure random• bulk removes

Figuring out working Set> db.wombats.stats() { "ns" : "test.wombats", "count" : 1338330, "size" : 46915928, "avgObjSize" : 35.05557523181876, "storageSize" : 86092032, "numExtents" : 12, "nindexes" : 2, "lastExtentSize" : 20872960, "paddingFactor" : 1, "flags" : 0, "totalIndexSize" : 99860480, "indexSizes" : { "_id_" : 55877632, "name_1" : 43982848 },

Size of data

Size on disk (and in memory!)

Size of all indexes

Average document size

Size of each index

Disk configurations

~200 seeks / second

Single Disk

Disk configurations

~200 seeks / second

~200 seeks / second ~200 seeks / second ~200 seeks / second

Single Disk

RAID 0

Disk configurations

~200 seeks / second

Single Disk

RAID 0

RAID 10

SSD?• Seek time of 0.1ms vs 5ms

(200 seeks / sec => 10000 seeks / sec)

• But expensive

Takeaway• Know how important page faults are

–If you want low latency, avoid page faults• Size memory appropriately

–To avoid page faults, fit everything in RAM–Collection Data + Index Data

• Provision disk appropriately–RAID10 is recommended–SSD’s are fast, if you can afford them

MONITORING

Monitoring Toolsmongostat -

MMS! - http://mms.10gen.com

munin, cacti, nagios -http://www.mongodb.org/display/DOCS/Monitoring+and+Diagnostics

Monitoring is your friend!

SUMMARY

Typical DeploymentsUse? Set

sizeData Protection

High Availability Notes

X One No No Must use --journal to protect against crashes

Two Yes No On loss of one member, surviving member is read only

Three Yes Yes - 1 failureOn loss of one member, surviving two members can elect a new primary

X Four Yes Yes - 1 failure** On loss of two members, surviving two members are read only

Five Yes Yes - 2 failuresOn loss of two members, surviving three members can elect a new primary

Summary

• Reads from Primary are always consistent

• Reads from Secondaries are eventually consistent

• Automatic failover if a Primary fails

• Understand your Working Set

• Monitoring is your Friend!

MongoDB in Production - Amazon Web...

Documents

mongoDB 3 type modeling in production

Deploying MongoDB in Production - Percona · Tuning MongoDB: WiredTiger WT syncs data to disk in a process called “Checkpointing”: Every 60 seconds or >= 2GB data changes In-memory

MongoDB in use(김인범, mongodb korea)

MongoDB - derickrethans.nlderickrethans.nl/talks/mongo-bbmw.pdf10gen, the company behind MongoDB 10gen began the MongoDB project Development, support, and services for MongoDB 100

MongoDB Atlas - On Tour!: Introduction to MongoDB

How Apollo Group Evaluated MongoDB - Bitpipedocs.media.bitpipe.com/io_10x/io_108569/item... · acceptable response times. The production MongoDB system would need to handle at least

NoSQL Concepts MongoDB Concepts MongoDB Demos Agenda

MongoDB Europe 2016 - Distributed Ledgers, Blockchain + MongoDB

Production Running MongoDB in - Percona · Capped Collection: A fixed-size FIFO collection ... MongoDB Client API MongoDB Replication API MongoDB Sharding API Sharding Only the ‘mongos’

MongoDB Evenings Minneapolis: Medtronic's MongoDB Journey

MongoDB São Paulo - Utilizando MongoDB com .NET

MongoDB Europe 2016 - MongoDB 3.4 preview and introduction to MongoDB Atlas

MongoDB Days Germany: Data Processing with MongoDB

MongoDB Europe 2016 - Debugging MongoDB Performance

Webinar: From Development to Production with Docker and MongoDB

Running MongoDB in Production, Part I€¦ · MongoDB supports x.509 certificate authentication for use with a secure TLS/SSL connection as of 2.6.x. The x.509 client authentication

MongoDB and using MongoDB with .NET

Automate MongoDB with MongoDB Management Service (MMS)

Production MongoDB in the Cloud

MongoDB Revised Sharding Guidelines MongoDB 3.x_Kimberly_Wilkins