NoSQL solutions

NoSQLMongoDB and Redis as alternatives to

traditional RDBMS

Then...

...and now

*This thing weighs less than 50g

Meaning of NoSQL

1970 = We have no SQL1980 = Know SQL2000 = No SQL!2005 = Not only SQL2014 = No, SQL

(slide adapted from @markmadsen)

MongoDB

● it is the “new MySQL”● Project started in 2007 by 10gen (now MongoDB Inc)● Cross-platform, open-source● 5th most used DBMS & most used Document Store*

(next DS CouchDB - 21st)* According to db-engines.com as of Oct 2014

Characteristics

● “It's really a hybrid database with features from a few different places.” (Gaetan Voyer-Perrault on Quora)

● Document Oriented but NO SCHEMA! ● Documents grouped in Collections● Binary JSON (BSON) format● Load Balancing (automated sharding, sharding key

can be user defined)● Replication (Replica Sets)● Automated failover

Characteristics - continued

● Primary and Secondary Indexes● JavaScript for UDF● MapReduce● Capped Collections● Aggregation Framework since 2.2● Ad-hoc Query Support

Caveats

Generic performance tips

● Use 64-bit OS● Lots of RAM, fast disks (was anyone expecting

something else?)● ensure that at least indexes + working set fit in RAM

(db.stats(), db.<coll>.stats()) - if not, you might want to try TokuMX

● Design for de-normalized data models

● Write-Concerns● Shard early● Fixed (or at least bounded) record size => better write

performance● Use short attribute names (reduces index & data size,

OFC!)● EXT4 or XFS

● virtualized server 8G RAM, 4 vCPU - no sharding, no replica sets

● 100 inserts/s , 130M doc collection WITH secondary index (avg doc size 0.6k)

● 20 inserts/s 3M doc collection WITH 18 secondary indexes (avg doc size 10k)

Use Cases

● Logs● Location Data (Mongo has built in Geospatial ops)● Account and User Profiles● Messaging● (complex) Config Data● http://www.mongodb.com/who-uses-mongodb (hint:

Expedia, Business Insider, The Weather Channel, Foursquare, eBay)

● Salvatore Sanfilippo (@antirez)● Started in 2009● Key-Value Store● 11th most used DBMS & most used KV Store* (next

KVS memcached - 19th)● Sponsored by Pivotal (spinoff EMC/VMware)* According to db-engines.com as of Oct 2014

Characteristics

● Holds all data in memory, persists on disk● Data Models

○ Strings/Blobs/Bit-Maps (not really Bitmaps)○ Hashtables○ Linked Lists○ Sets○ Sorted Sets

● HyperLogLog (+2.8.9 - trade accuracy for memory)● Master Slave Replication● High Availability (through Sentinel)

Characteristics - continued

● Redis Cluster in works (not production ready yet) - sharding ○ asynchronous replication○ does not guarantee strong consistency (may ‘forget’ writes)

● AOF sync - default 2s● Does not support secondary indexes● Pub/Sub mode since 2.0● Key expiry● Server scripting with Lua

● virtualized server 4G RAM, 1vCPU● +50k get/set per second (redis-benchmark)● only 128 queries out of 1165550375 over 10ms

(0.00001%)○ uptime_in_days:439○ used_memory_human:424.09M○ used_memory_peak_human:834.94M○ total_connections_received:1352935○ db0:keys=610884,expires=355397

● Use short key names (reduces data size, OFC!)● You can create secondary indexes (but you have to

maintain them, e.g. using SET)● You can have ad-hoc queries (actually is query) :

using SORT

Use Cases

● Cache● IPSS/IPC● Queue mechanisms (see e.g. Resque)● Log/Task buffers● Statistics and aggregation datastore● (anywhere you use memcached)● http://redis.io/topics/whos-using-redis (hint: Twitter,

GitHub, Snapchat, StackOverflow a.o.)

One size does NOT fit all!

NoSQL solutions

Technology

NoSQL Orientado à documentos. Apresentação NoSQL Problemas que levaram a utilização do NoSQL Quando utilizar NoSQL? Tipos Orientação à documentos MapReduce

HENRI TERHO USING NOSQL DATABASE SOLUTIONS IN MOBILE ... Terho.pdf · Keywords: mobile, android, noSQL, couchDB, eHealth In the coming future, the healthcare system will face severe

© 2013 Mellanox Technologies 1 NoSQL DB Benchmarking with high performance Networking solutions WBDB, Xian, July 2013

M. Grigorieva, M. Golosova€¦ · Database performance tests : SQL - NoSQL, NoSQL - NoSQL Technology evaluation tests results for NoSQL databases: MongoDB, HBase, Cassandra, Dremel,

MarkLogic Database – Only Enterprise NoSQL DB · 2017-01-29 · Marklogic is designed to handle the volume, variety, and velocity of Big Data like other NoSQL solutions, and has

NoSQL 프로그래밍 : 한 권으로 끝내는 NoSQL 솔루션 활용법

NoSQL Databases for Enterprises - NoSQL Now Conference 2013

Mom, I so wish Hibernate for my NoSQL database · Java Persistence (JPA) support for NoSQL solutions JP-QL queries are converted in native backend queries Hibernate Search as indexing

Consistent NoSQL data storage with ModeShape (NoSQL Matters 2013)

«NoSQL benchmarking v2.0. Исследование производительности современных NoSQL-решений»

PostSQL Using PostgreSQL as a better NoSQL - NoSQL Matters

How companies use NoSQL & Couchbase - NoSQL Now 2014

Build and Manage Hadoop & Oracle NoSQL DB Solutions- Impetus Webinar

NoSQL Tel Aviv Meetup#1: NoSQL Data Modeling

NOSQL PROBLEM LITERATURE REVIEW - … · NoSQL systems are viable solutions for applications that require scalable data repositories, which can easily scale out over multiple servers

Oracle NoSQL Database – A Distributed Key-Value Store · HPTS, October 24, 2011 Agenda • Oracle and NoSQL • Oracle NoSQL Database Architecture • Oracle NoSQL Database Technical

NOSQL - CRS4dassia.crs4.it/wp-content/uploads/2014/11/01_NOSQL.pdf · 2015-03-06 · NOSQL Origini e Significato NOSQL = NO a SQL NOSQL = Not Only SQL Il termine NOSQL fu introdotto

Analyse des solutions de spatialisation dans les graphes ...des bases de données NoSQL, la plateforme Neo4j a développé une base de données NoSQL graphe. L’avantage de la base

Q y // NoSQL’Road’Show,’Zurich’nosqlroadshow.com/dl/NoSQL-Road-Show/slides/nosql... · NoSQL,’NewSQL’and’Beyond ... •’OrientDB ’ •’NuvolaBase ... •’ScaleBase

NoSQL Now! NoSQL Architecture Patterns