Data Distribution in Gluster

Niels de Vosndevos@redhat.com

Software Maintenance EngineerRed Hat UK, Ltd.

February, 2012

Agenda

● Terminology

● Striped Volumes

● Replicated Volumes

● Distributed Volumes

● Distributed Replicated Volumes

● Elastic Hashing

Terminology

● Brick:● Mountpoint used for data storage

● Volume:● Multiple bricks combined

● Node:● Server running the Gluster Daemon● Contains the Brick(s)● Provides access to the Volume(s)

● Trusted Pool

Terminology: Brick

● Mountpoint used for data storage

● Underlying physical storage for Volumes

Creating a Brick

# mkfs -t xfs -i size=512 /dev/sdb1# mkdir -p /bricks/storage# vi /etc/fstab# mount /bricks/storage

Terminology: Volume

● Multiple bricks combined

● Mountable by Gluster clients

brick-A brick-B

Volume

Terminology: Node

● Server running the Gluster Daemon

● Contains the Brick(s)

● Provides access to the Volume(s)

brick-A brick-B

Striped Volumes

● Comparable with RAID-0

● Sparse files contain parts of the actual data

● Contents of the files are distributed over Bricks

● Losing one Brick results in loss of all the files

● No redundancy available (at the moment)

node-2node-1

0 2 4 31 5

Brick Brick

Striped Volume

x allocated data block unallocated/sparse data block

Striped Volumes

Mount point

Creating a Striped Volume

# gluster volume create my-striped-vol \ stripe 2 \ node1:/bricks/stripe node2:/bricks/stripe# gluster volume start my-striped-vol# gluster volume info my-striped-volVolume Name: my-striped-volType: StripeStatus: StartedNumber of Bricks: 2Transport-type: tcpBricks:Brick1: node1:/bricks/stripeBrick2: node2:/bricks/stripe

Replicated Volumes

● Comparable with RAID-1

● File duplication for redundancy

● Complete files available

node-2node-1

Brick on node-1 Brick on node-2

Replicated Volume

allocated physical file

Replicated Volumes

Mount point

Creating a Replicated Volume

# gluster volume create my-replicated-vol \ replica 2 \ node1:/bricks/repl node2:/bricks/repl# gluster volume start my-replicated-vol# gluster volume info my-replicated-volVolume Name: my-replicated-volType: ReplicateStatus: StartedNumber of Bricks: 2Transport-type: tcpBricks:Brick1: node1:/bricks/replBrick2: node2:/bricks/repl

Distributed Volumes

● Just a Bunch Of Bricks (JBOB)

● Comparable with JBOD (Just a Bunch Of Disks)

● No duplicating

node-2node-1

Brick Brick

Distributed Volume

Allocated physical file

Distributed Volumes

Mount point

Creating a Distributed Volume

# gluster volume create my-distributed-vol \ node1:/bricks/dist node2:/bricks/dist# gluster volume start my-distributed-vol# gluster volume info my-distributed-volVolume Name: my-distributed-volType: DistributeStatus: StartedNumber of Bricks: 2Transport-type: tcpBricks:Brick1: node1:/bricks/distBrick2: node2:/bricks/dist

Distributed Replicated Volumes

● Just a Bunch Of Bricks (JBOB)

● Comparable with JBOD (Just a Bunch Of Disks) and RAID-1

● File duplication for redundancy

node-1 node-2

brick-A brick-A

Distributed Volume

allocated physical file

Distributed Replicated Volumes

Mount point

brick-B brick-B

Replicated Volume 0 Replicated Volume 1

Creating a Distributed Replicated Volume

# gluster volume create my-dist-repl-vol \ replica 2 \ node{1,2}:/bricks/dist-repl-A \ node{1,2}:/bricks/dist-repl-B# gluster volume start my-dist-repl-vol# gluster volume info my-dist-repl-volVolume Name: my-distributed-volType: Distributed-ReplicateStatus: StartedNumber of Bricks: 2 x 2 = 4Transport-type: tcpBricks:Brick1: node1:/bricks/dist-repl-ABrick2: node2:/bricks/dist-repl-ABrick3: node1:/bricks/dist-repl-BBrick4: node2:/bricks/dist-repl-B

Elastic Hashing & File Distribution

● Only known attribute is the path

● A hash based on the filename gets calculated● Davies-Meyer hashing function

● A hash has a finite number of results {0..N}

● A brick has a range of hash results

● Loop through the bricks and check if the hash result is in the hash range

Distributed Hash Table Xlator

select the next brick

file I/O

hash = dht_hash_compute()

brick.start >= hash&&

brick.end <= hash

select the first brick

Related sources:xlators/cluster/dht/src/dht-layout.c:dht_layout_search()libglusterfs/src/hashfn.c

Keep in Touch

● gluster-devel@nongnu.org● http://lists.nongnu.org/archive/html/gluster-devel/

● #gluster on Freenode

● twitter.com/RedHatStorage

Questions?

Data Distribution in Gluster - Red Hat

Documents

Splunk on Red Hat Gluster Storage for the Win

Red Hat Gluster Storage performance · PDF fileRed Hat Gluster Storage performance Manoj Pillai and Ben England Performance Engineering June 25, 2015

Demystifying Gluster - Red Hat...2014/03/04 · Demystifying Gluster GlusterFS and RHS for the SysAdmin Niels de Vos Sr. Software Maintenance Engineer, Red Hat Gluster Cloud Night

RED HAT GLUSTER STORAGE ADVANCED FEATURES LABpeople.redhat.com/dblack/summit2015/gluster_advanced_lab/gluster... · RED HAT GLUSTER STORAGE ADVANCED FEATURES LAB Dustin L. Black,

Red Hat Gluster Storage Performance

Red Hat Gluster Storage 3 · 2017-11-17 · Red Hat Gluster Storage provides the containerized distributed storage based on Red Hat Gluster Storage 3.1.3 container. Each Red Hat Gluster

Introduction to Red Hat Storage · 3 Red Hat Acquisition of Gluster – What it Means for You Gluster Red Hat Storage (closed October 2011) More resources for proven GlusterFS architecture

Demystifying Gluster...2013/10/29 · Demystifying Gluster GlusterFS and RHS for the SysAdmin Niels de Vos Sr. Software Maintenance Engineer, Red Hat Gluster Community Day in London

Red Hat Gluster Storage 3 - Red Hat Customer Portal · RED HAT GLUSTER STORAGE AND RED HAT ENTERPRISE VIRTUALIZATION INTEGRATION ... INTRODUCTION Red Hat Gluster Storage is a software

Console Developer Guide - Red Hat Customer Portal · Anjana Suparna Sriram Pavithra Srinivasan Divya Muntimadugu Red Hat Gluster Storage 3.1 Console Developer Guide Red Hat Gluster

Red Hat Gluster Storage 3 · PDF fileRed Hat Gluster Storage 3.1 Deployment Guide for Containerized Red Hat Gluster Storage in OpenShift Enterprise Deploying Containerized Red Hat

RED HAT GLUSTER STORAGE PRODUCT OVERVIEWpeople.redhat.com/mlessard/qc/presentations/oct2015/RHS...RED HAT GLUSTER STORAGE PRODUCT OVERVIEW Michael Lessard Senior Solutions Architect

Red Hat Gluster Storage 3

Red Hat Gluster Storage Administration with exam (RH237) Hat Gluster Storage... · unified file and object storage, and geo-replication. Finally, students will learn about the Hadoop

Red Hat Storage Day Dallas - Gluster Storage in Containerized Application

Red Hat Gluster Storage 3 2. ACCESSING RED HAT GLUSTER STORAGE USING AMAZON WEB SERVICES Red Hat Gluster Storage for Public Cloud packages glusterFS as an Amazon Machine Image (AMI)

Red Hat Storage Day Seattle: Supermicro Solutions for Red Hat Ceph and Red Hat Gluster Storage

Red Hat Gluster Storage Advanced Features Informationpeople.redhat.com/mlessard/mtl/presentations/oct2015/... · 2015-10-30 · Red Hat Gluster Storage Advanced Features: Lab Guide

Red Hat Gluster Storage 3...Last Updated: 2017-10-18 Red Hat Gluster Storage 3 Technical Notes Detailed notes on the changes implemented in Red Hat Storage 3 Anjana Suparna Sriram

RED HAT GLUSTER STORAGE · Red Hat Gluster Storage delivers a continuous storage fabric across physical, virtual, and cloud resources so customers can transform their big, semistructured,