28
CLUSTER CLUSTER COMPUTING COMPUTING

Cluster Computing

Embed Size (px)

Citation preview

Page 1: Cluster Computing

CLUSTER CLUSTER COMPUTINGCOMPUTING

Page 2: Cluster Computing

INTRODUCTIONINTRODUCTION A computer cluster is a group of tightly coupled computers that work together closely so that it can be viewed as a single computer.

Clusters are commonly connected through fast local area networks.

Clusters have evolved to support applications ranging from ecommerce, to high performance database applications.

Page 3: Cluster Computing

HISTORYHISTORY The first commodity clustering product was

ARCnet, developed by Datapoint in 1977. The next product was VAXcluster, released by

DEC in 1980’s. Microsoft, Sun Microsystems, and other leading

hardware and software companies offer clustering packages

Page 4: Cluster Computing

WHY CLUSTERS?WHY CLUSTERS?Price/Performance The reason for the growth in use of clusters is that

they have significantly reduced the cost of processing power.

Availability Single points of failure can be eliminated, if any

one system component goes down, the system as a whole stay highly available.

Scalability HPC clusters can grow in overall capacity because

processors and nodes can be added as demand increases.

Page 5: Cluster Computing

Contd…Contd…The components critical to the development of

low cost clusters are: Processors Memory Networking components Motherboards, busses, and other sub-systems

Page 6: Cluster Computing
Page 7: Cluster Computing

LOGICAL VIEW OF LOGICAL VIEW OF CLUSTERCLUSTER

Page 8: Cluster Computing

ARCHITECTUREARCHITECTURE A cluster is a type of parallel /distributed processing

system,which consists of a collection of interconnected stand-alone computers cooperatively working together a single, integrated computing resource.

A node: a single or multiprocessor system with memory, I/O facilities,

&OS generally 2 or more computers (nodes) connected together in a single cabinet, or physically separated & connected via a

LAN appear as a single system to users and applications provide a cost-effective way to gain features and benefits

Page 9: Cluster Computing

ARCHITECTUREARCHITECTURE

Page 10: Cluster Computing

COMPONENTSCOMPONENTS

1.Multiple High Performance Computers

a.PCs

b.Workstations

c.SMPs (CLUMPS)

d.Distributed HPC Systems

Page 11: Cluster Computing

Contd…Contd…2. State of the art Operating Systems

a. Linux (Beowulf)b. Microsoft NT (Illinois HPVM)c. SUN Solaris (Berkeley NOW)d. IBM AIX (IBM SP2)e. HP UX (Illinois - PANDA)

Page 12: Cluster Computing

3.High Performance Networks/Switches3.High Performance Networks/Switches

a. Ethernet (10Mbps),a. Ethernet (10Mbps),b. Fast Ethernet (100Mbps),b. Fast Ethernet (100Mbps),c. Gigabit Ethernet (1Gbps)c. Gigabit Ethernet (1Gbps)

e. ATMe. ATMf. Myrinet (1.2Gbps)f. Myrinet (1.2Gbps)

g. Digital Memory Channelg. Digital Memory Channelh. FDDIh. FDDI

Contd…

Page 13: Cluster Computing

Contd…Contd…4. Network Interface Card

a. Myrinet has NIC

5. Fast Communication Protocols and Services

a. Active Messages (Berkeley)

b. Fast Messages (Illinois)6. Cluster Middleware

a. Single System Image (SSI)

b. System Availability (SA) Infrastructure

Page 14: Cluster Computing

Contd…Contd…7. Parallel Programming Environments

and Toolsa. Threads (PCs, SMPs, NOW..)b. MPIc. Compilersd. RAD (rapid application development tools)e. Debuggersf. Performance Analysis Toolsg. Visualization Tools

Page 15: Cluster Computing

Contd…Contd…

8. Applications

a. Sequential

b. Parallel / Distributed (Cluster-aware app.)

Page 16: Cluster Computing

DIFFERENT KINDS OF DIFFERENT KINDS OF CLUSTERSCLUSTERS

High Performance (HP) Clusters Load Balancing Clusters High Availability (HA) Clusters

Page 17: Cluster Computing

Contd…Contd…HIGH PERFORMANCE CLUSTER

Start from 1994Donald Becker of NASA assembled this cluster.Also called Beowulf clusterApplications like data mining, simulations, parallel processing, weather modeling, etc

Page 18: Cluster Computing

Contd…Contd…

LOAD BALANCING CLUSER

PC cluster deliver load balancing performanceCommonly used with busy ftp and web servers with large client baseLarge number of nodes to share load

Page 19: Cluster Computing

Contd…Contd…

HIGH AVAILABILITY CLUSTER

Avoid single point of failureThis requires atleast two nodes - a primary and a backup.Always with redundancyAlmost all load balancing cluster are with HA capability

Page 20: Cluster Computing

ISSUES TO BE ISSUES TO BE CONSIDEREDCONSIDERED

Cluster Networking Cluster Software Programming Timing Network Selection Speed Selection

Page 21: Cluster Computing

Contd…Contd…

Cluster networking If you are mixing hardware that has different

networking technologies, there will be large differences in the speed with which data will be accessed and how individual nodes can communicate. If it is in your budget make sure that all of the machines you want to include in your cluster have similar networking capabilities, and if at all possible, have network adapters from the same manufacturer.

Page 22: Cluster Computing

Contd…Contd…Cluster Software You will have to build versions of clustering

software for each kind of system you include in your cluster.

Page 23: Cluster Computing

Contd…Contd…

Programming Our code will have to be written to support the

lowest common denominator for data types supported by the least powerful node in our cluster. With mixed machines, the more powerful machines will have attributes that cannot be attained in the powerful machine.

Page 24: Cluster Computing

Contd…Contd…

Timing This is the most problematic aspect of cluster.

Since these machines have different performance profile our code will execute at different rates on the different kinds of nodes. This can cause serious bottlenecks if a process on one node is waiting for results of a calculation on a slower node..

Page 25: Cluster Computing

Contd…Contd…

Network Selection There are a number of different kinds of

network topologies, including buses, cubes of various degrees, and grids/meshes. These network topologies will be implemented by use of one or more network interface cards, or NICs, installed into the head-node and compute nodes of our cluster.

Page 26: Cluster Computing

Contd…Contd…Speed Selection

No matter what topology you choose for your cluster, you will want to get fastest network that your budget allows. Fortunately, the availability of high speed computers has also forced the development of high speed networking systems. Examples are :

10Mbit Ethernet, 100Mbit Ethernet, gigabit networking, channel bonding etc.

Page 27: Cluster Computing

ConclusionConclusion Clusters are promising Solve parallel processing paradox New trends in hardware and software technologies

are likely to make clusters. Clusters based supercomputers (Linux based

clusters) can be seen everywhere !!

Page 28: Cluster Computing