Resource Management in the Greenplum Parallel Database

Preview:

DESCRIPTION

Siva Narayanan (siva@greenplum.com) Consultant Software Engineer, Query Processing, EMC Greenplum. Resource Management in the Greenplum Parallel Database. Why, you ask?. Problem. Finite resources - CPU/memory/IO/network Concurrent activity Different business value (Loads/Reports/Analytics) - PowerPoint PPT Presentation

Citation preview

Siva Narayanan (siva@greenplum.com)Consultant Software Engineer, Query Processing, EMC Greenplum

1

2Tuesday, April 25, 2023 2

3

Finite resources - CPU/memory/IO/network Concurrent activity

Different business value (Loads/Reports/Analytics)

Different system impact (Simple/Complex queries)

How can a DBA manage the system and keep everyone happy?

Tuesday, April 25, 2023 3

4

Determine business value of a query upon arrival Translate that to fair share of CPU and

Memory Resource reservation / Admission control

Are the resources available? Run-time resource allocation

Ensure that reservations are honored Adjust behavior as necessary

Tuesday, April 25, 2023 4

5Tuesday, April 25, 2023 5

6

Every query operator in a execution plan Continually measures its actual CPU usage

and compares it with fair share If it uses too much, it sleeps for a short

while Rinse, repeat

I/O and network bandwidths are similar

Tuesday, April 25, 2023 6

7

Every query operator in a execution plan Gets a portion of memory reserved for the

entire query Memory intensive operators vs not Re-use memory between blocking operators If data is too large, they spill

Net effect, every query uses up to its fair share

Tuesday, April 25, 2023 7

8

Resource management is a big problem with big data

Align resource allocation with business value

Greenplum Parallel Database has mechanisms for CPU and Memory

Tuesday, April 25, 2023 8

9

We’re hiring! siva@greenplum.com

Recommended