9
Siva Narayanan ([email protected]) Consultant Software Engineer, Query Processing, EMC Greenplum 1

Resource Management in the Greenplum Parallel Database

  • Upload
    aron

  • View
    53

  • Download
    1

Embed Size (px)

DESCRIPTION

Siva Narayanan ([email protected]) Consultant Software Engineer, Query Processing, EMC Greenplum. Resource Management in the Greenplum Parallel Database. Why, you ask?. Problem. Finite resources - CPU/memory/IO/network Concurrent activity Different business value (Loads/Reports/Analytics) - PowerPoint PPT Presentation

Citation preview

Page 1: Resource Management in the Greenplum Parallel Database

Siva Narayanan ([email protected])Consultant Software Engineer, Query Processing, EMC Greenplum

1

Page 2: Resource Management in the Greenplum Parallel Database

2Tuesday, April 25, 2023 2

Page 3: Resource Management in the Greenplum Parallel Database

3

Finite resources - CPU/memory/IO/network Concurrent activity

Different business value (Loads/Reports/Analytics)

Different system impact (Simple/Complex queries)

How can a DBA manage the system and keep everyone happy?

Tuesday, April 25, 2023 3

Page 4: Resource Management in the Greenplum Parallel Database

4

Determine business value of a query upon arrival Translate that to fair share of CPU and

Memory Resource reservation / Admission control

Are the resources available? Run-time resource allocation

Ensure that reservations are honored Adjust behavior as necessary

Tuesday, April 25, 2023 4

Page 5: Resource Management in the Greenplum Parallel Database

5Tuesday, April 25, 2023 5

Page 6: Resource Management in the Greenplum Parallel Database

6

Every query operator in a execution plan Continually measures its actual CPU usage

and compares it with fair share If it uses too much, it sleeps for a short

while Rinse, repeat

I/O and network bandwidths are similar

Tuesday, April 25, 2023 6

Page 7: Resource Management in the Greenplum Parallel Database

7

Every query operator in a execution plan Gets a portion of memory reserved for the

entire query Memory intensive operators vs not Re-use memory between blocking operators If data is too large, they spill

Net effect, every query uses up to its fair share

Tuesday, April 25, 2023 7

Page 8: Resource Management in the Greenplum Parallel Database

8

Resource management is a big problem with big data

Align resource allocation with business value

Greenplum Parallel Database has mechanisms for CPU and Memory

Tuesday, April 25, 2023 8

Page 9: Resource Management in the Greenplum Parallel Database

9

We’re hiring! [email protected]