The Fresh Breeze Memory Model Status: Linear Algebra and Plans

The Fresh Breeze Memory ModelStatus: Linear Algebra and Plans

Guang R. GaoJack Dennis

MIT CSAIL University of Delaware

Funded in part by NSF HECURA Grant CCF-0937832

Joshua Slocum Xiaoxuan MengBrian Lucas

Problem: I/O Performance

Culprits:

• Large Units of Data Transfer

• Operating System Overhead/Noise

Managed by hardwareManaged by software (OS)

Memory

FileMemory

• Few Concurrent Transfers (OS Limits) 2

Solution: Integrated Memory Hierarchy

Features:

• Many Concurrent Transactions – High Bandwidth

• Global Virtual Store

Managed by hardware

Memory

FileMemory

• Superior Basis for Security / Privacy 3

The Fresh Breeze Memory Model

• Fan-out as large as 16

Chunks

e.g. 128 Bytes

Master

• Write-Once then Read Only

Cycle-Free Heap Arrays as Trees of Chunks

• Arrays: Three levels yields 4096 elements (longs)

Fresh Breeze System Vision

Many Core Processing Chips

Switch Switch

Main Memory:Associative Directoriesand DRAM Chips

Backing Store:Access Controllersand Flash Devices

AD DRAM

Switch

AD DRAMAD DRAM

AC Flash AC Flash AC Flash AC FlashAC Flash

L2 Cache

Unique Features of the Processor

• Cache Lines are 128-byte Chunks.• Registers are tagged to indicate those

holding handles of chunks.• Hardware task scheduler: Active Task

List and Pending Task Queue.

Spawn and Join

Master Task

spawn (n)

Join_fetch Join

Worker 0 Worker n-1

join_update join_update

Spawned Worker Tasks

Continuation Task

The Dot Product

5 levels:Vector length =165 = 1,048,576

scalar result

Each Leaf Task: Dot Product of two 16-element vectors: 16 multiplies; 15 adds

Linear Algebra: Three Algorithms

• Dot Product• Matrix Multiply• Fast Fourier Transform

Let’s consider the special characteristics of each.

Dot Product

SegmentA

Multiplies

31 Operations

• No data reuse

• No intermediate data

• No chunks written

• Large volume of input data

Leaf Task: Dot Product of 16-element segments A and B

SegmentB

Matrix Multiply

Multiplies

Operations

• Each input chunk used many times

• Result chunks written to memory

• No chunks written

• Relatively small input data

Leaf Task: Product of two 4-by-4 matrices

16 dot products of four-element vectors

Fast Fourier Transform

EightData

Samples

FourTwiddleFactors

Multiplies

10Operations

• Log2 (n) stages

• Intermediate data

• Chunks written and read

Leaf Task: Group of Four Butterfly Computations

BFLYEight

ResultsBFLY

One Butterfly Four Butterflies

Simulated System Model

Memory

Switch

Processors – 16, 24, 32, 40

Up to 64IndependentStorage Units

Simulation Modes

Non-Blocking:

A task executing a read simply waits for operation to complete.

Models an L2 cache with short access time.

Models a main memory with long access time.

Blocking:

A task executing a read suspends to permit other tasks to run.

Estimated PerformanceL2 Cache Model - NonBlocking

Dot ProductA: 163

B: 164

C: 165

Matrix MultiplyA: 16 x 16B: 32 x 32C: 48 x 48

Fourier TransformA: 1024B: 2048C: 4096

A A A A

A A A AB

B B B B

C C C C

16 24 32 40 16 24 32 40 16 24 32 40

Number of Processors

Estimated PerformanceL2 Cache Model - NonBlocking

Dot ProductA: 163

B: 164

C: 165

Matrix MultiplyA: 16 x 16B: 32 x 32C: 48 x 48

Fourier TransforA: 1024B: 2048C: 4096

16 24 32 40 16 24 32 40 16 24 32 40

Number of Processors

Findings

• Data locality of chunks works well.• L1 Cache is not very important; only a buffer

for input and result chunks of tasks.• Task switching for chunk reads is costly;

percolation would make a big difference.

Percolation: Ensuring that input chunks of a task have been retrieved before the task is scheduled.

Further Work

• Model a Three-Level or Four-Level Storage System

• Develop Hierarchical Work Stealing to improve Load Distribution for Massively Parallel Systems.

• Compiler Development: Automatic Mapping of Objects to Trees of Chunks.

• Expand Test Programs to include Transaction Processing and Database Operations .

Distributed Discrete-Event Simulation

• Accurate timing for interconnected communicating components.

• Packet Communication Architecture is a model for distributed systems especially amenable to efficient distributed simulation.

• UDel and MIT have begun a project to develop a PCA-based Simulation Sandbox for evaluating alternate PXMs for massively parallel computing.

Relevance to FSIO

• The Fresh Breeze Memory Model extends to 264 chunks, about 1021 chunks or 100,000 exabytes.

• The tree-structure is an excellent basis for advanced security and data object sharing.

• Can simplify check-point / restart.

• Seamless transition from “in-core” to “out-of-core” operation of arbitrary parallel programs.

• Provides ability to use any program as a component of new programs – including any parallel program.

Conclusion

• Making the best of many-core technology requires study of new program execution models (PXMs).

• The FSIO challenge can be met by integrating the file system into the system memory hierarchy as a global virtual store.

• The Fresh Breeze PXM demonstrates some of the benefits from departing from conventional system organization.

• More exploration of new PXMs is needed!

The Fresh Breeze Memory Model Status: Linear Algebra and Plans

Documents

UNITED STATES ENVIRONMENTAL PROTECTION AGENCY … · 05-10-2020 · Vanilla & Blossoms Citrus Meadows Fresh Oriental Spice Lavender Pure Breeze Vanilla & Spring Clean Renewal Fresh

Breeze C3 - Fisaude · Breeze C3. Breeze C3. Plata metalizada. Breeze C3. Azul metalizado. AfiScooters AfiScooters desarrolla y fabrica scooters eléctricos avanzados tecnológicamente

Superconducting Generators: A Fresh Breeze in Renewables · Superconducting Generators: A Fresh Breeze in Renewables Around the globe, wind energy accounts for an increasingly large

DESTINATIONS - World Journeys...– Suviche, Asian cuisine with Mexican influences – Breeze, with fresh and organic menu selections LEISURE FACILITIES – Introducing two new stunning

Breeze FPV Controller 5-ger-DE final - Breeze+FPV+

Manual Fresh-Breeze-Tandem Trike „XCitor“...Producer: Fresh Breeze LangerAcker 11 D-30900 Wedemark Tel.: 05130-3769922 Fax :05130-3769944 Owner: This Paratrike must be used within

PowerPoint 프레젠테이션 · 2020. 5. 29. · the breeze portfolio. 7 the breeze portfolio. 8 the breeze portfolio. 9 the breeze portfolio. 10 the breeze portfolio. 11 the breeze

Fresh Breeze at Pelton

INDIANOLA BEACH IMPROVEMENT CLUB Breeze Ad Pages · 2017. 10. 4. · 360.297.2298 • NEW PATIENTS WELCOME! PERSEPHONE FARM Fresh vegetables Seasonal Wedding Flowers Community Supported

BREEZE AT - Dubai off plan Properties · breeze at dubai creek beach breeze at creek beach. breeze at dubai creek beach breeze at creek beach. breeze at dubai creek beach breeze at

Gulf Breeze Will Do · Congratulations to the Fall 2015 Grant Winners $1000-$2499: • Gulf Breeze High School Girls’ Basketball – Fresh Start, Great Start $2500-$4999: • The

Summer breeze

Field Check List of Birds - History of Parks Canada eLibraryparkscanadahistory.com/wildlife/bird-checklist-pei.pdf4.5-9.5 V2 overcast fresh breeze 15 0.5-4.0 1/4 overcast light breeze

Tortuga Marriott Website › marriottassets › marriott › ... · OCEAN BREEZE vodka / fresh mint / cranberry and grapefruit WATERMELON COOLER tequila / watermelon liqueur and pineapple

BREEZE - JUNG PUMPEN · 2020-03-09 · BREEZE Breeze PSB - Breeze SH 1 - Breeze SH 2 - Breeze SH 3. 2 . 3 DEUTSCH Sie haben ein Produkt von Pentair Jung Pumpen gekauft und damit Qualität

Ocean Breeze

Fresh Seafood - Sea Breeze Food Service Fresh... · Sea Breeze Food Service 3807 Edgewood Drive Jacksonville, Florida 32254 (904) 356-9905 Serving North Florida & Southeast Georgia

DERRERA Gayle (Daughty ...sites.rootsweb.com/~cowcgs/WCMI/Derrera_G.pdf · The Johnstown Breeze The Johnstown Breeze The Johnstown Breeze The Johnstown Breeze The Johnstown Breeze

BREEZE LETTER - scufhoftp.scufsanthome.com Breeze/April 2017 Breeze...Shriram City—News Letter - Breeze- 1st ... His father Mr.K. Odelu aged 65 years a diploma ... In recovery front

Tatler Asia (Formerly Edipresse Media) · Four Seasons (Apple, Strawberry, Mango, Pineapple) Coolers ( Strawberry, Blueberry, Mango, Apple ) ( Fresh Fruits, Flavor Syrup, Soda ) Breeze