Upload
kelly-woods
View
213
Download
0
Embed Size (px)
Citation preview
Commodity-SC Workshop, Mar00
Cluster-based Visualization
Dino Pavlakos
Sandia National Laboratories
Albuquerque, New Mexico
Commodity-SC Workshop, Mar00
High End Graphics Platforms
Years/Compute Performance
PC Graphics
SGI Graphics
103
107
108
109
106
2004200119993
Tflops10
Tflops100
Tflops
Poly
gon R
enderi
ng R
ate
(M
egapoly
s/S
eco
nd
)
Commodity-SC Workshop, Mar00
Rendering and Sorting
4 X 4Transform
Clipping Rasterizationand
Texture Mapping
Framebuffer
Analog RGBOutput
Sort FirstSort Middle
Sort Last
Polygon Rendering Pipe
Commodity-SC Workshop, Mar00
Tiled vs. Single / Composite Displays
Renderer
Renderer
Renderer
Renderer
Display(s)
Tiled
Renderer
Renderer
Renderer
Renderer
Display
Composite
Commodity-SC Workshop, Mar00
Data Exploration ArchitectureData Exploration Architecture
Reducedpolys/data
ComputeService
UserWork-station
SimulateCompress
DecompressUserdI
render
Vis.Service
Dec/CDDDdDI
Archive
RenderDec/CDI
DataService ImagesPolys/
Data
DataArchive
Big Data
Commodity-SC Workshop, Mar00
Tightly Coupled Compute, Data Services and Visualization
InfiniBand x12Link Speed:6 GB/s Bidirectional(3 GB/s each way)
Aggregate bandwidth across vertical plane:768 GB/s each way, with 256 (16 x 16) rows(exceeds I/O requirements)
16 rows
16 rows
Commodity-SC Workshop, Mar00
Visualization/Data Service Clusters @ SNL
Existing• 16-node SGI/320 NT graphics cluster (GigE)• 72 node Compaq/NT data service cluster
(ServerNet)
Coming• 64 node graphics cluster (ASCI V1 Corridor)• 8-16 node graphics cluster (open testbed)
Commodity-SC Workshop, Mar00
Cluster-based visualization issues
• Rendering scalability vs. interactive latency– Expect good results for rendering large data
– Getting high frame rates (e.g. 60Hz) harder
• Dynamic resource management• Desktop access to large resource• Parallel/Scalable IO• Parallel inter-process communication (runtime visualization
& data services, computational experiments)• Classified/Unclassified use
Commodity-SC Workshop, Mar00
Cluster-based Composite RenderingSimplistic Projection
• 16 node SGI/320 cluster• Peak 4 Million polygons/sec. per node• 16 x 4 = 64 Million polygons/sec. peak (perfect scaling)
• Assume – 64 Million-Polygon surface data
– Sort-last rendering (HW-accelerated)
– 1K x 1K Display
• Render 64M polygons in 1 sec.• Add .84 sec. composite time (Compaq NT cluster / ServerNet)
– gives 35M polygons/sec.
• Add .16 sec. composite time (ASCI Red)– gives 55M polygons/sec.
Commodity-SC Workshop, Mar00
Parallel Visualization Abstract Partitioning Model
DataRepository/
Buffer
DataRepository/
Buffer
DataInterface
SimulationCode
VisualizationModule
Data ServiceModule
...
...Control
Interface
(Parallel Disk,Shared Memory,Dist. MemoryBuffer, …)
AbstractDataSpace
Commodity-SC Workshop, Mar00
Do-It-All on C-Plant Instantiation
Node1
Node2
NodeN
Disk1
Disk2
DiskN
...
...
...
Comp 1
Comp 2
Comp N
AbstractDataSpace