Upload
others
View
3
Download
0
Embed Size (px)
Citation preview
1
Accelerating EMT simulation of large-scale power
system using SW26010 many-core processors
Ying Chen, Laijun Chen
2019.08
Outline
• Challenges on EMT simulations of large-scale power system
• High Performance Simulation on Cloud Platform
• Architecture of Sunway26010 and Sunway-TaihuLight
• Accelerating EMT simulation on SW26010
• Conclusion
2
1. Challenges on EMT simulations of large-scale power system
3
2. High Performance Simulation on Cloud Platform
4
Web Server
Data Server
User
GPU-based Computing Nodes
• Create & manage simulations
• Get Results
• Web Server on Alibaba Cloud
GPUs
Sunway
SW26010
• EMT simulations
are accelerated
by parallel
computing
• All technique
solutions are
transparent to
end-user Computation Servers
Designed by Tsinghua
Powered by Sunway
2. High Performance Simulation on Cloud Platform
5
How to accelerate batched EMT simulations of large scale AC/DC power systems on such a new computation platform?
3. Architecture of Sunway26010 and Sunway-TaihuLight
• SW26010: Sunway 260-Core Processor
6
Core Group 2
Data Transfer
Network
MPE8*8 CPE
Mesh
PPU
iMC
Memory
Core Group 0
MPE8*8 CPE
Mesh
iMC
PPU
Memory
Core Group 1
MPE8*8 CPE
Mesh
PPU
Core Group 3 iMC
Memory
MPE8*8 CPE
Mesh
PPU
iMC
Memory
NoC
Computing
Core
LDM
Column
Communication Bus
Control
Network
Registers
Row
Communication
Bus
Transfer Agent (TA)
Memory Level
LDM Level
Register Level
Computing Level
8*8 CPE Mesh
3. Architecture of Sunway26010 and Sunway-TaihuLight
• Sunway-TaihuLight: 10M-core system
7
3. Architecture of Sunway26010 and Sunway-TaihuLight
• HPC Performance Features
8
3. Architecture of Sunway26010 and Sunway-TaihuLight
• Programming Model on TaihuLight
9
MPI + XX: (Sunway OpenACC / Athread)
MPI
One MPI process runs on one management core (MPE)
Sunway OpenACC
Sunway OpenACC conducts data transfer between main memory and local data
memory (LDM), and distributes the kernel workload to the computing cores (CPEs)
Athread
Athread is the threading library to manage threads on computing core (CPE), which is
used in the Sunway OpenACC implementation
4. Accelerating EMT simulation on SW26010
• Hybrid-grained parallel solution – Multi-grained system decomposition
10
Inter-region Level
• long distance transmission line decomposition
Intra-region Level
• MATE-based decomposition and coordination
Computation Graph Level
• Homogeneous Computation (eg. State update of electrical components)
• Heterogeneous Computation(eg. Complicate control system solution)
• Network Solution ( eg. SparseLU, SparseFS/BS)
4. Accelerating EMT simulation on SW26010
• Hybrid-grained parallel solution – Multi-grained system decomposition
11
E N C
PG
PG PG
PG
PG PG
Regional
Power
Grid
Regional
Power
GridLong Distance
Transmission Line
E N CMATE
Protocol
NoCMPE CPE
Core Group 1.1# Core Group 1.2#
Supernode 1#
E N C E N CMATE
Protocol
NoC
Core Group 1# Core Group 2#
Supernode 2#
CPE CPE CPEMPE MPE MPE
4. Accelerating EMT simulation on SW26010
• Hybrid-grained parallel solution– Parallel processing of electrical system
12
Homogeneous Computation• Solve Norton Equivalent Currents
• Basic Fused Multiply-Add (FMA) Instructions
=
ne
ne
t t t
t t t t t t
C
I GU I
I PI QU I
Network Solution
• Sparse Direct Solver on SW26010
4. Accelerating EMT simulation on SW26010
• Hybrid-grained parallel solution– Parallel processing of control system
13
1. Calculate Current of Photovoltaic Cell (Partial)
1
U t tq
Ak Tph sI t I I e
2. Maximum Power Point Tracking (Partial)
PPV
UPV
uref
e-sT
e-sT
e-sT
+_
+_
N/D
N
D +
+ uref
1 2
4
5
6
7
3
8
10
11
9
1 2
4
5
6
7
3
8
10
11
9Breaking Feedback
loops
7
Directed Acyclic Graph (DAG) Directed Graph
Iph
exq
Ak ×
U t t
× ×
I t
T
1 Is ++
_+
N/DN
D
Computation Decomposition
1 2
4
5
6
7
38
10
11
9 7Layered DAG (LDAG)
Layering
Operation at T T TFeedback value at
a) Example of Calculation for non-linear features of electrical system and control system
b) Formation of Layered Directed Acyclic Graph
Layered Directed Acyclic Graph (LDAG)
Heterogeneous Computation• Form final control signals P(t),Q(t),G(t),IC(t)
=
ne
ne
t t t
t t t t t t
C
G
P UQ
I
I I I
U I
4. Accelerating EMT simulation on SW26010
14
• Implementation & Case StudyRegional Grid
• 1260 Bus (over 4000 nodes)
• 243 Generator include AVR, GOV
• 1572 AC Transmission Lines
• 2 UHVDC Lines: 800kV, 1100kV
Decoupling Scheme
• 20 Regions (Decoupled by long lines)
• 4 MATE Regions (HVDC station)
Generator Station UHVDC System
Max Nodes Max Comp. Time
AC Region 309 21 μs
MATE Region 92 (Total)12 (Thevenin Equiv.)
44 μs
Time Cost per Step(50us)
Computing Resources
• 24 Chips, 5200 Cores
(≤ 2.4% of One Rack)
Conclusion
• Cloud-based simulation platform proves to be transparent and productive for EMT simulations users
• Using hybrid-grained parallel computing, SW-processers and servers can be used to accelerate EMT simulations of large scale AC/DC power system
15
SCAN to Subscribe on WeChatSCAN to visit CloudPSS
Thank you!
Please find more on
www. cloudpss.net