Upload
dangnhu
View
571
Download
10
Embed Size (px)
Citation preview
ANSYS FLUENT Performance
Intel Xeon 5675 Comparison
ANSYS FLUENT 12.1 to ANSYS FLUENT 13.0
Performance Testing Performed
at
QLogic NETtrack Developer Center
Agenda
• Partners in the Testing
• Test Configuration
• Performance Profile
• Difference in Performance Rating
• User and MPI Utilization Differences
• MPI Function Calls
• Message Rate
• NETtrack Developer Center
• ANSYS Customer Test Drive Program
Month DD, YYYYQLogic Confidential2
Partners in the Testing
• The information contained within this document was generated in QLogic’s
NDC laboratory
• Participating parties: ANSYS, Dell, Intel and QLogic
• Cluster Resources: NETTrack Developer Center (NDC) Dell C6100 – Sapphire II
• For more info regarding participants, please refer to the following sites:
www.qlogic.com, www.Dell.com, www.ansys.com, www.Intel.com
Month DD, YYYY3 QLogic Confidential
NDC – Test Configuration
QLogic Confidential4
Server: Dell PowerEdge C6100 (Sapphire-II)
CPU: 2 - Intel Xeon X5675 @ 3.07 GHz (Hex Core)
Mem: 48GB (12x4G) DDR3 @ 1333MHz
OS: RHEL 5.5 / Kernel 2.6.18-194.el5
App: ANSYS FLUENT 12.1.11 and 13.0
MPI: Platform MPI
HCA: QLE7340 - QDR InfiniBand
Stack: OFED 1.5.2
Switch: 12300 - 36-port QDR InfiniBand
F/W: 6.0.2.1.11
Settings: 4K MTU 1 Virtual Lane
Performance Profiles
Month DD, YYYYQLogic Confidential5
Performance Profile
ANSYS FLUENT 12.1 vs. ANSYS FLUENT 13.0
• 14% average performance improvement with ANSYS FLUENT 13.0 across the seven benchmark tests
• Greatest performance improvement shown with Sedan (k-eps Turbulence model with Pressure-Based Coupled Implicit solver). Significant gains due to new hybrid parallel implementation in ANSYS FLUENT 13.0
• Least performance gains shown with Truck_14M and Truck_111M benchmark tests, both tests use DES Turbulence model with Segregated Implicit solver
6 QLogic Confidential
0
2000
4000
6000
8000
10000
12000
14000
Aircraft_2M Sedan_4m Eddy_417k Truck_14M Truck_Poly_14M Truck_111m
ANSYS FLUENT Comparison16-nodes/192-cores
FLUENT r12.1
FLUENT r13.0
11%53%
7%
1% 13%
-2%
Profile: User and MPI Overhead
16-Nodes/192-Cores
• ANSYS FLUENT 13 achieves a 14%
better average rating across the
seven tests. In producing these better
results:
• Utilizes an average of 2% less time in
computing
• Increased MPI communications by an
average of 4%
QLogic Confidential7
Benchmark Name Usage v12.1.11 v13.0 % Change
aircraft_2m Rating 9818.2 10922.9 11%
User Time 39.33% 35.82% -9%
MPI time 60.67% 64.18% 6%
sedan_4m Rating 7912.1 12134.8 53%
User Time 49.62% 49.31% -1%
MPI time 50.38% 50.69% 1%
eddy_417k Rating 12272.7 13110.8 7%
User Time 30.51% 27.81% -9%
MPI time 69.49% 72.19% 4%
truck_14m Rating 1497.1 1511 1%
User Time 48.00% 38.51% -20%
MPI time 52.00% 61.49% 18%
truck_poly_14m Rating 1346.8 1525.7 13%
User Time 47.76% 40.41% -15%
MPI time 52.24% 59.59% 14%
truck_111m Rating 180.6 176.9 -2%
User Time 30.51% 44.99% 47%
MPI time 69.49% 55.01% -21%
Average Change Rating 31%
User Time -2%
MPI time 4%
Fluent Version
Profile: MPI Functions
Eddy_417
• The following are the most used calls– MPI_Iprobe (stat gathering), MPI_Isend,
MPI_Irecv, MPI_ALLreduce
• The following calls produce the most Overhead: MPI_Recv, MPI_Allreduce,
MPI_Barrier, MPI_Init, MPI_Waitall, MPI_Iprobe, MPI_Isend.
Month DD, YYYYQLogic Confidential8
ANSYS FLUENT 12.1.11 ANSYS FLUENT 13.0
Profile: MPI Functions
Truck_111
• The following are the most used calls– MPI_Iprobe (stat gathering), MPI_Isend,
MPI_Irecv, MPI_ALLreduce
• The following calls produce the most Overhead (73%): MPI_Recv, MPI_Allreduce,
MPI_Waitall.
Month DD, YYYYQLogic Confidential9
ANSYS FLUENT 12.1.11 ANSYS FLUENT 13.0
Profile: Message Rate
Eddy_417K & Truck_111M
Eddy_417K Model
• ANSYS FLUENT 13.0 sends 8% less messages at 192-cores
• Average message size at 192-cores is 15% smaller with 13.0
• 97% of the messages are less than 1K in size
Truck_111M Model
• ANSYS FLUENT 13.0 sends 57% less messages at 192-cores
• Average message size at 192-cores is 66% larger with13.0
• 98% of the messages are less than 65K in size
Month DD, YYYYQLogic Confidential10
Take a
Test
Dri
ve
QLogic NETtrack Developer Center
Test Drive Program
• Test Drive your models/simulations at the QLogic NETtrack Developer
Center
• Range of Configurations
• Nodes, Cores, Memory
• Interconnects
• Storage
• Determine the Optimal Performance Setup for your Business
Get Started – Contact us at [email protected]
Month DD, YYYY13 QLogic Confidential