Upload
georgia-martin
View
221
Download
1
Embed Size (px)
Citation preview
Oracle Confidential – Highly Restricted 1
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | Oracle Confidential – Highly Restricted 2
Safe Harbor StatementThe following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle’s products remains at the sole discretion of Oracle.
Essbase and ExalyticsSubtitle
Kumar RamiayerSr. Director Essbase Development
Steve LiebermenschDirector, Product Management – Essbase
October 2, 2014
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. |
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | 4
Introducing X4-4Subject to Change
• New Intel Processor – Designed for Oracle– Intel Xeon E7-8895 v2 – 60 Cores (4*15)– Up to 1.8X throughput increase with the same software configuration
• Clock Speed range between 2.8 - 3.6GHz– Manually controlled or automated
• Memory and disk resources identical to X3-4– 2 TB Memory, 2.4 TB Flash storage
10 20 30 40 50 600.00
5.00
10.00
15.00
20.00
25.00
30.00
2.8 GHz TPS WSM 2.26GHz
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | 5
Exalytics X4-4• Uses “Designed for Oracle” Intel Processor – E7-8895 v2
– An “All-In-One” CPU - Another major step for Exalytics as an Engineered Systems
• Has as much capacity as the largest E7 Processor (15C)– Larger scalability and consolidation
• Runs Faster than any other Intel 4 Socket Processor (3.6 GHz)– Faster single threaded performance for Essbase
• Addresses key Exalytics use cases– When high scalability is required, opt for more cores with
reduced clock speed– When running batch processes with limited parallelism,
opt for less cores with higher clock speed15C 155W2.8GHz 37.5M
E7-8890 v2
10C 155W3.2GHz 37.5M
6C 155W3.4GHz 37.5M
E7-8891 v2 E7-8893 v2
15C 10C 6C 155W2.8GHz 3.2GHz 3.4GHz max Turbo Freq 3.6GHz 37.5M
E7-8895 v2
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. |
Exalytics Hardware Updates
Exalytics X3-42TB RAM, 2.4TB Flash
Exalytics X2-41TB RAM, 40 cores
Q1FY14 Q2FY14 Q3FY14 Q4FY14 Q1FY15
Exalytics T5-8, Sparc4TB RAM, 128 cores
Exalytics X4-42TB RAM, 60 cores
Q1FY15
PS5PS3 PS4
6
7Copyright © 2014, Oracle and/or its affiliates. All rights reserved. |
Software Updates
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | 8
Oracle 12c DBIM on Exalytics
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | 9
Oracle Database In-Memory and Exalytics
+ =Awesome In-Memory
HardwareAwesome In-Memory
DatabaseAnalyticsNirvana
Exalytics is the Fast Path to trying out DB 12c In-Memory Option
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | 10
12c DB In-Memory Option Certification on Exalytics• Oracle Database and In-memory Option
– Installation on Exalytics– OBIEE 11.1.1.7 certification for 12c
• Summary Advisor• Aggregate Persistence• As a regular data source
• “In-memory Data Mart”• TimesTen in offered parallel to 12c IMDB
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | 11
HFM Certification• HFM on Linux is coming with EPM PS4 • Certified to run on Exalytics
– Both bare metal and OVM configurations
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | 12
Essbase Enhancements• Essbase becomes “Pure” in-memory engine as calculation will no
longer wait for I/O (background write)• In-Memory Aggregate Views for ASO • Improved Resource management and CPU utilization
– Thread management and thread based memory allocation– Fundamental improvement to Essbase infrastructure, will impact resource
consumption, stability and performance.
• Leverage X4-4 capabilities by improving scalability
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | 13
Essbase Performance TPS Scalability Testing• Identical version of Essbase
– Based on an Hyperion Planning customer use case (BSO)
Exalytics X4 vs. X3
X3
X4
X3
X4
X3
X4
Exalytics X4 vs. X3
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | 14
Exalytics T5-8 PerformanceSoftware/Hardware Optimizations
Script 1 Script 2 Script 3 Script 4 Script 5 Script 6 Script 7 Script 8 Script 9 Script 100
100
200
300
400
500
600
700
800
900
1000
1100
-10%
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
65%
4%-2%
62%
82%
20%
-4%
70%
35%
57%
SPARC Exalytics – Essbase Calculations in Seconds
11.1.2.3.500 Upcoming Improvement %
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | 15
Exalytics T5-8 PerformanceSoftware/Hardware Optimizations
FIXP32 CALP320
100
200
300
400
500
600
700
800
53%
54%
55%
56%
57%
58%
59%
60%
56%
59%
Exalytics T5-8 – Essbase Calculations in Seconds
11.1.2.3.500 Upcoming Improvement %
16Copyright © 2014, Oracle and/or its affiliates. All rights reserved. |
Customer Results
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | 17
Why Exalytics• Pure performance
– Improve the uptime of my system– Increase the volume of data I can process during my update window– Allow users to iterate more times– Allow users to iterate over more data
• Total Cost of Ownership– Server consolidation– Reduced deployment time
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. |
Hyperion Performance with ExalyticsExamples of high performance gains with X improvement factor
Operation Artifact DB Current time Exalytics Desired X Factor
Calc Script TFCCalc1 Fin 8.0 1.2 7
Restructure Fin 120.0 6.4 10 19
Export Fin 12.0 2.8 4
Restructure Fin 480.0 15.2 10 32
Calc Script AggFocus Fin 0.7 0.1 0 13
Calc Script TCalcFC Cap 45.0 8.2 6
Calc Script TFCAgg WrkF 45.0 15.8 3
Calc Script ISAGGAll Fin 120.0 4.7 25
Calc Script CvAggall Cap 180.0 14.7 12
Calc Script WfAggAll WrkF 600.0 22.3 27
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. |
Hyperion Performance with ExalyticsSelected examples of high performance gains
Current Exalytics X improvement
Total Process 8.65 5.59 1.55
AggLMAct 6.26 .73 8.56
AggLM 19.28 2.32 8.31Clear Upper Blocks
Calc 17.21 14.51 1.4
Restructure 2.17 .72 3.02
MyAggAll 20.00 11.08 1.81
Note: Total Process is in hours; all other times are minutes.
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. |
Exalytics POC ResultsScenario # 6 – Query Testing
Single Query Current Exalytics X factor % Improve
France Unit 46.7 5.6 8.3 87.9%
North America Cons 21.4 17.9 1.2 16.5%
Global Cons
127.0 125.1 1.0 1.5%
Mexico Cons 59.0 36.0 1.6 39.0%
Avg 25 Current Exalytics X factor % Improve
France Unit 82.6 18.0 4.6 78.2%
North America Cons 50.0 21.7 2.3 56.6%
Global Cons
567.0 157.0 3.6 72.3%
Mexico Cons 95.0 54.0 1.8 43.2%
Longest 25 Current Exalytics X factor % Improve
France Unit 97.1 19.3 5.0 80.1%
North America Cons 52.0 21.9 2.4 57.9%
Global Cons
768.0 162.0 4.7 78.9%
Mexico Cons
126.0 60.0 2.1 52.4%
-
20.0
40.0
60.0
80.0
100.0
120.0
140.0
-
100.0
200.0
300.0
400.0
500.0
600.0
- 100.0 200.0 300.0 400.0 500.0 600.0 700.0 800.0 900.0
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. |
Exalytics POC ResultsScenario # Load / Concurrency Operation Type Notes on settings/fix
statements/special cases, etc…Current time (hh:mm:ss) Exalytics (hh:mm) x Faster % Improve
1 1 Cube Aggregation
Aggregation process based on query tracking (based on 8 queries)
0:0:35 0:0:13 2.7 62.9%
2 11 cube Rebuild Build/DataLoadLaunch 11 Rebuilds from the LaunchPad
0:41:19 0:24:54 1.7 39.7%
3 20 cube Rebuild Build/DataLoadLaunch 20 Rebuilds from the LaunchPad
2:20:33 1:29:08 1.6 36.6%
4 30 cube Rebuild Build/DataLoadLaunch 30 Rebuilds from the LaunchPad
2:46:37 1:34:26 1.8 43.3%
5 5 Cube AggregationAggregation based on size (200% growth factor)
1:41:57 1:14:08 1.4 27.3%
5a 20 cube AggregationAggregation based on size (200% growth factor)
1:53:54 1:12:46 1.6 36.1%
1 2 3 4 5 5a0
2000
4000
6000
8000
10000
12000
Current timeExalytics
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | 22
Exalytics vs. ExalyticsSPARC vs. Linux or T vs. X
X series T series
Linux Solaris
60 core Intel 128 core SPARC
2TB RAM 4TB RAM
3.6GHz max speed 3.6GHz
2 IB, 4 10GBE, 2 16GB FC 4 IB, 4 10GBE, 4 8GB FC
2.4 TB Flash 6.4 TB Flash
Platform specific optimizations
Oracle Confidential – Highly Restricted 23Copyright © 2014, Oracle and/or its affiliates. All rights reserved. |
Design Ethos
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | Oracle Confidential – Highly Restricted 24
Developing Software for Exalytics• Exalytics first
– In place block writes
• Exalytics restricted– FixParallel
• Exalytics only– Chip/OS based optimizations
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. |
Engineering for Exalytics – Large Memory Challenge
Platform and processor
Number of Sockets
Memory Speed (MHz)
Local Memory (ns)
1-HOP remote (ns)
2-HOP remote (ns)
2S WSM-EP 3.3GHz
2 1333 72 119 NA
4S WSM-EX 2.4GHz
4 1067 117 176 NA
8S WSM-EX 2.4GHz
8 1067 162 193 235
2S SNB-EP 2.9GHz
2 1600 76 126 NA
4S SNB-EP 2.7GHz
4 1033 87 152 204
Socket S1
10 cores with L1 and L2 cache
250 GB Memory
Socket S1
10 cores with L1 and L2 cache
250 GB Memory
Socket S1
10 cores with L1 and L2 cache
250 GB Memory
Socket S1
10 cores with L1 and L2 cache
250 GB Memory
Memory
CPU
CPU
CPU
CPU
CPU
CPU
SMP
syst
ems
Mul
ti-So
cket
Lar
ge M
emor
y N
UM
A sy
stem
• Symmetric Multi-processing (SMP) System guarantees uniform memory latency for all CPU, but total memory is limited
• Multi-socket systems are needed for supporting large memory, but the memory latency is not uniform
• Complexity shifts from hardware to software• In-memory analytics require all data in memory for processing
• How does the software utilize all the memory but all work faster?
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. |
Engineering for Exalytics – NUMA Challenges• Two engineering solutions
– Use local socket memory in “critical parts of the software” by using thread affinity– Use padding to avoid false sharing – align important memory structures to cache lines
• Example -- use thread-affinity below to pin thread to a socket
// Suppose thread is scheduled initially in Socket S1. Now memory gets allocated in Socket S1
int * p = (int*)malloc(1024*sizeof(int));
// Compute p[i] and store value in memory – latency 117 ns
p[i] = a * b + 43
// perform I/O – thread yields
Read_From_Disk_Storage(24);
// Now do computation. Thread wakes up – but in a CPU that belongs to different socket – S2
// Accessing memory p[i] is now “remote” and it costs 176 ns
T = p[i] * 32;
// Insert the following code before I/O to // pin the thread to current CPU pthread_setaffinity_np(…)
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. |
Engineering for Exalytics – High CPU CORE count Challenge• Exalytics Software Use Cases
– Large number of users performing fast and small workloads– Small number of users or single user performing a large highly parallelizable workload– Large and Legacy code and scripts that run sequentially but require faster CPU
• Engineering Solutions Adopted– Basic principles of semaphores, mutual exclusion and synchronization don’t help– Any locking is bad and leads to poor utilization of CPU cores – Lockless algorithms based on Intel hardware instructions (compare and swap) were
designed and implemented– We need shared data structures for doing any useful applications, but sharing cannot
be done using typical semaphores use lockless algorithms to reduce contention
Oracle Confidential – Highly RestrictedCopyright © 2014, Oracle and/or its affiliates. All rights reserved. |
Q&A
Copyright © 2014, Oracle and/or its affiliates. All rights reserved. | Oracle Confidential – Highly Restricted 29