Upload
angelica-garrison
View
234
Download
0
Tags:
Embed Size (px)
Citation preview
How Oracle Database 10g Revolutionizes Availability and Enables the Grid
Juan LoaizaVice President, Systems Technologies
Oracle Corporation
Session id: 40164
3
From High Quality Parts to High Quality Systems Traditionally Low Cost = Low Quality High quality systems were built by combining
high quality, high cost parts – Mainframe model Oracle enables a new model Oracle combines high volume inexpensive
processors and storage to produce a high quality system
Unbreakable Inexpensive Systems
4
Low Cost Fault Tolerance
Grid ClustersGrid ClustersLow Cost Fault ToleranceLow Cost Fault Tolerance
Computer Computer FailuresFailures
Data Data FailuresFailures
System System ChangesChanges
Data Data ChangesChanges
UnplannedUnplannedDowntimeDowntime
PlannedPlannedDowntimeDowntime
5
Commercial Grids and Availability
Grid pools standard low cost nodes and modular disk arrays
Perfect for RAC HA Failover can happen to
any node on the grid Grid load balancing will
redistribute load over time
Designed to Tolerate Failures
7
New Economics forData Protection & Recovery
Disk Based RecoveryDisk Based Recovery
Computer Computer FailuresFailures
Data Data FailuresFailures
System System ChangesChanges
Data Data ChangesChanges
UnplannedUnplannedDowntimeDowntime
PlannedPlannedDowntimeDowntime
Trade cheap disk spacefor expensive downtime
8
New World: Disk Based Data Recovery
Disk economics are close to tape
Disk is better than tape– Random access to any data
We rearchitected our recovery strategy to take advantage of these economics
– Random access allows us to backup and recover just the changes to the database
Backup and Recovery goes from hours to minutes
2000’s - 200 GB
1980’s - 200 MB
1000x increase
9
Resiliency using Low Cost Storage
Computer Computer FailuresFailures
Data Data FailuresFailures
System System ChangesChanges
Data Data ChangesChanges
UnplannedUnplannedDowntimeDowntime
PlannedPlannedDowntimeDowntime
Storage FailureStorage Failure
Human ErrorHuman Error
CorruptionCorruption
Site FailureSite Failure
Four Failure Types
10
Data Mirroring with ASM ASM mirrors data across
inexpensive modular storage arrays
No additional logging or expensive NVRAM to recover mirrors
– Database logging recovers mirrors
Automatically remirrors when disk or array fails
Designed to tolerate failures
Failure Resiliency using Low Cost Storage
11
Collapsing the Cost of Human Error
Computer Computer FailuresFailures
Data Data FailuresFailures
System System ChangesChanges
Data Data ChangesChanges
UnplannedUnplannedDowntimeDowntime
PlannedPlannedDowntimeDowntime
Storage FailureStorage Failure
Human ErrorHuman Error
CorruptionCorruption
Site FailureSite Failure
12
Human Error
Goal is to quickly analyze and repair – For Localized damage
Need surgical analysis and repair Example – deleted wrong order
– For Widespread damage Need complete back-out to avoid long downtime Example – batch job deletes this month’s orders
Human Errors
Other Downtime
Single Biggest Cause of Downtime
13
Flashback Time Navigation Flashback Query
– Query all data at point in time
Flashback Versions Query– See all versions of a row between
two times– See transactions that changed the
row
Flashback Transaction Query– See all changes made by a
transactionTx 1
Tx 2
Tx 3
Select * from Emp AS OF ‘2:00 P.M.’ where …
Select * from Emp VERSIONS BETWEEN ‘2:00 PM’ and ‘3:00 PM’ where …
Select * from DBA_TRANSACTION_QUERY where xid = ‘000200030000002D’;
14
Flashback Database
A new strategy for point in time recovery Flashback Log captures old versions of
changed blocks– Think of it as a continuous backup– Replay log to restore DB to time– Restores just changed blocks
It’s fast - recover in minutes, not hours It’s easy - single command restore
Flashback Database to ‘2:05 PM’
“Rewind” button for the DatabaseData Files Flashback
Log
New BlockVersion
Disk Write
Old BlockVersion
15
Flashback Error Correction Recovery at all levels Database Level
– Flashback Database restores the whole database to time
Uses Flashback Logs Table Level
– Flashback Table restores rows in a set of tables to time
Uses UNDO in database– Flashback Drop restores a
dropped table or a index Recycle bin for DROPs
Row Level– Restore individual rows
Uses Flashback Query
Order
Database
Customer
16
Flashback for All UsersEND USER• Flashback Query• Flashback Versions Query
DEVELOPER• Flashback Versions Query• Flashback Transaction Query• Flashback Table
DATABASE ADMIN• Flashback Database• Flashback Drop
SYSTEM ADMIN• Data Guard
17
Revolution in Recovery
Flashback Revolutionizes Recovery– Operates on just the changed data– Time to correct error equals time to make error
Minutes instead of hours
Flashback is Easy– Single command instead of complex procedure
Correction Time = Error Time + f(DB_SIZE)
18
Prevention & Recovery of Corruptions
Computer Computer FailuresFailures
Data Data FailuresFailures
System System ChangesChanges
Data Data ChangesChanges
UnplannedUnplannedDowntimeDowntime
PlannedPlannedDowntimeDowntime
Storage FailureStorage Failure
Human ErrorHuman Error
CorruptionCorruption
Site FailureSite Failure
20
Flash Recovery Area
Fully automatic disk based backup and recovery
– Set and Forget
Nightly incremental backup rolls forward recovery area backup
– Changed blocks are tracked in production DB
Full scan is never needed– Dramatically faster (20x)– Blocks validated to prevent
corruption of backup copy
Use low cost ATA disk array for recovery areaTwo Independent Disk Systems
Flash RecoveryArea
Nightly ApplyValidatedIncremental
WeeklyArchiveTo Tape
DatabaseArea
21
Low Cost No Compromise Disaster Recovery
Computer Computer FailuresFailures
Data Data FailuresFailures
System System ChangesChanges
Data Data ChangesChanges
UnplannedUnplannedDowntimeDowntime
PlannedPlannedDowntimeDowntime
Storage FailureStorage Failure
Human ErrorHuman Error
CorruptionCorruption
Site FailureSite Failure
22
Existing Site Recovery Tradeoffs
User can delay log apply to protect from user errors but:– Failover takes hours– Reports run on hours old data
After failing over to standby, production DB must be rebuilt– Production has updates that did not get to standby
ProductionDatabase
StandbyDatabase
TransactionShipping
4 HourDelayApply
ReportingOn Delayed
Data
23
Low Cost No Compromise Disaster Recovery
Flashback DB removes need to delay apply of logs to correct errors Flashback DB removes the need to reinstantiate primary on failover Real-time log apply enables real-time reporting on standby Data Guard works transparently across GRID clusters
– Standby can use fewer CPU resources than primary
Flashback Log
ProductionDatabase
StandbyDatabase
TransactionShipping
(Real Time Apply)
NoDelay
ReportingOn Real Time
Data
Flashback Log
Some Nodes Used for
Other Computing
24
Highest Data Protection Lowest Cost
Combine the Features to
Achieve Any Level of Data Protection
Dramatic Advances in Ease of Use
ASM Mirroring Storage Failure
Protection
Flashback Human Error
Protection
Data Guard Site Failure ProtectionFlash
Recovery Area
Corruption Protection
27
Allow any change to the system with no downtime
Online ReconfigurationOnline Reconfiguration
Goal
No Cost System Changes
Rolling UpgradesRolling Upgrades
Computer Computer FailuresFailures
Data Data FailuresFailures
System System ChangesChanges
Data Data ChangesChanges
UnplannedUnplannedDowntimeDowntime
PlannedPlannedDowntimeDowntime
28
No Cost System Changes – Capacity on Demand
CPU– Add/remove CPUs on SMP online
Cluster Nodes– Add/remove cluster nodes online– No data movement needed
Memory– Grow and shrink shared memory
and buffer cache online– Auto tuning of memory online
Disk– Add/remove disks online– Automatically rebalance– Move datafiles
29
Rolling Patch Upgrade using RAC
Initial RAC Configuration Clients on A, Patch B
OraclePatch
Upgrades
OperatingSystem
Upgrades
Upgrade Complete
HardwareUpgrades
Clients Clients
Clients on B, Patch A
Patch
1 2
34
A B A B
A BA B
B
APatch
30
Rolling Release Upgrade using Data Guard
Initial SQL Apply Config
Run mixed to testSwitch to B, upgrade A
Upgrade node B to X+1
Major Release
Upgrades
Patch SetUpgrades
Cluster Software & Hardware Upgrades
Clients
LogsShip
Version X Version X
1
Clients
2
Clients
LogsShip
3
Clients
LogsShip
4
UpgradeLogsQueue
X X+1
X+1 XX+1X+1
31
Online RedefinitionOnline RedefinitionEvolution without InterruptionEvolution without Interruption
Competitive pressures demand continual change
Need to change data with no interruption to the application
– location, format, indexing, or even definition
Goal
No Cost Data Changes
Computer Computer FailuresFailures
Data Data FailuresFailures
System System ChangesChanges
Data Data ChangesChanges
UnplannedUnplannedDowntimeDowntime
PlannedPlannedDowntimeDowntime
34
Maximum Availability Architecture (MAA)
Operational Practices are key– Technology alone is not enough
MAA is a blueprint for achieving HA & DR
– Tested, validated, and documented best practices Database, Storage, Cluster,
Network 10 person year effort
– otn.oracle.com/deploy/availability
M.A.A.How to Prevent, Tolerate, & Recover From Outages
Maximum Availability = Unbreakable Architecture + Best Practices
35
Highest Availability at Lowest Cost
Highest Availability– Fault Tolerant Clusters– Flashback Error Correction– Automated Disk Backup– No Compromise Disaster Recovery– Rolling Upgrades– Online Redefinition
At Lowest Cost– Low Cost Grid servers– Low Cost Modular Storage Arrays– Automated & Simple to Use
Oracle10g is Unbreakable & Inexpensive
36
Next StepsHigh Availability Sessions from Oracle
11:00 AM
How Oracle Database 10g Revolutionizes Availability and
Enables the Grid
3:30 PM
Oracle Recovery Manager (RMAN) 10g: Reloaded
5:00 PM
Proven Techniques for Maximizing Availability
8:30 AM
Oracle Database 10g - RMAN and ATA Storage in Action
11:00 AM
Oracle Data Guard: Maximum Data Protection at Minimum Cost
1:00 PM
Oracle Database 10g Time Navigation: Human-Error Correction
4:30 PM
Data Guard SQL Apply: Back to the Future
Wednesday in Moscone Room 304Tuesday in Moscone Room 304
For More Info On Oracle HA Go To http://otn.oracle.com/deploy/availability/
37
Next StepsHigh Availability Sessions from Oracle
8:30 AM in Moscone Room 304
Oracle Database 10g Data Warehouse Backup and Recovery:
Automatic, Simple, Reliable
8:30 AM in Moscone Room 104
Building RAC Clusters over InfiniBand
Thursday
For More Info On Oracle HA Go To http://otn.oracle.com/deploy/availability/
Real Application Clusters
Data Guard
Database Backup & Recovery
Flashback Recovery
LogMiner, Online Redefinition, and Cross Platform Transportable
Tablespaces
Database HA Demos All Four DaysIn The Oracle Demo Campground
AQ&Q U E S T I O N SQ U E S T I O N SA N S W E R SA N S W E R S
39
New Oracle Database 10g HA Features
Clusters Portable Clusterware Cluster file system for
Linux & Windows Automated Patching
Data Guard SQL Apply Support for Longs Support for multi-byte
CLOBs and NCLOBs Support for Index
Organized Tables Simplified zero data
loss failover Real time apply allows
real time reporting Zero downtime
instantiation
Rolling Upgrades Rolling Upgrades Using
Data Guard SQL Apply
Online Redefinition Support of Unique
Indexes One Step Cloning of
Dependent Objects Columns can be
Populated Using Sequences & Sysdate
Signature Based Dependency Tracking Using Synonyms
Online Segment Shrink
Data Guard Generic Data Guard Broker
support for RAC Named Data Guard
Configurations Real Time Apply Flashback Standby
Database Flashback
Reinstantiation Improved Recovery
Parallelism
40
New Oracle Database 10g HA Features
Flash Backup & Recovery
Automated Management of B&R Disk Space
Simplified Backup Using Image Copy
Change Aware Incremental Backups
Incrementally Updated Backups
Compressed archive logs
Tuning Improved Recovery
Parallelism Faster Instance Startup
& Cache Warm
Flashback Flashback Drop Flashback Row History Flashback Table Flashback Transaction
History Flashback Database Better map of time to
SCN for flashback query
LogMiner Automated Specification
of Logs to Mine Support for Shared
Server Configurations Fine Grained
Supplemental Logging
Backup & Recovery Simplified Recovery
Through Resetlogs Restore Tolerates Missing
Backups Proxy Backup of Archives Automated TSPITR
Instantiation Full DB Begin Backup Automated Backup
Channel Failover Simplified RMAN
cataloging of backup files Automated File Creation
during Recovery Drop Database Rename Tablespace