ASH and AWR on DB12c

Preview:

Citation preview

© 2015 Delphix. All Rights Reserved. Private & Confidential.

Performance Data in Database 12cKnowledge is Power

AWR and ASH with EM13c

Kellyn Pot’Vin-Gorman, Technical Intelligence ManagerOctober 2016

Who am I?

• Optimization- Tune for Time or You’re Wasting Time.• Know your goal(s)• Set a stopping point, avoid OTD, (Obsessive Tuning Disorder)• Do NOT assume. Always do the research and have data behind findings.

Stay on the Path…

• ASH= Active Session History• AWR= Automatic Workload Repository• Introduced in Oracle 10g• Evolution to statspack, requests for performance reporting improvements.• “Always on” approach to performance metrics with requirement of non-

locking collection process.• Requires Management Diagnostic Pack License from Oracle.

Brief History

The Location in EM12c For Some of Today’s Presentation…

Always on with default intervals of 1hr snapshots and 8 days retention. Should have at least 60 days of retained data. Desire more? Have an AWR Warehouse. Interval increase? Use this during workload testing, otherwise, take a manual

snapshot:EXEC DBMS_WORKLOAD_REPOSITORY.create_snapshot;

Automatic Workload Repository, (AWR) Reports

Buffer writes one direction,we read the other!

• To inspect a database level issues, for both a small window of time to extended window.

• Extensive information in report, knowing HOW to parse through the report to achieve goal is important.

• Via EM12c, the report is offered in HTML format and will be environment aware, (single instance, RAC, Exadata.)

• Different reports available from the command line when running from the $ORACLE_HOME/rdbms/admin directory and can be generated in HTML or TXT format.

AWR Reports Are Best Used For..

Wasting Time….

Rarely is there value in this section. As long is everything contains high percentages, move on.

AWR- Top 10 Foreground

•CPU is expected and should be the majority of time.•CPU processing can be extensive though, (still needs to be investigated)•Anything under 10% commonly is disregarded.•Understand what each wait event definition is

Top SQL

Focus on Elapsed Time, but…Displays Top SQL by:CPUIOGetsReads, etc…

Full SQL Statements

Linked from Top SQL Lists in HTML report via SQL_ID links.Quick reference when needed.

SGA “Thrashing”

Why PGA is Important

What is an optimal vs. 1 or (M)ulti-pass executions?

Why Can’t I Achieve 100%??

You shall not pass! (optimal, so 90%, that is… )

Percentages and Amounts of Reads are Important

Top two objects correspond to SQL statements in the top IO and most likely top SQL by elapsed time.

What is a Direct Physical Read?

•Inefficient SQL and objects that have high quantity of row changes involved.•Adds significant pressure in RAC environment, too!

Un-optimized vs. Optimized?

Percentages are low per object, (under 10%, which is a good sign for any database!)This is an exadata, so it means they are either not in the buffer cache or the smart flash cache, which means un-optimized”.This is “somewhat” expected and don’t panic unless you see high percentages.

Initial Transaction Locks

10% rule applies here, too!ALTER TABLE <name> initrans <xx>;

RAC Interconnect Exchange

•Data from V$SYSMETRIC_HISTORY •High Exchange rate can signal and issue.

RAC Cache Interconnect Stats

Transfer Rate Between RAC Nodes

• Excellent for identifying specific issues in database.• Identifies not just the top SQL, but code.• Shows top wait events by sample time.• Don’t confuse samples with AWR snapshots.• Should not be used to track # of executions.

ASH Reports

Buffer writes one direction,We read the other!

Running ASH Report from Cloud Control• ASH is by time, not snapshot.• Set start date and time.• End date and time• Generate report

HTML Format ASH

Main ASH Info

Top SQL, Top Sessions

Top SQL Details

Top Parallel, Top DB Files

ASH Report- Use Case

@$ORACLE_HOME/rdbms/admin/ashrpt.sql; Report Format: Text Performance Issue during day, need to know what’s going on! Run ASH Report from the command line with SQL*Plus:

“Interesting Part”

Finally!

Select * from table(dbms_xplan.display_awr(‘43mp3mjufgnkg’));

AWR and ASH from the Command Line Interface

All DBAs should know how to do this!

$ORACLE_HOME/rdbms/admin/awrrpt.sql;$ORACLE_HOME/rdbms/admin/ashrpt.sql;

$ORACLE_HOME/rdbms/admin/awrsqrpt.sql;Less Known AWR Reports:awrinfo.sql General AWR Infoawrddrpt.sql Comparison report between snapshotsawrblmig.sql Migrates pre-11g baseline data into 11g Baseline tables.awrgrpt.sql RAC Aware AWR Report.

Running Reports, Command Line

Snapshot Interval Information Basic Info on Instances and Nodes No User or Application Schema info. Space Usage by SYSAUX WRH$ and Non- AWR Objects, ordered by size Snapshot info and if any errors. Advisor Tasks

AWR Info Report

AWR General Information Report

ASH Info Report

@$ORACLE_HOME/rdbms/admin/ashrpt.sql; Report Format: Text or HTML

SQL_ID Specific AWR Report

• More defined reporting• No need to pull full report• Detail on waits that are of interest• Join to non-AWR objects

• Examples and Ideas…

Querying ASH Data Directly

• SAMPLE_ID- This is a unique identifier within an ASH sample.• SAMPLE_TIME- A unit of time used by Active Session History, (not to be confused with DB_TIME)• USER_ID- Identifier for a user that’s executing the session.• SESSION_ID- Same as the SID or Session ID and can be used to join to SID in other views/tables.• SESSION_STATE- What was the state of the session when ASH recorded the sample.• ON CPU/WAITING- The two session states in Active Session History. ON CPU is active, vs.

Waiting, which is self-explanatory.• EVENT- Type of event that the session is currently active or waiting on.• TIME_WAITED- How long the session has been waiting if waiting.• WAIT_TIME- Confusing- but this is populated by any wait time if the session is currently active and

for the previous waits.• SQL_ID- The unique identifier for the SQL statement being executed.• SQL_CHILD_NUMBER-The cursor child number.

V$ACTIVE_SESSION_HISTORY

Select ROUND(RATIO_TO_REPORT(SUM(1)) OVER () * 100 ,2) PERCENTAGE,ash.session_type SESS_TYPE, session_state STATUS, decode(nvl(sql_id,'-1'),'-1','nonsql','sql') SQL_TYPE, count(distinct to_char(session_id)|| to_char(session_serial#)) SESS_CNTfrom v$active_session_history ashwhere sample_time > sysdate - 30/(24*60) and ( ( ash.session_state = 'ON CPU’ ) or ( ash.session_type != 'BACKGROUND' ))group by ash.session_type, ash.session_state, decode(nvl(sql_id,'-1'),'-1','nonsql','sql')order by count(*)/

Session Averages

Session Avg. Output• Note the % of Background processes

Inspecting Whatselect * from (select ash.SQL_ID , ash.SQL_PLAN_HASH_VALUE Plan_hash, aud.name type, sum(decode(ash.session_state,'ON CPU',1,0)) "CPU", sum(decode(ash.session_state,'WAITING',1,0)) "WAITING", sum(decode(ash.session_state,'WAITING', decode(wait_class, 'User I/O',1,0),0)) "IO WAIT" , sum(decode(ash.session_state,'WAITING', decode(wait_class, 'User I/O',1,0),0)) "IO" , sum(decode(ash.session_state,'WAITING', decode(wait_class, 'Concurrency',1,0))) "CONCURRENCY" , sum(decode(ash.session_state,'WAITING', decode(wait_class, 'Application',1,0))) "Application" , sum(decode(ash.session_state,'ON CPU',1,1)) "TOTAL"from v$active_session_history ash, audit_actions aud where SQL_ID is not NULL and ash.sql_opcode=aud.action and ash.sample_time > sysdate - &minutes /( 60*24)group by sql_id, SQL_PLAN_HASH_VALUE , aud.name order by sum(decode(session_state,'ON CPU',1,1)) desc) where rownum < 5;

10 Min. View of Waits by SQL_ID• Choose Time in Minutes To Review, (10 in our

example)• SQL_ID and Plan Hash Value Shown• Waits for CPU, Wait, IO Wait and others.

Quantity of Events Occurred Over Small Amounts of Time

Col event for a50select event, count(1)from v$active_session_historywhere sample_time between to_date('21-FEB-14 01.43.00 PM','dd-MON-yy hh:mi:ss PM') and to_date('21-FEB-15 01.53.00 PM','dd-MON-yy hh:mi:ss PM')group by eventorder by event;

Results, Where to Focus?

Transaction Wait Detailselect to_char(sample_time,'HH:MI') st, substr(event,0,20) event, ash.session_id sid, mod(ash.p1,16) lm, ash.p2, ash.p3, nvl(o.object_name,ash.current_obj#) objn, substr(o.object_type,0,10) otype, CURRENT_FILE# fn, CURRENT_BLOCK# blockn, ash.SQL_ID, BLOCKING_SESSION bsidfrom v$active_session_history ash, all_objects owhere event like 'enq: TX%' and o.object_id (+)= ash.CURRENT_OBJ# and sample_time > sysdate - 10/(60*24)order by sample_time;

Transaction Lock Output What TX row locks are occurring!

Knowing What’s in the ASH BufferDeters from making assumptions on what data is being

queried.Know your samples!

Wait Events Across Nodes

Query top 10 SQL_ID’s in the last 10 minutes?

SQL_ID and CPU Usage

IO Waits by Object from ASH

SQL Text with ASHSQL for most recent five minutes of sample data from ASH

SQL ResultsSQL_ID, SQL Text, Sample Time that Process was captured

in.

Graphing From the CLI, via Kyle Hailey-

Formatting and Setupaccept hours prompt "hours (default 12) : " default 12 select &hours f_hours from dual;select 3600 f_secs from dual;select &v_secs f_samples from dual;select 30 f_graph from dual;select to_char(to_date(tday||' '||tmod*&v_secs,'YYMMDD SSSSS'),'DD-MON HH24:MI:SS') tm, samples npts,total/&samples aas, substr(substr(substr(rpad('+',round((cpu*&v_bars)/&samples),'+') || rpad('-',round((waits*&v_bars)/&samples),'-') || rpad(' ',p.value * &v_bars,' '),0,(p.value * &v_bars)) || p.value || substr(rpad('+',round((cpu*&v_bars)/&samples),'+') || rpad('-',round((waits*&v_bars)/&samples),'-') || rpad(' ',p.value * &v_bars,' '),(p.value * &v_bars),10) ,0,30) ,0,&v_graph)graph,total,cpu, waits from (

URL to Kyle Hailey’s Original, Fully Formatted Query

select to_char(sample_time,'YYMMDD')tday , trunc(to_char(sample_time,'SSSSS')/&v_secs) tmod , sum(decode(session_state,'ON CPU',1,decode(session_type,'BACKGROUND',0,1))) total , (max(sample_id) - min(sample_id) + 1 ) samples , sum(decode(session_state,'ON CPU' ,1,0)) cpu , sum(decode(session_type,'BACKGROUND',0,decode(session_state,'WAITING',1,0))) waits from v$active_session_history where sample_time > sysdate - &v_hours/24 group by trunc(to_char(sample_time,'SSSSS')/&v_secs), to_char(sample_time,'YYMMDD')union all select to_char(sample_time,'YYMMDD')tday, trunc(to_char(sample_time,'SSSSS')/&v_secs) tmod , sum(decode(session_state,'ON CPU',10,decode(session_type,'BACKGROUND',0,10))) total , (max(sample_id) - min(sample_id) + 1 ) samples, sum(decode(session_state,'ON CPU' ,10,0)) cpu , sum(decode(session_type,'BACKGROUND',0,decode(session_state,'WAITING',10,0))) waitsfrom dba_hist_active_sess_history where sample_time > sysdate - &v_hours/24 and sample_time < (select min(sample_time) from v$active_session_history) group by trunc(to_char(sample_time,'SSSSS')/&v_secs), to_char(sample_time,'YYMMDD')) ash, v$parameter pwhere p.name='cpu_count'order by to_date(tday||' '||tmod*&v_secs,'YYMMDD SSSSS'); **Thanks to Kyle Hailey for this great graph via the CLI

Pivot the Wait Events

Digging into History• DBA_HIST_ACTIVE_SESS_HISTORY

SNAP_ID SAMPLE_ID SAMPLE_TIME SESSION_ID USER_ID SQL_ID WAIT_CLASS SESSION_STATE PGA_ALLOCATED

Process InformationSELECT * FROM ( SELECT /*+ PARALLEL */ count(*) AS count, user_id, program, module, sql_id FROM SYS.DBA_HIST_ACTIVE_SESS_HISTORY WHERE sample_time > TO_DATE('19-FEB-2014 03.00.00 PM','dd-MON-yy hh:mi:ss PM') AND sample_time < TO_DATE('19-FEB-2014 08.00.00 PM','dd-MON-yy hh:mi:ss PM') AND program LIKE 'oracle@%' GROUP BY user_id, program, module, machine, sql_id ORDER BY count(*) desc)WHERE rownum <= 20/

Results of Process History

Tyler Muth ASH Mining Query

ASH Mining Output

Additional Options:• Physical Read Averages• Physical Writes, (Max/Averages)• Redo Info• Login Info• Hard Parsing, etc.

Best Practice When Querying ASH Data

Keep it Simple and don’t reinvent the wheel. Again- samples are an alias for time, not for counts. Understand what is valuable and compare to packaged

reports. Be aware on RAC of node specific data. Take care when querying Obj#, File# and Block#, (still issues

in different versions…) Check the time that is available in buffer, don’t assume!

• One More way to identify performance issues.• Monitoring view ease for those less familiar with database performance.

SQL Monitor, EM12c Style

SQL Monitor Dashboard

Status of Statement

Wait EventsDegree of Parallelism

SQL_ID

SQL Text

• Drill down to specific statement within SQL Monitor will display offload efficiency per statement.

Exadata and Offloading

Full Detail of SQL Execution

View Report

SET LONG 1000000SET LONGCHUNKSIZE 1000000SET LINESIZE 1000SET PAGESIZE 0SET TRIM ONSET TRIMSPOOL ONSET ECHO OFFSET FEEDBACK OFF

SELECT DBMS_SQLTUNE.report_sql_monitor( sql_id => '5vh6y3b7tnv8r', type => 'TEXT', report_level => 'ALL') AS reportFROM dual;

SQL Monitor Report via the Command Line Interface

Text Output of SQL Monitor

One of the Best & Least Used Features in Enterprise Manager: Search SQL

Problem Query

4v2tsp8dz0nhn is our SQL_IDGo to the EM Console, (Example is EM12c)

We Have the SQL_ID, What Next?

Choose AWR Snapshots, (change Time Period), AWR Baselines and put SQL_ID

Search SQL Interface

SQL_ID link for SQL Details Split up by tabs for Cursor, AWR, Baselines and SQL Tuning Sets Plan Hash Value Elapsed Time

Click on Search

Snapshot Data

Using the information provided by Search SQL, locate the correct plan hash value to create a profile from.

Identify

SQL Details

AWR Report or Run ADDM Report

Baseline Impact?

Third Tab contains Baseline Information and links to verify if implemented.

• Tyler Muth: http://tylermuth.wordpress.com/• Kyle Hailey, John Beresniewicz, Graham Wood: http://ashmasters.com/• Mine- “For the Love of ASH and AWR” http

://dbakevlar.com/2011/02/for-the-love-of-awr-and-ash/• Using AWR Reports:

http://dbakevlar.com/2015/01/working-with-awr-reports-from-em12c/• How to Use an ASH Report:

http://dbakevlar.com/2015/02/how-to-use-an-ash-report-and-why/• SQL ID Specific Performance Information:

http://dbakevlar.com/2015/05/sql-id-specific-performance-information/

AWR/ASH Links/Blogs

Q&A

Thank you!