96
© 2010 Wellesley Information Services. All rights reserved. Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring Andy Pedisich Technotics

Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

  • Upload
    lindsey

  • View
    39

  • Download
    7

Embed Size (px)

DESCRIPTION

Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring. Andy Pedisich Technotics. In This Session. Let’s talk about your servers Are they struggling to keep up or not even breaking a sweat? - PowerPoint PPT Presentation

Citation preview

Page 1: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

© 2010 Wellesley Information Services. All rights reserved.

Admin Aspirin — Reduce Performance Headaches Through Proper Domain MonitoringAndy PedisichTechnotics

Page 2: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

2

In This Session ...

• Let’s talk about your servers Are they struggling to keep up or not even breaking a sweat? Do you have too many users on a server or could you easily

consolidate the ones you have? Are they having problems, but you’re just waiting for a help

desk ticket to come in before you find out what’s wrong?• This session takes you behind the scenes to show you how you

can be a more powerful administrator using data that’s always there, but a little hard to find

Page 3: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

3

What We’ll Cover …

• Understanding the fundamentals of statistic collection• Exploring the native Notes monitoring system• Taking advantage of server activity logging and trending• Exposing message tracking’s capabilities• Using the Domino Configuration Tuner• Dominating your domain with DDM• Customizing STATREP.NSF to pull out hidden data• Analyzing statistics for proactive problem solving• 10 statistics to monitor for top performance and stability• Wrap-up

Page 4: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

4

The Two Things Needed

• Two things are required for statistics collection: The Collect task must be running on any server that is

designated to collect the statistics Not all servers should run the Collect task

The EVENTS4 database must have at least one Statistics Collection document Statistics should be collected centrally on one or two servers

so that the data is easy to get to Stats should be collected every hour to be effective

EVENTS4 should be the same replica on all servers in the domain

Page 5: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

5

We Know What the Replica ID Should Be for EVENTS4

• Replica ID of system databases, such as EVENTS4.NSF, are derived from the replica ID of the address book

Database Replica ID

NAMES.NSF 852564AC:004EBCCF

CATALOG.NSF 852564AC:014EBCCF

EVENTS4.NSF 852564AC:024EBCCF

ADMIN4.NSF 852564AC:034EBCCF Notice that the first two numbers after the colon for the

EVENTS4.NSF replica are 02 Make sure that EVENTS4.NSF is the same replica ID

throughout the domain by opening it and putting it on your desktop

Page 6: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

6

Want to Add Every EVENTS4.NSF to Your Desktop?

• Add this code to a button on your toolbar This is courtesy of Thomas Bahn

www.assono.de/blog• This code will prompt you to pick the servers that have the

database you want on your desktop Then it will prompt for the name of the database

And open it on all the servers you’ve selected

_names := @Subset(@MailDbName; 1) : "names.nsf"; _servers := @PickList([Custom]; _names; "Servers"; "Select servers"; "Select servers to add database from"; 3); _db := @Prompt([OkCancelEdit]; "Enter database"; "Enter the file name and path of the database to add."; "log.nsf");@For(   n := 1;   n <= @Elements(_servers);   n := n + 1;@Command([AddDatabase]; _servers[n] : _db) )

Page 7: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

7

A Required Design, but No Required Name

• There has to be a STATREP.NSF on every server It is used by the server to store monitoring data

It must be designed using the Statrep5.ntf Monitoring Results template Its default title is Monitoring Results

• But you don’t have to use one of those for your statistic collection repository Create your own collection points and give the database a

unique name

City Collecting Server Monitoring Results DB New York USNYAdmin1 USStatrep.nsf

Amsterdam EUNeHub01 EUStatrep.nsf

Page 8: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

8

What We’ll Cover …

• Understanding the fundamentals of statistic collection• Exploring the native Notes monitoring system• Taking advantage of server activity logging and trending• Exposing message tracking’s capabilities• Using the Domino Configuration Tuner• Dominating your domain with DDM• Customizing STATREP.NSF to pull out hidden data• Analyzing statistics for proactive problem solving• 10 statistics to monitor for top performance and stability• Wrap-up

Page 9: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

9

Event Monitoring Details

• Event monitors are set in the EVENTS4 database• Event generators

Specify what built-in event shouldbe monitored

• Event handlers Specify the action that Domino

takes when an event generator occurs• Here’s the high-level concept of monitoring:

Setup an event generator to capture an important moment Use an event handler to be notified about it

You can also use event handlers to send you a notification of anything that appears on the console or in a LOG.NSF

Page 10: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

10

Event Generators Are Your Friends

• Event generators are pre-set groups of categories to monitor• Some of the more useful are:

Database generators Triggers when the ACL in the address book is changed

Statistical generators Trigger for low disk space, too many messages waiting

Domino server response generator Triggers when servers are not reachable

Task status generator Triggers when HTTP task is down

Page 11: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

11

An Example: Database Event Generator

• The ACL of NAMES.NSF should be monitored for changes in every Notes domain Once properly set, the ACL of NAMES.NSF should rarely

change! All kinds of bells and whistles should go off when it does

We’ll talk about notification in a moment Here’s how to set up the monitoring of the ACL

Select New Database Event Generator

Page 12: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

12

Setting Up the Database Generator

• Select Names.nsf You can choose either a single

server, such as the administration server for the address book, OR

All servers in the domain• I like to pick all servers in the domain

Admins won’t get away with anything!

But I do get a storm of messages when an ACL change occurs Every server tells me about

the change

Page 13: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

13

Another Event Generator Example

• Domino Server Response Event Generator Checks connectivity/port status of server’s network

• One server checks others by sending a probe It’s a good idea to try opening Names.nsf

If you can’t open Names.nsf, then something is wrong!

Page 14: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

14

Event Handlers Set Up Notification

• When configuring an event generator, you can click the button on the final tab for the Event Handler Wizard Simple to do and guarantees it will work the way you want it to The wizard will walk you through the process of creating an

event notification method for the event generator you just made

Page 15: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

15

What We’ll Cover …

• Understanding the fundamentals of statistic collection• Exploring the native Notes monitoring system• Taking advantage of server activity logging and trending• Exposing message tracking’s capabilities• Using the Domino Configuration Tuner• Dominating your domain with DDM• Customizing STATREP.NSF to pull out hidden data• Analyzing statistics for proactive problem solving• 10 statistics to monitor for top performance and stability• Wrap-up

Page 16: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

16

Turning on Activity Logging

• Activity logging is switched on in server configuration documents Select all the server tasks you would like to use to produce

activity logging data If you want to see activity trends, enable all tasks except

Domino.MAIL At a minimum, you must enable Domino.Notes.Session

and Domino.Notes.Database

Page 17: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

17

Check Points and Prime Shift

• Leave the default checkpoint interval at 15 minutes It seems to be sufficient

• We’re surprised some of these are checkboxes since you really need them all for activity logging Log checkpoint at midnight is required for both activity logging

and activity trends Log checkpoint for prime shift is required for activity trends

Enter whatever you consider as your prime shift• And voila! Activity logging is started!

Page 18: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

18

Activity Logging

• Activity logging causes Domino to write information to the LOG.NSF about the tasks you are interested in Expect the log to grow to about 2 GB The data is kept in a few hidden views in a form that is not

meant for humans to see IT is waiting there for us to access in two ways:

Activity analysis Activity trending

Page 19: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

19

Activity Trends Are Very Useful

• You can enable the activity trends collector in the same configuration document that you used to set up activity logging If you activate it in the default config document it creates an

ACTIVITY.NSF database on every server in the domain Then it collects data from the log every night at 3:23 AM

You can run the task manually at the console using Load trends

Page 20: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

20

Activity Data Rocks!

• Activity trending gives you an unbelievable amount of data about your environment such as: How your protocols are used How your servers are utilized during prime time Growth rates for databases

Page 21: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

21

Inactivity Is Important Too

• Inactivity is also tracked Lets you rid your systems of dead mail files and applications

that are truly no longer being used A little bit of investigation using Activity.nsf will probably

save you gigabytes of disk space And you won’t be wasting time backing up obsolete

databases

Page 22: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

22

But Wait, There’s More …

• Activity trending also provides information about user activity It’s probably more information than you need for everyday

administration But it’s great for capacity planning or when you have a server

consolidation to accomplish

Page 23: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

23

Catalog Must Be Enabled

• Make sure there are no errors when the activity trends are rolled out Check the Run Log for details about the collection process

• The catalog task must run on each server you want to include in activity trending You’ll see an error message in the Run Log if there is no

catalog on the server

Page 24: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

24

Demo

Page 25: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

25

What We’ll Cover …

• Understanding the fundamentals of statistic collection• Exploring the native Notes monitoring system• Taking advantage of server activity logging and trending• Exposing message tracking’s capabilities• Using the Domino Configuration Tuner• Dominating your domain with DDM• Customizing STATREP.NSF to pull out hidden data• Analyzing statistics for proactive problem solving• 10 statistics to monitor for top performance and stability• Wrap-up

Page 26: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

26

Message Tracking

• Allows administrators to track all messages Users can track their own messages

• Shows delivery status and current disposition Read/Unread, Deleted

• Configurable Track message subject Don’t track messages from certain users

• Great tool to use when users complain of “lost” email!

Page 27: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

27

Enabling Message Tracking

• From the Server Configuration document Configuration tab in the Administration client Go to the Router/SMTP – Message Tracking tab Complete fields as needed

• Tracking Interval controls how often data is written to the message tracking database The default interval is 15 minutes, which is a great place to start

Page 28: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

28

Access to Message Tracking

• The Access Settings fields are used to limit who can track other users’ messages Include administrators in this field

Maybe Help Desk?• Make sure that LocalDomainServers is included if you need to

track across servers

Page 29: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

Message Tracking Reports

• Daily/weekly/monthly options for all reports: Message volume Largest messages Top 25 next hops Top senders and receivers

By count By size

Message status Delivered/not delivered/in

process

29

Page 30: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

30

Caution Using MTC on iSeries

• Servers have seen the MTC task take up large amounts of CPU Up to >50% on iSeries systems Appears to be related to full text indexing of the MTSTORE.NSF

• To remediate this problem Stop MTC task Delete and recreate FTI on mtdata/mtstore.nsf Start MTC task

This can be done on all platforms I’ve seen the problem on Wintel and UNIX

Page 31: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

31

What We’ll Cover …

• Understanding the fundamentals of statistic collection• Exploring the native Notes monitoring system• Taking advantage of server activity logging and trending• Exposing message tracking’s capabilities• Using the Domino Configuration Tuner• Dominating your domain with DDM• Customizing STATREP.NSF to pull out hidden data• Analyzing statistics for proactive problem solving• 10 statistics to monitor for top performance and stability• Wrap-up

Page 32: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

32

Domino Configuration Tuner

• Domino Configuration Tuner (DCT) produces a list of items scanned and provides complete explanations of each item It’s a fast and easy way to take stock of your domain

Page 33: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

33

Domino Configuration Tuner (cont.)

• Domino Configuration Tuner (DCT) compares your server settings to an IBM catalog of Best Practices DCT looks at settings in the Domino Server documents, the

NOTES.INI file, and advanced database properties It also looks for worst practices

All servers in your domain can be evaluated at once

Page 34: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

34

Batteries Not Included

• The DCT is not included with the Domino install files• You can download the latest and greatest version here:

http://www-01.ibm.com/support/docview.wss?rs=463&uid=swg24019358

• IBM updates the design and rules that the application uses on a regular basis Make sure you use the Check for Updates before you scan

your systems

Page 35: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

35

Run Scan and Wait for the Results

• Clicking on Run New Scan will start the tuner which will access your domain’s address book to pull out a list of servers Select the servers you want the tool to scan

You’ll also have the opportunity to manually add servers and optionally create a name for the scanning

This is a very valuable free asset from IBM

Page 36: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

36

What We’ll Cover …

• Understanding the fundamentals of statistic collection• Exploring the native Notes monitoring system• Taking advantage of server activity logging and trending• Exposing message tracking’s capabilities• Using the Domino Configuration Tuner• Dominating your domain with DDM• Customizing STATREP.NSF to pull out hidden data• Analyzing statistics for proactive problem solving• 10 statistics to monitor for top performance and stability• Wrap-up

Page 37: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

37

DDM Details

• DDM provides a single location where administrators can access issues that are affecting multiple servers and databases Reduces the risk of unplanned downtime

• The DDM database is the central repository of all monitoring data This can be data collected by probes that you can configure Result messages from event generators that you configured in

previous releases of Notes Results for routine checks that run as part of specific server

tasks, such as the router or replicator

Page 38: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

38

Configure DDM for Centralized Data Collection

• DDM.NSF has the most value when it is the central repository for all issues It will contain all of the issues that come from all of the servers

• There is no collection hierarchy set up by default If your collection hierarchy looks like the screen shot below, the

collection hierarchy has not been configured yet

Page 39: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

39

Aggregate Data Centrally

• A DDM server collection hierarchy lets you aggregate the data onto a key server or servers This must be configured in the EVENTS4.NSF

• The simplest hierarchy is to configure one server to collect from all servers in the domain

Page 40: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

40

Rolling Up the Data

• DDM data roll-up propagates the probe results up the DDM server collection hierarchy Data roll-up is accomplished using Domino’s selective

replication to transport the data The replication formulas are created automatically when you

define your DDM server collection hierarchy• Collection replication occurs about every five minutes

It’s hard coded and can’t be changed

Page 41: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

41

Address Issues by Severity Level

• Looking at issues by severity gives you the chance to deal with the most important issues first They are broken out by severity category

Page 42: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

42

Looking at a Failure

• This simple event in DDM tells us what the problem was and when it occurred And it offers a possible solution

This example shows the basics of DDM

Page 43: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

43

Analysis Probes

• Configure DDM using probe documents in EVENTS4.NSF Otherwise known as the Monitoring Configuration database

• A probe is a check, or set of checks, configured to run against one or more servers, databases, and services A probe returns status and results to the Domino Domain

Monitoring Database – DDM.NSF The DDM.NSF is created automatically when an R7/R8 server

starts You can create multiple probes for each feature area

• Just remember that you don’t need probes to use DDM!

Page 44: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

44

What’s in These Probe Documents?

• Probe type and probe subtype For example, Security is a probe type

One of its probe subtypes is Best Practices This combination of probe type and probe subtype creates

a Security probe

Page 45: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

45

Extra Information About the Probe Is Provided

• These probe documents also contain a general description of the probe, its purpose, and its intended use

Page 46: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

46

Demo of Domino Domain Monitoring

Page 47: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

47

Plenty of Cool Probes

• R8 gives us 58 default DDM probes to work with R7 gives us 48 – still plenty to get us started

• You can get probing as soon as R7/R8 is up• Just plug in your server info to get DDM started

You can also create new probe documents Define and customize your own probes

Page 48: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

48

Zeroing in on Probes

• We’re going to focus on three probes that have a high value in almost every Domino domain: Application probes Web probes Security probes

Page 49: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

49

Agents Are Tracked by Application Probes

• Application probes monitor agents None of these probes can be scheduled

They are all real-time monitors• Application probes have the following subtypes:

Agents behind schedule Agents ranked by CPU usage Agents ranked by memory usage Long-running agents

Page 50: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

50

Web Probes

• Web probes provide results as a Domino Event document in the DDM database

• There are two subtypes: Configuration probe

Verifies the accuracy of a Web server’s configuration Best Practices probe

Compares servers’ configurations against a known good server

• The default schedule for these probes is once a month

Page 51: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

51

The Best Practice DDM Report

• A DDM document details the values that don’t match the Best Practice values recommended by Lotus And provides information on how to resolve any issues

Could be corrections to server functionality, server performance, or server security

This helps improve overall security and Web performance

Page 52: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

52

The Five Security Check Probes

• Best Practices Compares your settings to Lotus recommended settings

• Configuration Compares settings to those in a specified “good” Server doc

• Database ACL Monitors access that groups and individuals have

• Database Review Reviews the security properties for a specified database

• Security Review Generates a report on the security settings specified in the

Specifics tab of the probe document

Page 53: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

53

The DDM Database Has More Than What’s on the Surface

• The DDM DB lets you review probable causes and possible solutions for reported DDM events And it lets you assign them to an administrator for resolution You can open a link to the appropriate database from which

you can resolve a reported event

Page 54: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

54

Explaining the Issue

• In some cases, the event will give details on the problem and provide a link to help you get to the place where the problem occurred One document might represent several occurrences of the

same event

Page 55: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

55

Resolving Problems

• In some cases a Choose Solution button is available Use it to help you automatically implement a particular solution

to resolve an event

Page 56: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

56

Using the Assign Button

• You can assign the event to a team member and add comments about the task using the Assign button Or you can simply assign the event to yourself

Page 57: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

57

What We’ll Cover …

• Understanding the fundamentals of statistic collection• Exploring the native Notes monitoring system• Taking advantage of server activity logging and trending• Exposing message tracking’s capabilities• Using the Domino Configuration Tuner• Dominating your domain with DDM• Customizing STATREP.NSF to pull out hidden data• Analyzing statistics for proactive problem solving• 10 statistics to monitor for top performance and stability• Wrap-up

Page 58: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

58

Why Would the STATREP.NSF Need Customization?

• STATREP.NSF database is surprisingly lacking in value• There are eight views of statistical data

Calendaring & Scheduling and Clusters views lack any substantial information

The Communications view strictly shows dial-up stats Fortunately we can add views to help us significantly

Page 59: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

59

Everything Is Everywhere

• Keep in mind that every single statistic that is generated is contained in every document in the Monitoring Results database You just can’t see it all because it’s not in views or documents

And views are the most important place to have it because that’s where it gives you the ability to compare samples And analyze trends

Page 60: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

60

Agents No Longer Have Secrets

• Our modified AD2K10STATREP.NTF template has lots of new views to keep you informed

• Are you wondering how many minutes the agent manager used running resource-hungry agents? Now you have a view to help you see it hour by hour

No more guesswork about server conditions

Page 61: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

61

Caveats About the Admin 2010 version of STATREP.NSF

• Some views expose the following statistics Agent.Daily.UsedRunTime and Agent.Hourly.UsedRunTime

This stat generated the agent runs in seconds• Some versions of Domino produce this stat as a text field, others

as a numeric field A formula converts it to a numeric field

This might not be necessary in your domain @If(@IsAvailable(Agent.Hourly.UsedRunTime);

(@TextToNumber(@LeftBack(Agent.Hourly.UsedRunTime;8))/60);“N/A”)

The formula also converts the statistic from seconds to minutes

Page 62: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

62

One More Caveat

• A few views display disk utilization statistics such as: Platform.LogicalDisk.2.AvgQueueLen.Avg

• Disk statistic names vary from platform to platform AIX and iSeries systems can have much longer device names Even in the Wintel platform they can be listed as:

Logical disks Physical disks

• Be sure to check to see how it is represented in your domain You might find it necessary to customize all disk views for your

own environment

Page 63: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

63

What We’ll Cover …

• Understanding the fundamentals of statistic collection• Exploring the native Notes monitoring system• Taking advantage of server activity logging and trending• Exposing message tracking’s capabilities• Using the Domino Configuration Tuner• Dominating your domain with DDM• Customizing STATREP.NSF to pull out hidden data• Analyzing statistics for proactive problem solving• 10 statistics to monitor for top performance and stability• Wrap-up

Page 64: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

64

Transactions Per Hour and Concurrent Users Per Hour

• These can be a very important statistics if you are thinking about consolidation Our customized Monitoring Results database lets you export

this and other data for analysis in your favorite spreadsheet• The idea is to compare your servers statistically

To find out: How many users are hitting these systems concurrently? How many transactions are these servers turning over?

• We want to want these things on a PER HOUR basis IBM’s STATREP.NSF doesn’t have this info, it must be extracted

from the cumulative statistics

Page 65: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

65

Start by Going to the Export Transactions/Users View

• Analysis starts with Export Transactions/Users view Do not hesitate to add new views to Monitoring Results

I don’t change the old ones, I just add new ones• Note that Server.Trans.Total is a cumulative stat

And the Trans/Hour column is blank We have a custom formula to apply to this column after the

data is exported into the spreadsheet you use

Page 66: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

66

Next, Export View to CSV File

• Export the contents of the view to a CSV file Call the file is C:\delme.csv so you can find it quickly

It’s overwritten each time I do an export Make sure you include the View titles

The import is fast

Page 67: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

67

Next, Open the Special Spreadsheet

• Start your spreadsheet (in this case it’s Microsoft Excel) and open the spreadsheet on the CD Admin2010MasterFormulaStatExports.xls

• The spreadsheet contains the formulas that will help to break down server activity into per hour averages Don’t worry about the #value errors

Then open the C:\delme.csv file

Page 68: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

68

We’re into MS Excel for the Analysis

• Next, we open the C:\delme.csv in Excel Excel knows we want to import it because it’s a CSV file

It opens quickly with no further prompts

Page 69: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

69

The Data Is Now in Excel

• The view brought it in sorted by Server and Collection Time Remember, we’d like to see the number of transactions

per hour With the way this spreadsheet is set up, it’s pretty easy to

construct a formula where we simply: Subtract the last hour’s number of transactions from this

hour’s transactions to get the number per hour

Page 70: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

70

Tricky Calculations – Server Restarts and Stuff

• Except sometimes when servers are restarted Then the cumulative stats start over

• Or when the next server starts being listed in the statistics You have to be careful not to subtract without paying attention

to these things

Page 71: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

71

Special Formulas to the Rescue

• To cope with the anomalies in the way the data is listed, I built a few fairly straightforward formulas you can use on your spreadsheets They are in the master formula spreadsheet

Just copy it from the cell

Page 72: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

72

Insert the Copied Cells

• Move to the delme.csv spreadsheet• Then use the Insert menu to insert the copied cells into your

spreadsheet Move the cells to the right or down to get them out of the way

You’ll be copying the proper formula into your spreadsheet• Copy that formula down your entire column of data

Save your spreadsheet as an XLS

Page 73: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

73

Create a Pivot Table or Its Equivalent

• We’re going to make a Pivot Table with our data The Pivot Table will take our data and let us easily manipulate it

and graph it Select all the data, including the column titles, and use the

menu to select PivotTable and PivotChart Report

Page 74: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

74

Take Defaults

• If you’re new at this, just take the default answers for the questions Excel asks

Page 75: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

75

Drag Server to the Column Tops

• It drops you into the Pivot Table function where you have a field list to drag and drop into the table Drag Server to the column top and Hour to the row

names column

Page 76: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

76

Drag the Data to the Center of the Table

• Drag the transaction per hour data to the table It defaults to the “Count of Trans/Hour”

But you’ll want to change it to Average, and format it to look nice too

Page 77: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

77

There You Have It

• You now have a nice breakdown of the average number of transactions per hour, per server

Page 78: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

78

Graphing Your Results

• This is where it really gets cool Just click on the Chart Wizard

And …

Page 79: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

79

Bingo, You Have an Instant Chart

• Stacked bar isn’t what we want, but that was quick!

Page 80: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

80

Line Graph Coming

• Use the icon on the right to change graph types A line graph is quite effective, most of the time

Page 81: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

81

Here’s the Line Graph You Ordered

• Simple, fast, and straightforward This is an average of transactions per hour

Page 82: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

82

Average Number of Concurrent Users/Hour

• This is an extremely valuable stat, especially when consolidating However, when a user connects, they stay connected

And are not dropped until they are inactive for four hours This makes it impossible to track actual concurrency

• Use this NOTES.INI server parameter to prevent idle sessions from taking up valuable resources Server_Session_Timeout = 30

• Sets number of minutes of inactivity after which a server automatically terminates network and mobile connections

Users will not have to re-enter a password if they become active after the time limit

• The minimum recommended setting is 30-45 minutes

Page 83: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

83

Change the Field List Easily

• It’s easy to remove the field Trans/Hour off the chart and replace it with the Average of Users

Page 84: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

84

Demonstration: Exporting Data and Creating Pivot Table

Page 85: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

85

What We’ll Cover …

• Understanding the fundamentals of statistic collection• Exploring the native Notes monitoring system• Taking advantage of server activity logging and trending• Exposing message tracking’s capabilities• Using the Domino Configuration Tuner• Dominating your domain with DDM• Customizing STATREP.NSF to pull out hidden data• Analyzing statistics for proactive problem solving• 10 statistics to monitor for top performance and stability• Wrap-up

Page 86: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

86

Disk Statistics

• Disk.D.Free Track whatever drive holds your data Monitor based on a percentage or an actual value

Depends on the size of the drive Larger drives are sometimes better to track by actual

space left rather than a percentage because the percentage you’d want free is very small

Page 87: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

87

Critical Mail Statistics

• Mail.dead Mail that cannot be delivered and cannot be returned to

the sender Watch for greater than 100

• Mail.Waiting Number of outgoing mail messages waiting to be either

delivered locally or transferred in MAIL.BOX Useful for detecting problems with the mail Router

Depends on the server SMTP server might watch for > 200 NRPC mail server might watch for > 50

Page 88: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

88

Critical Mail Statistics (cont.)

• Mail.TotalPending May be more reliable than Mail.Waiting which depends on the

router task If the router is not running, Mail.Waiting is not updated

The Mail.TotalPending count includes both dead and held mail The Mail.Waiting figure does not include these items

Updated by the server task to reflect the current number of messages that are pending delivery

It is updated by the server task every five minutes and its value therefore becomes “stale” Threshold depends on type of server

SMTP threshold will again be higher than NRPC server

Page 89: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

89

Critical Mail.Box Statistics

• Mail.Mailbox.Accesses Total number of times that threads accessed any mailbox on

the server• Mail.Mailbox.AccessConflicts

Total number of times a thread attempting to access a mailbox had to wait because the number of concurrent threads exceeded the number of mailboxes configured

(Mail.Mailbox.AccessConflicts / Mail.Mailbox.Accesses) x 100 Should be less than 2

Page 90: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

90

Platform Disk Statistics

• The disk specification will vary by server• Platform.LogicalDisk.1.AvgQueueLen

AvgQueueLen the average number of both read and write requests that were queued for all logical disks on all physical disks during the sample interval Should not consistently rise above 2

• Platform.LogicalDisk.1.PctUtil PctUtil percent of time the drives are busy reading or writing

Watch for disks constantly hitting above 80% • Track both of these statistics in Notes

Follow up with performance monitoring on the OS level

Page 91: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

91

Important Cluster Statistics

• Replica.Cluster.SecondsOnQueue The number of seconds that the last database spent waiting in

the replication queue before being replicated Generally this should be under 20

• Replica.Cluster.WorkQueueDepth Number of items in the replication queue

Should be under 10• We’ll talk more about this during the session on Domino

clustering

Page 92: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

92

Bonus 11th Statistic for Transaction Logging

• Database.RM.SinceStartup.Critical.Log.Times Shows how many times your system ran critically log full

Should be zero If non-zero, consider increasing log size or number of times

that Domino filled the allocated log space If greater than zero, consider increasing log size If circular and >0, the system has a long-running transaction

If this number creeps into the hundreds, your server is probably going to crash soon

Page 93: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

93

What We’ll Cover …

• Understanding the fundamentals of statistic collection• Exploring the native Notes monitoring system• Taking advantage of server activity logging and trending• Exposing message tracking’s capabilities• Using the Domino Configuration Tuner• Dominating your domain with DDM• Customizing STATREP.NSF to pull out hidden data• Analyzing statistics for proactive problem solving• 10 statistics to monitor for top performance and stability• Wrap-up

Page 94: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

94

Resources

• Description of HTTP statistics for a Lotus Domino server www-1.ibm.com/support/docview.wss?uid=swg21207314

• How is the mail delivery latency formula calculated? http://www-01.ibm.com/support/docview.wss?uid=swg21279936

• Download location for Domino Configuration Tuner http://www-01.ibm.com/support/docview.wss?rs=463&

uid=swg24019358• Lotus Education on Demand: Domino Domain Monitoring (DDM)

http://www-01.ibm.com/support/docview.wss?uid=swg27007060

Page 95: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

95

7 Key Points to Take Home

• Make sure all EVENTS4.NSF are the same replica ID• Use activity trending to build ACTIVITY.NSF and discover

databases that aren’t being used• Use the Domino Configuration Tuner to discover settings that

improve performance and security• Create a collection hierarchy for DDM in the EVENTS4.NSF

Monitoring Configuration database• You don’t have to run probes to take advantage of Domino

Domain Monitoring• Add views to your Monitoring Results database to pull out hidden

information• Use SERVER_SESSION_TIMEOUT=30 to ensure idle users don’t

take up valuable server resources

Page 96: Admin Aspirin — Reduce Performance Headaches Through Proper Domain Monitoring

96

Your Turn!

How to contact me:Andy Pedisich

[email protected]