31
Technology Exploitation and the Bottom Line An Introduction to Web Analytics for Performance Analysts and Capacity Planners Anna Long Web Analytica SM

Cmg10 Web Analytics Pres Am Long

Embed Size (px)

DESCRIPTION

This CMG \'10 presentation discusses web analytics topics of interest to performance analysts and capacity planners.

Citation preview

Page 1: Cmg10 Web Analytics Pres   Am Long

Technology Exploitation

and the Bottom Line

An Introduction to Web Analytics for Performance Analysts and Capacity Planners

Anna Long Web AnalyticaSM

Page 2: Cmg10 Web Analytics Pres   Am Long

1 - 2

Agenda

Web Analytics Definition Current Web Analytics Technologies Web Analytics Applied to Marketing Applying Web Analytics to Performance

Management and Capacity Planning

Page 3: Cmg10 Web Analytics Pres   Am Long

Defining Web Analytics

The Web Analytics Association (WAA) Definition:

Web Analytics is the measurement, collection, analysis and reporting of Internet data for the purposes of understanding and optimizing Web usage. [“Web Analytics Definitions” – WAA Standards Committee, 2008]

Page 4: Cmg10 Web Analytics Pres   Am Long

Current Web Analytics Technologies

Major technologies in use today: Web server logging ( such as Microsoft IIS logging, Apache

Access logging) Page tagging (such as Adobe SiteCatalyst, Google Analytics)

Other technologies in use: Network Appliances (such as Pion Systems by Atomic Labs) Tag Management Systems (such as Ensighten Manage)

Page 5: Cmg10 Web Analytics Pres   Am Long

Web Server Logging – How Does it Work?

Web Servers such as Apache or Microsoft IIS record activity as requests are made and served.

Web servers provide general-purpose logging at a very detailed level.

Data must be cleaned and organized to make it usable for web analytics.

Page 6: Cmg10 Web Analytics Pres   Am Long

Web Server Logging – A Log Record Example*

204.243.130.5 - - [26/Feb/2001:15:34:52 -0600] "GET / HTTP/1.0" 200 8437 "http://xyz.com/crawler?category=dimensional+modeling" "Mozilla/4.5 [en] (Win98; I)"

IP Address

Timestamp

Request

Status and Bytes

Referring Page

*Example log record from Sweiger2002

Page 7: Cmg10 Web Analytics Pres   Am Long

Web Server Logging – Processing Complexities

One page request generates multiple log records Log records may be ordered by completion time, not

initiation time Records for different page requests can be interleaved in log Request for page components may be completed by multiple

servers Web 2.0 technologies such as AJAX (Asynchronous

JavaScript and XML) add further complexity

User activity can be difficult to map accurately with web server log records:

Page 8: Cmg10 Web Analytics Pres   Am Long

Web Server Logging –Making Sense of It All

The granularity of recorded activity is frequently non-optimal, either too fine or too coarse.

IP addresses do not always map to unique visitors. Server logs lack visibility into client-side activity.

Several characteristics of web server logging limit its usefulness for analyzing website user activity:

Page 9: Cmg10 Web Analytics Pres   Am Long

Web Page Tagging –How Does It Work?

User clicks on a link on a search engine results webpage

Web server delivers requested webpage with tagging code (usually a JavaScript snippet)

JavaScript tag execution creates cookies and sends logging info to a web analytics server

Logged data is stored in a web analytics server and retrieved for dashboards and reporting queries

Page 10: Cmg10 Web Analytics Pres   Am Long

Web Page Tagging –A JavaScript Tag Example

<script type="text/javascript">

  var _gaq = _gaq || [];  _gaq.push(['_setAccount', 'UA-12345-1']);  _gaq.push(['_trackPageview']);

  (function() {    var ga = document.createElement('script'); ga.type = 'text/javascript'; ga.async = true;    ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + '.google-analytics.com/ga.js';    var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(ga, s);  })();

</script>

Account Number

*Example JavaScript Tag from Google2010e

Page 11: Cmg10 Web Analytics Pres   Am Long

Web Page Tagging –Implementation Complexities

Several characteristics of web page tagging complicate its successful application for analyzing website user activity: Complex architectures (such across multiple

subdomains or third-party ecommerce subsystems) can be difficult to tag for accurate data collection

Proper maintenance of high-volume tagging for a major online property can be overwhelming (especially when considering tagging of webpages and in-bound links)

Page 12: Cmg10 Web Analytics Pres   Am Long

Web Page Tagging – Making Sense of It All

Tag execution can cause webpages to hang (using asynchronous snippets in place of synchronous snippets can help)

Users can disable JavaScript execution (their visits will be invisible to tag monitoring)

Users can delete cookies (their visits will be incorrectly categorized)

User activity can be difficult to map accurately with web page tagging records:

Page 13: Cmg10 Web Analytics Pres   Am Long

How Web Analytics is Used in Marketing

developing customer profiles managing online marketing campaigns improving conversion

Even though web analytics technologies have drawbacks, they still provide useful results. For example, marketing groups have been very successful using analytics for these activities:

Page 14: Cmg10 Web Analytics Pres   Am Long

Customer Profiling

Segmentation of the customer population Persona development RFM (Recency/Frequency/Monetary Value) Analysis

Web analytics can support the activities of customer profiling, which enable marketing to tailor offerings to particular customer needs.

Page 15: Cmg10 Web Analytics Pres   Am Long

Measuring Response for Online Marketing Campaigns

Each in-bound “call to action” link can be uniquely tagged. Search engine advertising programs such as Google

AdWords add extensive in-bound link tagging to identify the characteristics of the visit and visitor

Analytics software collects this data, which the web analyst process and analyzes with the following aims:

Identify which campaigns are most and least successful Determine which aspects of successful campaigns to replicate Determine whether to revise less successful campaigns or pull the

plug

Online marketing makes extensive use of web analytics.

Page 16: Cmg10 Web Analytics Pres   Am Long

Improving Conversion

Conversion is the event where a website visitor becomes a customer.

The process through which conversion takes place is called a “conversion funnel”.

The conversion process is viewed as a leaky funnel, meaning some people abandon the process, i.e., they “leak out” at each step along the way.

Marketers attempt to improve conversion by reducing the leakage.

Improving conversion is a major goal of online marketing.

Page 17: Cmg10 Web Analytics Pres   Am Long

Sample Conversion Funnel(Part 1)

Page 18: Cmg10 Web Analytics Pres   Am Long

Sample Conversion Funnel(Part 2)

Page 19: Cmg10 Web Analytics Pres   Am Long

Conversion FunnelAnother View

Graphic courtesy of WAA colleague Bob Russotti

Page 20: Cmg10 Web Analytics Pres   Am Long

Web Analytics andPerformance Management /

Capacity Planning

Managing the impact of web analytics on performance and capacity requirements

Applying web analytics to enrich performance and capacity planning activities

As a performance analyst or capacity planner, your involvement with web analytics can fall into two categories:

Page 21: Cmg10 Web Analytics Pres   Am Long

Web Analytics –How It Can Impact Performance

Placement of JavaScript snippets Planning for in-house analytics data

collection/storage Selecting sampling strategies

As a performance analyst or capacity planner, you may be asked to consult in areas such as these:

Page 22: Cmg10 Web Analytics Pres   Am Long

Placing JavaScript Snippets – No Single Optimal Solution

Snippet placement can cause problems no matter where in the page code it is placed:

Placement anywhere on a webpage can slow rendering if the analytics server is down.

Placement at the bottom of webpage code can cause data to be lost when visitor leaves a page before the snippet executes.

Placement at the top of webpage code can cause passing of custom variables to fail.

Page 23: Cmg10 Web Analytics Pres   Am Long

Bringing Web Analytics In-House – To SAAS or Not to SAAS?

Plan for storage if bringing collected data in-house (Google recommendation: minimum of 10 GB for each million pageviews (but plan for much more))

Architect/plan for servers and connectivity if bringing collection servers in-house

Most major web analytics tools offer a SAAS version, but some organizations choose to keep their web analytics data or the entire collection process in-house. With in-house, there are performance/capacity challenges to address:

Page 24: Cmg10 Web Analytics Pres   Am Long

Sampling All That Data – Your Judgment Call

Instrumenting a subset of the website (not recommended)

Collecting a subset of tracked events Querying a subset during analysis (if must sample

this approach is most desireable)

To stay within performance, resource, or budget restrictions, some organizations resort to data sampling. Several sampling approaches are commonly applied:

Page 25: Cmg10 Web Analytics Pres   Am Long

We’re From Web Analytics – We’re Here to Help!

Diagnosing problems Alerting for anomalies Developing benchmark workloads Developing forecasts

While web analytics can add complexity to performance and capacity management processes, it can also provide insights to enhance performance and capacity planning activities:

Page 26: Cmg10 Web Analytics Pres   Am Long

Diagnosing Performance Problems

When a performance problem cannot be reproduced in a testbed

When performance monitors do not have the granularity or reach to “see” into a web application

When a performance problem is caused by a particular mix of configuration/user characteristics not readily isolated by other tools

When traditional performance monitoring/testing tools fall short in tackling web-based problems, web analytics data can provide insights not available from other tools. For instance:

Page 27: Cmg10 Web Analytics Pres   Am Long

Alerting For Anomalies

Start by setting up an alert “ladder” in your web analytics tool

If receiving alerts, you can dig into the web analytics data to learn characteristics of the activity spike: Peak referrers Peak geographic regions Time zone rolling

Web analytics tools can be configured to support proactive performance management by alerting for situations that could build into performance problems.

For example, managing demand when a video goes viral:

Page 28: Cmg10 Web Analytics Pres   Am Long

Developing Benchmark Workloads

Ecommerce system example: one million purchases per day does not equate to one million visitors per day.

Web analytics data can reveal conversion funnel patterns of ecommerce to help you produce a more realistic visitor pattern.

Web analytics data can also identify customer segments so each can be represented with its own unique characteristics.

Web analytics data can provide key insights to make simulated workloads a more accurate representation of operational demand.

Page 29: Cmg10 Web Analytics Pres   Am Long

Developing Forecasts

Many of the features that make web analytics so useful for benchmark development also apply for forecasting.

Caution: need to remember that data from page tagging undercounts the volume of activity. Use it for revealing trends and identifying customer segments. Augment that trend analysis with analysis of web server log data to estimate volumes.

The historical perspective gained from web analytics data and tools can help you develop better forecasts of future system demand.

Page 30: Cmg10 Web Analytics Pres   Am Long

Conclusions

Web analytics tools play an important role in the information technology architecture of virtually any organization with a major web presence.

While web analytics tools traditionally support marketing, they can also be applied much more widely to manage resources.

Web analytics tools can support the work of performance analysts and capacity planners, enhancing their work products and making their projects more successful.

Web Analytics… they’re not just for marketing any more!

Page 31: Cmg10 Web Analytics Pres   Am Long