Icpe2015 weiyi shang (1)

Automated Detection of Performance Regressions Using Regression Models on

Clustered Performance Counters

Weiyi Shang, Ahmed E. HassanQueen’s University, Kingston, Canada

Mohamed Nasser, Parminder FloraBlackBerry, Waterloo, Canada

What is a performance regression?

Old version New version

Does the new version have worse performance than the old version?

How to detect performance regression?

Old Version

New version

requests

Performance counters

Large software systems generate large amounts of performance counters

Performance engineers pick counters to compare using T-test

Picking the wrong counters will not detect performance regression!

Are these counters significantly

different between old and new

versions?

Performance engineers build a model using all counters

Target Counter Independent Counters

CPUMemory I/O read I/O write

Target counter

Select target counter

Build model

Verify model

Model Prediction error

Target counter is selected by experience!

Model-based performance regression detection

Selecting a wrong target counter will fail to detect performance regression

Memory-related performance regressions will not be detected by this model!

Memory has a low correlation with CPU

I/O write has a high correlation with CPU

Our approach

Target counter

Build model

Verify model

Reduce counters

Reduced counters

Clustercounters

Clusters of counters For each cluster

Build model

Verify model

Reduce counters

ClusterCounters

Removing zero variance counters

Removing redundant counters

Available CPU cores:4, 4, 4, …4, 4

I/O write byte/sec= a*I/O write op/sec +b*CPU

Step 1: Reduce counters

Step 2: Cluster counters

Hierarchical clustering Dendrogram cutting

CPU I/O Memory CPU I/O Memory

Build model

Verify model

Reduce counters

ClusterCounters

Step 3: Select target counter

Build model

Verify model

Reduce counters

ClusterCounters

CPU from old version

CPU from new version

Kolmogorov–Smirnov test

P-VALUE

We select the counter that has significant difference between two versions with highest certainty.

We select the counter with

smallest p-value

Step 3: Build model Select target counter

Build model

Verify model

Reduce counters

ClusterCounters

We build a linear regression model.

+ c+ ba

Step 4: Verify model

Linear regression models for each cluster

Calculate prediction error

New version counters left in cluster 1

Linear regression model for cluster 1

… …

Mean prediction error

< threshold

> threshold

Build model

Verify model

Reduce counters

ClusterCounters

Case study: subject systems

DELL DVD Store

Enterprise Application

CPU overhead Memory overhead I/O overhead

Removing text index Removing column index

Real-life performance regression

Can our approach detect performance

regressions?

AccuracyApplicability

How many target counters does our

approach pick?

Is our approach better than traditional

approaches?

Comparison

Our approach picks a small number of target counters

DELL DVD Store

Our approach picks 2 to 4 target counters.

Performance engineers cannot pick target counters based on experience

Picked target counters are different across runs of our approach.

regressions?

approach pick?

approaches?

Comparison

Our approach picks a small number of target counters.

regressions?

approach pick?

approaches?

Comparison

Our approach can detect performance regressions

DELL DVD Store

Max prediction error4% to 11% without regression

24% to 1622% with regression

Our approach is not heavily impacted by the choice of threshold value.

Without performance regression

9% Max prediction error

With performance regression

916%Max prediction error

regressions?

approach pick?

approaches?

Comparison

Our approach can detect performance regressions and is not heavily impacted by threshold value.

regressions?

approach pick?

approaches?

Comparison

Our approach outperforms picking one target counter to build a model

DELL DVD Store

With performance regression

Building one model using memory as target counter may fail to detect performance regression

T-test does not perform well in detecting performance regressions.

DELL DVD Store

3 to 6 counters are flagged

There are a large number of counters with significant differences in the T-test results even though no regressions exist.

Enterprise Application

32% counters are flagged

regressions?

approach pick?

approaches?

Comparison

Our approach out performs traditional approaches.

How to detect performance regression?

Old Version

New version

requests

Performance counters

Target counter

Build model

Verify model

Target counter is selected by experience!

Model-based performance regression detection

Our approach

Target counter

Build model

Verify model

Reduce counters

Reduced counters

Clustercounters

Clusters of counters For each cluster

regressions?

approach pick?

approaches?

Comparison

Our approach out performs traditional approaches.

weiyishang.wordpress.comswy@cs.queensu.ca

Icpe2015 weiyi shang (1)

Documents

Xia, Shang, & Zhou Dynasties Xia, Shang, & Zhou Dynasties

Chu weiyi pcp_ppp_slideshow_pdf

Shang Dynasty Gn

Liying Yu, Weiqi Tang, Weiyi He, Xiaoli Ma, Liette Vasseur ... · PDF filePUBLISHED VERSION Liying Yu, Weiqi Tang, Weiyi He, Xiaoli Ma, Liette Vasseur, Simon W. Baxter, Guang Yang,

The Shang Dynasty

Dinastia shang

Glossary - Springer978-1-4039-8313-8/1.pdf · Lao Tzu Lao Zi Lao Tsu ... Shang Yang (Lord Shang) Shang Jun Shang Yang Shanxi Shansi Shanhsi ... Tai KungTao Dao Tao Te De Te 234 Glossary

Continuous validation of performance test workloadszmjiang/publications/asej2016_syer.pdf · 2016. 4. 4. · B Mark D. Syer mdsyer@cs.queensu.ca Weiyi Shang swy@cs.queensu.ca Zhen

Automated Approaches for Reducing the Execution Time of ... · An Automated Approach for Recommending When to Stop Performance Tests. Hammam M. Alghamdi, Mark D. Syer, Weiyi Shang

Source: gsill on slideshare...Geography Shang Dynasty 1766-1122 B.C. Ruled by Shang dynasty @ Final Shang capital YONG State under Shang control Thais Culture group Desert China's

Shang Salcedo Place

Continuous validation of performance test workloadszmjiang/publications/asej2016_syer.pdf · B Mark D. Syer mdsyer@cs.queensu.ca Weiyi Shang swy@cs.queensu.ca Zhen Ming Jiang zmjiang@cse.yorku.ca

NEOLITHIC CHINA: BEFORE THE SHANG DYNASTYg380/3.7-Neolithic-2010.pdf3.7 NEOLITHIC CHINA: BEFORE THE SHANG DYNASTY. The problem of the origins of the Shang polity . The Shang state

1 Opinion Retrieval from Blogs Wei Zhang, Clement Yu, and Weiyi Meng (2007 CIKM)

MORPHOLOGICAL FILTERS - wayne-weiyi-chen.github.iowayne-weiyi-chen.github.io/dip/p1-5.pdf · information digging. In the first part of this problem, we will perform shrinking and

Shang Han Lun

Química general shang

Shang dynasty - Wikipedia, the free encyclopediawildehistory.weebly.com/uploads/1/6/7/0/16706304/shang_dynasty... · Literal meaning Shang dynasty Shang dynasty ... However during

The Shang Dynasty - Equality Charter Middle Schoolecmshistory.weebly.com/uploads/8/8/1/0/88105734/china1.pdfThe Shang Dynasty The Shang d ynasty held power from around 1600 to 1046

Ase2010 shang