117
EXCEL STATISTICAL FUNCTIONS T-Tests Chi-Squared Average Z-Test Weighted Average Percentile Probabilities Standard Normal Distribution Rank y=mx+b Regression

EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

EXCEL STATISTICAL FUNCTIONS

T-T

ests

Chi-Squared

Av

erag

eZ-Test

Weighted Average

Percentile

Probabilities

Standard Normal Distribution Ran

k

y=mx+b

Regression

Page 2: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Average vs. Weighted Average

AverageWeighted

Average

Page 3: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Average Price

Transaction

DateItem

Price Per

Unit

Units

Sold

5/7/2007 Coffee $ 20 500

6/7/2007 Coffee $ 25 750

7/6/2007 Coffee $ 35 200

8/9/2007 Coffee $ 30 300

Total: 20+25+35+30 = 110

Average Price: 110 / 4 = 27.50

Sum of Prices / Count of Prices

Page 4: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Weighted Average Price

Date ItemPrice Per

Unit

Units

Sold

5/7/2007 Coffee $ 20 500

6/7/2007 Coffee $ 25 750

7/6/2007 Coffee $ 35 200

8/9/2007 Coffee $ 30 300

500+750+200+300 = 1,750

Weighted Average Price: 44,750 / 1,750 = $25.57

∑(Price per Unit * Units Sold) /∑ Units Sold

= 10,000

= 18,750

= 7,000

= 9,000

44,750

XXX

X+

+

Page 5: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

=SumProduct(Array1,Array2)

Multiplies corresponding numbers in two or more different ranges together then sums them.

Date ItemPrice Per

Unit

Units

Sold

5/7/2007 Coffee $ 20 500

6/7/2007 Coffee $ 25 750

7/6/2007 Coffee $ 35 200

8/9/2007 Coffee $ 30 300

→ 20*500 = 10,000

→ 25*750 = 18,750

→ 35*200 = 7,000

→ 30*300 = 9,000

44,750Array 1 Array 2

Page 6: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Wt. Avg. with

SumProduct() Exercise

“Weighted_Avg_SumProduct” sheet

Page 7: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Wt. Avg. with

SumProduct() Solution

“Weighted_Avg_SumProduct” sheet

Page 8: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Summations

10

1

2i

Page 9: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Summations

Sum(12 , 22 , 32 , 42 , 52 , 62 , 72 , 82 , 92 , 102) = 385

10

1

2iStartingNumber

EndingNumber Mathematical

Operation

Page 10: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Using Sum() as an Array

Press Control + Shift + Enter to make Excel Process cells in a range one at a time.

Sum(A1:A10^2)

Sum(A12,A22,A32,A42,A52,A62,A72,A82,A92,A102)

Sum(12,22,32,42,52,62,72,82,92,102)

Sum(1,4,9,16,25,36,49,64,81,100) 385

Page 11: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Summation Exercises

“Summations” sheet

Page 12: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Summation Solutions

“Summations” sheet

Be sure to press Control + Shift + Enter after typing or editing the formulas.

Page 13: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Factorials in Excel

!)!1(

!1

kn

kn

Page 14: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Factorial Exercise using

Combinations

You have an unlimited supply of Coconuts and Pineapples. How many different ways can you fill a bowl which will contain exactly 3 items?

Page 15: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Factorials Examle: Combinations

(With Replacement, Order Not Important)

!)!1(

!1

kn

kn

n = The number of different items available (2).K = The number of items selected (3).

Page 16: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Factorials (!)

4 * 3 * 2 * 1 4! = = 24

6 * 5 * 4 * 3 * 2 * 16! = = 720

=Fact(4)4! = = 24

=Fact(6)6! = = 720

Page 17: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Factorial Exercise with

Combinations

“Factorials” sheet

Page 18: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Factorial Exercise with

Combinations: Solution

“Factorials” sheet

Page 19: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Frequency

Nu

mb

er

Occ

urr

ing

Grade

F D C B A

7

6

5

4

3

2

1

Page 20: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

>89.99 and <=100 ---------> A

>79.99 and <=89.99 ------> B

>69.99 and <=79.99 ------> C

>59.99 and <=69.99 ------> D

<= 59.99 ---------------------> F

We wish to determine the number of students getting an A, B, C, D and F based on the conditions below.

Page 21: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

=Frequency(Data Array, Bins Array)

This is the range of values you are analyzing.

This is your grouping.

The values listed are the maximums for the group and are inclusive.

In this example the groupings are:

>89.99 and <=100>79.99 and <=89.99>69.99 and <=79.99>59.99 and <=69.99<=59.99

Page 22: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

=Frequency(Data Array, Bins Array)

You must highlight your output area prior to typing the formula.

When done typing, you must press: Control +Shift + Enter

Page 23: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

=Frequency(Data Array, Bins Array)

You must:• Highlight F11:F15 First• Type the function• Press Control + Shift + Enter

“Frequency” sheet

Page 24: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

=Frequency(Data Array, Bins Array)

You must:• Highlight First• Press Control + Shift + Enter

Page 25: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Rank and Percentile

(Internal Ranking)

Page 26: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

RankRanking Scores

Rank.Eq()Ties Given the Same First Rank

Rank.Avg()Ties Given the Same Average Rank

Page 27: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

=Rank.Eq(Score,Range,Order)

Value being ranked

Set of values being ranked

0 Lower scores have lower ranking

1 Lower scores have higher rankings

=Rank.Avg(Score,Range,Order)

Value being ranked

Set of values being ranked

0 Low scores have low ranking

1 Low scores have higher rankings

Page 28: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Rank.Eq() Exercise

Rank our student’s scores in descending order (i.e Lower scores are assigned the higher ranks.)

“Rank_and_Percent” sheet

Page 29: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Percent Rank

Expressing a Score as a Percent of Scores it is Greater Than

PercentRank.Inc(Range of all Scores , Specific Score to be Ranked)The "Inc" indicates that the ranks include 0 and 100 in its calculation. The outcome is that the highest score gets 100% indicating that 100% of the scores are less than this score in question.

PercentRank.Exc(Range of all Scores , Specific Score to be Ranked)The "Exc" indicates that the ranks exclude 0 and 100. The outcome is that the highest score might get a 95% but it does not express the score as being better than 100% of the other scores. This is a more commonly used method of expressing percentiles.

Page 30: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

PercentRank.Exc() Exercise

PercentRank.Exc(Array,k)

Specific ScoreSet of Scores

“Rank_and_Percent” sheet

What Percent did of students did each student do better than?

Page 31: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Percentile

Returns the Sore for a Given Percentile (i.e. What is the minimum score to be in the 90th percentile?)

Percentile.Inc(Range of Scores, Specific Score)This returns the score associated with a given breakpoint. For example, you wanted to know what test score begins the 90th percentile. The "Inc" indicates that 0 and 100 percent are included when calculating the score associated with the percent given.

Percentile.Exc(Range of Scores, Specific Score)Just like "Percentile.Inc" except that it excludes 0 and 100 percent.

Page 32: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Percentile.Exc() Exercise

What is the minimum score to be in the 90th percentile?

“Rank_and_Percent” sheet

Page 33: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

CountIF() Exercise

What is the number of our students who are in the local 90th percentile?

“Rank_and_Percent” sheet

Page 34: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

CountIF(Criteria Rang , “Criteria”)

This returns the number of cells in your criteria range that matches your criteria. Note that you criteria must be in quotes unless you are referring to cell addresses.

Examples:• Number of cells with numbers greater or equal to 10: =CountIf(B2:B100,”>=10”)• Number of cells with numbers equal to the number in cell K11: =CountIf(B2:B100,K11)• Number of cells with numbers greater than the number in cell K11: =CountIf(B2:B100,”>”&K11)• Number of cells containing just “USC”: =CountIf(B2:B100,”USC”)• Number of cells containing “USC” and other characters: =CountIf(B2:B100,”*USC*”)• Number of cells in the range that are not blank: =CountIf(B2:B100,”<>”)• Number of cells in the range that are blank: =CountIF(B2:B100,””)• Number of cells in the range that are greater than average: =CountIF(B1:B100,”>”&Average(B2:B100))

If using a function here, the function must return the answer for a range of cells, not a single cell. For example, IsText(B2:B100) would not work because IsText() can only look at one cell at a time.

Page 35: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

CountIF() Exercise

What number of our students are in the local 90th percentile?

“Rank_and_Percent” sheet

Page 36: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Count() & Average() Exercise

What percent of our students are in the 90th percentile? What is the average score for our students?

Page 37: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Solutions to Rank.Eq(), PercentRank.Exc()

& Percentile.Exc() Exercises

“Percent_and_Rank” sheet

Page 38: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Next… we wish to compare our student scores to the National

Scores. To do that, we must first verify our scores are normally

distributed.

Page 39: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Normal Distribution

Page 40: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Creating a Histogram

“Rank_and_Percent” sheet

Creating a Histogram: Data – Data Analysis - Histogram

Page 41: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Standard Deviation

How spread out from the average is the data?

If your data is normally distributed then you can use Standard Deviation to see how spread out the data is.

Page 42: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Standard Deviation

How spread out from the average is the data?

For data that is normally distributed: 68% of your data will be within 1 standard deviation from the average. 95% of your data will be within 2 standard deviation from the average. 99% your data will be within 3 standard deviation from the average.

Page 43: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Standard Deviation

How spread out from the average is the data?

Stdev.P(Range)

Stdev.S(Range)

Standard Deviation of a Population

Standard Deviation of a Sample

“Normal_Distribution” sheet

Page 44: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Standard Deviation

How spread out from the average is the data?

“Rank_and_Percent” sheet

Our average is 50.7 and our standard deviation is 21. This means that:

68% of our students are within 21 points of the average (i.e. 29.7 to 71.7). 95% of our students are within 42 points (2 x 21) of the average (i.e. 8.7 to 92.7). 99% of our students are within 63 points (3 x 21) of the average (i.e. -12.3 to 113.7).

Note because of the thinness of the tails at 3 standard deviations, you should think of this as 0 to 100.

Page 45: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

How do our Local Test Score Compare

with National Test Scores?

To compare different curves, you can use the Standard Normal Distribution…

Page 46: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Standard Normal Distribution

The Standard Normal Distribution is the Normal Distribution adjusted so the mean falls on zero.

The image above shows test scores (0-150) plotted on a histogram. The data is normally distributed.

Page 47: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Standard Normal Distribution

and Z Scores

The data points are converted to Z-Scores and represent the number of standard deviations from the mean using this formula:

𝑍 𝑆𝑐𝑜𝑟𝑒 =𝐷𝑎𝑡𝑎 𝑃𝑜𝑖𝑛𝑡 − 𝐴𝑣𝑒𝑟𝑎𝑔𝑒

𝑆𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝐷𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛 𝑓𝑜𝑟 𝑡ℎ𝑒 𝑆𝑎𝑚𝑝𝑙𝑒

For example: • A Z-Score of 0 indicates that the score is the same

as the mean. • A positive Z-Score of 1.62 means that the data

point is 1.62 standard deviations above the mean.

• A negative Z-Score means it is below the mean.

Standardize(Data point, Mean, Standard Deviation)or

Page 48: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Standard Normal Distribution

and Z Scores

Z-Scores allow you to:

• Compare scores from different normal distributions.

• Combine scores from data with different means and standard deviations.

Page 49: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Standard Normal Distribution

Z Scores as Probabilities

Z-Scores are often converted to probabilities percentages to make them easier to understand. Z-scores are expressed as a 0% to 100% probability of occurring.

For example, a test score of 85 might turn out to have a z-score of 1.62 which in turn might have a corresponding probability of 94%. This indicates that getting a score of 85 means you did better than 94% of the population. It also can be taken to mean that if you randomly sample a score, there is a 94% probability that it will be less than 85.

Score of 85

Z Score of 1.62

94% Probability

Page 50: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Norm.Dist()

Return a Probability for a Given Data Point

Page 51: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Norm.Dist() Exercise

Convert Data Point to Probability Percent

The National Mean is 65 with a Standard Deviation of 15

Where do our students fall in the National Percentiles?

“Rank_and_Percent” sheet

Page 52: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

“Rank_and_Percent” sheet

Norm.Dist(Data point , National Mean, National Standard deviation, Cumulative?)Returns a

probability for a given data point.

Cumulative is set to either True or False: True - Excel returns the percentage of students whose scores were less than yours.False - Excel returns the percentage of students who have the same score as yours.

Norm.Dist() Exercise

Convert Data Point to Probability Percent

The National Mean is 65 with a Standard Deviation of 15

Where do our students fall in the National Percentiles?

Page 53: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

“Rank_and_Percent” sheet

Solution

Norm.Dist() Exercise

Convert Data Point to Probability Percent

The National Mean is 65 with a Standard Deviation of 15

Where do our students fall in the National Percentiles?

Page 54: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Norm.Inv()

Return a Data Point for a Given Probability

Page 55: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Norm.Inv() Exercise

Return Data Point for a Given Probability

The National Mean is 65 with a Standard Deviation of 15. What is the cutoff score for our students to be in the National 90th Percentile?

“Rank_and_Probability” sheet

Norm.Inv(Probability , National Mean , National Standard Deviation)Returns a data point

for a given probability

Page 56: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

CountIF() and Count() Exercise

Compare Local Test Scores with National

How many of our students are in the National 90th Percentile?What percent of our students are in the National 90th Percentile?

“Rank_and_Probability” sheet

Page 57: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Understanding Cumulative in Norm.Dist() Exercises

“Probability_Exercise” sheet

• What is the probability that the student will be exactly 70 inches tall?

• What is the probability that the student will be 70 inches tall or less?

• What is the probability that the student will be between 70 to 74 inches tall?

• What is the probability that the student will be greater than 70 inches tall?

A new student is being sent to a height minded Volleyball coach to be on his team. Google tells him that in the USA, the average height of a man is 69.1" with a standard deviation of 2.9".

Page 58: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Norm.Dist(DataPoint , Mean, Standard Deviation , Cumulative)

• What is the probability that the student will be exactly 70 inches tall?(Cumulative False)

• What is the probability that the student will be 70 inches tall or less?(Cumulative True)

• What is the probability that the student will be between 70 to 74 inches tall? (Cumulative True)

• What is the probability that the student will be greater than 70 inches tall?(Cumulative True)

70

Page 59: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Norm.Dist() Exercise

“Probability_Exercise” sheet

A new student is being sent to a height minded Volleyball coach to be on his team. Google tells him that in the USA, the average height of a man is 69.1" with a standard deviation of 2.9".

Solution

Page 60: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

T-Test

Used to determine how similar two sets of data are. More specifically, they test the hypothesis that the means of the two sets of data do not differ (i.e. the null hypothesis). In other words, are they different simply due to chance or over time will they continue to be different?

Requirements:• Standard Deviation of the Population is

unknown.• Sample Size is less than 30 (n<30)• Data is Normally Distributed

Page 61: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

T-Tests

(Comparing Two Means)

Interpreting the Results• T-Test results are typically expressed in probabilities (p).• Because it T-Tests evaluates the hypothesis that there is no real

difference, the smaller the p value the more likely it is that there is a true difference between the means. A 95% level of confidence is typically used so:

A p-value less than .05 indicates that the data sets do differ and not just do to randomness

Page 62: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

T-Tests

(Comparing Two Means)

Factors to Consider: Number of TailsOne Tailed: testing is in just one direction e.g. The mean of sample A is less than the mean of sample B

Two Tailed: testing is in both directionse.g. Testing that the mean of sample A is less than or greater than the mean of sample B.

Page 63: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

T-Tests

(Comparing Two Means)

Factors to Consider: Paired vs. UnpairedPaired: Each value in a sample is paired with a value in another sample. Each sample has the same number of data points. Often used in before and after comparison.

• Testing a person’s reflexes before and after drinking alcohol.• Weighing each individual before and after taking a new

weight loss drug.

Page 64: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

T-Tests

(Comparing Two Means)

Factors to Consider: Paired vs. UnpairedUnpaired: data is gathered by randomly sampling each set of data separately under different conditions. The size of each group does not have to be equal.

• Comparing the performance of two different stocks.• Comparing the battery life of batteries from two different

suppliers.

Page 65: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

T-Tests

(Comparing Two Means)

Factors to Consider: Paired vs. UnpairedUnpaired: data is gathered by randomly sampling each set of data separately under different conditions. The size of each group does not have to be equal.

Unpaired can have either an equal or unequal population variance. Because you generally don’t know the population variance, Unequal variances is used much more often.

Page 66: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

T-Tests

(Comparing Two Means)

Factors to Consider: One-Sample vs. Two-SampleOne-Sample: You wish to compare your data to a known value or population mean.

• Measuring a school’s test scores against the national average.• Measuring birth weight in a certain city against a mean taken

50 years ago in the same city.

Page 67: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

T-Tests

(Comparing Two Means)

Factors to Consider: One-Sample vs. Two-SampleTwo-Sample: The means was sampled independently from two different population groups.

• Comparing the mean of people on a new weight loss drug against the mean of a control group taking a placebo.

• Comparing the battery life of batteries from two different suppliers.

Page 68: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

T-Tests with the data Analysis Tookpak

Page 69: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

T-Test: Example 1

A researcher gave one group a drug to lower toxins in their cells and another

group a placebo. She wishes to test the hypothesis that the amount of toxins in

the group treated with the drug will be less than the group given the placebo.

Number of Tails?

One or Two Sample?

Paired or Unpaired?

“Ttest_Ex1” Sheet

One - Because we are testing that the difference will be one direction (less than), this is a one-tailed test.

Two Sample - Because the data comes from two independent samples (drug group and placebo group).

Unpaired- Two separate groups are measured. Also, we don't know the variance of the two populations so we will assume they are unequal.

Page 70: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

T-Test: Example 1

A researcher gave one group a drug to lower toxins in their cells and another group a

placebo. She wishes to test the hypothesis that the amount of toxins in the group treated

with the drug will be less than the group given the placebo.

Because P is less than .05 (.029), we reject the null hypothesis. The difference is not due to randomness and the drug does reduce the amount of toxins in the blood.

Page 71: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

T-Test: Example 2

A company wishes to see if a new drug it has developed for diabetes has any effect on

people's weight. They weighed several individuals before giving them the drug and then

again after giving them the drug.

Number of Tails?

One or Two Sample?

Paired or Unpaired?

“Ttest_Ex2” Sheet

Two- Because we are testing for any difference (less than or greater than).

Two - Because we are the data comes from two samples, it is a two-sample test.

Paired – We are weighting the same individual(s) both before and after the test

Page 72: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

T-Test: Example 2

A company wishes to see if a new drug it has developed for diabetes has any effect on

people's weight. They weighed several individuals before giving them the drug and then

again after giving them the drug.

“Ttest_Ex3” Sheet

Because P is les than .05 (.0000103), we reject the null hypothesis. The drug does affect people’s weight.

Page 73: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

T-Test: Example 3

"TV's R Us" is opening a store in Sunnydale, CA and wishes to see if the town’s average

differs from the national average of 290 minutes per day. The data from the 15 residents

they surveyed is below.

Number of Tails?

One or Two Sample?

Paired or Unpaired?

“Ttest_Ex3” Sheet

Two- Because we are testing for any difference (less than or greater than).

One (Sort of)- We are comparing our data with a national average so this is a one sample test; however, the Data Analysis Tookpakdoesn’t do have a one sample test so we must make a column of 290 and specify a Two Sample test.

Unpaired- Two separate groups are measured. Also, we don't know the variance of the two populations so we will assume they are unequal.

Page 74: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

T-Test: Example 3

"TV's R Us" is opening a store in Sunnydale, CA and wishes to see if the town’s average

differs from the national average of 290 minutes per day. The data from the 15 residents

they surveyed is below.

“Ttest_Ex3” Sheet

Because P is les than .05, we reject the null hypothesis. Sunnydale’s viewing does differ from the national average.

Page 75: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

T.Test() Function

T.Test(Array1,Array2,Tails,Type)

Tails has Two Choices1 for a one tailed test.2 for a two tailed test.

Type has Three Choices:1 for Unpaired and Unequal Variance.2 for Unpaired and Equal Variance.3 for paired

Page 76: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

T.Test() Function (Optional)

Page 77: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Z-Test (When N>30)

Like a T-Test this is used to determine how similar two sets of data are. When the number of data points is greater than 30, use a Z-Test rather than a T-Test.

Requirements:• Variance of Samples should be close (F-Test).• Does not have to be normally distributed.• Samples sizes don’t have to be the same size

but should be close. • Data points should be independent of each

other.

Note that if you know the standard deviation of the population you should always use a Z-Test regardless of the sample size.

Page 78: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Z-Test (n>30)

Steps in Excel

Computer the Variance of both data setsVar.S(Range)

Perform a Z-TestData Analysis Toolpak

Page 79: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Z-Test (n>30)

Exercise 1

A company manufacturing “Easy-Start” file logs wants to know if a new manufacturing treatment increases the log's burning time. They have tested 41 of their old longs and 35 of their new logs and have recorded the burning times in the table shown to the right. They now wish to see if the difference they are seeing in the two means is due to randomness or the new logs actually do burn longer.

“Ztest_Ex1” Sheet

Page 80: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Z-Test (n>30)

Exercise 1

Step 1: Find the Variance for both the Old and New logs.

=VAR.S(A5:A45)And

=VAR.S(B5:B39)

“Ztest_Ex1” Sheet

Page 81: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Z-Test (n>30)

Exercise 1

Step 2: Perform a Z-Test to see if the samples differ

Because our P value for a one-tail test (0.00913) is less than 0.05, we will reject the null hypothesis that the means are the same. Or in other words, the difference in our means is not due to randomness and the new logs do burn longer than the older logs.

Page 82: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Linear Regression

Cause and Effect of Two Variables

Page 83: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Linear Regression - Slope

Rise Over Run

The slope of a line tells you how much a dependent variable (Sales) changes given the values of an independent variable (Advertising Costs) and is the RISE / RUN.

Slope = (Y2-Y1)/(X2-X1) = (230-200)/(20-10) = 30/10 = 3

A slope of 3 indicates that for every dollar of advertising we spend, our sales increase by 3 dollars.

Slope(Independent Range Y, Dependent Range X)

Page 84: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Linear Regression – Slope Exercise 1

For each dollar of advertising we spend, how much do our sales go up?

Use Excel’s Slope() functionIt uses the Least Squares Method (Line of Best Fit) to find the slope using multiple data points

“Slope_and_Intercept” Sheet

Page 85: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Linear Regression – Slope Exercise 1

For each dollar of advertising we spend, how much do our sales go up?

“Slope_and_Intercept” Sheet

SolutionFor every dollar of advertising we spend, our sales go up $2.86

Page 86: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Linear Regression – Y Intercept

The Y-Intercept is where your lines intersects the Y axis and tells you what the value of your Y coordinate would be if its corresponding X coordinate were zero. In this example, it tells us what sales would be if we spent nothing on advertising?

Intercept: y = mx+b

Where:y = the y coordinate of any data point on the line.x = the corresponding x coordinate for the y coordinate given above.m = the slope of the line.b = the y-intercept.

Intercept(Independent Range Y, Dependent Range X)

Page 87: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Linear Regression – Intercept Exercise 1

If we didn’t spend anything on advertising, what would our sales be?

Use Excel’s Intercepet() function to find the Y Intercpet.

Intercept(Independent Range Y, Dependent Range X)

“Slope_and_Intercept” Sheet

Page 88: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Linear Regression – Intercept Exercise 1 Solution

If we didn’t spend anything on advertising, what would our sales be?

“Slope_and_Intercept” Sheet

SolutionIf we spent nothing on advertising our sales would be $179,192

Page 89: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Plotting The Line

Page 90: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Plotting The Line

Meaning of R SquaredR2 is the square of the correlation and represents the percentage that the independent variable (advertising) "explains" the dependent variable (sales). Or in other words, checks to see how much advertising affects sales. It is always positive and between 0 and 1 with 1 being a 100% correlation and 0 being no correlation.

In this example R2 is .8548 (or 85.5%) which is a pretty strong correlation indicating that advertising does indeed affect sales. It also means there is some other unknown variable affecting 14.5% of sales.

Page 91: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Forecasting Future Values

(Linear Regression)

Hard

ware

Sale

s

New Home Startups

800K 900K 600K 400K 425K 500K

2.25M

2.0M

1.75M

1.5M

1.25M

1M

20

05

20

06

20

07

20

08

20

09

20

10

Regression Formula - Predicts a single data point.Forecast() - Predicts a single data point.Trend() - Predicts multiple data points.

Page 92: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Exercise 1: Predicting a Y Value Using Slope,

Y-Intercept, and the corresponding X Value

𝒀 𝑽𝒂𝒍𝒖𝒆 = Slope * X value + Y Intercept Y=mX+b

Forecast sales for the first three months of the next year based on sales for the first half using Y=mX+b

“Slope_and_Intercept” Sheet

Page 93: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Exercise 1: Predicting a Y Value Using Slope,

Y-Intercept, and the corresponding X Value

“Slope_and_Intercept” Sheet

Solution

Forecast sales for the first three months of the next year based on sales for the first half using Y=mX+b

Page 94: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Forecasting a Y Value using Forecast()

“Forecast” Sheet

Forecast(x, Known y's, known x's)

X is the known matching independent variable for the dependent variable Y you are trying to predict.

Known Y's are the known dependent variables. (Hardware sales in this example.)

Known X's are the known independent variables. (New Home Start-ups in this example.)

We believe that the number of new home start-ups in our state has a strong impact on our sales and is a good indicator of what our hardware sales will be. Predict what our Hardware Sales will be in 2014.

Page 95: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Exercise 2: Forecasting a Y Value using Forecast()

“Forecast()” Sheet

Forecast(x, Known y's, known x's)

X is the known matching independent variable for the dependent variable Y you are trying to predict.

Known Y's are the known dependent variables. (Hardware sales in this example.)

Known X's are the known independent variables. (New Home Start-ups in this example.)

SolutionWe believe that the number of new home start-ups in our state has a strong impact on our sales and is a good indicator of what our hardware sales will be. Predict what our Hardware Sales will be in 2014.

Page 96: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Trend() Exercise 1: Using Trend() to Predict Multiple Values

“Trend_Ex1” Sheet

=Trend(Known Y's, Known X's, New X's, Constant T/F)

Known Y'sThese are the historic dependent variables (e.g Sales for the 1st

half of the year.)

Known X'sThese are the historic independent variables (e.g Ad Costs for the 1st

half of the year.)

New X'sThese are the independent variables we plan to use (e.g Ad Costs for the 2nd half of the year.)

Constant (Optional)If True or omitted, b is calculated normally. If False, b is set to 0 and m-values are adjusted so y=mx.

Trend() is an array function so:

a. Highlight where you want your results (C9:C14).b. Type your formula.c. Press Control + Shift + Enter

Page 97: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Trend() Exercise 1: Using Trend() to Predict Multiple Values

“Trend_Ex1” Sheet

=Trend(Known Y's, Known X's, New X's, Constant T/F)

Trend() is an array function so:

a. Highlight where you want your results (C9:C14).b. Type your formula.c. Press Control + Shift + Enter

Solution

Page 98: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Trend() Exercise 2: Using Trend() For Multiple Regression

“Trend_Ex2” Sheet

=Trend(Known Y's, Known X's, New X's, Constant T/F)

If the array of known Y values (E3:E13) are in a single column, then Trend() interprets each column of known X values (A3:D13) as a separate variable.

ExerciseIn this example we have an office building we wish to sell but we are unsure of what our asking price should be. To help us determine our asking price, we have gathered data on office buildings similar to ours and in our area that have sold recently (A3:E13).

Page 99: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Trend() Exercise 2: Using Trend() For Multiple Regression

“Trend_Ex2” Sheet

=Trend(Known Y's, Known X's, New X's, Constant T/F)

=Trend(E3:E13,A3:D13,A17:D17)

Page 100: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Chi-Square Test

Used to test how likely it is that observed data is due to chance. Example, we randomly asked 44 men and 44 women whether they preferred red or white wine. As the table to the right shows, it appears that men prefer red and women prefer white but is this true or just random chance?

Page 101: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

About Chi-Square

• Chi-Square Tests the Null HypothesisChi-square tests the null hypotheses (that there is no relationship between two measured phenomena) against our data. Therefore, you will need two data sets to compare: expected and actual. A value of less than .05 indicates the null hypotheses is weak and our assumed results is not due to randomness or in other words, or data does appear to reflect reality.

• Difference Between Chi-Square and a T or Z TestChi-Square tests the null hypotheses about a relationship between two (usually) variables. (e.g. do men and women have a preference for red wine, white wine, or beer?) The t-test tests the null hypothesis between two means. (e.g. whether boys and girls in kindergarten have the same average height.

Page 102: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Using Chi-Square

• Use Chi-Square with Categorized Data Only(i.e. Data that has been counted and divided into categories)

• Do not use Chi-Square with uncategorized continuous data such as height, weight, or scores. (However, you can use Chi-Square if you have categorized your ranges. For example: Count of ages 0-25, 26-50, 50-75, etc.)

Page 103: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Using Chi-Square

• Do Not use with Derived Data, Percentages, or Ratios(e.g. estimations, assumptions, averages or some other statistical manipulation)

• Do Not use Chi-Square if any Category is less than 5

• Variables must be Mutually Exclusive(e.g. a person cannot like both red and white wine. It is one or the other)

• Data in the Set Should Never be Excluded (e.g. if you asked people if they liked red wine, white wine or Champaign, you must use all three.

Page 104: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Exercise 1: Chi-Square

We asked 44 men and 44 women if they preferred red or white wine. Our results are shown in the table to the right.

“Chi_Square_2x2” Sheet

Page 105: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Exercise 1: Chi-Square

Testing our data involves three steps:

Step A: Categorize the Results

Step B: Build a Model of Expected Data Representing the Null Hypothesis

Step C: Perform Chi-Square Test

“Chi_Square_2x2” Sheet

Page 106: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Exercise: Chi-SquarePart A: Categorize the Results Using CountifS()

“Chi_Square_2x2” Sheet

1. =COUNTIFS($A:$A,E$3,$B:$B,$D4)

2. Copy and Total

=CountifS(Criteria Range1, Criteria1, Criteria Range2, Criteria2)

Page 107: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Exercise: Chi-SquarePart B: Build a Model of Expected Data Representing the Null Hypothesis

“Chi_Square_2x2” Sheet

Use the same column and row totals as above. We had 44 males and 44 females and 38 people liked red and 50 people linked white wine.

The Null Hypotheses states that there is no preference so we need the male to female figures to reflect that. In E11 type the formula below then copy.

=E$13*$G11/$G$11

Page 108: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Exercise: Chi-SquarePart C: Determine Chi-Square using Chisq.Test()

“Chi_Square_2x2” Sheet

=ChiSq.Test(E4:F5,E11:F12)

=ChiSq.Test(Actual Range, Expected Range)

.000107 is less than .05 indicating that the null hypothesis is weak and our results are not due to random chance.Or in other words, the assertion that more women preferred white wine while more men preferred red wine is valid and not due to chance.

Page 109: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

CorrelationIs there really a correlation between our X and Y variables?

Page 110: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Correlation

Is there really a correlation between our hardware sales and the number of new housing starts?

=Correl(Array1,Array2)

-1 -.75 -.5 -.25 0 .25 .5 .75 1

PerfectCorrelation

InverseCorrelation

NoCorrelation

Forecast_Ex2 sheet

Page 111: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Correl(Array1,Array2)

Forecast_Ex2 sheet

Correlation Exercise 1:Determine if there really is a correlation between our hardware sales and new home start ups.

Page 112: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Correl(Array1,Array2)

Forecast_Ex2 sheet

Solution to Correlation Exercise 1There is nearly a perfect correlation. New home start ups is an excellent predictor of the sales we can expect in our hardware store.

Page 113: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Correl(Array1,Array2)

Is there a correlation between Interest rates and average home price in the last few years?

Is there a correlation between the age of an automobile and its average resale value?

Page 114: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Interval Estimation Around Mean

How close to the Average of a Sample can we Expect Data to be?

Page 115: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Interval Estimation about Means

One day at your manufacturing plant you wrote down the number of sprockets made by 60 randomly selected workers. You would like to compute the following:• The average for the day.• The standard deviation of the sample.• How close to the sample mean the population mean might be expected to fall.

Confidence.T(Alpha, Standard Deviation of the Sample, Sample Size)

When you don’t have the Standard Deviation of the Population, use Confidence.T() and the Standard Deviation of the Sample.

Page 116: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Interval Estimation about Means

“Confidence” sheet

Page 117: EXCEL STATISTICAL ANALYSISinfo.marshall.usc.edu/dept/training/Documents/... · 2014-11-11 · Percent Rank Expressing a Score as a Percent of Scores it is Greater Than PercentRank.Inc(Range

Interval Estimation about Means

“Confidence” sheet