3
Presentation Title Goes Here presentation subtitle. Data Transformation Leilani Nora Assistant Scientist Crop Research Informatics Laboratory International Rice Research Institute :: color, composition, and layout 2 Assumptions in the ANOVA ANOVA is valid only when the following assumptions are met: Additive Effects Independence of errors - Assured by proper randomization Homogeneity of variance Normal distribution :: color, composition, and layout 3 Assumptions in the ANOVA Most common violation in the ANOVA is the homogeneity of variances. If this assumption is not met, most often the normality and additive assumptions are also violated. Transforming the data often homogenizes the variances and consequently solves the problem of non-additive effect and non-normality of variances. :: color, composition, and layout 4 Data Transformation Commonly used transformations Logarithmic Transformation Square-root Transformation Arc Sine Transformation

Powerpoint presentation - Data Transformation

Embed Size (px)

DESCRIPTION

Powerpoint presentation - Data Transformation

Citation preview

Page 1: Powerpoint presentation - Data Transformation

Presentation Title Goes Here…presentation subtitle.

Data Transformation

Leilani Nora

Assistant Scientist

Crop Research Informatics Laboratory

International Rice Research Institute

:: color, composition, and layout2

Assumptions in the ANOVA

ANOVA is valid only when the following assumptions

are met:

• Additive Effects

• Independence of errors

- Assured by proper randomization

• Homogeneity of variance

• Normal distribution

:: color, composition, and layout3

Assumptions in the ANOVA

Most common violation in the ANOVA is the homogeneity of

variances. If this assumption is not met, most often the

normality and additive assumptions are also violated.

Transforming the data often homogenizes the variances and

consequently solves the problem of non-additive effect and

non-normality of variances.

:: color, composition, and layout4

Data Transformation

Commonly used transformations

• Logarithmic Transformation

• Square-root Transformation

• Arc Sine Transformation

Page 2: Powerpoint presentation - Data Transformation

:: color, composition, and layout5

Replace with 100-25/nFor obs = 100%

Replace with 25/nFor obs = 0%

Arcsin(sqrt(y/100))Other ranges

No transformation30 – 70

Sqrt(100-Y)80 – 100

Sqrt(Y)0 – 20

% whiteheads

% survival

Percentage data

Tiller no.

100 grain weight

Percent filled grains

Panicle number

No transformationYield and yield

components

ExampleTransformationVariable (Y)

:: color, composition, and layout6

Log(Y+1)Some obs. < 10

Log(Y)All obs. > 10

No. of insects per plot

No. of egg masses per

plant

Whole numbers that

cover a wide range

Sqrt (Y+.5)Some obs. < 10

Sqrt (Y)All obs. > 10

No. of infested plants

per plot

No. of insects caught

in traps

No. of weeds per plot

Small whole numbers

ExampleTransformationVariable (Y)

:: color, composition, and layout7 :: color, composition, and layout8

PLOT OF LS RESIDUALS AGAINST FITTED VALUES

..............................................................

: :

: :

: * :

: * :

8. -: * :

: :

: * * :

: 2 * * * * :

: 2 * * :

0. -: * * * ** :

: * * * * * :

: ** * * ** :

: * * :

: * :

-8. -: * :

: :

: :

: :

: * :

-16. -: :

:............................................................:

: : : : : :

-5. 0. 5. 10. 15. 20.

Page 3: Powerpoint presentation - Data Transformation

:: color, composition, and layout9

A satisfactory residual plot should give this overall impression

:: color, composition, and layout10

A pattern or trend in the

distribution of points in the

residual plot may be

encountered indicating

transformation is needed.

:: color, composition, and layout11

PLOT OF LS RESIDUALS AGAINST FITTED VALUES

..............................................................

: :

: :

: * :

: * :

8. -: * :

: :

: * * :

: 2 * * * * :

: 2 * * :

0. -: * * * ** :

: * * * * * :

: ** * * ** :

: * * :

: * :

-8. -: * :

: :

: :

: :

: * :

-16. -: :

:............................................................:

: : : : : :

-5. 0. 5. 10. 15. 20.

before transformation

:: color, composition, and layout12

PLOT OF LS RESIDUALS AGAINST FITTED VALUES

..............................................................

: * :

: * :

: * :

: ** :

6. -: * * *:

: * * * * :

: * * * :

: * * * * * * :

: * * * :

0. -: * * * :

: * * :

: * * 2 :

: * * :

: * * * :

-6. -: * * :

: * * * * * :

: * :

: * :

: * :

-12. -: :

:............................................................:

: : : : : :

47.5 52.5 57.5 62.5 67.5 72.5

after transformation