SASによる中間解析のデザインと解析消費関数（O‘Brien-Fleming型とPocock型） SEQDESIGNプロシジャの出力内容の解説消費関数の種類，中間解析の回数・時期について

1

SASによる中間解析のデザインと解析

浜田知久馬東京理科大学

Interim Analysis using SAS

Chikuma HamadaTokyo University of Science

要旨

V9.2から中間解析を計画するためのSEQDESIGNプロシジャと，解析するためのSEQTESTプロシジャが実験版として

追加された．

本発表では，中間解析の数理と，この２つのプロシジャが出力する内容についてチュートリアルとして解説を行う．

キーワード：SEQDESIGN,SEQTEST,中間解析，消費関数 2

2

内容中間解析の数理情報時間の定義中止基準（中間解析なし，有効中止のみ，

無効中止のみ，有効・無効中止）消費関数（O‘Brien-Fleming型とPocock型）SEQDESIGNプロシジャの出力内容の解説消費関数の種類，中間解析の回数・時期についての検討

SEQTESTプロシジャの出力内容の解説条件付き検出力と予測検出力生存時間をエンドポイントとした中間解析 3

症例数設計のペンタゴンα，β，σ，Δ，N

4

Δ：効果の大きさ

σ：ばらつき

N：（情報時間I）

α β

3

中間解析の６つのステップ1)You specify the statistical details of the design, including the null and alternative

hypotheses, a test statistic , the Type I and II error probability levels, a stopping criterion, the total number of stages, and the relative information level at each stage.

2)You compute the boundary values for the trial based on the design specifications. The sample size required at each stage is also computed for the specified hypothesis test.

3)At each stage, additional data with the required sample sizes are collected. The data available at each stage include the data collected at the current stage as well as the data collected at previous stages.

4)At each stage, the available data are analyzed with a procedure. 5)At each stage, you compare the test statistic with the corresponding boundary values.

Either the trial is stopped to reject or accept the hypothesis, or the trial is continued to the next stage. If the trial continues to the final stage, the hypothesis is either rejected or accepted.

6)After the trial stops, you compute parameter estimates, confidence limits for the parameter, and a -value for the hypothesis test. 5

中間解析の６つのステップ

1)中間解析のデザインを決める条件の設定

検定統計量，帰無仮説(H0) ，対立仮説(H1)，

α，βエラー，中止基準，

中間解析の回数・時期

2)SEQDESIGNプロシジャによる症例数と，

棄却限界値を計算しSASデータセットに

おとす

3)各解析の段階でのデータを得る6

4

中間解析の６つのステップ

4)各解析の段階でのデータをプロシジャで解析

し結果をデータセットにおとす

5) 2)の棄却限界値と，４）の解析結果をSEQTESTプロシジャで比較し，試験の中止・

続行を判断

6) 中止の場合は，中間解析を考慮した

p値，推定値，信頼区間の計算

7

Input:Alpha,Beta,Alternative Reference,Number of Stages,Stopping Criterion,…

Group Sequential Design (SEQDESIGN)

Sample sizes

Group Sequential Test(SEQTEST)

Continues

Test Statistics

Data

LIFETEST

Boundaries(Critical Values)

Trial?

Stops

Analysis(SEQTEST):Parameter Estimates,p-valuesConfidence Limits

中間解析のステップ

8

5

中間解析のデザインのポイント１）想定条件

反応変数の型・指標，片側・両側，α，β ，Δ，σ２）中止基準

有効中止，無効中止，有効・無効中止

３）中間解析の方法・消費関数

O‘Brien-Fleming型とPocock型４）中間解析の回数と時期

9

SEQDESIGNプロシジャ棄却限界値と症例数を出力

10

中間解析の情報時間I情報時間Iの例

– 暦時間

– イベント数

– 患者数

0 1 2 3 4 5(年)

60% 100%

0 200 400(ｲﾍﾞﾝﾄ)

50% 100%

0 300 1000(人)

30% 100%

生存時間の情報

6

中間解析の数理

MLE:

],[

)1,(:),1,(:

::2,

::

2

121

2211

21

21

の

情報時間効果の大きさ

変量正規分布～

最終統計量中間統計量

kk

kkkkk I

ZIZ

IIZZCov

INZINZ

IZZ

ZZ

11

中間解析と最終解析は正の相関（Z2はZ１を含む）


221

2

1

21

2211

21

1],[

)1,(:),1,(:

2,

IIII

I

Cov

ININ

IZIZ

k

kkkkk


12

7

中間解析の数理(スコア統計量S）

)),((:,

][

],[

),(:),,(:2,

log

121212

121

11

212

121

222111

21

IIIINSSSSS

SVI

IIIISSCov

IINSIINSSS

IZd

LdS kkk

は独立


13

正規分布の場合

22

2

22

2

2

2

2

2

log

)(log2

)(log

2)(exp

21

Nd

LdI

yNNyNd

LdS

yL

yL

i

i

定数

14

8

一標本の母平均の検定（σ2既知）

2

2

0

21

00

I

)1,(),(

),(

1:

nI

Nn

Ny

nNy

yn

y

Hn

ii

～

～

15

一標本の母平均の検定

),(),,(

,

1,1

2,

2

2

2

1

2

1

022011

122

111

21

21

NN

NN

yy

yN

yyN

y

yyN

ii

N

ii

～～


16

9


2

1

2

121

2

2

202

222

1

1

201

111

]Z,Z[

)1,(:Z

)1,(:Z

NN

IICov

IN

N

yI

IN

N

yI

17


2

1

212

21

21

2

1

2

122

1

1

1

111

]Z,Z[

Z

Z

212

11

NN

NNNCov

Nyyy

NyyN

Nyy

NyyN

NNN

NN

LLL

LL

18

Z１

Z2

10


2

0212

22

0111

1

2022

22

2

202

222

2011

21

1

201

111

21 1

S,

1

S

)(ZS

)(ZS

NyN

NyN

yNN

N

yI

yNN

N

yI

N

ii

N

ii

19


最も単純な１標本の母平均の検定の問題

帰無仮説H0： μ=0，分散既知： σ2=1

中間解析は１回，情報時間Iが半分で実施

N1＝N， N2＝２N

Ｘ1, Ｘ2 , ・・・, Ｘ2N ：独立にN（μ，12)

20

11


NNINNINNNN

xxNN

xNxNyxy

xN

xxN

x

NxN

Nii

N

ii

2,))2(1,(),1,(

2,

1,1)1,(:

2211

21

2121

22111

2

12

11

～～

21


707.0][][

],[

707.021],[

2/)(2/)(2

)1,2(),1,(

21

,1

21

21

2

121

21212

11

21

2

2

221

1

11

ZVZVZZCovr

NNZZCov

xxNxxNZ

xNZ

NNZNNZ

yN

N

yZyN

N

yZ

～～

22

Z2

Z1

12

23

条件付き検出力（σ2=1，N1=N2=N)

中間解析の結果が得られたという

条件付（ Z1 ）で対立仮説の下，

最終解析で有意差が出る確率

Pr（Z2>ｃ2｜H1 ，z1 ）

が小さいときは無効中止する．

1221

21

2

22

1

11

2 |1,,2

HNNN

xNxx

NNNxN

NxN

Z ～

23

条件付き検出力（σ2=1，N1=N2=N)

Nzc

HNzcNN

x

HN

xcN

x

HcNxxzHcZ

12

1122

11

22

1221

1122

21

2Pr

2Pr

2Pr),|Pr(

既知確率変数

24

13

帰無仮説の下でのZの分布(3D表示)

中間解析 Z1

最終解析

Z 2 次元正規分布

の

2

707.021r

25

確率密度

帰無仮説の下でのZの分布（等高線）

中間解析 Z1

最終解析

Z 2

26

確率密度

平均０，分散１相関係数0.707の２次元正規分布

14

有効域

続行

無効域

27

Z統計量に基づく棄却域（上側検定）

Z2Z１

情報時間（I)

消費関数の模式図

情報時間（時間分数）

消費α

Pocock型

O'Brien-Fleming型

１回目の

中間解析

2回目の

中間解析

最終

解析

中間解析の時期・回数の変更に柔軟に対応

28

15

想定１標本のｔ検定（薬剤投与前後の検査値の変化）標準偏差SD：1（既知）帰無仮説H0：μ=0対立仮説H1：μ=0.5(Δ=0.5)α：上側2.5% β： 10% (検出力: 90%)中間解析１回：有効・無効中止（O‘Brien-Fleming型のα消費関数）（Pocock型のβ消費関数）情報時間の半分で実施

29

SEQDESIGN プロシジャの文法

PROC SEQDESIGN Statement

出力等の指定

DESIGN Statement

中間解析の方法，条件の指定（複数可）

SAMPLESIZE Statement

解析対象変数の条件の指定

30

16

SEQDESIGNのプログラムods graphics on;

proc seqdesign altref=0.5 errspend

pss( cref= 0 0.5 1) stopprob( cref= 0 0.5 1)

plots=( asn power errspend);

OneSidedErrorSpending:

design method(alpha)=errfuncobf alpha=0.025

method(beta)=errfuncpoc beta=0.10

nstages=2 alt=upper stop=both;

samplesize model=onesamplemean

(mean=0.5 stddev=1);

ods output Boundary=bound_t;run;31

Design Information

Statistic Distribution Normal

Boundary Scale Standardized Z

Alternative Hypothesis Upper

Early Stop Accept/Reject Null

Method Error Spending

Boundary Key Both

Alternative Reference 0.5

Number of Stages 2

Alpha 0.025

Beta 0.1

Power 0.9

Max Information (Percent of Fixed Sample) 110.3272

Max Information I 46.37016 N

Null Ref ASN (Percent of Fixed Sample)

ASN: Average Sample Number

65.68762H0のNの期待値

Alt. Ref ASN (Percent of Fixed Sample) 90.94031 H1のNの期待値 32

17

Method Information

Boundary Method Alpha Beta Error Spending AlternativeReference

DriftH1の下でのZ統計量Function

Upper Alpha

Error Spending

0.02500 . Approx. O'Brien-Fleming

0.5 3.404782

Upper Beta Error Spending

. 0.10000 Approx. Pocock 0.5 3.404782

Powers and Expected Sample SizesReference = CRef * (Alt. Reference)

CRef Power Sample Size

Percent Fixed-Sample

0.0000 （μ=0) 0.02500 65.6876

0.5000 （μ=0.25) 0.37961 87.8001

1.0000 （μ=0.5) 0.90000 90.9403

0.5*√I 33

pss( cref= 0 0.5 1)

Expected Cumulative Stopping ProbabilitiesReference = CRef * (Alt. Reference)

CRef ExpectedStopping Stage

Source Stopping Probabilities

Stage_1 Stage_2

0.0000 （μ=0) 1.191 Reject Null α 0.00153 0.02500

0.0000 1.191 Accept Null 0.80770 0.97500

0.0000 1.191 Total 0.80922 1.00000

0.5000（μ=0.25) 1.592 Reject Null 0.03930 0.37961

0.5000 1.592 Accept Null 0.36906 0.62039

0.5000 1.592 Total 0.40837 1.00000

1.0000 （μ=0.5) 1.649 Reject Null 0.28943 0.90000

1.0000 1.649 Accept Nullβ 0.06201 0.10000

1.0000 1.649 Total 0.35144 1.00000

期待中止ステージ 1.191=1×0.80922+2×(1-0.80922）34

stopprob( cref= 0 0.5 1)

18

Boundary Information (Standardized Z Scale)Null Reference = 0

_Stage_ Alternative Boundary Values

Information Level Reference Upper

Proportion Actual Upper Beta Alpha

1 0.5000 23.18508 2.40754 0.86944 2.96259

2 1.0000 46.37016 3.40478 1.89189 1.89189

Error Spending Information

_Stage_ InformationLevel

Cumulative Error Spending

Upper

Proportion Beta Alpha

1 0.5000 0.06201 0.00153

2 1.0000 0.10000 0.0250035

40478.346.370.5

40754.291.230.5

NE[Z]

Sample Size Summary

Test One-Sample Mean

Mean 0.5

Std. Dev. 1

Max Sample Size 46.37016

Expected Sample Size (Null Ref) 27.60828

Expected Sample Size (Alt. Ref) 38.2219

Sample Sizes (N)One-Sample Z Test for Mean

_Stage_ Fractional N Ceiling N

N Information N Information

1 23.19 23.1851 24 24.0000

2 46.37 46.3702 47 47.000036

19

有効域

続行

無効域

37

Z統計量に基づく棄却域

boundaryscale=ｓｔｄｚオプション

N

y2

0Z

中間解析最終解析情報時間（I)

Z

38

boundaryscale=mle オプション

平均値に基づく棄却域

y

20

39

boundaryscale=score オプション

スコア統計量（Z×√Iに基づく棄却域

N

y2

0S

40

boundaryscale=pvalue オプション

p値に基づく棄却域

dxx2

exp21p

2

z

無効域

有効域

21

0.06201

0.10000

0.00153

0.02500

41

Pocock型の

β消費関数

O‘Brien-Fleming型の

α消費関数

42

期待症例数１段階に対する％

有効中止の可能性大

無効中止の可能性大

μ=0.5

22

43

検出力

μ=0.5

SEQDESIGNによるデザインの結果

無効

有効

：最終解析

継続

無効中止

有効中止

：中間解析

中間解析なし

:89189.1:89189.1

4747)(

:2.9625986994.0:86994.0:2.96259

24)24(

43)N:(47,24

2

2

222

1

1

1

111

21

ZZ

yZN

ZZZ

yZN

NN

44

23

限界値(critical value)とエラーの消費

中止基準中間下限中間上限最終上限

A最終のみーー 1.95996α=0.02500

B有効中止ー 2.96259α=0.00153

1.96860α=0.02450

C無効中止 0.86592β=0.06201

ー 1.88372α=0.02980

D有効中止＋無効中止

0.86994β=0.06201

2.96259α=0.00153

1.89189α=0.02925

45

A αエラーの領域（中間解析なし）

αエラーの領域

46

中間解析 Z1

最終解析

Z 2 1.95996

24

B αエラーの領域（有効中止のみ）

47

αエラーの領域1.96860

2.96259

Z1

Z2

C αエラーの領域（無効中止のみ）

48


0.86592

Z1

Z2

25

D αエラーの領域（有効・無効中止あり）

49


0.86994 2.96259

Z1

Z2

無効中止ありの有意水準

正規分布の両側５%点は1.96（有意水準0.05)

Z2の棄却限界値は1.89189（有意水準0.058506）

中間解析を行うと有意水準が緩められる？

Z2が1.96を超えても， Z１が0.86994未満のときは

無効中止で有意にならない

50

26

帰無仮説と対立仮説のZの分布

51

Z2

Z1

(0,0)

(2.408,3.405)＋

H0の下で青の領域に入る確率はα（2.5%）H1の下で青の領域に入る確率は1-β(90%)

NE[Z]

Nを増やしたときのZの期待値

帰無

N1=23.19, N2=46.37

52

46.370.5]E[Z

91.230.5]E[Z

E[Z]

2

1

N

H0の下で青の領域に入る確率は2.5%H1の下で青の領域に入る確率は90%

対立

Z1

Z2

27

両側検定（有意水準5%)の場合ods graphics on;proc seqdesign altref=0.5 errspendpss( cref= 0 0.5 1) stopprob( cref= 0 0.5 1)

plots=( asn power errspend);TwoSidedErrorSpending: design method(alpha)=errfuncobf

method(beta)=errfuncpocnstages=2 alpha=0.05 alt=twosidedstop=both;

samplesize model=ONESAMPLEMEAN(mean=0 stddev=1);

ods output Boundary=bound_t;run; 53

両側検定（有意水準5%)の場合

Boundary Information (Standardized Z Scale)Null Reference = 0


Information Level Reference Lower Upper

Proportion

Actual Lower Upper Alpha Beta Beta Alpha

1 0.5000 23.2198 -2.40935 2.40935 -2.96259 -0.87540 0.87540 2.96259

2 1.0000 46.43959 -3.40733 3.40733 -1.89093 -1.89093 1.89093 1.89093

54

28

両側検定の場合

55

-2.96259

-0.87540

0.87540

2.96259 1.89093

-1.89093

有効域

有効域

無効域

続行

続行

情報時間（N)

Z

消費関数

Pocock

O’Brien-Flemingガンマパワー

56

29

消費のタイプ

Pocock浪費家給料日前はいつもスッカラカン趣味は博打

O‘Bｒien倹約家趣味は貯金

57

ハイリスク・ハイリターンローリスク・ローリターン

消費関数

パターン α消費関数 β消費関数

１ (PP) Pocock Pocock

２ (PO) Pocock O’Brien-Fleming

３ (OP) O’Brien-Fleming Pocock

４ (PP) O’Brien-Fleming O’Brien-Fleming

58

30

４つのデザインの比較

OneSidedErrorSpending1pp: design method(alpha)=errfuncpoc method(beta)=errfuncpocnstages=2 alpha=0.025 alt=upper stop=both;OneSidedErrorSpending2po: design method(alpha)=errfuncpoc method(beta)=errfuncobfnstages=2 alpha=0.025 alt=upper stop=both;

OneSidedErrorSpending3po: design method(alpha)=errfuncobf method(beta)=errfuncpocnstages=2 alpha=0.025 alt=upper stop=both;OneSidedErrorSpending4oo: design method(alpha)=errfuncobf method(beta)=errfuncobfnstages=2 alpha=0.025 alt=upper stop=both;

59

消費関数の型

β Pocock

β OｂF

α Pocock

α OｂF

60

31

検出力

61

期待症例数

１PP2PO 3OP

４OO

62

32

１ Pocock Pocock(PP)

63

中間解析を行ったために有意にならない

情報時間（N)

α Pocock β Pocock

Z

2 Pocock ObF(PO)

64

情報時間（N)

α Pocock β ObF

Z

33

3 ObF Pocock(OP)

65

情報時間（N)

α ObFβ Pocock

Z

4 ObF ObF(OO)

66

中間解析を行わない場合と変わらない

α ObFβ ObF

情報時間（N)

Z

34

考察消費関数

１）Pocock（ハイリスク・ハイリターン）

期待症例数（↓ ↓）最大症例数（↑ ↑）

２）O’Brien-Fleming（ローリスク・ローリターン）

期待症例数（↓ ）最大症例数（↑）

３）α消費関数をPocock型にするとH1の下での期待症例数が減少

４）β消費関数をPocock型にするとH0の下での期待症例数が減少

67

中間解析の回数１回(nstages=2）

68

情報時間（N)

Z

35

中間解析の回数２回(nstages=3）

69

情報時間（N)

Z

中間解析の回数３回(nstages=4）

70

情報時間（N)

Z

36

中間解析の回数４回(nstages=5）

71

情報時間（N)

Z

消費関数

72

37

期待症例数

中間解析１回２回

３回４回

73

期待症例数

中間解析なし

考察中間解析回数

１）中間解析回数を増加させると期待症例数は

減少．

２）中間解析回数を増加させると最大症例数は

増加．

３）中間解析を３回以上行っても，期待症例数，最大症例数は

あまり変わらない．．

74

38

中間解析の時期 25% info=cum(0.25,1)

75

情報時間（N)

Z


76

情報時間（N)

Z

39


77

情報時間（N)

Z

消費関数

75%

75%

50%

25%

78

40

期待症例数

75%

25%

50%

79

考察中間解析の時期

１）中間解析を後期(0.75)に行っても，期待症例数の減少は小さい．

２）中間解析を早期(0.25)に行っても，有効中止は望めないのでH1の下で期待症例数は100%より大きくなる．

３）中間解析の時期は真中(0.5)付近が望ましい．

80

41

生存時間解析で必要な統計量

81

4

4)

41

log

イベント数

イベント数（情報量

イベント数

効果の大きさ：

IV

Z

I

IVbV

HRb

中間解析の条件の設定

生存時間をログランク検定（帰無仮説：対数ハザード比0）対立仮説ハザード比（HR)： 0.5: Δ=log(0.5)=- 0.69315対照群Cのハザード：0.35 薬剤群Dのハザード：0.175

α：片側2.5% β： 10% (検出力: 90%)登録期間：２年追跡期間：５年

情報時間の半分で中間解析１回：有効・無効中止

エラーの消費

O‘Brien-Fleming型のα消費関数Pocock型のβ消費関数

82

42

83

想定する生存時間曲線

登録期間

生存割合

追跡期間

対照群Cハザード：0.35

薬剤群Dハザード：0.175

Δ=log(0.5)=- 0.69315

83

SEQDESIGNのプログラムods graphics on;proc seqdesign altref=0.69315 errspendpss( cref= 0 0.5 1) stopprob( cref= 0 0.5 1)

plots=( asn power errspend);OneSidedErrorSpending: design method(alpha)=errfuncobf alpha=0.025

method(beta)=errfuncpoc beta=0.10nstages=2 alt=upper stop=both;

samplesize model=twosamplesurvival(nullhazard=0.35 acctime=2 foltime=5);

ods output Boundary=bound_surv;run;ods graphics off;SEQDESIGNでは基準群を第2群に設定する

84

43

試験デザインの基本情報Design Information


Boundary Scale 棄却限界値の統計量 Standardized Z

Alternative Hypothesis 対立仮説の方向 Upper

Early Stop 中止基準 Accept/Reject Null

Method 中間解析の方法 Error Spending

Boundary Key 棄却域構成する上で制御する過誤 Both

Alternative Reference 対立仮説 0.69315

Number of Stages 2

Alpha 0.025

Beta 0.1

Power 0.9


Max Information 24.12815 イベント数/4

Null Ref ASN (Percent of Fixed Sample) 65.68762 H0の平均イベント数

Alt. Ref ASN (Percent of Fixed Sample) 90.94031 H１の平均イベント数

85ASN: Average Sample Number 期待症例数

Method Information

Boundary Method Alpha Beta Error Spending AlternativeReference対数HR

DriftH1の下でのZ統計量

Function

Upper Alpha

Error Spending

0.02500 . Approx. O'Brien-Fleming

0.69315 3.404782

Upper Beta Error Spending

. 0.10000 Approx. Pocock 0.69315 3.404782

Powers and Expected Sample SizesReference = CRef * (Alt. Reference)

Cref対数ＨＲに対する効果をC倍

Power Sample Size

Percent Fixed-Sample

0.0000 （帰無） 0.02500 65.6876

0.5000 0.37961 87.8001

1.0000 （対立） 0.90000 90.9403

対数HR*√I

86

44

各ステージの中止確率Expected Cumulative Stopping ProbabilitiesReference = CRef * (Alt. Reference)

CRef ExpectedStopping Stage

Source Stopping Probabilities

Stage_1 Stage_2

0.0000 1.191 Reject Null 0.00153 0.02500

0.0000 1.191 Accept Null 0.80770 0.97500

0.0000 1.191 Total 0.80922 1.00000

0.5000 1.592 Reject Null 0.03930 0.37961

0.5000 1.592 Accept Null 0.36906 0.62039

0.5000 1.592 Total 0.40837 1.00000

1.0000 1.649 Reject Null 0.28943 0.90000

1.0000 1.649 Accept Null 0.06201 0.10000

1.0000 1.649 Total 0.35144 1.00000

期待中止ステージ 1.191=1*0.80922+2*(1-0.80922)

87

棄却限界値とエラーの消費Boundary Information (Standardized Z Scale) Null Reference = 0


Information Level Reference Upper

Proportion Actual Events Upper Beta Alpha

1 0.5000 12.06408 48.25631 2.40754 0.86944 2.96259

2 1.0000 24.12815 96.51262 3.40478 1.89189 1.89189


_Stage_ Information Level Cumulative Error Spending

Upper

Proportion Beta Alpha

1 0.5000 0.06201 0.00153

2 1.0000 0.10000 0.0250088

45

Boundary Valuesとエラーの消費中間解析上側：2.92569 下側：0.86944

06201.0))1,4075.2((

00153.0))1,0((

3.40478912.40.6931524.128150.69315

4075.2473.30.6931512.064080.69315

IlogZ

0.8694492569.2

NPDF

NPDF

HR

89

Sample Size Summary

Test Two-Sample Survival

Null Hazard Rate 0.35

Hazard Rate (Group A) 0.175

Hazard Rate (Group B) 0.35

Hazard Ratio 0.499999

log( Hazard Ratio) -0.69315

Reference Hazards Alt. Ref

Accrual Rate 年あたりの登録数 63.35757(126.7151/2)

Accrual Time 登録期間 2

Follow-Up Time 追跡期間 5

Total Time 登録期間＋追跡期間 7

Max Number of Events 96.51262

Max Sample Size 最終までの症例数 126.7151

Expected Sample Size (Null Ref) 126.7151

Expected Sample Size (Alt. Ref) 126.7151 90

46

解析時点と症例数

Numbers of Events (D) and Sample Sizes (N)Two-Sample Log-Rank Test

_Stage_

Fractional Time Ceiling Time

D Time N N(Grp 1)

N(Grp 2)

Information

Time

N

1 48.26 2.923 126.7 63.36 63.36 12.0641 3 127

2 96.51 7.000 126.7 63.36 63.36 24.1282 7 127

イベント数症例数 91

棄却限界値の図示

有効棄却域

無効棄却域

続行

92

VZ

Z

47

エラーの消費

0.06201 0.00153

0.10000 0.02500

0.00153

0.025000.06201

0.10000

93

Pocock型β消費関数

O‘Brien-Fleming型α消費関数

薬剤効果と期待症例数

対数HRに対する比

無効中止の可能性大

有効中止の可能性大

94

期待症例数

48

薬剤効果と検出力

対数HRに対する比 95

検出力

デザインの結果

有効最終解析

継続

無効中止

有効中止

　：期待イベント数：

年年中間解析の時期：

　　：期待イベント数：

年群計目標症例数：　

年年　　追跡期間：登録期間：

:89189.1:2.9625986994.0

:86994.0:2.96259

17.9):D30.448.3(C()(2.923

41.1):D55.496.5(C63.25/ )128126.7(2

52

2

1

1

1

ZZ

ZZ

96

49

棄却限界のSASデータセット化データセットbound_surv

_Stage_ _InfoProp_

_Info_ Events NullRef AltRef_U

Bound_UB

Bound_UA

1 0.5000 12.0640 48.2563 0 2.40754 0.86944 2.96259

2 1.0000 24.1281 96.5126 0 3.40478 1.89189 1.89189

97

シナリオシナリオ中間解析最終解析

１）継続有効

２）継続無効

３）有効中止 ―

４）無効中止 ―

98

50

仮想的な中間解析データdata year_1;HR=0.5;do TrtGp=1 to 2;do i=1 to 50;event=1;if TrtGp=1 then year=ranexp(4989)/0.35;if TrtGp=2 then year=ranexp(4989)/(0.35*hr);ctime=1+2*ranuni(4989);if year >= ctime thendo;year=ctime;event=0;end;

output;end;end; 99

中間解析は計画通りにはいかない

LIFETESTによる解析

proc lifetest data=year_1 notableplots=(s);

time year*Event(0);

strata TrtGp/test=logrank;

ods output HomStats=outs;

ods output LogHomCov=outv;

run;100

51

仮想的な中間解析結果（3年後）

101

ログランク Z=2.22871 p=0.0258

対照群

薬剤群

LIFETESTによる解析結果

102

打ち切りと非打ち切り値の数の要約

層 TrtGp 全体死亡打ち切りパーセント打ち切り

1 1 50 27 23 46.00

2 2 50 17 33 66.00

Total 100 44 56 56.00

層に対しての同等性の検定

検定カイ 2 乗自由度 Pr > Chi-Square

ログランク 4.9672 1 0.0258

順位統計量

TrtGp ログランク

1 7.3221

2 -7.3221

目標症例数は128例だったのに100例しか登録できなかったけど大丈夫？ I=44/4=１１

52

データセットlogrank_1の加工data logrank_1;merge outs outv;keep Variable _Scale_

_Stage_ StdErr Estimate;_Scale_='Score';_Stage_= 1;StdErr=sqrt( _1);Estimate=LogRank;Variable='TrtGp';if trtgp=1 then output;proc print;title 'Stage 1 Test Statistics';run; OBS Variable Estimate StdErr _Scale_ _Stage_

1 TrtGp 7.3221 3.2854 Score 1

6784.02854.3/3221.7/log 22SEEstimateHR 103

SEQTESTによる中間解析棄却限界：SEQDESIGN 解析：LIFETESTods graphics on;

proc seqtest boundary=bound_surv

parms(testvar=TrtGp)= logrank_1

nstages=2

order=stagewise

condpower predpower

plots =all;

ods output test=test_surv_1;run;

ods graphics off; 104

53

Design Information

BOUNDARY Data Set BOUND_SURV（SEQDESIGNの結果）

Data Set LOGRANK_1 （LIFETESTの結果）





Number of Stages 2

Alpha 0.025

Beta 0.0979 （計画時は0.10)

Power 0.9021


Max Information 24.1281543

Null Ref ASN (Percent of Fixed Sample) 64.06755

Alt. Ref ASN (Percent of Fixed Sample) 91.94305 105

エラーの消費

106

情報時間が0.4473のときに行うのでエラーの消費は減少

54

中間解析結果の判定（継続）Test Information (Standardized Z Scale)Null Reference = 0

_Stage_

Alternative

Boundary Values Test

Information Level Reference

Upper TrtGp

Proportion Actual Upper Beta Alpha Estimate Action

1 0.4473 10.79361 2.27725 0.67285 2.99668 2.22871 Continue

2 1.0000 24.12815 3.40478 1.90605 1.90605 .

107

0.86944 2.96259

1.89189 1.89189

計画段階の棄却限界

中間解析結果の判定（継続）

108

Z

情報時間（E/4)

55

中間解析の結果（継続）

109

0.9002797572.0

:99668.22.2287167285.050744.0HR0.67838,logHR

17.9):D30.448.3(C(17):D27(C44

31002

1

予測検出力：

条件付き検出力：　

継続


　：イベント数：　

年中間解析の時期：　

例登録年　　登録期間：　

Z

110

Conditional Power（条件付き検出力）CONDPOWERオプション

試験開始時点：最終解析で有意になる確率90%

Z1とZ2が未知

中間開始時点：最終解析で有意になる確率は？

Z1は既知，Z2が未知

Z1の条件付きで

Z2が最終的に有意になる確率が

低ければ試験を無効中止

Z2

Z1

56

Conditional Power（条件付き検出力）

Conditional Power Information （条件付き検出力）Reference = CRef * (Alt. Reference)

StoppingStage

MLElogHR中間解析

Reference 最終解析までのZの増分

ConditionalPower

Ref CRef

1 0.67838 MLE(中間) 0.9787 0.97247

1 0.67838 Null（帰無） 0 0.28816

1 0.67838 0.5000 0.76015

1 0.67838 Alternative 1.0000 0.97572

1 0.67838 1.5000 0.99940111

条件付き検出力（Conditional Power）

112

Z1+想定

対立帰無

検出力

57

２種類の検定と信頼区間CIHR：薬剤群に対する対照群のハザード比

H0：HR<1，（ｌogHR<0）を水準αで検定

H１：HR>1，（ｌogHR>0）

α水準の下側CIが0を含まなければ有意

（有効中止）

H0：HR>2，（ｌogHR>log2）を水準βで検定

H１：HR<2，（ｌogHR<log2）

β水準の上側CIがlog2を含まなければ有意

（無効中止）113

Predictive Power Information

StoppingStage

MLE Predictive Power（予測検出力）

1 0.67838 0.90027

114

α1水準の下側信頼区間H0: logHR=0

β1水準の上側信頼区間H0: logHR=0.69

H0もH1も棄却されないので続行

H0

H1

情報時間（E/4)

58

仮想的な最終解析結果（7年後)

115

ログランクZ=2.45250p=0.0142

対照群

薬剤群

ログランク検定の結果

打ち切りと非打ち切り値の数の要約

層 TrtGp 全体死亡打ち切りパーセント打ち切り

1 1 50 43 7 14.00

2 2 50 36 14 28.00

Total 100 79 21 21.00

116

順位統計量

TrtGp ログランク

1 10.639

2 -10.639

層に対しての同等性の検定

検定カイ 2 乗自由度 Pr > Chi-Square

ログランク 6.0148 1 0.0142

目標イベント数は97なのに，79しかなかったけど大丈夫？

59

117

Design Information

BOUNDARY Data Set WORK.BOUND_SURV

Data Set WORK.LOGRANK_2





Number of Stages 2

Alpha 0.025

Beta 0.15653 （計画時は0.10)

Power 0.84347


Max Information 18.8179875

Null Ref ASN (Percent of Fixed Sample) 69.59215

Alt. Ref ASN (Percent of Fixed Sample) 89.83464

判定結果(有効）Test Information (Standardized Z Scale)Null Reference = 0

_Stage_ Alternative

Boundary Values Test

Information Level Reference

Upper TrtGp

Proportion

Actual Upper Beta Alpha Estimate Action

1 0.5736 10.79361 2.27725 0.68336 2.99668 2.22871 Continue

2 1.0000 18.81799 3.00686 1.93781 1.93781 2.45250 Reject Null

118

60

最終解析の判定結果(有効）

119

Z

情報時間（E/4)

120

Parameter EstimatesStage-Wise Ordering

Parameter Stopping Stage MLE p-Value forH0:Parm=0

MedianEstimate

Lower 95% CL

TrtGp 2 0.565357 0.0076 0.564985 0.18271

下側信頼区間が０を含まないのでH0を棄却

61

最終解析の結果

121

有効


　：イベント数：　

年中間解析の時期：　

例登録年　　登録期間：　

:2.452501.9378156916.0HR0.56535,logHR

41.1):D4.5596.5(C(36):D4379(C

71002

2Z

疑問

ログランク検定のP値と結果が

異なるのはなぜ？

Median Estimateとは何か？

122

62

ｐ値=0.0076

123

2.45250

0.68336

積分領域

この領域の確率は0.76%7574.05736.0r

Z1

Z2

2.45250

この領域の確率は50%

)log,(log 21 IHRIHR

Median Estimate(0.5649859)の下での分布

Z1

Z2

124

63

95%下限(0.18271)の下での分布

125

この領域の確率は5%

2.45250

Z1

Z2

一標本Z検定σ=1，H0：μ=0

126

Nx

N

xz1)1,0(N

ｐ値

)1,( NxN

)1,)96.1(( NxN

0.50

0.025

z

64

中間解析で有効中止

Z=4.11414p<.0001

対照群

薬剤群

127

128

Test Information (Standardized Z Scale)Null Reference = 0

_Stage_ Alternative Boundary Values Test

Information Level Reference Upper TrtGp


1 0.3544 8.552168 2.02705 0.29149 3.06694 4.11414 Reject Null

2 1.0000 24.12815 3.40478 1.92563 1.92563 .

Z

情報時間（E/4)

65

129


Parameter StoppingStage

MLE p-Value forH0:Parm=0

MedianEstimate

Lower 95% CL

TrtGp 1 1.406829 <.0001 1.406829 0.84437

下側信頼区間が０を含まないのでH0を棄却

情報時間（E/4)

中間解析で無効中止

130

Z=0.09980p=0.53795

対照群

薬剤群

66

131

Test Information (Standardized Z Scale)Null Reference = 0

_Stage_ Alternative Boundary Values Test

Information Level Reference Upper TrtGp


1 0.5528 13.3379 2.53146 1.03325 2.65174 0.09980 Accept Null

2 1.0000 24.12815 3.40478 1.90004 1.90004 .

Z

情報時間（E/4)

132


Parameter StoppingStage

MLE p-Value forH0:Parm=0

MedianEstimate

Lower 95% CL

TrtGp 1 0.027326 0.4603 0.027326 -0.42306

上側信頼区間が対立仮説を含まないのでH1を棄却

情報時間（E/4)

情報時間（E/4)

67

LOG画面の出力

• WARNING: The CONDPOWER option is not applicable after the group sequential trial stops.

• WARNING: The PLOTS=CONDPOWER option is not applicable after the group sequential trial stops.

• WARNING: The PREDPOWER option is not applicable after the group sequential trial stops.

133

イベント数が20（情報時間が5)data info;

input _Stage_ _Info_;datalines;

1 5;

ods graphics on;proc seqtestboundary=bound_survboundarykey=alphadata=infonstages=2boundaryscale=stdzerrspend; 134

計画段階のα，β Beta Alpha

0.06201 0.00153

0.10000 0.02500

68

イベント数が20（情報時間が5)

135

Z

情報時間（E/4)

136

症例数設計のペンタゴンα，β，σ，Δ，N

136

Δ：効果の大きさ

σ：ばらつき

N：（情報時間I）

α β

69

boundarykey=alphaError Spending Information

_Stage_ Cumulative Error Spending

Information Level Upper

Proportion Actual Beta Alpha

1 0.2072 5 0.02236 0.00063

2 1.0000 24.12815 0.08702 0.02500

137

boundarykey=betaError Spending Information




1 0.2072 5 0.02570 0.00052

2 1.0000 24.12815 0.10000 0.02075

138

70

boundarykey=both





1 0.2175 5 0.02570 0.00063

2 1.0000 22.98994 0.10000 0.02500

139

PROC SEQDESIGN文のオプションOption Description Design ParametersALTREF= specifies alternative reference

BOUNDARYSCALE= specifies statistic scale for the boundary

MAXINFO= specifies maximum information level

Table Output

ERRSPEND displays cumulative error spending at each stage

PSS displays powers and expected sample sizes

STOPPROB displays expected cumulative stopping probabilities

Graphics Output

PLOTS=ASN displays expected sample sizes plot

PLOTS=BOUNDARY displays detailed boundary plot

PLOTS=COMBINEDBOUNDARY displays combined boundary plot

PLOTS=ERRSPEND displays error spending plot PLOTS=POWER displays powers plot 140

71

DESIGN文のオプションOption Description Design Parameters

ALPHA= specifies Type I error probability level

ALT= specifies type of alternative hypothesis

BETA= specifies Type II error probability level

BOUNDARYKEY= specifies type of error probability to maintain

INFO= specifies information levels NSTAGES= specifies number of stages

STOP= specifies condition for early stopping

Boundary Methods

METHOD= specifies methods for boundary values 141

SAMPLESIZE文で解析可能な指標Option Description One-Sample Models

ONESAMPLEMEAN specifies one-sample test for mean

ONESAMPLEFREQ specifies one-sample test for binomial proportion

Two-Sample Models

TWOSAMPLEMEAN specifies two-sample test for mean difference

TWOSAMPLEFREQ specifies two-sample test for binomial proportions

TWOSAMPLESURVIVAL specifies log-rank test for two survival distributions

Regression Models

REG specifies test for a regression parameter

LOGISTIC specifies test for a logistic regression parameter

PHREG specifies test for a proportional hazards regression parameter 142

72

SEQTEST文のオプションOption Description Input Data Sets

BOUNDARY= specifies data set for boundary information

DATA= specifies data set for parameter estimates and information levels

PARMS= specifies data set for parameter estimates and standard errors

Boundaries

BOUNDARYADJ= specifies boundary adjustment for information levels

BOUNDARYKEY= specifies boundary key to maintain Type I and II error probability levels

BOUNDARYSCALE= specifies boundary scale

ERRSPENDMIN= specifies minimum error spending for the boundaries

MAXINFO= specifies maximum information NSTAGES= specifies number of stages Test Variable

DATA( TESTVAR= ) specifies test variable in DATA= data set

PAMRS( TESTVAR= ) specifies test variable in PARMS= data set 143

SEQTEST文のオプションOption Description

CITYPE= specifies types of confidence interval Table OutputCONDPOWER displays conditional powers ERRSPEND displays cumulative error spending at each stage PREDPOWER displays predictive powers PSS displays powers and expected sample sizes

RCI displays repeated confidence intervals

STOPPROB displays expected cumulative stopping probabilities Graphics OutputPLOTS=ASN displays expected sample sizes plot PLOTS=CONDPOWER displays conditional powers plot PLOTS=ERRSPEND displays error spending plot PLOTS=POWER displays powers plot PLOTS=RCI displays repeated confidence intervals plot

PLOTS=TEST displays boundary plot with test statistics 144

73

参考文献

1)Wald A : Sequential Analysis. Wiley, New York, 1947

2) Pocock SJ : Group sequential methods in the design and analysis of clinical trials. Biometrika64:191–199, 1977

3)村岡了一，小出俊彦，青木保典ほか：臨床試験のための中間解析―基礎から実施するまで―.サイエンティスト社，東京，2004

4)Lan KKG, DeMets DL : Discrete sequential boundaries for clinical trials. Biometrika 70: 659–663, 1983

145

Documents

SASによる 中間解析のデザインと解析消費関数（O‘Brien-Fleming型とPocock型） SEQDESIGNプロシジャの出力内容の解説 消費関数の種類，中間解析の回数・時期について

SASによる中間解析のデザインと解析消費関数（O‘Brien-Fleming型とPocock型） SEQDESIGNプロシジャの出力内容の解説消費関数の種類，中間解析の回数・時期について