36
Symposium 1 A framework and approaches to develop an in-house CAT with freeware and open sources. Tetsuo Kimura (Niigata Seiryo University) Kyung (Chris) T. Han (Graduate Management Admission Council) Michal Kosinski (University of Cambridge) Kojiro Shojima (The National Center for University Entrance Examinations in Japan

A framework and approaches to develop an in-house CAT with freeware and open sources

Embed Size (px)

DESCRIPTION

Symposium at IACAT 2012 Sydney

Citation preview

Page 1: A framework and approaches to develop an in-house CAT with freeware and open sources

Symposium 1A framework and approaches to develop an in-house CAT with freeware and open sources.

Tetsuo Kimura (Niigata Seiryo University)

Kyung (Chris) T. Han (Graduate Management Admission Council)

Michal Kosinski (University of Cambridge)

Kojiro Shojima (The National Center for University Entrance Examinations in Japan

Page 2: A framework and approaches to develop an in-house CAT with freeware and open sources

CAT is greedy! CAT likes big pool!

Page 3: A framework and approaches to develop an in-house CAT with freeware and open sources

The outline of the symposiumFramework to develop a CAT

(Thompson & Weiss, 2011)

Introduction of freeware and open sources for CAT

development

Approaches to develop an in-house CAT with freeware and

open sources

ExametrikaR package ltm

Moodle UCATConcerto

SimulCATR package catR

Page 4: A framework and approaches to develop an in-house CAT with freeware and open sources

The framework to develop a CAT

Step Stage Primary work1 Feasibility, applicability,

And planning studiesMonte Carlo simulation; business case evaluation

2 Develop item bank content or utilize existing bank

Item writing and review

3 Pretest and calibrate item bank

Pretesting; item analysis

4 Determine specifications for final CAT

Post-hoc or hybrid simulations

5 Publish live CAT Publishing and distribution; software development

Framework Proposed by Thompson & Weiss (2011)

Page 5: A framework and approaches to develop an in-house CAT with freeware and open sources

The three stages of CAT development

Pretesting & Item Analysis: Construction of Item Bank

Simulating CAT with Existing Item Bank: Determine specifications

Implementing CAT: Publishing a CAT on a software

ExametrikaR package ltm

Moodle UCATConcerto

SimulCATR package catR

Page 6: A framework and approaches to develop an in-house CAT with freeware and open sources

Pretesting & Item Analysis: Construction of Item Bank

Pretesting

Item analysis: calibration, elimination of misfit &

equating

More pretests with new items and anchored items

Item bank

Calibrated items

Anchored items

Page 7: A framework and approaches to develop an in-house CAT with freeware and open sources

Simulating CAT with Existing Item Bank: Determine specifications

Simulating CAT

Examining: Item selection rules,

Item exposure,Stopping rules, etc.

Determine CAT specifications

Item bank

Calibrated items

Page 8: A framework and approaches to develop an in-house CAT with freeware and open sources

Implementing CAT: Publishing a CAT on a software

Specify CAT AlgorithmOn a CAT Software

Implementing CAT

Examine CAT Results

Item bank

Calibrated items

Page 9: A framework and approaches to develop an in-house CAT with freeware and open sources

The outline of the symposiumFramework to develop a CAT

(Thompson & Weiss, 2011)

Introduction of freewares and open sources for CAT

development

Approaches to develop an in-house CAT with freewares and

open sources

ExametrikaR package ltm

Moodle UCATConcerto

SimulCATR package catR

Page 10: A framework and approaches to develop an in-house CAT with freeware and open sources

The three stages of CAT development

Pretesting & Item Analysis: Construction of Item Bank

Simulating CAT with Existing Item Bank: Determine specifications

Implementing CAT: Publishing a CAT on a software

ExametrikaR package ltm

Moodle UCATConcerto

SimulCATR package catR

Page 11: A framework and approaches to develop an in-house CAT with freeware and open sources

Exametrika

Page 12: A framework and approaches to develop an in-house CAT with freeware and open sources

The three stages of CAT development

Pretesting & Item Analysis: Construction of Item Bank

Simulating CAT with Existing Item Bank: Determine specifications

Implementing CAT: Publishing a CAT on a software

ExametrikaR package ltm

Moodle UCATConcerto

SimulCATR package catR

Page 13: A framework and approaches to develop an in-house CAT with freeware and open sources

R package: ltm

• ltm: Latent Trait Models under IRT– Dimitris Rizopoulos  

• This R package provides a flexible framework for IRT analyses for dichotomous and polytomous data under a Marginal Maximum Likelihood approach. The fitting algorithms provide valid inferences under Missing At Random missing data mechanisms.http://rwiki.sciviews.org/doku.php?id=packages:cran:ltm

• ltm: An R Package for Latent Variable Modeling and Item Response Theory Analyses. 2006, Journal of Statistical Software, 17(5), 1-25. http://www.jstatsoft.org/v17/i05/

Page 14: A framework and approaches to develop an in-house CAT with freeware and open sources

ltm: Available Features• Descriptives:

– samples proportions, missing values information, biserial correlation of items with total score, pairwise associations between items, Cronbach’s α, unidimensionality check using modified parallel analysis, nonparametric correlation coefficient, plotting.

• Dichotomous data: – Rasch Model, Two Parameter Logistic Model, Birnbaum’s

Three Parameter Model, and Latent Trait Model up to two latent variables (allowing also for nonlinear terms between the latent traits).

Page 15: A framework and approaches to develop an in-house CAT with freeware and open sources

ltm: Available Features• Test Equating:

– Alternate Form Equating (where common and unique items are analyzed simultaneously) and Across Sample Equating (where different sets of unique items are analyzed separately based on previously calibrated anchor items).

• Plotting: – Item Characteristic Curves, Item Information Curves, Test

Information Functions, Standard Error of Measurement, Standardized Loadings Scatterplot (for the two-factor latent trait model), Item Operation Characteristic Curves (for ordinal polytomous data), Item Person Maps.

Page 16: A framework and approaches to develop an in-house CAT with freeware and open sources

ltm: Available Features• Polytomous data:

– Graded Response Model and Generalized Partial Credit Model.

• Goodness-of-Fit: – Bootstrap Pearson χ2 for Rasch and Generalized Partial

Credit models, fit on the two- and three-way margins for all models, likelihood ratio tests between nested models (including AIC and BIC criteria values), and item- and person-fit statistics.

• Factor Scoring: – Empirical Bayes (i.e., posterior modes), Expected a Posteriori

(i.e., posterior means), Multiple Imputed Empirical Bayes, and Component Scores for dichotomous data.

Page 17: A framework and approaches to develop an in-house CAT with freeware and open sources

ltm:examples

Page 18: A framework and approaches to develop an in-house CAT with freeware and open sources

The outline of the symposium

Pretesting & Item Analysis: Construction of Item Bank

Simulating CAT with Existing Item Bank: Determine specifications

Implementing CAT: Publishing a CAT on a software

ExametrikaR package ltm

Moodle UCATConcerto

SimulCATR package catR

Page 19: A framework and approaches to develop an in-house CAT with freeware and open sources

SimulCAT

Page 20: A framework and approaches to develop an in-house CAT with freeware and open sources

The outline of the symposium

Pretesting & Item Analysis: Construction of Item Bank

Simulating CAT with Existing Item Bank: Determine specifications

Implementing CAT: Publishing a CAT on a software

ExametrikaR package ltm

Moodle UCATConcerto

SimulCATR package catR

Page 21: A framework and approaches to develop an in-house CAT with freeware and open sources

R package: catR

• catR : Latent Trait Models under IRT– David Magis & Gilles Raîche

• This R package catR was developed to perform adaptive testing with as much flexibility as possible, in an attempt to provide a developmental and testing platform to the interested user.

• Random Generation of Response Patterns under Computerized Adaptive Testing with the R Package catR. Journal of Statistical Software, 48(8), 1-31. http://www.jstatsoft.org/v48/i08/.

Page 22: A framework and approaches to develop an in-house CAT with freeware and open sources

catR: Available Features• The item bank can be provided by the user previously

calibrated according to the 4PL model or any simpler logistic model, or randomly generated from parent distributions of item parameters.

• The package proposes– several methods to select the early test items, several methods

for next item selection– different estimators of ability (maximum likelihood, Bayes modal,

expected a posteriori, weighted likelihood), – three stopping rules (based on the test length, the precision of

ability estimates or the classification of the examinee).

• The output can be graphically displayed.

Page 23: A framework and approaches to develop an in-house CAT with freeware and open sources

catR:example

Page 24: A framework and approaches to develop an in-house CAT with freeware and open sources

The outline of the symposium

Pretesting & Item Analysis: Construction of Item Bank

Simulating CAT with Existing Item Bank: Determine specifications

Implementing CAT: Publishing a CAT on a software

ExametrikaR package ltm

Moodle UCATConcerto

SimulCATR package catR

Page 25: A framework and approaches to develop an in-house CAT with freeware and open sources

Moodle UCAT

UCAT: Rasch-based CAT program written in BASIC (Linacre, 1987)

http://www.rasch.org/memo69.pdf

Moodle UCAT: converted into PHP so that CATs can be administered on a major open source LMS, Moodle

(Kimura, Ohnishi & Nagaoka, 2012)

Page 26: A framework and approaches to develop an in-house CAT with freeware and open sources

26

Development StatusCAT setting window

• Ending conditions• Logit to unit conversion• Logit bias

CAT administration window• Set item difficulty individually or category by category• Set student’s ability individually or as a whole

Administer CAT and provide result individuallyRetrieve CAT processes and results

Recalibration of item difficulty & estimate ability

Unit = Logit×10 + 100

Moodle UCAT beta ver.

Under Development for Ver.1 to be released in late August 2012

Page 27: A framework and approaches to develop an in-house CAT with freeware and open sources

CAT Algorithm: Initial Ability Estimation

27

UCAT Moodle UCAT

Lower Limit (LL) =

  AVG(D) - (0.5+0.5*RND)

Upper Limit (UL) = LL + 1

      

B0 = AVG(D) - 0.5*RND AVG(D): average item difficulty RND: random value between 0 & 1 B0 : initial ability

Assign each student’s initial ability in the CAT administration window based on other test results or intelligently one by one, or as a whole.

Page 28: A framework and approaches to develop an in-house CAT with freeware and open sources

CAT Algorithm: Ability (B) Estimation

28

UCAT / Moodle UCAT

the number of successes

probability of success of a student of ability Bm on the i-th dministered item of difficulty Di

)(

)(

1

11

1

)1(

DiBm

DiBm

mi

m

imimi

m

imim

mm

e

ep

PP

PRBB

:mR

:miP

Page 29: A framework and approaches to develop an in-house CAT with freeware and open sources

CAT Algorithm: Standard Error (SE) Estimation

UCAT / Moodle UCAT

m

imimi

m

PPSE

1

1

)1(

1

Page 30: A framework and approaches to develop an in-house CAT with freeware and open sources

CAT Algorithm: Item Selection

30

UCAT / Moodle UCATNext item will be selected randomly between LL and UL

score when he next (m-th) answer will be wrong

If no item found between LL & UL , use the closest.

m

imimi

m

imimi

m

imim

m

PpLLUL

Pp

PRBLL

1

1

11

)1(

1

)1(

:1mR

Ability estimate when the next answer will be wrong

Ability estimate when the next answer will be correct

Page 31: A framework and approaches to develop an in-house CAT with freeware and open sources

CAT Algorithm: Ending ConditionUCAT / Moodle UCAT

Prescribed number of itemPrescribed SEBoth number of item and SEAll item

Page 32: A framework and approaches to develop an in-house CAT with freeware and open sources

CAT Algorithm: Item Selection (logit bias)

32

Moodle UCATLL and UL can be adjusted by adding logit value to the Logit bias box in the CAT setting window

BiasULULBiased

BiasLLLLBiased

_

_

Positve logit value decrease the chance of answer correct

Negative logit value increase the chance of answer correct

Page 33: A framework and approaches to develop an in-house CAT with freeware and open sources

Moodle UCAT demo

Page 34: A framework and approaches to develop an in-house CAT with freeware and open sources

The outline of the symposium

Pretesting & Item Analysis: Construction of Item Bank

Simulating CAT with Existing Item Bank: Determine specifications

Implementing CAT: Publishing a CAT on a software

ExametrikaR package ltm

Moodle UCATConcerto

SimulCATR package catR

Page 35: A framework and approaches to develop an in-house CAT with freeware and open sources

Concerto

Page 36: A framework and approaches to develop an in-house CAT with freeware and open sources

Questions & Answers

• Tetsuo Kimura (Niigata Seiryo University)tetsuo.kmr<AT>gmail.com

• Kyung (Chris) T. Han (Graduate Management Admission Council)

khan<AT>gmac.com• Michal Kosinski (University of Cambridge)

mk583<AT>cam.ac.uk• Kojiro Shojima (The National Center for University

Entrance Examinations in Japan)shojima<AT>rd.dnc.ac.jp