1
Helena Dedic 1,2 , Steven Rosenfield 1,2 & Nathaniel Lasry 2,3,4 Results Let Y tl (l = 1, 2, 3, 4) be observed variables, where each variable is a response to the l-th FCI question at time t (t = 0, 1). FCI responses seen as a vector Y that has components Y tl . Five possible responses to each question, so particular values of Y tl can be y tl = 1, 2, 3, 4, 5, given by a student on a pre‑test (t = 0) and on a post‑test, (t = 1) and may also depend on another variable, Z, the respondent’s institution. The model assumes that responses depend on a single discrete latent (i.e., unobserved) underlying variable X t that can take on different values, x t (x t = 1, 2, ..., C). Using the notation Y = y to refer to an arbitrary response pattern, we denote the conditional probability of observing y when Z = z as P(Y = y | Z = z). The fundamental idea of this method is that P(Y = y | Z = z) is equal to the weighted average of the conditional probabilities of observed responses y from a member of class x, P(Y = y | X = x), with P(X 0 = x 0 | Z = z) and P(X 1 = x 1 |(X 0 = x 0 ), (Z = z)) acting as weights. Assuming that observed responses are independent of each other within each class and when they were measured, we obtain 2 Latent Markov Chain: Main Idea Method SAMPLE Students (N = 2275) enrolled in introductory mechanics courses in three distinct institutions (NU1 = 213; NU2 = 1560; and NU3 = 502) were given the FCI at the beginning and at the end of the semester. DESIGN • Four FCI questions studied using Markov Chain Modeling: q04, involves a collision between a truck and a car, and is labelled ‘collision’; q15 involves a car pushing a truck while speeding up, and is labelled ‘speeding upq16 involves a car pushing a truck at cruising speed, and is labelled ‘cruisingq28 involves a student A pushing a student B, and is labelled ‘students’. ASSUMPTIONS • When a response is selected, one of the five following schemas is activated : N (Newtonian), D1 (dominance), D2 (dominance), PO (dominance) and NF (NetF). • Student responses to the four questions are not independent of each other. Student responses to test items are not independent of each other. Example: if we were to survey subjects about the variety of foods they prefer, the pattern of responses might depend on an underlying latent (i.e., unobserved) variable such as socio-economic status. The pattern of responses (food preferences) may also depend on the age of respondents. In addition, food preferences may also evolve with time and hence responses also depend on time t . Latent Markov Chains assume that responses depend on a single discrete latent (i.e., unobserved) underlying variable X t which describe response patterns of distinct groups of respondents. The object of the analysis is to find these groups. In our example, analysis might reveal Background According to di Sessa and Redish, students do not recall a formulated theory, but rather formulate a theory on the spot from p‑prims or reasoning primitives. These are statements that are neither incorrect nor correct (e.g., closer implies stronger’). Reasoning primitives are resources that are activated when thinking about a problem. If a set of resources is frequently activated together, or as a sequence, or in certain contexts, then the probability of activation increases. When the probability of activation of a set/sequence of resources is large, then the set/sequence becomes a cognitive unit called a schema. HYPOTHESIS : the probability of selecting a particular FCI response depends on the probability of activation of a schema that leads to this response. Latent Markov Chain: The Math A Story of Hierarchy •Students using PO schema are in classes at the “bottom” of the hierarchy (C6 and C7). Students with consistent use of the ‘dominance’ schema, D 1 or D 2 are in latent class C5. Members of class C5 are likely use N more often after instruction (ie move up in the hierarchy). Some C5 students are possibly using NF schema like C2 and C4 (a combined probability of 0.39 of ending up in C2 or C4). Use of D 1 or D 2 declines above C4 with increased frequency for using Newton’ Laws. •While use of NF schema appears in higher classes (C2, C4), it is still hard for students to get out of C2. •Students who consistently use Newtonian schema on the pretest have 11% probability to move in C2. Some Wrong Answers Are More Right Than Others ! •Hierarchy of schemas for most to least wrong: PO D1 D2 NF NOT ladder through each; CAN JUMP OVER certain schema types on the

Are All Wrong FCI Answers Equivalent?

  • Upload
    rendor

  • View
    34

  • Download
    0

Embed Size (px)

DESCRIPTION

2. Latent Markov Chain: The Math. Background - PowerPoint PPT Presentation

Citation preview

Page 1: Are All Wrong FCI Answers Equivalent?

Helena Dedic1,2, Steven Rosenfield1,2 & Nathaniel Lasry2,3,4

Results

Let Ytl (l = 1, 2, 3, 4) be observed variables, where each variable is a response to the l-th FCI question at time t (t = 0, 1). FCI responses seen as a vector Y that has components Ytl. Five possible responses to each question, so particular values of Ytl can be ytl = 1, 2, 3, 4, 5, given by a student on a pre‑test (t = 0) and on a post‑test, (t = 1) and may also depend on another variable, Z, the respondent’s institution. The model assumes that responses depend on a single discrete latent (i.e., unobserved) underlying variable Xt that can take on different values, xt (xt = 1, 2, ..., C). Using the notation Y = y to refer to an arbitrary response pattern, we denote the conditional probability of observing y when Z = z as P(Y = y | Z = z). The fundamental idea of this method is that P(Y = y | Z = z) is equal to the weighted average of the conditional probabilities of observed responses y from a member of class x, P(Y = y | X = x), with P(X0 = x0 | Z = z) and P(X1 = x1 | (X0 = x0), (Z = z)) acting as weights.

Assuming that observed responses are independent of each other within each class and when they were measured, we obtain

2

Latent Markov Chain: Main Idea

MethodSAMPLEStudents (N = 2275) enrolled in introductory mechanics courses in three distinct institutions (NU1 = 213; NU2 = 1560; and NU3 = 502) were given the FCI at the beginning and at the end of the semester.

DESIGN• Four FCI questions studied using Markov Chain Modeling: •q04, involves a collision between a truck and a car, and is labelled ‘collision’; •q15 involves a car pushing a truck while speeding up, and is labelled ‘speeding up’•q16 involves a car pushing a truck at cruising speed, and is labelled ‘cruising’ •q28 involves a student A pushing a student B, and is labelled ‘students’.

ASSUMPTIONS• When a response is selected, one of the five following schemas is activated :N (Newtonian), D1 (dominance), D2 (dominance), PO (dominance) and NF (NetF).• Student responses to the four questions are not independent of each other.

Student responses to test items are not independent of each other. Example: if we were to survey subjects about the variety of foods they prefer, the pattern of responses might depend on an underlying latent (i.e., unobserved) variable such as socio-economic status. The pattern of responses (food preferences) may also depend on the age of respondents. In addition, food preferences may also evolve with time and hence responses also depend on time t . Latent Markov Chains assume that responses depend on a single discrete latent (i.e., unobserved) underlying variable Xt which describe response patterns of distinct groups of respondents. The object of the analysis is to find these groups. In our example, analysis might reveal that there are three groups of respondents, those who prefer: expensive exotic foods (e.g., Kobe beef); home‑cooked affordable food; inexpensive prepared foods. In this case, Xt has three values. This means, ONE latent variable but THREE classes

BackgroundAccording to di Sessa and Redish, students do not recall a formulated theory, but rather formulate a theory on the spot from p‑prims or reasoning primitives. These are statements that are neither incorrect nor correct (e.g., ‘closer implies stronger’). Reasoning primitives are resources that are activated when thinking about a problem. If a set of resources is frequently activated together, or as a sequence, or in certain contexts, then the probability of activation increases. When the probability of activation of a set/sequence of resources is large, then the set/sequence becomes a cognitive unit called a schema.

HYPOTHESIS: the probability of selecting a particular FCI response depends on the probability of activation of a schema that leads to this response.

Latent Markov Chain: The Math

A Story of Hierarchy•Students using PO schema are in classes at the “bottom” of the hierarchy (C6 and C7).•Students with consistent use of the ‘dominance’ schema, D1 or D2 are in latent class C5. Members of class C5 are likely use N more often after instruction (ie move up in the hierarchy). Some C5 students are possibly using NF schema like C2 and C4 (a combined probability of 0.39 of ending up in C2 or C4).•Use of D1 or D2 declines above C4 with increased frequency for using Newton’ Laws.•While use of NF schema appears in higher classes (C2, C4), it is still hard for students to get out of C2.•Students who consistently use Newtonian schema on the pretest have 11% probability to move in C2.

Some Wrong Answers Are More Right Than Others !•Hierarchy of schemas for most to least wrong: PO D1 D2 NF •NOT ladder through each; CAN JUMP OVER certain schema types on the way to Newtonian thinking!