143
Page | 1 INDEX Lesson No. Topic Page No Background: Medical educational Research 1 Objectives of the workshop 3 Contributors and Acknowledgements 5 1 Introduction to Educational Research 6 2 Educational Techniques: Trends, Utility & Effectiveness 18 3 Research Methodology: Outline of Qualitative, Quantitative & Mixed Research Designs & Methods 23 4 Quantitative Methods: Data Collection, Questionnaire Preparation 46 5 Qualitative & Mixed methods in Educational Research 70 6 Focus Group Discussions: Participatory and Non- Participatory Techniques of Qualitative Data Collection in Medical Education 78 7 Descriptive & Inferential Statistics 83 8 Areas of Research in Medical Education & Ethics 123 9 Qualitative Techniques and Computer aided Analysis 127 10 Research Project Proposals Prepared by Participants 133 List of Participants 140 Teaching & Learning Methods for the Revised MBBS Curriculum 142

Workshop Report Final

Embed Size (px)

Citation preview

Page 1: Workshop Report Final

Page | 1

INDEX

Lesson No.

Topic Page No

Background: Medical educational Research 1

Objectives of the workshop 3

Contributors and Acknowledgements 5

1 Introduction to Educational Research

6

2 Educational Techniques: Trends, Utility & Effectiveness

18

3 Research Methodology: Outline of Qualitative, Quantitative & Mixed Research Designs & Methods

23

4 Quantitative Methods: Data Collection, Questionnaire Preparation

46

5 Qualitative & Mixed methods in Educational Research

70

6 Focus Group Discussions: Participatory and Non-Participatory Techniques of Qualitative Data Collection in Medical Education

78

7 Descriptive & Inferential Statistics

83

8 Areas of Research in Medical Education & Ethics

123

9 Qualitative Techniques and Computer aided Analysis

127

10 Research Project Proposals Prepared by Participants

133

List of Participants 140

Teaching & Learning Methods for the Revised MBBS Curriculum

142

Page 2: Workshop Report Final

Page | 2

Report Workshop

Medical Educational Research: Concepts and Methodologies

Background: The basic issue is not whether our students and our medical colleges are

better than those of a generation ago, but whether the quality of today‟s

education is sufficient to meet tomorrow‟s demands, which will be infinitely

more complex than those of the past or the present. In other words, it is not

a matter of whether, through research, we can prove that our medical

institutions are better, but whether, through research and implementation,

we can make them good enough.

Medical Education has been conventionally taught and learnt in an

inductive way and has been considered as difficult to both impart and

imbibe. Already, there have been some major breakthroughs in education

towards accommodation of individual differences, improved learning theory

and practice, better tests and measurements, more effective counselling and

guidance, use of new media, team teaching, evaluation of teacher

effectiveness, and in other areas. Still, many areas have gone with out

investigation and many “good” teachers have been so busy with the twin

problems rising from combined pressures of the explosion of knowledge and

rapid increases in enrolment of medical students.

Basically we need to know a great more deal about how people learn. If we

are to attract and prepare the best possible teachers, we must learn what

kind of person makes a good teacher, what his motivations, attitudes, and

values are likely to be. Stated in more general terms, we need to know how

to maintain quality and enhance it, how to guarantee that the educational

programmes of the future will stimulate the fullest development of every

individual in spite of the raising costs of providing for longer periods of

duration.

At present we have laboratories, Classrooms, and teachers and patients to

initiate research in education. The addition of teaching machines

(computers), multimedia and television, self instructional material etc lead

to experiment the teaching and learning procedures ultimately to find out

Page 3: Workshop Report Final

Page | 3

the most effective processes. This means we move rapidly and judiciously in

trying out new technologies and constantly evaluating them. By persistent

refinement of the process and tools, we should consistently improve the

product. This calls for research, not guess work.

The total research effort requires adequate planning, prompts

communication and should cover all levels of educational curriculum. The

present medical research has the following setbacks.

1 It has focused too much on what happens to cohort of students and

less on the individual cognitive, emotional and attitudinal changes that

occur during the course and how these affect learning.

2 There were no attempts to be systematic in the efforts to rate the two

styles of learning. In this golden age of evidence-based medicine, where were

the calls for active comparator trials of teaching methods? There are no

trials of the use of placebo teaching, let alone "sham" lectures. Could not the

lecturers/tutors be blinded? What about random allocation of academic low

performers to different teaching methods? As to long-term follow-up, has

anyone given consideration to the comparison of patient satisfaction when

treated by doctors trained by the different styles of teaching?

3 There is no evidence available to either refute or support the major

curricular reforms embarked on by Many Indian Medical Institutions

4 The suggested curricular reform should be carefully researched and

evaluated. A major barrier to this is lack of funding available for such

research despite the possible „consequences for the future of our profession

and our patients'.

Research efforts are of little value unless the educational enterprise is able

to put the findings into practice. Many Medical Institutions in Maharashtra

are devoid of sufficient resources for current programmes and are unable to

implement the recommendations

Page 4: Workshop Report Final

Page | 4

Topic Objectives

Introduction to Educational Research

1. Sensitize the issue on Medical educational Research 2. Understand the types of Research 3. Delineate the necessity of Documenting the

evidence on effectiveness of techniques

Educational Techniques: trends, Utility & Effectiveness

1. Describe the various techniques of Teaching and learning

2. Understand the limitations of each method 3. Outline the basic learning concepts in adults

Tea Break

Research Methodology: outline of Qualitative & quantitative and Mixed Research Designs & Methods

1. Describe the educational research proposal preparation

2. Enumerate the General research designs adopted for qualitative and quantitative data.

3. Able to outline the Experimental research 4. Understand the importance of quasi-experimental

and single-case designs in educational research

Quantitative Methods: Data Collection, Questionnaire preparation

1. Describe the various categories of data(Nominal, ordinal, interval & Ratio)

2. Understand the sampling procedures used in educational research.

3. Devise a questionnaire for data collection and data quality checks

Lunch Break

Qualitative & Mixed Methods in Education:

1. Understand the types of qualitative research (phenomenology, ethnography, grounded theory and case study)

2. Describe the various qualities (SWOT) of qualitative research

3. Understand the data collection tools and techniques for qualitative data(questionnaire, Interview, Focus group, Observation ).

4. Devise a suitable tool for educational data collection

Tea Break

Focus Group Discussions and Non participatory and Participatory techniques of qualitative data collection in education

1. Describe the merits and demerits of Focus group discussions

2. Understand the method of conducting Focus group and small group Discussions.

3. Understand the method of Participatory Learning Appraisal(PLA)

4. Describe the concepts of non participatory educational techniques like Objective structured examinations/assessments

Descriptive & Inferential Statistics:

1. Understand the concepts in Frequency distribution, Measures of central tendency, measures of variability.

2. Describe the various sampling distributions and Procedures used in educational research ( purposive, opportunistic, critical case)

3. Able to conduct Hypothesis testing. 4. Understand the concept of t-test, Analysis of

Variance and Chi-square tests.

Page 5: Workshop Report Final

Page | 5

Areas of Research in Medical Education & Methods

1. Understand the areas in Education and Learning requiring research

2. Delineate the Priority areas of educational research in India

3. Describe the Ethical problems involved in Educational Research

Analysis of qualitative data: Use of computers & Software

1. Familiar with the basic operational and data handling techniques with SPSS software

2. Understand the techniques used in qualitative data analysis software

3. Able to analyse data collected through interviews and focus group discussions

Tea Break

Group Work: Preparation of Research Projects in 1)Innovative Teaching techniques 2) Academic Assessment 3) Impact evaluation 4) Economic assessment

Dr Amol Dongre Dr S P Rao Dr Pradeep Borle Dr J V Dixit

Lunch Break

Presentation and Discussion of Research Projects

All Faculty

Tea Break

POST TEST and Concluding Session

Page 6: Workshop Report Final

Page | 6

Contributors:

1 Amol Dongre M. G. Institute of Medical Sciences, Sevagram

2 Jagannath Dixit Government Medical College, Aurangabad

3 Pradeep Borle BJ Medical College, Pune

4 Surya Prakasa Rao SBH Government Medical College, Dhule

The authors wish to acknowledge the following experts for their invaluable support

Dr Hemant Apte, Anthropological Society on India

Dr Payal Bansal, Maharashtra University of Health Sciences

Dr Avinash Supe, KEM Medical College

Dr Biranjan JR, Government Medical College, Dhule

Dr Patil , ACPM medical College, Dhule

Dr Haribhai Patel, Ahmedabad

The Authors acknowledge the generous financial support rendered by the National

Academy of Medical Sciences, New Delhi

Page 7: Workshop Report Final

Page | 7

Lesson I: Introduction to Educational Research

Objectives:

1 Sensitize the issue on Medical educational Research

2 Understand the types of Educational Research 3 Delineate the necessity of Documenting the evidence on effectiveness of

techniques

Lesson Outline:

1 Existing Medical Colleges and The Role of Medical Educational Units

2 Scientific Methods: Inductive and Deductive. Qualitative, Quantitative & Mixed Research Designs. Why Educational Research

in Medicine is not taking off. Why to study educational Research: Newer methods of teaching & learning; newer methods of Evaluation/Assessment. Research Wheel

3 Objectives of Educational Research. Enumerate the Various learning

techniques (Adult Learning, Student Autonomy, Computer Assisted

Learning, Web based Learning) and the assessment methods ( OSCE, OSLER, Mini CEX, Case Based Discussion, Portfolio, Mini Source

Feedback, 360 degrees and client satisfaction

By separating teaching from learning, we have teachers who do not listen and students who do not talk"

Based on Palmer P (The Courage to Teach. Jossey Bass, 1998) Remembering the famous Rudyard Kipling‟s five brave men, in medical

education, the Curriculum, Content conveniently classified as Must Know

and Desirable fall into the category of What to teach. In the When to teach

category, identify the subjects to be taught in the pre, para and clinical

years of medical education. In the traditional compartmental approach,

subjects like Anatomy, Physiology and Biochemistry are imparted in the pre

clinical years; Pathology, Microbiology, Pharmacology and Forensic Medicine

in the Para clinical years and the Medicine, Paediatrics, Surgery,

Orthopaedics, Obstetrics & Gynaecology, ENT, Ophthalmology and

Community Medicine in the clinical years. The teaching of the above

subjects will be at the hospital, Out Patient department/ In patient

Page 8: Workshop Report Final

Page | 8

department, Class rooms, practical halls and in the community form the

Where to teach category. In the How to teach category, there are various

methods of teaching and learning including the Lecture Discussion,

practicals, tutorials, seminars, Problem Based Learning, Small group

teaching, Projects, puzzles and case based learning etc.

The 300 odd medical schools in India are fortunately equipped with medical

education Units (MEU). Thanks to the untiring efforts of Medical Council of

India and their inspectors. Mere existence of these units would not be able

to usher in required sea changes in medical education. Because most

medical schools in India, currently are experiencing difficulties in providing

the right quality and quantity of educational experiences as the curricula

have failed to respond to the needs of the community and country. The

pedagogic shift from traditional approach to a need-based approach requires

a fundamental change of the roles and commitments of educators, planners

and policymakers. Teachers of health professional education are to be well-

informed of the trends and innovations and utilize these to increase

relevance and quality of education to produce competent human resources.

The main functions of Medical Educational unit are as follows

1. The MEU should create a culture of educational research. 2. It should keep the faculty aware of the ongoing research in the field

3. It should generate publications and resources in medical education 4. It should identify and facilitate the teaching learning needs of the

students 5. It should provide instructional design 6. It should focus on newer learning technologies such as simulation and

e-learning 7. It should develop guidelines for student evaluation and curriculum

development

8. It should emphasize Faculty Development

In its document on graduate regulations Medical Council of India

emphasized that Medical Education Units/ Departments be established in

all medical colleges for faculty development and providing learning resource

material to teachers. "The Edinburgh Declaration" of World Federation for

Medical Education (WFME) and "Tomorrow's Doctors" of General Medical

Page 9: Workshop Report Final

Page | 9

Council (GMC) of UK outlined a number of specific strategies to guide

reforms and bring need-based changes in medical education. The Edinburgh

Declaration, now translated into all major languages, has been very widely

adopted as basis for reform of medical education. Most of the medical

schools in India have traditional, teacher-centred and hospital-based

training with a few exceptions only. Educational innovations and

experiments are not quite evident in India.

Newer methods of Learning and teaching are being introduced in the

west and the Asian countries are eager to globalise these methods. These

include PBL, Student centred teaching, Modular teaching.

………………….etc. Changing learning styles such as application of adult

learning principles, student autonomy, self learning, experimental

learning, reflective learning, computer assisted learning, distance

learning, e- and web based learning, use of skill learning laboratories

Innovative curriculum models such as problem based curriculum,

integrated curriculum, competency based curriculum and hybrid

curriculum

New evaluation methods such as Objective structured long examination

record (OSLAR), Objective structured clinical examination (OSCE) &

Objective structured practical examination (OSPE), Case evaluation

exercise (Mini CEX), Case based discussion (CbD), Portfolio, Multi source

feedback, 360 degrees, Videoing consultation, Patient satisfaction

questionnaire.

Even the assessment methods are revolutionised. The age old systems are

being replaced by objective assessments leading to transparency and

improving the quantity in terms of passed out graduates. However, all these

techniques need rigorous scrutiny and it is necessary to provide evidence of

their superior utility over the traditional methods. Unravelling the truths

about adult learning lead to more specific insights into the student learning.

Medical colleges as basic Educational Institutions should be able to cater

the following basic functions.

1 Administration

Page 10: Workshop Report Final

Page | 10

2 Clinical Care

3 Research and

4 Education

The function of Education is the foremost and should dominate all other

functions. In practice, medical colleges are obsessed and preoccupied with

functions pertaining to administration and clinical care. The function of

research has taken a back seat. Education reforms and novel educational

techniques are never attempted to be implemented.

Why Research in Medical education?

To this day, tradition and intuition continue to be the prime guiding

principles of education. However, just like medicine, education should be

grounded as much as possible in the best evidence we can find. We

acknowledge the parallel between evidence-based medicine and the

importance of best evidence in medical education. Education should use

research methods that are geared to the idiosyncrasies of the domain of

education and, unlike much of medical research; the education research

neither can nor should always use controlled experimentation as the method

of preference. Such research in education can use both quantitative and

qualitative research methods and combinations thereof.

In recent years, political systems, epidemiological and demographic

patterns, micro-economic strategies, technology, and health care systems

have undergone profound changes. To cope with these changes, educational

institutions around the world have been increasingly confronted with the

challenge of making their curricula more meaningful and relevant to the

needs of the time to produce doctors oriented to the real needs of the

community. Many authorities highlighted the need for reorientation of

medical education and suggested strategies for direction of such changes.

In summary, the reasons for conducting research in medical education are

as follows:

• Become Research Literate • Medicine is a high stake education • Education in age of accountability

Page 11: Workshop Report Final

Page | 11

• Evidence based approach to education • Improve critical thinking skills

• Read & Critically evaluate published research • Research in medical education is feasible

• Learn to design and conduct research

However, in India, medical education research has not taken off. The apathy

towards this area can be mainly due to the paucity of funds & lack of

resources like absence of Teaching Development Grants, long career in

medical education (almost 10 years to become eligible to teach), not so

attractive option for the clinical scientists ( at present there are only handful

of academicians engaged in educational research), Ivory Tower approach

(feeling and following the age old traditional teaching and learning methods

as the best), Lack of awareness/ apathy towards learning & education

among teaching faculty, medical education research not so attractive for

both basic or clinical scientists and low impact factor for the journals in

medical education.

Impact Factor: It is, devised by Eugene Garfield, a measure of the citations

to science and social science journals. It is frequently used as a proxy for

the relative importance of a journal within its field. Impact factors are

calculated each year for those journals which it indexes, and the factors and

indices are published in the Journal Citation Reports. The impact factors for

various medical journals calculated for 2007 are as follows.

• Medical Education 2.1 • Academic Medicine 1.9

• Medical Teacher .8 • BMJ 7.0 • Lancet 22.0

There are now 18 International Medical Education Journals and the three

major journals in medical education are Medical Education, Academic

Medicine and Medical Teacher.

Is it true that a physician must be a good teacher?

Yes. Doctors have to teach their patients how to get well. They have a

responsibility to teach and educate the members of community how to stay

well. They have additional burden to teach their colleagues all that they

Page 12: Workshop Report Final

Page | 12

learnt. And if they choose to become medical teacher, they have the

responsibility for medical students.

Although we would never allow a patient to be treated by an untrained

doctor or nurse, we often tolerate professional training being delivered by

untrained teachers. Traditionally students were expected to absorb most of

their medical education by attending timetabled lectures and ward-rounds,

moving rapidly from one subject to the next in a crowded curriculum. Our

junior doctors learnt by watching their seniors in between endless menial

tasks. In recent years the importance of active, self directed learning in

higher education has been recognised. Outcome led structured programmes

for trainees are being developed in the face of reduced working hours for

both the learners and teachers. These all constitute new challenges for

teachers in medicine of all levels of seniority.

Throughout the world there is great interest in developing a set of

qualifications for medical teachers, both at the elementary “teaching the

teacher” level and as part of progressive modular programmes leading to

formal certification. In addition to acquiring new qualifications and

standards, teachers also need access to literature resources that describe

essential components in medical education and supply tips and ideas for

teaching.

What is expected from MBBS doctor now?

Medical Council of India expects that Graduate students to undertake the

responsibilities of a physician of first contact who is capable of looking

after the preventive, promotive, curative & rehabilitative aspect of

medicine. In addition to clinical competencies, students must develop

generic competencies or transferable personal skills essential to their roles

as health professionals, which include bio-ethics and communication skills,

interpersonal skills, problem-solving ability, decision-making capability,

management and organization skills, working in team, IT skills and doctor-

patient relationship.

Why to change the present system of teaching and learning?

The trends in present day medical education are as follows

1 Education for Capability

Page 13: Workshop Report Final

Page | 13

2 Community Oriented Medical Education

3 Self directed/ Learner centered Learning

4 Problem Based Learning (PBL) and Task Based Learning (TBL)

5 Integration and Early Clinical Contact

6 Continuing Professional Development

7 Unity Between Education and Practice

8 Evidence Based Medical Education/Best Evidence Medical

Education(BEME)

9 Communication and Information Technology

What is the situation of Medical Education Research in India?

More than a dozen peer reviewed journals are available for research in

medical education. However, Indian authors‟ contribution to these journals

is miniscule. Hence it is necessary to inculcate the methodology of

educational research among the faculty of medical educators and promote

the evidence based teaching and learning methods. Until now medical

teachers are confined to the domain of clinical care and devoted less

importance to medical education research

Unfortunately, it has been reported that the majority of published studies

and dissertations on medical education are seriously flawed, containing

analytical and interpretational errors. Some of these flaws have arisen from

ill conceived statistical concepts, inappropriate research methodology both

qualitative and quantitative, deep rooted beliefs of various erroneous

"mythologies" about the nature of research and from a failure,

unwillingness, or even refusal to recognize that analytical and

interpretational techniques that were popular in previous decades no longer

reflect best practices and, moreover, may now be deemed inappropriate,

invalid, or obsolete.

The present understanding in medical education is such that the

standard/traditional methods on instruction/ teaching are ineffective. It has

been emphatically proved that learning among medical students is passive

rather than active. There is now enough evidence to prove that Traditional

Methods do not stimulate critical thinking, Creative thinking and

Page 14: Workshop Report Final

Page | 14

Collaborative Problem Solving. The explorative knowledge in Cognitive

Psychology proved beyond doubt these facts. The emphasis should be on

Andragogy rather than Pedagogy. It is to be kept in mind that the present

teaching is in an environment of Internet expansion/explosion.

Government of India and Medical Council of India revamped the curriculum

for MBBS course in a draft circulated to all medical colleges in India during

2007. The draft has rightly incorporated the various newer teaching

methods and encouraged to use newer assessment methods. The teaching

methods suggested are as follows:

Teaching Methods: • Lectures • Structured interactive sessions

• Small group discussion a) Demonstrations. b) Tutorials.

c) Seminars. d) Problem Based Learning.

• Focus group discussion (FGD)

• Projects • Participatory learning appraisal (PLA) • Video clips

• Written case scenario • Self learning tools Interactive learning

• e-modules • Skills Labs • Preparation of scientific article

Assessment Methods:

• MCQ • SAQ • OSCE

• OSLER • MiniCEX • Case Based Discussion

• Multi Source Feedback 360 Degrees • Client Satisfaction

It has been observed that medical scientific research is obsessed with

Quantitative methods. The emphasis on Observatory & Explanatory

(Interpretive) studies is minimal. It is time now to adopt Research Approach

exploring Cultures & Subjectivities.

Objectives of Educational Research:

Page 15: Workshop Report Final

Page | 15

1. Exploration: This is done to generate ideas about something.

2. Description: This is done to describe the characteristics of something

or some phenomenon.

3. Explanation: This is done to show how and why (causality)a

phenomenon operates as it does.

4. Prediction: The advanced sciences make much more accurate

predictions than the newer social and behavioral sciences.

5. Influence: The application of research results to impact the world.

Although we would never allow a patient to be treated by an untrained

doctor or nurse, we often tolerate professional training being delivered by

untrained teachers. Traditionally students were expected to absorb most of

their medical education by attending timetabled lectures and ward-rounds,

moving rapidly from one subject to the next in a crowded curriculum. Our

junior doctors learnt by watching their seniors in between endless menial

tasks. In recent years the importance of active, self directed learning in

higher education has been recognised. Outcome led structured programmes

for trainees are being developed in the face of reduced working hours for

both the learners and teachers. These constitute present new challenges for

teachers in medicine of all levels of seniority.

Types of Research:

1 Basic & Applied Research: • Basic Research: generate fundamental knowledge & theoretical

understanding basic human and other natural processes

eg. Process of cognitive priming • Applied Research: Answer practical questions to provide immediate

solutions eg. Effectiveness of two approaches to counselling

BASIC…………………………MIXED………………………….APPLIED

2 Evaluation Research: • Formative: for programme improvement

• Summative: programme summary judgments & decision to continue It can be further subdivided into the following components

Page 16: Workshop Report Final

Page | 16

• Needs assessment, which ask this question: Is there a need for this type of program?

• Theory assessment, which asks this question: Is this program

conceptualized in a way that it should work? • Implementation assessment, which asks: Was this program

implemented properly and according to the program plan?

• Impact assessment, which asks: Did this program have an impact on its intended targets?

• Efficiency assessment, which asks: Is this program cost effective? 3 Action Research: It focuses on the solving practitioner‟s local

problem. Through action research, investigators will be able to constantly

observe students for patterns and think about ways to improve instruction,

classroom management etc.

4 Orientational Research: It is mainly based on the critical theory. The

focus is on some form of inequality, discrimination, or stratification in

society. Some areas in which inequality manifests itself are large differences

in income, wealth, access to high quality education, power, and occupation.

A good researcher‟s basic quality is the ability to reason. There are two kinds

of reasoning namely deductive and inductive.

Deductive reasoning (i.e., the process of drawing a specific conclusion from

a set of premises). In this approach of formal logic, a conclusion from

deductive reasoning will necessarily be true if the argument form is valid

and if the premises are true.

Inductive reasoning (i.e., reasoning from the particular to the general). The

conclusion from inductive reasoning is probabilistic. It is based on the

assumption that the future might not resemble the present.

Common Assumptions in Medical Education Research: • World out there that can be studied

• World is unique. Some of it is regular and predictable. But most of it is Dynamic and complex

• Researchers can examine/ study the unique, regular and complex world

• Researchers follow agreed norms/ practices

• It is possible to distinguish between good & poor research • Science can not provide answers to All questions.

Page 17: Workshop Report Final

Page | 17

Scientific Methods: There are many scientific methods. The two major

methods are the inductive method and the deductive method.

• The deductive method( Quantitative technique) involves the following

three steps:

1. State the hypothesis (based on theory or research literature).

2. Collect data to test the hypothesis.

3. Make decision to accept or reject the hypothesis.

• The inductive method(Qualitative technique) also involves three steps:

1. Observe the world.

2. Search for a pattern in what is observed.

3. Make a generalization about what is occurring.

Diagramatically, these two methods are represented as below.

Qualitative Vs Quantitative Design*: *= Based on Assumption that people have meaningful experiences that can be interpreted

Qualitative Research

Collect

More Data Tighter

Specification of Question

Page 18: Workshop Report Final

Page | 18

Quantitative Research: Based on Assumption that Random Events are

Predictable. Any application of science includes the use of both the

deductive and the inductive approaches to the scientific method either in a

single study or over time. The inductive method is as “bottom up” method

that is especially useful for generating theories and hypotheses; the

deductive method is a “top down” method that is especially useful for testing

theories and hypotheses. This is called “Research Wheel”.

Page 19: Workshop Report Final

Page | 19

Lesson II: Educational Techniques: trends, Utility & Effectiveness

Objectives:

1 Describe the various techniques of Teaching and learning 2 Understand the limitations of each method 3 Outline the basic learning concepts in adults

Lesson Outline:

1 The traditional methods of Teaching & Learning. Newer Methods of Learning: Self Directed Learning; Problem

Based Learning; AV Media; Role Play; Focussed Discussions; Group work. e-Modules; skill labs; Participatory Learning Appraisal (PLA);case Studies; Algorithms; workshops; Projects; seminars; Portfolio based; Virtual classrooms etc.

2 Eight Principles of Adult Learning: Characterstics of Adult Learners. 3 Techniques: Teacher oriented, Interactive and Independent Techniques of

teaching & Learning.

Learning: Learning is a process which results in a relatively permanent

change in the behaviour of the learner. This change can be in the way of

thinking, feeling or doing and is reflected in the acquisition of knowledge

and skills and the development of attitudes by the learner. Learning is an

outcome of one‟s interactive experience with the environment. Learning is an

active and continuous process

Theories of Learning:

Conditioning theories- these explain learning process in terms of stimuli and responses

Theories of connectionism- these explain learning in terms of the

formation and strengthening of bonds or neural connections between stimuli and responses (law of readiness, exercise, effect and

belongingness) Field theory Learning models- Ausbel‟s advance organizer model, inquiry training

model Levels of Learning:

Signal learning Stimulus response learning Chaining

Verbal chaining

Page 20: Workshop Report Final

Page | 20

Discrimination learning Concept learning

Rule learning Problem solving

Principles of Learning: Goal setting

Relevance of learning experience Motivation Personal nature of learning

Active involvement of learners Meaning orientation

Application of knowledge Realistic learning Facilitative instructional sequence

Feed back Teaching: It is a process which facilitates learning by encouraging learners

to think, feel and do. The learning experience results in the acquisition of

knowledge and skills and development of attitudes. The role of a Teacher

can be identified as a Manager, Communicator, Self-learner, Research

worker and a Role model

Adult Learning Principles: Adult learning is a process whereby persons,

whose major social roles are characteristic of adult status, undertake

systematic and sustained learning activities to acquire desirable changes in

knowledge, attitudes, values or skills. It involves a complex interaction

among psychological, personal, social and environmental factors that

influence how an adult participates and learns. When new facts, ideas or

concepts are presented to adults; They think dialectically and contextually;

look for embedded logic; apply working intelligence and common sense and,

form opinions and judgments based on their existing cognitive framework

Critical reflection is a characteristic of adult learning

How to incorporate adult Learning Principles: Training programmes should be relevant Adults need high levels of motivation

Adults need high levels of involvement Adults need a variety of experiences Adults have personal concerns

Adults need positive feedback Adults have plenty of past experiences

Adults have variable educational orientation A good lecturer is a text book plus personality…..

Page 21: Workshop Report Final

Page | 21

- Flexner But, all too often, the personality is missing and the lecture becomes

“a process by which information is transferred from the notes of the lecturer to the notes of the student without going through the minds

of either!” - Sir Joseph Bancroft

Exercise I

Enlist the merits and demerits of lecture as a method of teaching and

learning

Classification of teaching-learning experiences Control based-classification: This classification divides T-L methods according to the person(s)

controlling the activity. – Teacher-controlled T-L activities: Lecture, symposium, team

teaching, demonstration, bedside clinics, etc. – Learner-Controlled activities : Free-group discussion, project

work and self-learning methods like self-study, programmed

instruction etc. Group size based classification: This classification is more useful to a teacher to plan according to the

student strength. – Large group methods: Can take care of any number of students.

Example : Lecture, panel discussion, symposium. – Small group methods: Are useful for up to 30 learners.

Example: Group discussion, seminar, workshop, bedside

clinics, demonstration, field visit. – Individual T-L methods: Attend to one student only and permit

individual learning.

– Example : Counseling, project work, assignment, computer assisted learning and self-study.

“We all know that 50% of what we teach to our students is right. But no one knows which 50%.” Various Methods:

Symposium: It is a series of prepared talks given by a few experts (2 to 5) on many aspects of a topic or problem under a chair person

The talk should be short and to the point There is no discussion among speakers Audience is passive unless question/reaction time is allowed

Merits: Symposium: It is a series of prepared talks given by a few experts (2 to

5) on many aspects of a topic or problem under a chair person

The talk should be short and to the point There is no discussion among speakers

Audience is passive unless question/reaction time is allowed Demerits

Formal atmosphere

Page 22: Workshop Report Final

Page | 22

Passive audience

The Panel A group of four or more persons sit with a moderator in front of an

audience; they hold an orderly and logical conversation on an assigned topic

Each member makes an opening remark for 3 to 5 minutes before

exchanging ideas Each member has a special knowledge or holds a particular view of

the topic

Merits Identifies and explores a problem or issue from many angles

Audience can understand various aspects of the issue Frequent change of speaker and view point maintains attention and

interest of audience

Establishes informal contact with the audience Demerits

Panelists may not cover all aspects of the problem and may over emphasize only certain aspect

Skilled moderator is necessary to ensure logical and balanced

coverage by the panel Audience is passive unless some question time is permitted

Team Teaching It has evolved since late 50s‟ with the objectives of improving the

quality of teaching by utilizing better talents and skills of a team of teachers. The team may act in four styles:

– Relay style of team teaching

– Team teaching in the same period (like a symposium) – Ability based team teaching – Specialization based team teaching

Small Group Methods:

Group discussion Seminar Tutorial

Demonstration Practical/bedside teaching/field work

Role play Workshop

Varieties of Group Discussion Controlled discussion Free group discussion

Buzz group Brain storming

Syndicate T-group

Page 23: Workshop Report Final

Page | 23

Individual Methods Reading

Programmed learning Project

Individual assignment Conference Counseling

Simulation Always Remember This Span of concentration ---- 7 minutes

Span of attention ----------- 57 seconds 20% is FORGOTTEN in 3 days.

70% is FORGOTTEN in 7 days Exercise II

Select appropriate T-L method – Convince a woman to use copper T

– Inform about side effects of a medicine to patients – Educate about drug policy of India – How to inject BCG vaccine

– Preparing a patient for operation – Autoclaving – Vaccination campaign

A teacher can never truly teach, unless he is still learning himself.

RABINDRANATH TAGORE

Page 24: Workshop Report Final

Page | 24

Lesson III:

Research Methodology: Outline of Qualitative, Quantitative & Mixed Research

Designs & Methods

Objectives:

1. Describe the educational research proposal preparation 2. Enumerate the General research designs adopted for qualitative and

quantitative data. 3. Able to outline the Experimental research 4. Understand the importance of quasi-experimental and single-case designs in

educational research

Lesson Outline:

1 Learning to select a research topic and preparing a research proposal. Sources of research ideas. Review of Literature. Literature search through web. Statement of Research Problem and purpose. Framing of Research Questions. Hypothesis. Research Proposal.

2 Observatory and Experimental and quasi experimental designs. Experimental Approach. Independent & Dependent Variables. Controlling Confounding Variables. Random assignment. Matching. Counter balancing. Post-test and Pre-test design. Pretest-posttest control-group design. Factorial design. Repeated measures design.

3 Validity of tools and designs. Internal validity and threats to internal validity. External Validity. Methods to improve validity.

4 Quasi-experimental and single case designs. Non-equivalent Comparison Group Design. Interrupted Time Series Design. Regression Discontinuity Design. Single –Case experimental Design and Multiple Baseline Design.

Educational Research in general, and medical education in particular, are

not research disciplines per se, with their own specialised theories and

methodologies; rather, they are fields of inquiry of potential interest in

multiple disciplines involving Psychology, anthropology, statistics and

epidemiology. In medical education, research is conducted to study links

between teaching factors and learning outcomes. However, presently,

thousands of studies that have been published in medical education are

best characterized as a diverse collection of bits and pieces with no unifying

mechanism of inquiry. The advent of statistical methods has acquired

Page 25: Workshop Report Final

Page | 25

credibility to experiments in education. Pearson (chi-squared goodness-of-fit

distribution in 1900), Gosset‟s (Student-t in 1908), Fisher‟s (significance

testing in 1925), Neyman and Person‟s (null-hypothesis testing in 1933) and

Yates(magnitude of the effects in 1951) are significant contributors to the

advancement of statistics in medical education.

The Core purpose of experimental paradigms is the establishment of

causality. The panacea of experimental research is Randomised Controlled

Trial (RCT) is based on the Mill‟s Method of Difference. It asserts that if two

situations differ in only one respect and an effect is observed in one

situation but not the other, then, it can be concluded that the effect was due

to the factor that is different. The laudable goal of medical education is Best

Evidence Medical Education (BEME).

There are currently three major research paradigms (a perspective based on

a set of assumptions, concepts, and values that are held by researchers) in

education. They are quantitative research, qualitative research, and mixed

research.

1. Quantitative research – research that relies primarily on the collection

of quantitative data.

2. Qualitative research – research that relies on the collection of

qualitative data.

3. Mixed research – research that involves the mixing of quantitative and

qualitative methods or paradigm characteristics.

The differences between these methods are depicted as below.

Page 26: Workshop Report Final

Page | 26

Characteristics of Qualitative, Quantitative & Mixed Research

Quantitative Mixed Qualitative

Scientific Method

Deductive ”Top Down”

Deductive & Inductive

Inductive “Bottom-up”

Human Behaviour view

Regular, Predictable

Somewhat predictable

Fluid, dynamic, situational, contextual,

personal

Objectives Describe, explain, predict

Multiple Describe, Explore, discovery

Focus Narrow-angle, specific hypothesis

testing

Multi-lens focus

Wide-angle & Deep angle

Observation Study behaviour under control

Behaviour in more than one

context

Behaviour in natural

Nature of reality

Objective Realism & Pragmatic

Subjective, Personal

Data Collection

Quantitative data Multiple forms Qualitative data

Nature of data

Variables Mixture of variables,

words, images

Words, images, categories

Data Analysis Statistical relationship

Quantitative & qualitative

Search patterns, themes, holistic features

Results Generalizable Corroborated findings may

generalize

Particular finding multiple perspectives

Final Report form

Statistical Report Assorted & Realistic

Narrative, Direct quotations from

participants

Quantitative Research Methods: The basic building blocks of quantitative

research are variables. Variables (something that takes on different values

or categories) are the opposite of constants (something that cannot vary).

The types of variables are classified according to the measurement and the

role they play.

Variable Type Key Characteristics Example

Level of Measurement Categorical Non-quantitative

measurement scale used to categorize, label, classify, name, or identify variables. It classifies groups or

Place of birth, college name, personality type, gender (male, female).

Page 27: Workshop Report Final

Page | 27

types.

Ordinal This enables to make ordinal judgments (i.e., judgments about rank order). The distance between the levels may not be equal.

Rank in the class, Grades, Malnutrition grade, Socio-economic class

Interval This has the characteristics of rank order and equal intervals (i.e., the distance between adjacent points is the same). It does not possess an absolute zero point.

Intelligent Quotient, Fahrenheit temperature

Ratio This is a scale with a true zero point. It also has equal intervals , rank order, and ability to mark a value with a name.

number correct, weight, height, response time, Kelvin temperature, and annual income.

Role of Variable Independent Variable (IV) That is presumed to cause

changes to occur in Duration of study hours

Dependent Variable (DV)

The one which changes because of another variable OR the Outcome( output, effect and impact)

Test grades OR Assessment marks

Mediating (Intervening) Variable

It accounts the relation between the predictor and the criterion. Mediators explain how external physical events take on internal psychological significance

Memory OR Intelligent quotient

Moderator Variable It affects the direction and/or strength of the relation between an independent or predictor variable and a dependent or criterion variable

Number of books referred OR Number of times notes has been prepared

Predictor Variable It is used in regression to predict another variable

Intelligent Quotient

Extraneous variable compete with Independent variable in explaining the outcome

A particular school/ college and the higher grades obtained

Confounding variable An extraneous variable Which is the real reason for an outcome

Excellent teaching faculty in a school OR Innovative teaching methods responsible for higher grades

Page 28: Workshop Report Final

Page | 28

Types of Quantitative Research: 1 Experimental: The purpose of experimental research is to study

cause and effect relationships. It is characterised by active manipulation of

an independent variable and random assignment (which creates "equivalent"

groups).

Pre Test Treatment Post Test O1 Xe O2

O1 Xc O2 Where

E stands for the experimental group (e.g., new teaching approach)

C stands for the control or comparison group (e.g., the old or standard teaching approach)

2 Non-Experimental: In this approach, there is no manipulation of the

independent variable and there is no random assignment of participants to

groups. In this study, even though there exists a relationship between IV

and DV, the causal relationship can not be concluded because there will be

too many other alternative explanations for the relationship. There are two

basic strategies in this non-experimental approach. In the "basic case" of

causal-comparative research, there is one categorical IV and one

quantitative DV. Example: Gender (IV) and class performance (DV) where

the gender relationship is compared among male and female with the

performance levels. In the simple case of correlational research, there is

one quantitative IV and one quantitative DV. Example: Self-esteem (IV) and

class performance (DV). It can be concluded that stronger evidence for

causality can be obtained through experimental research than from non-

experimental research.

Selecting the right measures:

One of the most important components of a good research protocol is the

selection of the best outcome measures. However, these measurements are

Page 29: Workshop Report Final

Page | 29

difficult. There are often no pre-existing measures that can validate the

desired outcomes of a specific study. Often, researchers select the measures

on the basis of their ease of creation, their ease of administration, their

capacity to generate numbers, their perceived reliability, or their mere

existence. The outcome measures can be defined at abstract level but, it is

not clear whether these terms amount to in observable behaviours or

activities. Therefore, creating a valid and reliable measure is vital to the

interpretation of experimental results.

Critical & Practical issues:

1 Control group: Majority of the experimental studies require a control

group for comparison and relative effectiveness of the intervention.

Offering no intervention to the control group will result in a poor,

weak design of the study. At abstract level, selecting a control group

and the type of intervention/ no intervention seems to be relatively

simple. At the level of detail, it can get complicated quickly and

generate conundrums that are not easy to solve.

2 Motivational Factors: It plays an important role in improvement of

intervention, compliance and adherence to no intervention

3 Placebo Effect: Merely the belief that an intervention offered is

sufficient to produce some kind of improvement. It might be due to

the repeated follow-up, contact with researchers and enhanced

attention

4 Selection bias: Specific selection of volunteers for the intervention or

control group would result in a bias limiting the generalizability of the

study

Conditions for Causation:

There are three necessary conditions that must be established to conclude

that a relationship is causal.

1. condition: Variable A & Variable B must be related (Relationship)

2. condition: Temporal antecedence (time order) 3. Condition: Relationship between A & B not due to confounding/ third

Variable (Lack of alternative explanation)

Page 30: Workshop Report Final

Page | 30

A. Experimental Approach Research Methods:

Is a situation in which a researcher objectively observes phenomena which

are made to occur in a strictly controlled situation where one or more

variables are varied and the others are kept constant. The independent

variable is the variable that is assumed to be the cause of the effect. It is the

variable that the researcher varies or manipulates in a specific way in order

to learn its impact on the outcome variable.

Independent Variable Manipulation:

One of the important aspect of experimental research is the manipulation of

the independent variable by the researcher.

A. Presence versus Absence Technique: the independent variable can be

manipulated by presenting a condition or treatment to one group of

individuals and withholding the condition or treatment from another

group of individuals

B. Gradation/Amount Technique: the independent variable can be

manipulated by varying the amount of a condition or variable such as

increasing the number of training sessions or varying the amount of a

drug which is given to children with a learning disorder.

C. Type/ Modality Technique: independent variable is to vary the type of the

condition or treatment administered. One type of drug may be

administered to one group of learning disabled children and another type

of drug may be administered to another group of learning disabled

children

Confounding Variable Control:

Researcher should keep in mind the effect of confounding variables and the

research design should be able to eliminate the effect of known confounding

variables. The following methods can be employed to reduce the effect of

confounding variables.

1 Random Assignment / Random Selection: Random assignment

makes the groups similar on all variables at the start of the experiment. If

random assignment is successful, the groups will be mirror images of each

other.

Page 31: Workshop Report Final

Page | 31

Random Selection Random assignment

Purpose To generate a sample that represents a larger population.

To take a sample (usually a convenience sample) and use the process of randomization to divide it into two or more groups that represent each other. It is used to create probabilistically “equivalent” groups.

Uses It helps to ensure external validity

It helps to ensure internal validity. It also eliminates the problem of differential influence in the groups.

Importance in Experimental Research

Less important in educational research/ qualitative research

more important than random selection

2 Matching Variables: It controls for confounding extraneous variables

by equating the comparison groups on one or more variables that are

correlated with the dependent variable. Decide what extraneous variables

are to be matched (i.e.., decide what specific variables can be matched to

make the groups similar on). These decided variables are called the

matching variables. Matching eliminates any differential influence of the

matching variables. Matching can be carried out on one or more extraneous

variables.

3 Holding Extraneous variables constant: This technique controls for

confounding extraneous variables by insuring that the participants in the

different treatment groups have the same amount or type on a variable. Eg.

To study the influence of gender, either select only men or women but not

both. A problem with this technique is that it can seriously limit study‟s

ability to generalize the results.

4 Building Extraneous Variables into Design: This technique takes a

confounding extraneous variable and makes it an additional independent

variable in research study. E.g. Both Male and females can be studied

incorporating gender into the study design.

5 Counter Balancing for Sequencing Effects(Order/Carry-over) Priming

effect: It is a technique used to control for sequencing effects (eg. order

effects and carry-over effects). However, this technique is only relevant for a

design in which the participants receive more than one treatment condition

(e.g., such as the repeated measures design). Sequencing effects are biasing

Page 32: Workshop Report Final

Page | 32

effects that can occur when each participant must participate in each

experimental treatment condition. Eg. In a drug trial, when the same person

is used as intervention and control in sequence, there can be residual effect

of the drug. Hence, wash out period is allowed. Same can be applied to

educational research. It also point out that whether control should be tested

first or the intervention.

• Order effects arise from the order in which the treatments are

administered. For example, as people complete their participation in their

first treatment condition they will become more familiar with the setting and

testing process. When these people participate, later, in their second

treatment condition, they may perform better simply because are now

familiar with the setting and testing that they acquired earlier. Order effects

should be controlled.

• Carry-over effects occur when the effect of one treatment condition carries

over to a second treatment condition. That is, participants‟ performance in a

later treatment is different because of the treatment that occurred prior to it.

When this occurs the responses in subsequent treatment conditions are a

function of the present treatment condition as well as any lingering effect of

the prior treatment condition.

Counterbalancing is a control technique that can be used to control for

order effects and carry-over effects. This can be achieved by administering

each experimental treatment condition to all groups of participants, but

applying it in different orders for different groups of people. For example if

two groups making up the independent variable counterbalance can be

carried out by dividing the sample into two groups and giving this order to

the first group (treatment one followed by treatment two) and giving this

order to the second group (treatment two followed by treatment one).

6 Analysis of Co-variance: It is a statistical control technique that is

used to statistically equate groups that differ on a pretest or some other

variable. For example, in a learning research study intelligence level has to

be controlled, if there are more brighter students in one of two comparison

groups, then the difference between the groups might be because the groups

differ on IQ rather than the treatment variable. Analysis of covariance

Page 33: Workshop Report Final

Page | 33

statistically adjusts the dependent variable scores for the differences that

exist on an extraneous variable.

Types of Designs:

A research design is the outline, plan, or strategy that is going to be used to

obtain an answer to research question. Research designs can be weak or

strong or quasi which are moderately strong, depending on the extent to

which they control for the influence of confounding variables

A. Weak Designs: considered weak because they do not control for the

influence of many confounding variables.

1 The one-group posttest-only design: It is a very weak research

design where one group of research participants receives an

experimental treatment and is then post tested on the dependent

variable. A serious problem with this design is that it is not known

whether the treatment condition had any effect on the participants.

There is no pre-test or control group for comparison. Another problem

is that there can be confounding bias.

2 One-group pretest-posttest design: It is a design where one

group of participants is pretested on the dependent variable and then

posttested after the treatment condition has been administered. This

is a better design than the one-group posttest-only design because it

at least includes a pretest, that indicates how the participants did

prior to administration of the treatment condition. In this design, the

effect is taken to be the difference between the pretest and posttest

scores. It does not control for potentially confounding extraneous

variables such as history, maturation, testing, instrumentation, and

regression artifacts, so it is still difficult to identify the effect of the

treatment condition.

3 Posttest-only design with non-equivalent groups: It includes an

experimental group that receives the treatment condition and a

control group that does not receive the treatment condition or receives

some standard condition and both groups are posttested on the

dependent variable. While this design includes a control group, the

participants are not randomly assigned to the groups so there is little

Page 34: Workshop Report Final

Page | 34

assurance that the two groups are equated on any potentially

confounding variables prior to the administration of the treatment

condition. Because the participants were not randomly assigned to the

comparison groups, this design does not control for differential

selection, differential attrition, and the various additive and

interaction effects

1) Post Test Only Design

2) One Group-Pretest-Post test Design

3) Post test only with nonequivalent groups

B. Strong Designs:

A research design is considered to be a "strong research design" if it controls

for the influence of confounding extraneous variables. This is accomplished

by random assignment and presence of a control group (which is the

comparison group that either does not receive the experimental treatment

condition or receives some standard treatment condition).

Page 35: Workshop Report Final

Page | 35

1 Pretest-posttest control-group design: This design is one of the

most common and appealing design for educational interventions. In this

design group of research participants is randomly assigned to an

experimental and control group. Both groups of participants are pre tested

on the dependent variable and then post tested after the experimental

treatment condition has been administered to the experimental group. This

design controls for all of the standard threats to internal validity. Differential

attrition may or may not be a problem depending on what happens during

the conduct of the experiment. In this design, instead of two groups, it can

be expanded to include more than one experimental groups. This design

provides an opportunity to confirm effectiveness of randomised allocation

through direct visual inspection of the pre-test scores. This will ensure that

the experimental and control groups are identical/ relatively close on the

dependent variable (no/minimal sampling error). From a practical point of

view, sometimes the use of pre-test is impractical especially in medical

educational research. In such instances, both the experimental and control

groups are assumed to behave the same way at the beginning/ pre-test.

This is also called as Floor effect. Infrequently, the use of pre-test can limit

the generalizability of the conclusions of the study. Hence, it can be

concluded that Pre-test-Post-test Control group design is more powerful

statistically and more compelling as a demonstration of improvement. It is

somewhat limited in its generalizability and its feasibility.

2 Posttest-only control-group design: In this design, research

participants are randomly assigned to an experimental and control group

and then post tested on the dependent variable after the experimental group

Page 36: Workshop Report Final

Page | 36

has received the experimental treatment condition. This design includes a

control or comparison group and has random assignment. It also controls

for all of the standard threats to internal validity. Differential attrition may

or may not be a problem depending on what happens during the conduct of

the experiment. This design does not include a pretest of the dependent

variable. This design offers the advantage of treatment group not being

primed by the pre-test. A clean control group (no treatment has been

offered) scores can be interpreted as the pre-test scores for the treatment

group. However, a clean control group is inappropriate and unethical.

Offering no treatment to control group would be unfair and de-motivating

leading to low scores (under estimate of their true naïve potential). The

improvement in the treatment group can also be contributed by the placebo

effect (over estimate). The statistical power is lost in this post test only

design. Finally this design is easier to administer and has more true to life

condition.

3.The Solomon Four Group Design: In this complicated design, named

after Solomon(1946), all the four groups from Pre-test/Post-test and Post-

test only designs.

Group I: Pre-test Treatment. Pre-Test……Intervention……Post-test

Group II: Pre-test control Pre-test………………………….Post-test

Group III: Post-test only treatment ……………..Intervention…….Post-test

Group IV: Post-test only control …………………………………..Post-test

Page 37: Workshop Report Final

Page | 37

This design also has clean control group. The advantages of both pre-

test/post-test and post-test only designs can be found in this design. The

design also allows further comparisons between the two designs. Baseline

scores are available for comparisons. However, the disadvantages include

limited number of subjects availability and because of the noise of sampling

error it will be statistically difficult to demonstrate the effect of treatment.

Practically, this design is more complicated to enact and may lead to

decrease in statistical power due to lesser number of subjects in each group.

4 Factorial design: In this design two or more independent variables

are simultaneously investigated to determine the independent and

interactive influence which they have on the dependent variable. It also has

random assignment to the groups. Each combination of independent

variables is called a "cell." Research participants are randomly assigned to

as many groups are there are cells of the factorial design if both of the

independent variables can be manipulated. The participants are

administered the combination of independent variables that corresponds to

the cell to which they have been assigned and then they respond to the

dependent variable. The data collected from this research give information

on the effect of each independent variable separately and the interaction

between the independent variables. The effect of each independent variable

on the dependent variable is called a main effect. There are as many main

effects in a factorial design as there are independent variables. If a research

design included the independent variables of gender and type of instruction,

then there would potentially be two main effects, one for gender and one for

type of instruction.

An interaction effect between two or more independent variables occurs

when the effect which one independent variable has on the dependent

variable depends on the level of the other independent variable. For

example, if gender is one independent variable and method of teaching

physiology is another independent variable, an interaction would exist if the

lecture method was more effective for teaching males physiology and

individualized instruction was more effective in teaching females physiology.

Page 38: Workshop Report Final

Page | 38

5. Repeated-measures design: In this design all research participants

receive all experimental treatment conditions. For example, in case of effect

of type of instruction on learning physiology and two types of instructions

(lecture method and individualized instruction) were used, the participants

would experience both types of instruction, first one and then the other.

This design has the advantage of requiring fewer participants than other

designs because the same participants participate in all experimental

conditions. It also has the advantage of the participants in the various

experimental groups being equated because they are the same participants

in all of the treatment conditions. If counterbalancing is used with this

design, then all of the standard threats to internal validity are controlled for.

Differential attrition may or may not be a problem depending on what

happens during the conduct of the experiment.

Page 39: Workshop Report Final

Page | 39

6. factorial design based on a mixed model: This is based on a mixed

model is a factorial design in which different participants are randomly

assigned to the different levels of one independent variable but all

participants take all levels of another independent variable. All of the

standard threats to internal validity are controlled for with this design if

counterbalancing is used for the repeated measures independent variable.

Differential attrition may or may not be a problem depending on what

happens during the conduct of the experiment.

Quasi-Experimental Research Designs

The Quasi-experimental research designs are used when it is not possible to

control for all potentially confounding variables; when it is impossible to

randomly assign participants to comparison groups and when a researcher

is faced with a situation where only one or two participants can participate

in the research study (single case designs). These designs have manipulation

of the independent variable but are not able to satisfy the criterion of

random assignment to two or more groups. Causal explanations can be

Page 40: Workshop Report Final

Page | 40

made only when data collected demonstrate that plausible rival explanations

are unlikely, and the evidence will still not be as strong as with one of the

strong designs. Quasi-experiments fall in the center of a continuum with

weak experimental designs on the far left side and strong experimental

designs on the far right side.

A Non-equivalent Comparison-Group Design

This is a design also called as Cohort Design contains a treatment group

and a non-equivalent untreated comparison group about of which are

administered pre-test and post-test measures. The groups are “non-

equivalent” because there is no random assignment leading to no assurance

that the groups are highly are similar. Hence, confounding variables (rather

than the independent variable) may explain any difference observed between

the experimental and control groups. It is a good idea to collect data that

can be used to demonstrate that key confounding variables are not the

cause of the obtained results. Also it is advisable to use statistical control

techniques for confounding variables. The most common threat to the

internal validity of this type of design is differential selection. The drawbacks

of this design are Systematic differences, maturation effects, and self

selection bias. One solution to thee concerns is to use pre-test/post-test

design. Another solution is to repeat the comparison on several groups or to

use stratification of groups. The problem is that the groups may be different

on many variables that are also related to the dependent variable (e.g., age,

gender, IQ, reading ability, attitude, etc.).

The primary threats to this design are described as below.

Selection Bias: Non-equivalent groups in this design, leads to differential selection of experimental and control groups.

Selection Maturation: One group of research study participants might be

Page 41: Workshop Report Final

Page | 41

more experienced/ tired or bored than participants in the other group. Selection-Instrumentation: Instrument/measurement may vary among non-

equivalent groups Selection-Regression: Non-equivalent groups might lead to one group with

high reading scores and the other low resulting in regression to mean Selection-history: The groups might differ in their past experiences.

B Volunteer treatment Design:

Volunteers are recruited in this design and those who choose to participate

receive the training whereas those who choose not to participate in the

intervention act as the control group. These designs overcome the ethical

issues of with holding the treatment in a situation where the outcome

measure is important to the participants. However, if there is a difference

between the treatment group and control groups, it is difficult to attribute

the difference to the intervention because of systematic difference and

simple selection bias. This can be overcome by selecting control as those

who are completely not compliant with the intervention(due to unavoidable

reasons) among those who have agreed to participate.

C Interrupted Time-Series Design

This is a design in which a treatment condition is accessed by comparing

the pattern of pretest responses with the pattern of posttest responses

obtained from a single group of participants. In other words, the

participants are pretested a number of times ie. Baseline data and then

Volunteers

Agreed to participate Refused to participate

Completely compliant with intervention

Non compliant due to emergency

Intervention

Control

No Intervention (Contr

ol)

Page 42: Workshop Report Final

Page | 42

posttested a number of times after or during exposure to the treatment

condition. A treatment effect is demonstrated only if the pattern of

posttreatment responses differs from the pattern of pretreatment responses.

That is, the treatment effect is demonstrated by a discontinuity in the

pattern of pretreatment and posttreatment responses. Many confounding

variables are ruled out in this design because they are present in both the

pretreatment and posttreatment responses. However, the main potentially

confounding variable that cannot be ruled out is a history effect. The history

threat is a plausible rival explanation if some event other than the treatment

co-occurs with the onset of the treatment.

C. Single-Case Experimental Designs

These are weakest quasi-experimental designs where the researcher

attempts to demonstrate an experimental treatment effect using single

participants, one at a time. These designs are also called as Pre-

experimental design. One of the plausible explanation of improvement in

group might be due to the pre-test which itself was sufficient to generate

improvement. Another defect is that unstructured time gap between the pre-

test and the post-test. It is possible that, having been informed of their areas

of weakness with the pre-test, the participants were able to independently

obtain the information necessary to perform better on the post-test, even if

the formal intervention is not effective at all.

1 A-B-A and A-B-A-B Designs

The A-B-A design is a design in which the participant is repeatedly pretested

(the first A phase or baseline condition), then the experimental treatment

condition is administered and the participant is repeatedly posttested (the B

phase or treatment phase). Following the posttesting stage, the pretreatment

conditions are reinstated and the participant is again repeatedly tested on

Page 43: Workshop Report Final

Page | 43

the dependent variable (the second A phase or the return to baseline

condition).

Baseline (A) Post Test (B) Baseline (A)

The effect of the experimental treatment is demonstrated if the pattern of the

pre- and posttreatment responses ( the first A phase and the B phase) differ

and the pattern of responses reverts back to the original pretreatment level

when the pretreatment conditions are reinstated (the second A or return to

baseline phase). Including the second A phase controls for the potential rival

hypothesis of history that is a problem in a basic time series design (i.e., in

an A-B design).

One limitation of the A-B-A design is that it ends with baseline condition or

the withdrawal of the treatment condition so the participant does not receive

the benefit of the treatment condition at the end of the experiment. This

limitation can be overcome by including a fourth phase which adds a second

administration of the treatment condition so the design becomes an A-B-A-B

design. A limitation of both the A-B-A and the A-B-A-B designs is that they

are dependent on the pattern of responses reverting to baseline conditions

when the experimental treatment condition is withdrawn. This may not

occur if the experimental treatment is so powerful that its effect continues

even when the treatment is withdrawn.

Baseline (A) Post Test (B) Baseline (A) Post Test (B)

O1 O2 O3 O4 O5 O6 O7 O8 O9 O10 O11 O12 O12

Treatment/ Intervention

Remove Intervention

O1 O2 O3 O4 O5 O6 O7 O8 O9 O10 O11 O12 O13 O14 O15 O16

Intervention Remove Intervention

Intervention

Page 44: Workshop Report Final

Page | 44

2. Multiple-Baseline Design

This is a design that investigates two or more people, behaviors, or settings

to identify the effect of an experimental treatment. The key is that the

treatment condition is successively administered to the different people,

behaviors, or settings. The experimental treatment effect is demonstrated if

a change in response occurs when the treatment is administered to each

person, behavior, or setting. Rival hypotheses are unlikely to account for the

changes in the behavior if the behavior change only occurs after the

treatment effect is administered to each successive person, behavior, or

setting. This design avoids the problem of failure to revert to baseline that

can exist with the A-B-A and A-B-A-B designs.

Phase 1 Phase 2 Phase 3 Phase 4

Different people, Different behaviours, or Different settings

A Baseline Treatment Treatment Treatment

B Baseline Baseline Treatment Treatment

C Baseline Baseline Baseline Treatment

3 Time series Analysis Design:

In this design, test the participants several times prior to the intervention

and after the intervention. If the intervention has no effect, the change in

scores will be smooth and continuous. An effective intervention would be

signalled by a discontinuity in this smooth progression; either a discrete

discontinuity at the time of intervention. Time series analysis are widely

available.

4 Naturalistic Experiments:

In these designs the term experiment is used loosely as there is no formal

randomization of subjects or groups and there is no experimenter initiated

intervention that is systematically applied to one group. The control can be

historical control. The newly introduced educational intervention (not by the

researcher) group will serve as experimental group. Eg. There is a change in

the syllabus and curriculum prescribed by Maharashtra University of Health

Sciences, Nashik during 1998. The older curriculum followed by respective

university was followed earlier. Students in this category will serve as

historical control. The limitations of the study include systematic difference

Page 45: Workshop Report Final

Page | 45

between the two groups in time, and change in the outcome measures over a

period of time (eg. Evaluation was long theory questions earlier compared to

MCQs and SAQ in the revised curriculum), expert versus novice difference.

5 Changing-Criterion Design

This is a single-case design that is used when a behavior needs to be shaped

over time or when it is necessary to gradually change a behavior through

successive treatment periods to reach a desired criterion. This design

involves collecting baseline data on the target behavior and then

administering the experimental treatment condition across a series of

intervention phases where each intervention phase uses a different criterion

of successful performance until the desired criterion is reached. The

criterion used in each successive intervention phase should be large enough

to detect a change in behaviour but small enough so that it can be achieved.

Conclusion:

Selection of an appropriate research design is vital. Use traditional

qualitative techniques and pilot test to refine the research question prior to

the main experiment. Remember that the educational experimentation will

always involve a set of compromises (such as the nature of the control

group, the nature of evaluation process, the nature of the research design

selected). It is better to explore the limitations (compromises) and select the

research design that best satisfies the needs. Also decide whether these

compromises are reasonable.

Qualitative Research Methods:

There are five major types of qualitative research which are basically similar.

However, each approach has some distinct characteristics and tends to have

its own roots.

• Phenomenology – a form of qualitative research in which the researcher

attempts to understand how one or more individuals experience a

phenomenon. For example, interview 20 trainees and ask them to describe

their experiences of the training programme.

• Ethnography – is the form of qualitative research that focuses on

describing the culture of a group of people. Note that a culture is the shared

attitudes, values, norms, practices, language, and material things of a group

Page 46: Workshop Report Final

Page | 46

of people. E.g. The tribal anthropological studies, describing their beliefs,

attitudes and their culture.

• Case study research – is a form of qualitative research that is focused on

providing a detailed account of one or more cases. For an example, study a

PBL programme implemented in a institute.

• Grounded theory – is a qualitative approach to generating and developing

a theory form data that the researcher collects. For example, Collection of

data on how the habituation of tobacco has started and is being sustained

in youth. Based on this data a theory can be construed to explain the

mechanism of tobacco use.

• Historical research – research about events that occurred in the past.

Example, study the use of education system at Nalanda University.

Mixed Research Methods:

Mixed research is a general type of research in which quantitative and

qualitative methods, techniques, or other paradigm characteristics are

mixed in one overall study. There are two major types of mixed research.

• Mixed method research – is research in which the researcher uses the

qualitative research paradigm for one phase of a research study and the

quantitative research paradigm for another phase of the study. Mixed

method research is like conducting two mini-studies within one overall

research study.

• Mixed model research – is research in which the researcher mixes both

qualitative and quantitative research approaches within a stage of the study

or across two of the stages of the research process.

The Advantages of Mixed Research

Mixed research is advocated whenever it is feasible. It will help qualitative

and quantitative researchers to get along better and, will promote the

conduct of excellent educational research. The researcher who mixes

quantitative and qualitative research methods, procedures, and paradigm

characteristics will be able to get better results due to the complementary

strengths and non-overlapping weaknesses. When different approaches are

used to focus on the same phenomenon and they provide the same result,

you have "corroboration" which means superior evidence for the result.

Page 47: Workshop Report Final

Page | 47

Lesson IV:

Quantitative Methods: Data Collection, Questionnaire Preparation

Objectives:

4. Describe the various categories of data(Nominal, ordinal, interval & Ratio) 5. Understand the sampling procedures used in educational research. 6. Devise a questionnaire for data collection and data quality checks

Lesson Outline:

1 Methods of Data Collection: Tests; Questionnaires; Interviews; Focus Groups; Observation and secondary data. Scales of measurement. Assumptions underlying Testing and Measurement. Identifying a good test or Assessment Procedure. Assessment tools in Education. Achievement Tests. OSCE. Patient satisfaction questionnaire. Ratings. Student Write ups

2 Sampling Procedure. Terminology used. Random sampling, Systematic, Cluster sampling. Non random sampling techniques. Quota sampling, purposive sampling

3 15 Principles for preparation of Questionnaire. Rating Scales, Rankings and checklists. Strengths and weaknesses of questionnaire. Types of questions. Open, closed.

Measurement: It is defined as the act of measuring by assigning symbols or

numbers to something according to a specific set of rules.

Measurement can be categorized by the type of information that is

communicated. They are called the four "scales of measurement." The

numerical method of describing observations of materials or characteristics

is called as “Quantification”.

Scales of Measurement

1. Nominal Scale. This is a least precise method of nonquantitative

measurement scale. It is used to categorize, label, classify, name, or

identify variables. These scales are nonorderable. It classifies groups or

types. It describes differences between things by assigning them to

categories-such as professors, associate professors, lecturers, tutors,

residents –and subsets such as males or females.

2. Ordinal Scale.

It indicates not only that things differ but that they differ in amount or

degree. However, the real differences between adjacent ranks may not be

equal. Any variable where the levels can be ranked is an ordinal variable.

Page 48: Workshop Report Final

Page | 48

Some examples are order of academic position in a examination, top 10,

rank in class.

3. Interval Scale.

It is based on equal units of measurement indicating how much of a

given characteristic is present. It indicates the relative amount of a trait

or characteristic. It‟s primary limitation is the lack of zero. Psychological

tests and inventories are interval scales although they can be added,

subtracted, multiplied, and divided. Another example is Celsius

temperature, Fahrenheit temperature, IQ scores where zero degrees in

these scales does not mean zero or no temperature.

4. Ratio Scale. This is a scale having equal interval properties in addition to

a true zero point and can be added, subtracted, multiplied and divided

and expressed in ratio relationships. It also has all of the "lower level"

characteristics (i.e., the key characteristic of each of the lower level

scales) of equal intervals (interval scale), rank order (ordinal scale), and

ability to mark a value with a name (nominal scale). Some examples of

ratio level scales are number correct, weight, height, response time,

Kelvin temperature, and annual income.

In Qualitative behavioural research many of the qualities or variables of

interest are abstractions and can not be observed directly. It is necessary to

define them in terms of observable acts. This operational definition tells

what the researcher must do to measure the variable. Eg. Intelligence is an

abstract quality that can not be observed directly. However, it can be defined

operationally as scores achieved on a particular intelligence test. It must be

remembered that excessive emphasis on quantification may result in the

measurement of fragmentary qualities not relevant to the real behaviour.

Assumptions Underlying Testing and Measurement

Cohen, et al. considered twelve assumptions which are basic to testing and

assessment. Testing is defined as the process of measuring variables by

means of devices or procedures designed to obtain a sample of behaviour

and Assessment is the gathering and integration of data for the purpose of

making an educational evaluation, accomplished through the use of tools

Page 49: Workshop Report Final

Page | 49

such as tests, interviews, case studies, behavioural observation, and

specially designed apparatus and measurement procedures.

1. Psychological traits and states exist: A trait is a relatively enduring (i.e.,

long lasting) characteristic on which people differ; a state is a less enduring

or more transient characteristic on which people differ.

2. Psychological traits and states can be quantified and measured: For

nominal scales, the number is used as a marker. For the other scales, the

numbers become more and more quantitative. Most traits and states

measured in education are taken to be at the interval level of measurement.

3. Various approaches to measuring aspects of the same thing can be

useful: For example, different tests of intelligence tap into somewhat

different aspects of the construct of intelligence.

4. Assessment can provide answers to some of life's most momentous

questions: It is important that the users of assessment tools know when

these tools will provide answers to their questions.

5. Assessment can pinpoint phenomena that require further attention or

study: For example, assessment may identify someone as having dyslexia or

low self-esteem or at-risk for drug use.

6. Various sources of data enrich and are part of the assessment process:

Information from several sources usually should be obtained in order to

make an accurate and informed decision. For example, the idea of portfolio

assessment is useful.

7. Various sources of error are always part of the assessment process:

There is no such thing as perfect measurement. All measurement has some

error. The two main types of error are random error (e.g., error due to

transient factors; also leads to less reliability) and systematic error (e.g.,

error present every time the measurement instrument is used; leads to

decreased validity).

8. Tests and other measurement techniques have strengths and

weaknesses: It is essential that users of tests understand this so that they

can use them appropriately and intelligently.

Page 50: Workshop Report Final

Page | 50

9. Test-related behaviour predicts non-test-related behaviour: The goal of

testing usually is to predict behaviour other than the exact behaviours

required while the exam is being taken.

10. Present-day behaviour sampling predicts future behaviour: Perhaps the

most important reason for giving tests is to predict future behaviour. Tests

provide a sample of present-day behaviour. However, this "sample" is used

to predict future behaviour.

11. Testing and assessment can be conducted in a fair and unbiased

manner: This requires careful construction of test items and testing of the

items on different types of people to make sure tests are fair and unbiased.

12. Testing and assessment benefit society: Many critical decisions are

made on the basis of tests (e.g., employability, presence of a psychological

disorder, degree of teacher satisfactions, degree of student satisfaction, etc.).

In Medical Education, qualitative data collection techniques such as

projective tests, observation, open ended questionnaires and opinionnaires,

and interviews are used for quantitative data. It is unwise to draw a hard-

and-fast distinction between qualitative and quantitative studies. The

difference is not absolute; it is one of emphasis. One emphasis should not

be considered superior to the other. The appropriate approach depends on

the nature of the questions under consideration and the objectives of the

researchers.

Psychological and Educational Tests and Inventories

Psychological tests are among the most useful tools of educational research.

These tests are instruments designed to describe and measure a sample of

certain aspects of human behaviour. He tests yield objective and

standardised descriptions of behaviour, quantified by numerical scores.

Tests and inventories are used to describe status or prevailing condition, to

measure changes in status produced by modifying factors, or to predict

future behaviour on the basis of present performance. Tests can be

classified as Performance tests (to assess the skill) and paper-and-pencil

tests (to measure knowledge). They can be also divided into Power versus

timed or speed tests. Power tests have no time limit, and the subjects

attempt progressively ore difficult tasks until they are unable to continue

Page 51: Workshop Report Final

Page | 51

successfully. Speed tests limit the time; the subjects have to complete

certain tasks. Alternatively, tests can be either non-standardised (teacher

made: terminal internal examinations) and Standardised (tailor-made:

university examinations).

Good measurement is fundamental for research. Testing and assessment

procedures are characterized by high reliability and high validity. While

devising test one should also consider the economy and interest of the test.

Validity It is the best available approximation to the truth of a given

proposition, inference, or conclusion

Validity refers to the degree to which evidence and theory support the

interpretation of test scores entailed by proposed uses of tests. It is the

accuracy of the inferences, interpretations, or actions made on the basis of

test scores. Validity has to do with both the attributes of the test and the

uses to which it is put. Technically speaking, it is incorrect to say that a

test is valid or invalid. It is the interpretations and actions taken based on

the test scores that are valid or invalid. All of the ways of collecting validity

evidence are really forms of what used to be called construct validity. All

that means is that in testing and assessment, we are always measuring

something (e.g., IQ, gender, age, depression, self-efficacy).

The overall purpose of educational and psychological testing is to draw an

inference about an individual or group of individuals. Because these

inferences are made on the test scores, the tests must have appropriate

evidence of their validity for these uses.

Introduction to Validity

Virtually all educational research involves measurement or observation.

And, whenever we measure or observe we are concerned with whether we

are measuring what we intend to measure or with how our observations are

influenced by the circumstances in which they are made. We reach

conclusions about the quality of our measures -- conclusions that will play

an important role in addressing the broader substantive issues of our study.

When we talk about the validity of research, we are often referring to these

to the many conclusions we reach about the quality of different parts of our

research methodology.

Page 52: Workshop Report Final

Page | 52

Sources of validity evidence

It should be remembered that the evidence for the specific use of a given test

is available rather than validity as residing in the test itself. Hence instead of

using the traditional terms such as content validity, predictive validity and

construct validity, general term validity evidence is being used in

educational research. Validity evidence is based on three broad sources:

content, relations to other variables, and construct. Not all test uses must

meet all three types. Different types of tests are used for different purposes

and, therefore, need different types of evidence. E.g. Intelligence test is

designed to predict academic achievement and is based on psychological

theory or construct. Thus it needs demonstration of evidence for both

construct and prediction but not necessarily demonstrate evidence for the

content.

1 Evidence of test content

It refers to the degree to which the test items actually measure, or are

specifically related to, the traits for which the test was designed and is to be

used. The content includes the issues, the actual wording, the design of the

items, or questions, and how adequately the test samples the universe of

knowledge and skills that a student is expected to master. The content

should match the course text books, syllabi, objectives and the judgements

of subject experts. It is high importance for achievement tests but not so for

aptitude tests. Also assess whether the test‟s content is appropriate for the

persons to be tested.

2 Evidence based on relations to other variables

This type of evidence has traditionally been referred to as criterion-related

validity. There may be two types namely, Predictive (the test is designed and

used to predict other variables) and concurrent (other tests that are

supposed to measure the same or similar construct). E.g. Usefulness of MH-

CET scores in predicting medical college performance scores. The faults in

the prediction can be attributed to the test itself or in the criteria of success,

Page 53: Workshop Report Final

Page | 53

or both. Hence it can be concluded that predictive validity is not easy to

assess. Evidence for the validity of the relationship to other measures refers

to whether test is closely related to other measures, such as institute ratings

(NAAC/ NBA), teacher‟s experience, or scores on another test of known

validity. Through this process, more convenient and more appropriate tests

can be devised to accomplish the measurement of behaviour more

effectively. In such cases, evidence will be convergent; otherwise, it would

be discriminate evidence.

3 Evidence based on Internal Structure

This type of evidence is also called as construct validity. It is the degree to

which test items and the structure of a test can be accounted for by the

explanatory constructs of a sound theory. A construct is a trait that can not

be observed. If one were to study such a construct as dominance, one would

hypothesize that people who have this characteristic will perform differently

from those who do not. Theories can be built describing how dominant

people behave in a distinctive way. Different Intelligence tests are based on

different theories; each test should be shown to measure what the

appropriate theory defines as intelligence. Evidence of internal structure is

important for personality and aptitude tests.

Validity is a unitary concept based on all of the evidence, the totality of the

evidence should be considered as evidence for validity of a given test use.

Page 54: Workshop Report Final

Page | 54

Reliability

Reliability refers to consistency or stability. In psychological and educational

testing, it refers to the consistency or stability of the scores that we get from

a test or assessment procedure. A test is reliable to the extent that it

measures whatever it is measuring consistently. Reliable tests are stable

and yield comparable scores on repeated administration. It is usually

determined using a correlation coefficient (it is called a reliability coefficient

in this context). The correlation coefficient is a measure of relationship that

varies from -1 to 0 to +1. Increase in the number of items in a test would be

able to increase the reliability because a test with few items has a great deal

of measurement error. There are a number of types of reliability.

1 Stability over time (test-retest): The scores on a test will be highly

correlated with scores on a second administration of the test to the

same subjects at a later date.

2 Stability over item samples (equivalent or parallel forms): Some tests

have two or more forms that may be used interchangeably. In these

cases, scores on one version will be very similar with scores on the

alternate form of the test. This can be practically carried out through

Page 55: Workshop Report Final

Page | 55

the administration of a longer test comprising of both the versions of

the test. Later separating the two versions, scores can be compared.

3 Stability of items (internal consistency): Test items should be highly

related to other test items. This is important because the test, or in

some cases the subtest, needs to measure a single construct. This can

be achieved by a) Split halves (test through Spearman-Brown formula)

or Coefficient of consistency (test through Kuder-Richardson

formula).

4 Stability over scorers (inter-scorer): Projection tests have a great deal

of judgement of the person scoring the test. Scorer reliability can be

determined by two independent scorers scoring the same test papers

or video tapes of the test.

5 Stability over testers: Differently trained testers and their personality

or other attributes can affect the test scores. This can overcome

through two different testers administer the two testings, with each

one giving the test first half of the time.

6 Standard error of measurement: This statistic permits the

interpretation of individual scores obtained on a test. No tests are

perfectly reliable. The standard error of measurement tells how much

difference can be expected by obtained score which is away from the

true score.

A test may be reliable even though it is not valid. However, for a test to be

valid, it must be reliable. That is, a test can consistently measure (reliability)

nothing of interest (be invalid), but if a test measures what is designed to

measure (validity), it must do so consistently (reliability).

Economy

Tests that can be given in a short period of time are likely to gain

cooperation of the subjects. Ease of administration, scoring, and

interpretation are important factors in selection of the test.

Interest

Tests that are interesting and enjoyable help to gain the cooperation of the

subject. Those that are dull or seem silly may discourage or antagonise the

Page 56: Workshop Report Final

Page | 56

subject. Under these unfavourable conditions the test is not likely to yield

useful results.

When psychological tests are used in educational research, one should

remember that standardised test scores are only approximate measures of

the traits under consideration. This limitation is inevitable and may be

ascribed to a number of possible factors.

1 Errors inherent in any psychological test-no test is completely

valid or reliable

2 Errors that may result from poor test conditions, inexpert or

careless administration or scoring of the test, or faulty tabulation

of test score

3 Inexpert interpretation of test results

4 The choice of an inappropriate test for the specific purpose in

mind.

Methods of Data Collection

There are six major methods of data collection.

• Tests (i.e., includes standardized tests that usually include

information on reliability, validity, and norms as well as tests

constructed by researchers for specific purposes, skills tests, etc).

• Questionnaires (i.e., self-report instruments).

• Interviews (i.e., situations where the researcher interviews the

participants).

• Focus groups (i.e., a small group discussion with a group moderator

present to keep the discussion focused).

• Observation (i.e., looking at what people actually do).

• Existing or Secondary data (i.e., using data that are originally

collected and then archived or any other kind of “data” that was

simply left behind at an earlier time for some other purpose).

Tests

Tests are commonly used in research to measure personality, aptitude,

achievement and performance. Tests can also be used to complement

other measures (following the fundamental principle of mixed research).

A researcher must develop a new test to measure the specific knowledge,

Page 57: Workshop Report Final

Page | 57

skills, behaviour, or cognitive activity that is being studied. For example,

a researcher might need to measure response time to a memory task

using a mechanical apparatus or develop a test to measure a specific

mental or cognitive activity (which obviously cannot be directly observed).

Remember that if a test has already been developed that purports to

measure what is intended to measure, then consider that test.

Strengths and Weaknesses of Tests

Strengths Weaknesses

Can provide measures of many

characteristics of people.

Can be expensive if test must be

purchased.

Often standardized (same stimulus is provided to all participants).

Reactive effects such as social desirability can occur.

Allows comparability of common measures across populations.

Test may not be appropriate for a local or unique population.

Strong psychometric properties (high measurement validity).

Open-ended questions and probing not available.

Availability of reference group data. Tests are sometimes biased against certain groups of people.

Many tests can be administered to groups which saves time.

Non response to selected items on the test.

Can provide “hard,” quantitative data.

Some tests lack psychometric data.

Tests are usually already developed. Can be expensive if test must be purchased.

A wide range of tests is available. Reactive effects such as social desirability can occur.

Response rate is high for group administered tests.

Test may not be appropriate for a local or unique population.

Ease of data analysis because of quantitative nature of data.

Open-ended questions and probing not available.

Educational and Psychological Tests in Medical Education

Three primary types of tests viz. intelligence tests, personality tests, and

educational assessment tests are commonly used in educational research.

1) Intelligence Tests: Intelligence has many definitions because a single

prototype does not exist. Although the construct of intelligence is hard to

define, it still has utility because it can be measured and it is related to

many other constructs.

2) Personality Tests: Personality is a construct similar to intelligence in

that a single prototype does not exist. Personality is the relatively permanent

Page 58: Workshop Report Final

Page | 58

patterns that characterize and can be use to classify individuals. Most

personality tests are self-report measures. Performance measures of

personality are also used. A performance measure is a test-taking method in

which the participants perform some real-life behaviour that is observed by

the researcher. Personality has also been measured with projective tests. A

projective test is a test-taking method in which the participants provide

responses to ambiguous stimuli. The test administrator searches for

patterns on participants‟ responses. Projective tests tend to be quite difficult

to interpret and are not commonly used in quantitative research.

3) Educational Assessment Tests.

There are four subtypes of educational assessment tests:

• Achievement Tests: These tests are important in Medical

Education. These are used in placing, advancing, or retaining

students at particular grade levels. These will measure the degree of

learning that has taken place after a person has been exposed to a

specific learning experience. They can be teacher constructed or

standardized tests. Many of these achievements tests are non-

standardised, teacher-designed tests which lack content validity.

Concurrent validity might be used to help establish a new

achievement test‟s validity. The only forms of reliability that are

critical are test-re-test, stability over test items, and the standard

error of measurement.

• Aptitude Tests: These focus on information acquired through the

informal learning that goes on in life. They are used to predict future

performance whereas achievement tests are used to measure current

performance. Aptitude tests attempt to predict an individual‟s capacity

to acquire improved performance with additional training. Eg.

Stanford-Binet Intelligence scale. These tests, particularly those that

deal with academic aptitude, that are used for purpose of placement

and classification have become highly controversial because of the

culturally different content. It is extremely difficult to eliminate

culture totally and develop one test that is equally fair to all

communities including the minority. For these tests, Predictive

Page 59: Workshop Report Final

Page | 59

validity and construct validity are important. The forms of reliability

that are critical to these tests are test-retest, stability over test items,

and the standard error of measurement. The tests that have some

degree of subjectivity also require inter-scorer and inter-tester

reliability.

• Personality Inventories: Personality scales are usually self-report

instruments. Because of the individual‟s inability or unwillingness to

report their own reactions accurately or objectively, these instruments

may be of limited value. They provide data useful in suggesting the

need for further analysis. Test setting also influence the results. Eg. A

test applied in clinical setting correlate well with psychiatrist‟s

diagnosis; but when applied to college students, it‟s diagnostic value

might be disappointing.

Diagnostic Tests: These tests are used to identify the locus of

academic difficulties in students.

Questionnaires: Inquiry Forms

A questionnaire is a self-report data collection instrument that is filled out

by research participants to record the factual information desired. When

opinions rather than facts are desired, an Opionnaire or Attitude scale is

used. Questionnaires are usually paper-and-pencil instruments, but they

can also be placed on the web for participants to go to and “fill out.”

Questionnaires are sometimes called survey instruments, but the actual

questionnaire should not be called “the survey.” The word “survey” refers to

the process of using a questionnaire or interview protocol to collect data.

Whenever questionnaires are mailed, the accompanying letter should not

commit that the sender needs the information to complete the requirements

for a graduate course, thesis or dissertation. This would lead to poor

response rates. Less than 50% ( 50% is adequate; 60% is good; and 70% is

very good) of response rates lead to limited validity. A questionnaire is

composed of questions and/or statements. Researcher should learn to write

questionnaires. Questionnaires that call for short, check mark responses are

known as restricted, or closed-form, type. In such forms besides providing

Page 60: Workshop Report Final

Page | 60

all the possible alternatives, add another alternative called other. This

category of response permits respondents to indicate what might be their

most important reason. When the questionnaire contains open questions, it

is called as open form. When developing a questionnaire make sure to follow

the 20 Principles of Questionnaire Construction to improve questionnaire

items.

Principle 1: Make sure the questionnaire items match the research

objectives.

Principle 2: Understand the research participants as they will be filling out

the questionnaire. Consider the demographic and cultural characteristics of

potential participants so that they can understand the questions properly.

Principle 3: Be careful in using descriptive adjectives and adverbs that have

no agreed upon meaning. Use natural and familiar language which is

comforting. If required, underline a word to indicate special emphasis.

Principle 4: Define or qualify terms that can be easily misunderstood. Write

items that are clear, precise, and relatively short. Short items are more

easily understood and less stressful than long items.

Principle 5: Do not use "leading" or "loaded" questions. Leading questions

lead the participant to ideal/ desired situations. Loaded questions include

loaded words (i.e., words that create an emotional reaction or response by

participants).

Principle 6: Avoid double-barreled questions. A double-barreled question

combines two or more issues in a single question (e.g., here is a double

barreled question: “Do you elicit information from parents and other

teachers?” It‟s double barreled because if someone answered it, it would not

possible to know whether they were referring to parents or teachers or both).

If the question includes the word "and"? If yes, it might be a double-barreled

question. Answers to double-barreled questions are ambiguous because two

or more ideas are confounded.

Principle 7: Avoid double negatives. Does the answer provided by the

participant require combining two negatives? (e.g., "I disagree that teachers

should not be required to supervise their students during library time"). If

yes, rewrite it.

Page 61: Workshop Report Final

Page | 61

Principle 8: Be careful of inadequate alternatives. Determine whether an

open-ended or a closed ended question is needed. Open-ended questions

provide qualitative data in the participants' own words. Open-ended

questions are common in exploratory research and closed-ended questions

are common in confirmatory research. Here is an open ended question:

How can your principal improve the morale at your school?

______________________________________________________________

Closed-ended questions provide quantitative data based on the researcher's

response categories.

Principle 9: Use mutually exclusive and exhaustive response categories for

closed-ended questions. Mutually exclusive categories do not overlap (e.g.,

ages 0-10, 10-20, 20-30 are NOT mutually exclusive and should be

rewritten as less than 10, 10-19, 20-29, 30-39, ...). Exhaustive categories

include all possible responses (e.g., if in a national survey of adult citizens

(i.e., 18 or older) then these categories (18-19, 20-29, 30-39, 40-49, 50-59,

60-69) are NOT exhaustive because there is no where to put someone who is

70 years old or older.

Principle 10: When asking for ratings or comparisons, provide a point of

reference. Also, provide if necessary, systematic quantification of response.

Consider the different types of response categories available for closed-ended

questionnaire items. Rating scales are commonly used. Numerical rating

scales (where the endpoints are anchored; sometimes the center point or

area is also labelled) are as below.

1 2 3 4 5 6 7

Very Low Very

High

1 2 3 4 5

Strongly Agree Neutral Disagree Strongly Agree

Disagree

Omitting the centre point on a rating scale (e.g., using a 4-point rather than

a 5-point rating scale) does not appreciably affect the response pattern.

Page 62: Workshop Report Final

Page | 62

Some researchers prefer 5- point rating scales; other researchers prefer 4-

point rating scales. Both generally work well.

Rankings (where participants put their responses into rank order, such as

most important, second most important, and third most important) can be

converted into Likert Scale.

Semantic differential (i.e., where one item stem and multiple scales that are

anchored with polar opposites or antonyms, are included and are rated by

the participants). It is similar to Likert Method in that the respondent

indicates an attitude or opinion between two extreme choices.

Checklists (i.e., where participants "check all of the responses in a list that

apply to them") can also be used.

Principle 11: Use multiple items to measure abstract constructs. This is

required to have high reliability and validity. One approach is to use a

summated rating scale also known as Likert Scale(such as the Rosenberg

Self-Esteem Scale that is composed of 10 items, with each item measuring

self-esteem).

Principle 12: Use multiple methods to measure abstract constructs. Use of

only one method might result in artefact of that method of measurement. If

more than one method is used, check can be kept whether the answers

depend on the method.

Principle 13: Use caution if reverse wording is used in some of the items to

prevent response sets. (A response set is the tendency of a participant to

respond in a specific direction to items regardless of the item content).

Reversing the wording of some items can help ensure that participants don't

just "speed through" the instrument, checking "yes" or "strongly agree" for

all the items. Evidence suggests that the use of reverse wording reduces the

reliability and validity of scales.

Principle 14: Phrase questions so that they are appropriate for all

respondents. Also, avoid unwanted assumptions. Design questions that will

give a complete response. Develop a questionnaire that is easy for the

participant to use. The participant must not get confused or lost anywhere

in the questionnaire.

Page 63: Workshop Report Final

Page | 63

Principle 15: Always pilot test the questionnaire with a small group of

persons similar to those who will be used in the study. These dry runs will

be worth the time and effort. Based on the observations of pilot study, revise

and re-revise the questions if necessary.

Principle 16: Questionnaire should seek information that can not be

obtained from other sources such as college record/ reports or census data.

Principle 17: Questions are to be presented in good psychological order,

proceeding from general to more specific questions.

Principle 18: It is advisable to pre-construct a tabulation sheet, anticipating

how the data will be tabulated (Dummy Tables) and interpreted, before the

final form of the questionnaire is decided on.

Principle 19: Questionnaire should be attractive in appearance, neatly

arranged, and clearly duplicated or printed.

Principle 20: It should be as short as possible and only long enough to get

the essential data

Strengths and Weaknesses of Questionnaires

Strengths Weaknesses

Good for measuring attitudes and eliciting other content

Usually must be kept short.

Inexpensive (mail questionnaires & group administered questionnaires).

Reactive effects may occur (e.g., interviewees may show only what is socially desirable).

Can provide information about participants‟ internal meanings and ways of thinking.

Non-response to selective items.

Can administer to probability samples. People may not recall important information and may lack self-awareness.

Quick turnaround. Response rate may be low for mail and email questionnaires.

Can be administered to groups. Open-ended items may reflect differences in verbal ability, obscuring the issues of interest.

Perceived anonymity by respondent may be high.

Data analysis can be time consuming for open-ended items.

Moderately high measurement validity (i.e., high reliability and validity)

Measures need validation.

Closed-ended items can provide exact information

Open-ended items can provide detailed information

Ease of data analysis for closed-ended items.

Useful for exploration as well as confirmation.

Interviews

Page 64: Workshop Report Final

Page | 64

In an interview, the interviewer asks the interviewee questions (in-person or

over the telephone). It is often superior to other data gathering devices.

Through this technique, the researcher may stimulate the subject‟s insight

in to his/her own experiences, thereby exploring significant areas not

anticipated in the original plan of investigation. It is the appropriate

techniques when dealing with children. Trust and rapport are important.

Probing is available and is used to reach clarity or gain additional

information. It is necessary to consider the gender, race, and possibly other

characteristics of the interviewer. However, distilling the essence of the

reaction is difficult, and interviewer bias may be a hazard. In interviews

actual wording of the responses should be retained. The validity can be

increased by conducting a structured interview. Though time consuming,

this technique, is useful in areas where human motivation is revealed

through actions, feelings, and attitudes. Interviews can be classified as

qualitative and quantitative. Quantitative interviews utilise closed-ended

questions and are standardised. Unlike qualitative interviews consist of

open ended questions. These can be further subdivided into informal

conversational interview (which is spontaneous and is loosely structured);

Interview Guide Approach (which is more structured having interview

protocol. Wording and sequence of questions can be altered by the

interviewer); and

Standardized Open-Ended Interview (where the questions are in a protocol

strictly adhered to and the wording can not be changed).

Strengths and Weaknesses of Interviews

Strengths Weaknesses

Good for measuring attitudes and most other content of interest.

In-person interviews are expensive and time consuming.

Allows probing and posing of follow-up questions by the interviewer.

Reactive effects (e.g., what is socially desirable).

Can provide in-depth information. Investigator effects may occur (e.g., untrained interviewers distort data due to personal biases and poor interviewing skills).

Can provide information about participants‟ internal meanings and ways of thinking.

Interviewees may not recall important information and may lack self-awareness.

Closed-ended interviews provide exact information.

Perceived anonymity by respondents may be low.

Telephone and e-mail interviews provide very Data analysis is time consuming for open-ended

Page 65: Workshop Report Final

Page | 65

quick turnaround. items.

Moderately high measurement validity (i.e., high reliability and validity)

Measures need validation

Can use with probability samples.

Relatively high response rates are often attainable.

Useful for exploration as well as confirmation.

.

Q Methodology

Q-Methodology is a technique for scaling objects or statements. It is a

method of ranking attitudes or judgements and is primarily effective when

the number of items to be ranked is large. The procedure is known as Q-

Sort, in which cards or slips bearing the statements or items are arranged in

a series of numbered piles.

Social Scaling

Sociometry: It is a technique for describing the social relationships among

individuals in a group. In an indirect way this technique attempts to

describe attraction or repulsion between individuals by asking them to

indicate whom they would choose or reject in various situations.

Diagrammatically, it can be represented as Venn or Chapati. In medical

education, health care seeking behaviour of a community or group can be

studied.

Sociogram: Sociometric choices may be represented graphically on a chart

known as sociogram. In this chart, those most chosen are referred as Stars,

and those less chosen as Isolates. Small groups made up of individuals who

choose one another are Cliques. Sociometry is a peer rating rather than a

rating by superiors. Students of group relationships and classroom

teachers may construct a number of sociograms over a period of time to

measure changes that may have resulted from efforts to bring isolates into

closer group relationships or to transform cliques into more general group

membership. Another technique used also determines the social-Distance.

This Social-Distance Scale attempts to measure to what degree an

individual or group of individuals is accepted or rejected by another

individual or group. The target sociogram is depicted as below. In this

Page 66: Workshop Report Final

Page | 66

diagram, nested series of concentric circles are drawn based on the points

that are equal in terms of how frequently they were chosen. Points in the

central circle are more central in the sense that they were chosen more

often. Points at the edge were chosen less often. The lines connecting them

represent the primary links between pairs. And all the points are placed in

the rings in such a way that the lines connecting them are relatively short. d

“Guess-Who” Technique

Developed by Hartshorne and May, 1929, Guess-Who technique is a

process, consists of descriptions of the various roles played by children in a

group. Children are asked to name the individuals who fit certain verbal

descriptions.

Name the teacher who always comes late to the class

Name the teacher who always uses the word” you know”

Name the student who always smiles and happy

Focus Groups

A focus group is a situation where a focus group moderator keeps a small

and homogeneous group (of 6-12 people) focused on the discussion of a

research topic or issue. Focus group sessions generally last between one

and three hours and they are recorded using audio and/or videotapes.

These groups are useful for exploring ideas and obtaining in-depth

information about how people think about an issue.

Strengths and Weaknesses of Focus Groups

Page 67: Workshop Report Final

Page | 67

Strengths Weaknesses Useful for exploring ideas and concepts.

Sometimes expensive.

Provides window into participants‟ internal thinking.

Difficult to find a moderator with good facilitative and rapport building skills.

Can obtain in-depth information. Reactive and investigator effects occur if participants feel they are being watched or studied (Hawthorne Effect).

Can examine how participants react to each other.

May be dominated by one or two participants.

Allows probing. Difficult to generalize results if small, unrepresentative participants sample

Most content can be tapped. May include large amount of extra or unnecessary information.

Allows quick turnaround. Measurement validity may be low.

Usually should not be the only data collection methods used in a study.

Data analysis can be time consuming because of the open-ended nature of data

Observation

In the method of data collection called observation, the researcher observes

participants in natural and/or structured environments as participant or

non-participant. Observation can be carried out as time sampling technique

(based on observation of individuals behaviour for every 60 seconds or more)

or it can be carried out as a frequency count (based on the number of

occurrences of a particular type of behaviour). Observation is specifically

used effectively to scout the performance of opposing teams in sports. It is

important to collect observational data (in addition to attitudinal data)

because what people say is not always what they do! Observation can be

carried out in two types of environments namely Laboratory observation

(which is done in a lab set up by the researcher) and Naturalistic

observation (which is done in real-world settings). However, observation

should always be systematic, directed by a specific purpose, carefully

focussed and thoroughly recorded. Criterion-related and construct validity

are necessary. It is recommended that observations should be double-blind

(both the observers and the observed are unaware of the purpose of the

study and are unaware of the observation process). It is also suggested in

order to reduce observer bias, conduct study by more than one observer.

Always have a dry-run phase before the implementation. Simultaneous

Page 68: Workshop Report Final

Page | 68

recording of observations is highly recommended. Educational research

seeks to describe behaviour under less rigid controls and more natural

conditions. There are two important forms of observation.

1 Quantitative observation involves standardization procedures, and it

produces quantitative data (The following data is collected: Who is observed;

what is observed; when the observations are to take place; where the

observations are to take place; and how the observations are to take place).

Standardized instruments (e.g., checklists) are often used in quantitative

observation. Two sampling procedures are also often used in quantitative

observation. Time-interval sampling (i.e., observing during time intervals,

e.g., during the first minute of each 10 minute interval) and Event sampling

(i.e., observing after an event has taken place, e.g., observing after teacher

asks a question).

2 Qualitative observation is exploratory and open- ended, and the

researcher takes extensive field notes. The qualitative observer may take on

four different roles that make up a continuum:

• Complete participant

• Participant-as-Observer (i.e., spending extensive time "inside" and

informing the participants that you are studying them).

• Observer-as-Participant (i.e., spending a limited amount of time "inside"

and informing them that you are studying them).

• Complete Observer

Strengths and Weaknesses of Observational Data

Strengths Weaknesses Allows one to directly see what people do without relying on what they say.

Reasons for observed behaviour may be unclear.

Provides firsthand experience. Reactive effects may occur when respondents know they are being Observed( Hawthorne Effect).

Can provide relatively objective measurement of behaviour.

Investigator effects (e.g., personal biases and selective perception)

Observer may see things that escape the awareness of people in the setting.

Sampling of observed people and settings may be limited.

Excellent way to discover what is occurring in a setting.

Cannot observe large or dispersed populations.

Helps in understanding importance of contextual factors.

Some settings and content of interest cannot be observed.

Can be used with participants with weak Collection of unimportant material

Page 69: Workshop Report Final

Page | 69

verbal skills. may be moderately high.

Provide information on things people would otherwise be unwilling to talk about.

More expensive to conduct than questionnaires and tests.

Observer may move beyond selective perceptions of people in the setting.

Data analysis can be time consuming.

Good for description.

Provides moderate degree of realism.

Secondary/Existing Data

Secondary data (i.e., data originally used for a different purpose) are

contrasted with primary data (i.e., original data collected for the new

research study). The most commonly used secondary data are documents,

physical data, and archived research data.

1. Documents. These are Personal documents (i.e., Letters, diaries,

family pictures) and Official documents (i.e., Newspapers, annual reports,

yearbooks, minutes).

2. Physical data (are any material thing created or left by humans that

might provide information about a phenomenon of interest to a researcher).

3. Archived research data (i.e., research data collected by other

researchers for other purposes, and these data are save often in tape form or

CD form so that others might later use the data).

Strengths and Weaknesses of Secondary Data

Physical Data

Strengths Weaknesses

Can provide insight into what people think and what they do.

May be incomplete.

Unobtrusive, making reactive and investigator effects very unlikely.

May be representative only of one perspective.

Can be collected for time periods occurring in the past (e.g., historical).

Access to some types of content is limited.

Provides background and historical data on people, and organizations.

May not provide insight into participants‟ thinking for physical data.

Useful for corroboration. May not apply to general populations.

Grounded in local setting.

Useful for exploration.

Archived research data

Strengths Weaknesses

Page 70: Workshop Report Final

Page | 70

• Archived research data are available on a wide variety of topics.

• May not be available for the population of interest to you.

• Inexpensive. • May not be available for the research questions of interest to you.

• Often are reliable and valid (high measurement validity).

• Data may be dated.

• Can study trends. • Open-ended or qualitative data usually not available.

• Ease of data analysis. • Many important findings have already been mined from the data.

• Often based on high quality or large probability samples.

Page 71: Workshop Report Final

Page | 71

Lesson V:

Qualitative and Mixed Methods in Educational Research

Objectives:

1. Understand the types of qualitative research (phenomenology, ethnography, grounded theory, case study and concept maps)

2. Describe the various qualities (SWOT) of qualitative research 3. Understand the data collection tools and techniques for qualitative

data(questionnaire, Interview, Focus group, Observation ) 4. Types of Qualitative data. Common misconceptions about qualitative research 5. Mixed Methods

Lesson Outline:

1 Definition and General Characteristics of qualitative research namely Phenomenology, Ethnography, Discourse Analysis, Grounded Theory and Case Study. The characteristics of each method. Concepts in Cognitive mapping.

2 The design, data collection, field work and Analysis strategies of qualitative research. SWOT Analysis of each method. Utilisation of these methods in medical education.

3 Quality in qualitative research. General Characteristics of Data collection methods such as questionnaire, Interviews, Focus group discussions and Observations. How can these methods be utilised in Educational Research

4 Types of data such as video recordings, Audio recordings, unstructured text. Common questions and doubts expressed against qualitative research such as is it scientific, Are the findings generalizable, effect of presence of researcher on the observations etc.

5 What are the mixed methods? How effectively qualitative and quantitative methods can be amalgamated.

Qualitative research relies primarily on the collection of qualitative data (i.e.,

nonnumeric data such as words and pictures). There are four major types of

qualitative research:

• Phenomenology.

• Ethnography.

• Grounded theory.

• Case study.

1 Phenomenology

Page 72: Workshop Report Final

Page | 72

Phenomenology is a descriptive study of how individuals experience a

phenomenon. In this , researcher study what is the meaning, structure, and

essence of the lived experience of this phenomenon by an individual or by

many individuals. The researcher tries to gain access to individuals' life-

worlds, which is their world of experience; it is where consciousness exists.

Conducting in-depth interviews is a common method for gaining access to

individuals' life- worlds. The researcher, next, searches for the invariant

structures of individuals' experiences. Phenomenological researchers often

search for commonalities across individuals (rather than only focusing on

what is unique to a single individual). For example, what are the essences of

peoples' experience of the death of a loved one?

After analysing the phenomenological research data, a report has to be

documented which provides rich description and a "vicarious experience" of

being there for the reader of the report.

2 Ethnography

Ethnography is the discovery and description of the culture of a group of

people. Here is the foundational question in ethnography: What are the

cultural characteristics of this group of people or of this cultural scene?

Because ethnography originates in the discipline of Anthropology, the

concept of culture is of central importance. Culture is the system of shared

beliefs, values, practices, language, norms, rituals, and material things that

group members use to understand their world. One can study micro

cultures (e.g., such as the culture in a classroom) as well as macro cultures

(e.g., such as the Pawra or bhil tribal culture in nandurbar district.).

There are two additional or specialized types of ethnography.

1. Ethnology (the comparative study of cultural groups).

2. Ethnohistory (the study of the cultural past of a group of people). An

ethnohistory is often done in the early stages of a standard

ethnography in order to get a sense of the group's cultural history.

Page 73: Workshop Report Final

Page | 73

The final ethnography (i.e., the report) should provide a rich and holistic

description of the culture of the group under study.

3 Case Study Research

Case study research is the detailed account and analysis of one or more

cases. Here is the foundational question in case study research: What are

the characteristics of this single case or of these comparison cases? A case

is a bounded system (e.g., a person, a group, an activity, a process). Because

the roots of case study are interdisciplinary, many different concepts and

theories can be used to describe and explain the case.

Robert Stake classifies case study research into three types:

1. Intrinsic case study (where the interest is only in understanding the

particulars of the case).

2. Instrumental case study (where the interest is in understanding

something more general than the case).

3. Collective case study (where interest is in studying and comparing

multiple cases in a single research study).

Multiple methods of data collection are often used in case study research

(e.g., interviews, observation, documents, questionnaires). The case study

final report should provide a rich (i.e., vivid and detailed) and holistic (i.e.,

describes the whole and its parts) description of the case and its context.

4 Grounded Theory

Grounded theory is the development of inductive, "bottom-up," theory that is

"grounded" directly in the empirical data. Here is the foundational question

in grounded theory: What theory or explanation emerges from an analysis of

the data collected about this phenomenon? It is usually used to generate

theory. Grounded theory can also be used to test or elaborate upon

previously grounded theories, as long as the approach continues to be one of

constantly grounding any changes in the new data.

Four important characteristics of a grounded theory are

• Fit (i.e., Does the theory correspond to real-world data?),

• Understanding (i.e., Is the theory clear and understandable?),

Page 74: Workshop Report Final

Page | 74

• Generality (i.e., Is the theory abstract enough to move beyond the

specifics in the original research study?),

• Control (i.e., Can the theory be applied to produce real-world results?).

Data collection and analysis continue throughout the study. When collecting

and analyzing the researcher needs theoretical sensitivity (i.e., being

sensitive about what data are important in developing the grounded theory).

Data analysis often follows three steps:

1. Open coding (i.e., reading transcripts line-by- line and identifying and

coding the concepts found in the data).

2. Axial coding (i.e., organizing the concepts and making them more

abstract).

3. Selective coding (i.e., focusing on the main ideas, developing the story,

and finalizing the grounded theory).

The grounded theory process is "complete" when theoretical saturation

occurs (i.e., when no new concepts are emerging from the data and the

theory is well validated). The final report should include a detailed and clear

description of the grounded theory.

Mixed Research:

Mixed research is research in which quantitative and qualitative techniques

are mixed in a single study. Proponents of mixed research typically adhere

to the compatibility thesis as well as to the philosophy of pragmatism. The

compatibility thesis is the idea that quantitative and qualitative methods are

compatible, that is, they can both be used in a single research study. The

philosophy of pragmatism says that researchers should use the approach or

mixture of approaches that works the best in a real world situation. In

short, what works is what is useful and should be used, regardless of any

philosophical assumptions, paradigmatic assumptions, or any other type of

assumptions.

Today, proponents of mixed research attempt to use what is called the

fundamental principle of mixed research. According to this fundamental

principle, the researcher should use a mixture or combination of methods

that has complementary strengths and non-overlapping weaknesses.

Page 75: Workshop Report Final

Page | 75

Quantitative research Qualitative research

Strengths

Already constructed theories Data based on participants meaning

Already constructed hypothesis Useful to describe complex phenomenon

Random samples, sufficient size

generate findings

Provide individual case information

Quantitative predictions possible Provides insider‟s view point

Confounders eliminated Dynamic process is documented

Data collection is quick Data collected in naturalistic setting

Precise quantitative data Data useful to generate hypothesis/

theory

Analysis less time consuming

Researcher independent results

Higher credibility

Large population can be studied

Weaknesses

Local constituents may not

understand

May not be generalizable

Focussing on hypothesis might miss

some phenomenon

Difficult to quantitative prediction

Abstract knowledge Difficult to test hypothesis/ theory

Low credibility

Data analysis time consuming

Researcher bias can influence results

Data collection takes more time

Mixed Research

Strengths Weaknesses

Words, pictures, narrative add

meaning to numbers

Difficult for single researcher to

conduct qualitative & quantitative

Can have strengths of both

qualitative and qualitative methods

More exepensive

Page 76: Workshop Report Final

Page | 76

Can generate and test theory More time consuming

Can answer broad range of questions Complex analysis

Can provide stronger evidence

Increases generalizability

The Research Continuum

Research can be viewed as falling along a research continuum with “mono

method” research placed on the far left side, “fully mixed” research placed

on the far right side, and “partially mixed” located in the center.

Types of Mixed Research Methods

There are two major types of mixed research: they are mixed model research

and mixed method research.

Mixed Model Research : In this type, quantitative and qualitative

approaches are mixed within or across the stages of the research process.

Here are the two mixed model research subtypes: within-stage and across-

stage mixed model research.

1. In within-stage mixed model research, quantitative and qualitative

approaches are mixed within one or more of the stages of research. An

example of within-stage mixed model research would be where a

questionnaire during data collection that included both open-ended (i.e.,

qualitative) questions and closed-ended (i.e., quantitative) questions.

2. In across-stage mixed model research, quantitative and qualitative

approaches are mixed across at least two of the stages of research. A

researcher wants to explore (qualitative objective) why people take on-line

Mixed Research

Collect Qualitative data

Collect Quantitative

Data

Perform Qualitative

Analysis

Perform quantitative

Analysis

Perform Qualitative

Analysis

Perform quantitative

Analysis

Page 77: Workshop Report Final

Page | 77

college courses. The researcher conducts open-ended interviews (qualitative

data collection) asking them why they take on-line courses, and then the

researcher quantifies the results by counting the number of times each type

of response occurs (quantitative data analysis); the researcher also reports

the responses as percentages and examines the relationships between sets

of categories or variables through the use of contingency tables.

Mixed Method Research

In this method, a qualitative phase and a quantitative phase are included in

the overall research study. It‟s like including a quantitative mini-study and a

qualitative mini-study in one overall research study.

Mixed method research designs are classified according to two major

dimensions:

1. Time order (i.e., concurrent versus sequential) and

2. Paradigm emphasis (i.e., equal status versus dominant status).

Stages of Mixed Research Process

There are eight stages in the mixed research process. It is important to note

that although the steps in mixed research are numbered, researchers often

follow these steps in different orders, depending on what particular needs

and concerns arise or emerge during a particular research study. For

example, interpretation and validation of the data should be done

throughout the data collection process.

Page 78: Workshop Report Final

Page | 78

(1) Determine whether a mixed design is appropriate

(2) Determine the rationale for using a mixed design

Rationale for Mixed Research

Purpose Explanation

Triangulation Seeks convergence, corroboration of results from

different methods

Complementary Seeks elaboration, enhancement, illustration and

clarification of results from one method to the other

Development Seeks the results of one method to improve, develop

the other method

Initiation Seeks new perspectives

Expansion Seeks expansion of results by different methods

(3) Select the mixed method or mixed model research design

(4) Collect the data

(5) Analyze the data

(6) Validate the data

(7) Interpret the data

(8) Write the research report.

In conclusion, mixed research is the newest research paradigm in

educational research. It offers much promise, and we expect to see much

more methodological work and discussion about mixed research in the

future as more researchers and book authors become aware of this

important approach to empirical research

Page 79: Workshop Report Final

Page | 79

Lesson VI:

Qualitative Techniques and Computer aided Analysis

Objectives:

1 Enumerate the various qualitative methods commonly practiced in anthropology

2 How these methods can be applied to medical education with particular reference to under-graduate medical education

3 Enumerate various qualitative data analysis software and their familiarity Lesson Outline:

1 Qualitative methods include Participatory techniques, in-depth techniques and systematic techniques. The participatory techniques are valuable and popular. Identifying the health resources, drawing socio-cultural relationships, mapping of health needs, transects etc are utilised by the educationalists for teaching and learning in medical curriculum.

2 These methods can be either demonstrated in the field practice area of

department of community medicine. Or it can be practiced in the hospital

3 Anthropac, Answer, Atlas-ti are some of the qualitative freeware available

for qualitative data analysis.

Introduction:

Qualitative research is type of formative research that includes specialized

techniques for obtaining „in-depth responses‟ from respondents. Qualitative

research is often conducted to answer the question - why? A purpose of qualitative

research is the construction (not the discovery) of new understanding. At present

there is a revival in the qualitative methods. The reasons for this revival of interest

in qualitative Methods are depicted below. 1) Growing realization of unsuitability of

survey research methods in the context of developing countries where population is

predominantly oral and illiterate and where magnitude of non-sampling errors is

high in surveys. 2) Increased interdisciplinary team work. 3) Demand of quick

results from the ethnographic work.

The latest trend in the field of research is the combined use of quantitative and

qualitative research methods i.e. mixed-method design within a single data set.

According to Morse (2005), it is in this area that the largest abuses of qualitative

data are occurring, largely because methodological principles have not been

followed.

Types of Qualitative Methods:

Page 80: Workshop Report Final

Page | 80

The qualitative data collection techniques range from the highly structured

systematic techniques to the highly flexible people-centered participatory

techniques.

Participatory research (PR) techniques

In-depth techniques

Systematic techniques

1 Participatory research (PR) Techniques: In conventional research, knowledge is generated by the researchers and the study

subjects have no control over it. PR process intends to change existing local

problems and synthesize local people‟s knowledge with existing scientific

knowledge. PR involves people as „stakeholders‟ for their empowerment. It assumes

that the ordinary people already possess knowledge and have an understanding of

their reality.

Classification of PR Tools: Space-related PRA applications: Social mapping, transect, mobility map,

mapping

Time-related PRA applications: Daily activity schedule, seasonal diagram,

trend analysis

Relational PRA applications: Venn diagram, spider diagram, force field

analysis, pair wise ranking

Application of PR in Medical Education:

E.g. Transect walk as Public health teaching-learning tool

Social Mapping exercise for medical students, Pulai

Participatory mapping of stool positive cases

Venn Diagram with the group of students

Participatory Group work

E.g. Force Field Analysis (FFA) with medical undergraduates to explore pressing

issues in their academic life

Further resources/ Reading:

Training in Participation Series [PRA tips on CD-ROM]. Patna (India): Institute for Participatory Practices; 2004.

Rajesh Tandon (Ed). Participatory Research: Revisiting the Roots. New Delhi (India), Mosaic books; 2005.

Page 81: Workshop Report Final

Page | 81

2 In Depth Techniques:

These are qualitative in-depth flexible discussions or interviews with the group or

person who knows what is going in community about the topic on which we want

to get information. Some commonly used methods are Focus Group Discussion

(FGD), Key Informant Interviews (KII) and In-depth Interview (IDI).

Application of in-depth techniques in medical education

Formative exploration of students‟ perception about Community Medicine teaching at Mahatma Gandhi Institute of Medical Sciences, Sewagram, India. Online J Health Allied Scs. 2008;7(3):2. Link: http://www.ojhas.org/issue27/2008-3-2.htm

Portfolio based approach for teaching public health among medical under-graduates and assessment of their learning in a Medical college of rural India

3 Systematic Techniques:

These techniques can be used with almost any qualitative research methods such

as focus group or participatory research to collect systematic and structured data.

Page 82: Workshop Report Final

Page | 82

Principle - Respondent make sense of their words by grouping their observation or

experiences in class known as domain. Examples - Free listing, Pile sorting, Delphi

panel etc. Free list combined with pile sort can be used for systematic exploration

of the perceptions of respondents on a given research topic.

Application of Systematic Techniques in Research

Process Documentation of Health Education Interventions for School Children and Adolescent Girls in Rural India. Education for Health, Volume 22, issue 1, 2009.

Available from: http://www.educationforhealth.net/

Eliminating Childhood Malnutrition : Discussions with Mothers and

Anganwadi Workers. Journal of Health Studies / I: 2,3 / May - Dec. 2008.

Available from: http://www.esocialsciences.com/essJournals/ essJournalIssuesMain.asp?jid=1&issue=Current

Sample Size and Sampling Techniques:

Sample size: No mathematical formula to calculate sample size in qualitative

research. The validity, meaningfulness and insights generated from the qualitative

data have more to do with the richness of data obtained. The process of data

collection is continued till the saturation point i.e. where no new information is

added after additional interviews or focus group discussions.

Sampling Techniques:

1) Purposive sampling, where sample units are selected with definite purpose in

view, e.g. victims of some events etc.

2) Convenient sampling, where the conveniently available respondents are

selected, e.g. participants of camp or workshop

3) Quota sampling is a restricted type of convenient or purposive sampling

defining the quota of sample to be drawn from different strata and then drawing

the required sample.

Page 83: Workshop Report Final

Page | 83

4) In Snow-ball sampling, the sample is driven by the respondents. It is used when

the target population is unknown or difficult to approach, e.g. such as MSM

population, Sex workers etc.

Sequencing of the Methods:

The qualitative data collection should be „on-going‟ or „iterative‟ process where one

method directs the other. In mixed methods design, quantitative (survey) and

qualitative methods are used in same research design. Pre-survey qualitative

research: for better pre understanding of the underlying dynamics, for exploring

local terms. Post- survey qualitative research: to bridge the gaps of information in

survey.

Analysis of Qualitative Data:

It is a multi-faceted endeavor. It requires planning, capacity for being open to views

that that are different from your very own, an appreciation of provisional nature of

human knowledge, strong conceptual skills and excellent scholarship

Some Commonly Used Terms:

Interim analysis: On-going and iterative (non-linear) process. Interim analysis

continues until the process or topic the researcher is interested in is understood.

Coding: It is defined as making the segments of data with symbols, descriptive

words or category name.

Memo: It is recording reflective notes about what you are learning from your data.

Include those memos as „additional data‟ to be analyzed.

Content analysis: Subjective interpretation of content of text data through the

systematic classification process of coding and identifying themes or patterns.

Steps in the process of Content Analysis:

Step1: Transcription: The raw data need to be transformed into written text

format before analysis.

Step 2: Deciding the unit of analysis: Defining the coding unit is one of the most

fundamental and important step. The commonly used coding units are word,

concept, sentence, paragraph and theme.

Step 3: From units to categories: categories and code schemes can be derived

from three sources such as a) data itself, 2) previous related studies, 3) theories.

Step 4: Test coding on sample test: If there is low inter-coder agreement then

revise the rules of coding sample text and checking coding consistency.

Step 5: Code all text data: When sufficient consistency is achieved then coding

rules can be applied to code all text data.

Page 84: Workshop Report Final

Page | 84

Step 6: Assess the coding consistency: After coding all text data, coding

consistency needs to be re-checked. Human coders are subject to fatigue and are

likely to make mistakes as coding proceeds.

Step 7: Drawing conclusions from the coded data: This is a crux of qualitative

data analysis. It involves reading and re-reading of text data. The activities involve

exploring properties and dimensions of categories and identify relationships

between categories.

Step 8: Reporting: While writing report it is important to maintain the balance

between description and interpretation. Here, one can use conceptual frameworks

derived from the data set.

Methods to ensure Validity in Qualitative Research:

Researcher as detective: The researcher has to develop the understanding of the

data through careful consideration of potential causes and effects by systematically

eliminating the rival explanations and hypothesis until the final cause is made

beyond a reasonable doubt.

Extended field work: The researcher should collect data in the field over the

extended period of time.

Low-inference descriptors: The use of descriptions phrased very close to the

participant‟s account or researcher‟s field notes. Verbatim i.e. direct quotations are

used as low-inference descriptors.

Triangulation: A combination of multiple methods, multiple investigators to collect

and interpret data adds to the validity of the results.

Participant feedback: The feedback and discussion on the researcher‟s

interpretation and conclusions with actual participants and other members of the

community helps in verification and better insight into the research problem.

Peer-review: Discuss the findings with the disinterested peer e.g. other researcher

who is not directly involved. Peer should be skeptical and play the devil‟s advocate,

challenging the researcher to provide solid evidence for any interpretation or

conclusion

Use of Software in the Analysis of Qualitative Data:

For smaller data set „manual content analysis‟ is undertaken. Here, coding is done

manually along a narrow blank column of the text document. A computer assisted

coding using software packages has significantly reduced the need for the

traditional filing technique. The most popular qualitative data analysis packages

are ATLAS-ti and Anthropac

Page 85: Workshop Report Final

Page | 85

Lesson VII:

Focus Group Discussions: Participatory and Non-Participatory techniques of

Qualitative Data Collection in Medical Education

Objectives:

1. Describe the merits and demerits of Focus group discussions 2. Understand the method of conducting Focus group and small group

Discussions. 3. Understand the method of Participatory Learning Appraisal(PLA) 4. Describe the concepts of non participatory educational techniques like Objective

structured examinations/assessments

Lesson Outline:

1 Definition and Characteristics of Focus Group. When to use Focus Groups. Role in Educational Research. What are advantages and disadvantages of FGD

2 How to conduct the discussions and the ground rules. Purpose, Preparations and Process. Recording of Focus Group Data.

3 Triangulation of Qualitative Research (Team members, Participants and Methods). Other methods of Participatory (Interviews, Venn Diagram, Pile sorting, Transects, Direct Observation (Walkabout), Portfolio, Participatory Learning for Assessment, Rapid Assessments, case/ event narratives, Role Play) and Non-participatory techniques(Time Line, Free Listing, Priority Matrix, Objective Structured assessments ). Details of purpose, preparation and process of Participatory Learning for Assessment.

4 Details of conducting the Objective structured Assessments both clinical and practical (OSCE,OSPE)

What is Focus Group Discussion? It is a technique of gathering data and insights from discussions and interactions among participants in a group, facilitated by a moderator . It promotes exchange of ideas among participants and is focused, but flexibly structured discussion. It is ideal for exploring norms, expectations, values and beliefs and NOT personal experiences. FGD is a group discussion of approximately 6-12 persons guided by a facilitator, during which the members of the group talk freely and spontaneously about a certain topic. It thus, aims to be more than question-answer interaction

where in the members are also encouraged to discuss the topic among themselves. When to use FDG:

o Focus research & develop relevant research hypotheses o Formulate appropriate questions for more structure and large scale surveys o Help understand and solve unexpected problems in interventions o Develop appropriate messages for health education programs o Explore controversial topics

Why Focus Group?

o Defining the research concept o Developing hypothesis o Generating Vocabularies

Page 86: Workshop Report Final

Page | 86

o Framing a questions in large scale surveys o Providing supplementary information related to the community's

knowledge, beliefs, attitudes and behavior on specific issues Composition and Selection of Participants: Select 8-10 individuals in a group willing participants NOT individuals who will dominate the discussion or inhibit the participation of others in the group. Participants are selected in advance by either random sampling or by any alternative criteria. The members are homogenous viz. regarding major social divisions and/or background characteristics. Age and sex often considered for assigning participants into different groups. Inform the participants about the topic of exploration through personal experience or interest arising from a particular role or position. The date, time and venue of the FGD is fixed in advance. A time limit of one and one-half hours is desirable and two hours is the maximum. Anonymity of the participants is preferred Members of the research team: a moderator (facilitator), a note-taker & recorder Guidelines for the FDG Participants: One participants speak at one time and clearly. Try gathering everyone‟s perspective/opinion and encouraging participation. Process of FDG: FGD guidelines are to be pre-tested in advance. More than one FGD is to be conducted Moderator /note-takers should be trained in advance. In recruitment of the participants take help from key informant so that homogeneity can be maintained. The process need to be recorded in addition to the routine note-taking. Ideally FGD should be of 90 minutes duration. Make physical arrangements for setting, equipment, food and drinks, and child care if necessary. Select the location and time for FGD. Essential Steps: Starting the Discussion

• Collect socio-demographic details informally • Summarize the purpose of the study • Describe the focus group discussion process

- No right or wrong answers - All should participate - All should respect the opinions of others

• Make sure everyone understood the informed consent • Ask participants to guard the confidentiality of others in the groups

Conducting the Discussion • Begin with warm-up questions • Be aware of who is talking and who is not

- Do not allow one or two individuals to dominate • Use broad, open-ended questions

- Avoid yes or no or short answer questions • Always probe • Record body language, nonverbal communication

Documenting the Discussion • Expand notes of the discussion • Record in writing any nonverbal data • Don‟t imply judgement • Add researcher‟s comments in parentheses • Finalise field notes as soon as possible after the discussion • If tape recorder is used, complement taped transcripts with field notes in

preparing final transcripts

Page 87: Workshop Report Final

Page | 87

Role of Moderator

• Introduce the session • Encourage discussion • Creating a climate for open exchange

- explaining the goal of discussion - setting ground rules - encouraging participation by all

• Guiding the discussion - introducing topics with main questions - eliciting detailed information with follow up questions - probing meaning of responses - Should not dominate the discussion

• Keeping the discussion focused • Encourage involvement of every member

• Monitor involvement & interaction among participants • While maintaining the core theme of the discussion ensure flow of

conversation. • Avoid being placed in the role of expert • Control of rhythm of the meeting, but avoid an unobtrusive way • Take time to end the meeting to summarize and check for agreement /

disagreement on important topics • Build rapport, empathize • Thank each of the participants personally for their participation • Monitor involvement & interaction among participants • Encourage involvement of every member • While maintaining the core theme of the discussion ensure flow of

conversation. Role of Note Taker/ Recorder Items to be recorded

• Date, time and place • Number, names and description of each participant • General description of group dynamics (level of participation, presence of

dominant participant, level of interest etc.) including non-verbal interaction among the participants

• Opinions of the participants including key statements • Emotional aspects (reluctance, strong feelings attached to a certain topic)

including any non-verbal communication • Taking notes without disturbing the discussion including identity of the

speakers • Spontaneous relevant discussion during breaks or after the formal session /

discussion • Works as back up to the moderator by drawing attention to missed

comments from participants and missed topics Designing the Interview Guide for FGD

o Must provide the moderator with he topics and issues that are, to the extent possible, to be covered at some point during the discussions

o The guide is loosely structure and does not suggest potential responses. o The questions should be unstructured, unbiased and non-threatening o Progression of the topics in the guidelines should be logical and should

move from general topic to specific topic

Page 88: Workshop Report Final

Page | 88

o The guide should not overly done or have too many questions (preferably should have no more than 20 questions / topics)

o Pretesting of guidelines with several mock sessions is essential Strengths and Limitations of FGD

o Should not be used for quantitative purpose, e.g. the testing of hypotheses or generalization of findings for larger areas that may need more elaborate surveys

o FGD can be used to complement findings from the surveys and other qualitative techniques as using it alone may be risky as the people tend to centre their opinion on the most common ones on Social Norms.

o FGD may not be very useful on sensitive topics where members may hesitate to air their feelings and experiences freely (sexual behavior/HIV AIDS)

o Evaluator has less control than individual interview o Groups are often difficult to assemble

Do‟s and Don‟ts of Moderator

Dos Don‟ts

Make everyone feel welcome Speak in a loud and clear voice Be flexible Include everyone in the discussion Leave enough time for people to answer question (enjoy the silence) Vary your style of asking questions to get a variety of answers Probe for clarity Allow diverse opinion to emerge

Talk too much Let one person dominate Fail to stay neutral on the issue More than one question at a time Ask „Yes‟ or „No‟ questions (instead ask open ended questions) Go over the allotted time Forget to thank people for participating

Exercise:

• Select a study topic • Prepare three questions for FGD discussion

Key Informant‟s Interview: Definition

• “Qualitative, in-depth, flexible interviews with persons who know what is going in the community, “experts” (knowledgeable) about a topic on which we want to get information.”

• Note that the key informant interview is usually not about that person herself, but about the topic on which she has information.

Objective: • The purpose of Key Informant Interviews is to collect information from a wide

range of people- including community leaders, professionals, or residents-who have first hand knowledge about the community , and our research topic.

• To get general information about the local community • These community experts, with their particular knowledge and

understanding, can provide insight on the nature of problems and give recommendations for solution.

When to Conduct Key Informant Interview:

Page 89: Workshop Report Final

Page | 89

• To get more candid or in-depth answers. Focus Group dynamic may prohibit you from candidly discussing sensitive issues or getting the depth of information you need. Sometimes group dynamic can prevent some participants from voicing their opinions about sensitive topics

Choosing Key informants: • KI must have first- hand knowledge about community, its residents and

issues or problems you are trying to investigate • KI can be a wide range of people, agency representatives, community

residents, community leaders, or community business owners. • ex., Religious leaders, government officials, young mothers, youth, minority

population etc. • Should have a diverse mix of key informants to ensure variety of

perspectives Exercise:

• Select a study topic • Identify key informant for your research topic

Page 90: Workshop Report Final

Page | 90

Lesson VIII:

Descriptive & Inferential Statistics

Objectives:

1. Understand the concepts in Frequency distribution, Measures of central tendency, measures of variability.

2. Describe the various sampling distributions and Procedures used in educational research ( purposive, opportunistic, critical case)

3. Able to conduct Hypothesis testing. 4. Understand the concept of t-test, Analysis of Variance and Chi-square tests.

Lesson Outline:

1 Basics of Frequency Distribution both tabular and graphic representations. Comparison of Mean, Median, Mode. Utility of Percentiles in education. Normal and Skewed curves. Rankings. Z-Scores and Regression Analysis.

2 Types of Samples used in education for qualitative research. Purposive sampling, Random Sampling. Opportunistic and Critical case sample. How to calculate sample size through computers.

3 What is null and alternate hypothesis? How they can be constructed for educational studies. What is the probability value and Significance level. Various steps involved in Hypothesis testing.

4 Basics on performing t-test and Analysis of Variance (both one way and two way). Utilisation of x2 test for contingency tables.

Descriptive and inferential statistics

A field of statistics can be divided into Descriptive and Inferential statistics.

Flow charts are as below Statistics

Descriptive Inferential

Estimation Hypothesis

Testing

Point Interval

Descriptive statistics: to describe, Summarize and explain the data.

Page 91: Workshop Report Final

Page | 91

How to prepare the data set?

A data set with the “cases” going down the row and the “Variable” going

across the columns.

Once you put your data set into a statistical software Programme such as

SPSS, Minitab, Epi-info, SAS etc., you are ready to obtain all the descriptive

statistics.

In descriptive statistics you will get

Inferential statistics: is the branch of statistics that is used to make inference

about the characteristics of a population based on sample data.

Medical Uncertainty: Uncertainties arising from various kinds of variations, lack

of knowledge, partial compliance, errors etc., in medical decision process.

A surgeon carried out the major surgery of his career and was successful. He

was quite uncertain about the success of the second. Uncertainty level

decreased as the number of successful surgeries piled up, Naturally! All of 1st

10 major surgeries were successful. An amazing feat indeed. The Surgeon was

certain that the Eleventh too would be successful. But it failed for no apparent

reason. This could be one of those unlucky flukes that can always occur in

Medical Sciences due to biological and other variations. The Surgeon had not

cared to examine the statistical chances. If the long-term failure rate is as high

as 25%, there is still a significant chance that all 10 surgeries would be

successful. Failure in the eleventh was not such a surprise after all at least

statistically.

Biostatistics

There are many fields of applied statistics depending on the science where it is

applied. Biostatistics is the branch of statistics applied to biological or medical

sciences. It is also called biometry. The Greek roots are bios (life) and metron

(measured); hence biometry means measurement of life. It may be stated as the

application of statistical methods to the solution of biological problems.

Biostatistics covers applications and contributions not only from health,

medicine and nutrition but also from fields such as agriculture, genetics,

biology, biochemistry, demography, epidemiology, anthropology and many

others. Biostatistics today has a wide coverage of applications.

Application of statistical methods in biological sciences, which include Medical

and Dental sciences. Or The science dealing with medical uncertainties – their

identification, measurement, and control, leading to decision with less error.

Page 92: Workshop Report Final

Page | 92

Sources of Medical Uncertainties:

This can be divided into three broad groups.

1. Genuine variability:

This arising from natural variations due to Biological, Environmental, and

Sampling factor.

a. Biological variability:

*The primary source of uncertainty in health and disease is variations in

biological characteristics among individuals. You know that, every person is

unique because of decreasing morphological features.

*Individuals also vary according to age and sex, height and weight, heredity and

parity etc. They are also vary with regard to parameters such as blood glucose level,

cholesterol level and creatinine excretion.

b. Environmental variability:

* The environmental factors give variability of these biological characteristics.

*Unclean water and lack of sanitation are responsible for diseases such as Typhoid,

Cholera, and Polio. Goitre is found in area with Iodine deficiency.

*The environmental factors affect not only the incidence of various diseases but

also the patient management methods.

Chance variability:

*Identical twins born at the same time they may have different birth weight. How do

you explain such a variation?

*Nobody knows why one set of parents has three daughters and the other has three

sons. „Chance‟ determines the gender of the unborn child.

*If you repeatedly analyze same blood sample two or more times in the same

laboratory with same method and reagents, there could still be some variation,

although this could be minor.

2. Variability due to Unstandardized Methods (Experimental

variability):

Variation occur evening case of perfectly executed standardized methods. Also

realize that it is not all that easy to attain complete standardization.

Unstandardized methods and procedures cause another type of uncertainty.

a. Observer Variability:

We can say with confidence that physician in India dramatically differ on the

average duration of survival after detection of HIV infection. This varies from

Page 93: Workshop Report Final

Page | 93

patient to patient but the average should not vary too much. Eg. One physician

insisting that the average survival is just 3 years and the other insisting that it is

10 years.

b. Instrumental Variability / Error:

Pain intensity measured by visual analogue scale can differ from visual rating

scale.

c. Laboratory Variability:

For inter-laboratory variability, suppose we split the same blood sample into two

bottle and send to two different laboratories for estimation of Hb / lipoprotein (a) /

CBC etc. One laboratory reported 12 mg./dl and other 14 mg./dl. Both used the

same technique. Possibly the quality of chemicals and reagents was not the same.

Above all, the human skill in the two laboratories could be vary different.

3. Variability due to Partial Information:

a. Unavoidable incomplete information:

We have already discussed sources of uncertainty that are genuine and can be

avoided. Incomplete information on a patient can be due to carelessness of the

assessor but for the time being we are concern with unavoidable situation. Some

time this can happen eg. When a patient comes in coma after an injury, in this

case you have to observe the case on this basis you have to manage the case, there

is not much that you can obtain by way of history.

b. Avoidable incomplete information:

Needless to say that you would like to have as complete information on a patient as

possible before advising him. In an OPD of a crowed hospital where 100 patients

are attended in a three-four period, you will not have time to go into details even if

there are interns and residents to assist. Prescriptions are made on the basis of

incomplete information that can produce uncertain results. Sometime you may like

to get an investigation done such as thyroid function test or CT Scan etc. but this

is not done because it is either to expensive for the patient or simply the facility is

not available.

It is not uncommon that patients intentionally suppress part of the history as in

the case of sexually transmitted diseases (STDs). In some cases you might forget to

ask about a vital aspect or consider at that time that this is not necessary.

c. Partial Compliance:

Medical uncertainties seem to never end. If a patient is properly assessed for his

conditions and adequately prescribed, it is difficult to ensure that he is following

the complete regimen. Incomplete therapy in the case of tuberculosis is very well

Page 94: Workshop Report Final

Page | 94

known, which causes its resurgence. In a case of myocardial infarction (MI), the

patient may not follow, fully or partially, advice on exercise, dietary restrictions,

and drug intake. Hospital patient may fail to take rest as advised. The response

could accordingly vary, and the outcomes remain uncertain.

Sources of Health Data:

1. Primary: Data generate with specific objective from subject under study. or Data

collected for specific purpose directly from the field of enquiry and are original in

nature. This data gives detailed information and very less errors.

a. Experiments: It is performed in the laboratories of physiology, biochemistry,

pharmacology, clinical pathology, hospital wards, in the community etc. for

investigation and fundamental research.

b. Surveys: It is carried out for epidemiological studies in the field by trained team.

Eg. Census, Population survey, Disease survey etc.

2. Secondary Data: Data generated as a routine administrative procedure. or

When we collect the data from reports which are already published for some other

purposes which may be a processed one. While collecting secondary data we have

to observe that the source must be reliable.

a. Records / Registers: It is maintained as a routine in registers or books over a

long period of time for various purposes such as vital statistics.

b. Publications: Reports of various national, international health agencies,

scientific journal papers, books etc.

Random Number: The functions define by sample space to real number. Eg. A

couple may decide to stop the reproduction till they may get the male/female child.

Variable: Any character that varies. Or Variables are those characteristics or

attributes that varies from person to person, from time to time, or from Place to

place.

Types of Data:

There are two types of data: Qualitative & Quantitative

Scale of Measurement:

Qualitative variable are measured either on a Nominal or an Ordinal scale and

quantitative variable are measured on an Interval or Ratio scale.

1. Nominal scale: Observations are placed into broad categories, which may be

denoted, by symbols or labels or names or divide qualities into two types. One that

can be graded and the other cannot be graded.

Eg. Diagnostic groups like cancer heart disease etc. site of malignancy such as

lung, mouth, breast or ovary have no order complaints such as pain; vomiting and

Page 95: Workshop Report Final

Page | 95

constipation have no order among themselves. You can write constipation before

vomiting. Characteristics or attributes with this feature are said to be on nominal

scale. Eg.

i. Disease as present / absent,

ii. Sex as male / female,

iii. Occupation as farming / business / labour / service, etc. and

iv. Diagnosis as hepatitis / cirrhosis / malignancy, etc.

2. Ordinal scale: Categories are ranked or ordered/graded. Each category is in

unique position in relation to other categories but distances between the categories

are not known. Eg. Severity of illness such as cancer is graded as stage I, II, III, IV.

Inability to do routine work of life can be graded as none, mild, moderate, and

serious. There is a definite order in these grades. Try writing them as moderate,

serious and mild and see how awkward do you feel. Such measurements are said

to be ordinal scale.Social status as low, middle and high. Age as child, adult and

old. Health as poor, fair, good and excellent.

It is used in two situations:

a. Hypertension measured as mild, moderate, and serious, although blood pressure

can be measured exactly as a quantity. The ordinal scale is adopted in such

situations for convenience.

b. When quantitative scale is not available or is extremely difficult to adopt stage of

cancer comes under this category.

Suppose you are a medical superintendent of a hospital and want to know how

much the patients are satisfied with the service of the hospital. This can be graded

as fully satisfied, partially satisfied, and not satisfied.

3. Metric scale: When a characteristic is measured exactly in terms of quantity, it

is said to be measure on a metric scale. Eg. Duration of disease, body temperature,

pulse rate, and number of deaths in one year.

Interval scale: Distance between any two number (values of the variable) is fixed

and equal. The origin is arbitrary. Eg. Temperature in C or F. Can you say that

body temperature of 102 F is double of 51 F? Even for whether, 0 C does not mean

no temperature such measurement are said to be on interval scale.

Ratio scale: In addition to interval scale / level of measurement, it has true zero

point as its origin. Eg. Weight in kg, or pounds, height in Cms, or Kgs. Weight 30

kg is triple of 10 kg. , Parity is on ratio scale because there is absolute zero and

parity 4 is double of parity 2. Measurements on ratio scale are easy to handle.

Page 96: Workshop Report Final

Page | 96

Since you can add, multiply or divide. This convenience is not easily available with

measurements on interval scale.

Methods of Presentation:

The three basic method of summarization are:

1. Tabular, 2. Graphic / Drawing,

1. Tabulation: It consists in a systematic arrangement of data in rows and

columns.

Rules: Tabulation can develop by experience. No hard and fast rule can be stated

for satisfactory tabulation.

a. A table should be a brief but self-explanatory title, which can answer what, when

and where about the data.

b. Heading of rows and columns should be clearly stated.

c. Unit should be mention whenever necessary.

d. Avoid short forms as far as possible and large figure should be abbreviated.

e. Classes and subclasses should be clearly separated by lines.

f. Figures to be compared should be placed in neighboring columns.

g. Explanation of sign, rounding and abbreviation of figure etc. should be given in

footnote and reference or sources of data should be given in source note.

Eg. Sexwise prevalence of carriers of filaria in Miraj during the year 2003.

Particulars Sex Total

Male Female

Carriers

Non-

carriers

Total

Footnote: - -Source note: -

The above example of qualitative data. The presentation of frequency is very small

because the characteristic is not variable.

Simple table or One way classification

Eg. Distribution of girls by timing of menarche.

Timing of

menarche

Number of

girls

Early (<11Yrs) 80

Average (11-14Yrs) 140

Page 97: Workshop Report Final

Page | 97

Late (>=14Yrs) 40

Total 260

Eg. Distribution of girls by birth order.

Birth order Number of girls

1 60

2 80

3 40

4+ 80

Total 260

Cross tabulation or Two-way classification.

A frequency table involving at least two variable that have been cross – classified

(tabulated against each other).

Eg. Distribution of girls by timing of menarche and birth order.

Timing of

menarche

Birth order Total

1 2 3 4+

Early (<11Yrs) 20 20 15 25 80

Average (11-14Yrs) 30 50 20 40 140

Late (>=14Yrs) 10 10 5 15 40

Total 60 80 40 80 260

Contingency table: When the tabulation is in mutually exclusive and exhaustive

categories. Eg. Mutually exclusive means that one person can belong to only one

category. Or a girl can have timing of menarche early, average or late and not two

together. If the categories are symptoms such as pain in abdomen, vomiting and

constipation, one person can have two or three of these together. They are not

mutually exclusive.

Eg. Exhaustive means that all possible categories are included. If the 4th row in

eg.4 is for birth order 4 and not 4+, the categories would not be exhaustive.

2. In quantitative data: The characteristics and frequency are both variable.

Frequency table or frequency distribution table:

A table in which the value of a variable are classified according to size.

Page 98: Workshop Report Final

Page | 98

Rules:

a. The class or group interval between the class or group should not too broad or

too narrow.

b. The number of groups should not be too many or to few but ordinarily between 8

and 20.

c. The class interval should be same throughout.

d. Heading must be clear.

e. The rates and proportions are given mentioned in the denominator.

f. Group should be tabulated in order, from lowest to highest value in the range.

g. If certain data are omitted or excluded, reason for the same should be given.

Eg. Number of diagnosed cases of TB by age in Miraj during the year 2003.

Age in (Yrs.) Number of

cases

0 – 04 1242

05 – 14 1081

15 – 24 2482

25 – 44 8153

45 – 64 10916

65 + 7124

Total 30998

Footnote: - Source note: -

Exclusive and Inclusive Methods

Classes Frequency Classes Frequency

0 – 10 2 0 – 9 2

10 – 20 3 10 – 19 3

20 – 30 4 20 – 29 4

30 – 40 2 30 – 39 2

40 – 50 1 40 – 49 1

Total 12 Total 12

It is called exclusive method of classification. It is called exclusive method of

classification. In this case the upper limit is not included in the class. In this case

the upper and lower limit are included in the particular class.

GRAPHICAL PRESENTATION

Page 99: Workshop Report Final

Page | 99

In day-to-day reading, including newspapers, we all come across various kinds of

diagrams such as a bar diagram, a pie diagram and line diagram. They are used to

readily show the pattern of value. Some people use tricks in drawing such diagrams

to highlight their positive own feature. You should be able to track down such

instances. Also you should be able to make a judicious choice yourself about the

kind of diagram that is most appropriate for a particular kind of data.

Drawing or graphical presentation: After class-wise or group-wise tabulation the

frequency of a characteristic can be presented by two kinds of drawing

a. Graphs b. Diagrams.

Presentation of quantitative data: Continuous or measured data is done by

graphs and those in common use.

a. Histogram:

It is a set of adjoining vertical bars whose areas are proportional to the frequency

represented by the bar. Here by taking class interval on X-axis and the frequency

on Y-axis. Eg. Tuberculin reaction measured in 206 persons.

0

10

20

30

40

50

60

No

of

case

s

Pain in

abdomen

Backache Discharge

PV

Bleeding PV Urinary

complaint

Pelvic

pressure

Symptoms

Bar diagram showing symptoms wise distribution of cases in study

group

b. Frequency polygon:It is easy to construct and simple to interpret. It is a line

chart plotted in the same way as the histogram. Here class mid point on X-axis and

Page 100: Workshop Report Final

Page | 100

the frequency on Y-axis, joined this points by straight line to give us frequency

polygon.

Scatter diagram showing correlation between days of delirium

and ICU stay in study group

0

5

10

15

20

25

0 2 4 6 8 10 12 14

Days of delirium (Days)

ICU

sta

y (D

ays)

c. Frequency curve:

When the numbers of observations are very large with small class intervals, it gives

smooth curve known as frequency curve. It is slightly modification of frequency

polygon. Here by taking class interval on X-axis and the frequency on Y-axis.

Joined this points by smooth curve instead of straight line.

d. Cumulative frequency curve or Ogive curve:

We are interested to knowing “How many cases attending the hypertension clinic

had cholesterol level less than 200 mg / dl and more than 200 mg / dl”,

“Percentage of students who have failed” etc. To answer these questions, it is

necessary to add the frequencies. When the frequencies are added, they are called

cumulative frequencies. The curve obtained by plotting cumulative frequencies is

called a Cumulative frequency curve or an Ogive curve.

There are two types:

1. Less than type: We start with the upper limits of the classes and go on adding

the frequencies. Plot these points we get a rising curve.

2. More than type: We start with the lower limits of the classes and subtract the

frequencies of each class. Plot these points we get a declining curve.

e. Line chart:

This is a frequency polygon presenting variation by line. It shows an event

occurring over a period of time rising, falling or fluctuations. This kind of diagram

is best used to show trend in a metric measurement over time or over age. Growth

charts, or “Road to Health” card, used for assessment in children, are line diagram.

Eg. This is a temperature chart for a patient of tuberculosis for these consecutive

days. Evening rise in temperature in this case can indicate toxaemia.

Eg. Birth rate, growth rate, IMR, death rate from 1951 to 1991.

Page 101: Workshop Report Final

Page | 101

f. Scatter or dot diagram:

To show the nature of correlation between two variable in the same persons or

groups. Eg. Height and weight, BMI and BSL in your batch. The points are plotted

on graph paper, one for each observation. Such type of diagram shows how far the

points are scattered. Hence it is called scatter diagram. Draw a line passing

through these points maximum points on a line, half point lie above and half lie

below, to show the nature of correlation at a glance.

Eg. BMI and BSL are recorded for 23 persons; a point can be plotted for each

person with BMI on horizontal axis and BSL on the vertical axis. The vertical axis

in a scatter diagram should be dependent or the outcome variable. Eg. BSL depend

on BMI and BMI does not depend on BSL. Thus BSL should be on vertical axis.

The trend of these points may show that BSL increases when the BMI increases.

Presentation of qualitative data: Continuous or measured data is done by graphs

and those in common use.

a. Bar Diagram:

It is easy and popular method. Length of bars, vertical or horizontal indicates the

frequency of a character to be compared. Bar may be drawn in ascending or

descending order of magnitude or serial order of event. Spacing between two bars

should be equal. There are 3 types of bar diagrams.

-Simple bar diagram

-Multiple bar diagram

-Proportional bar diagram

b. Pie or Sector diagram:

This is another way of presenting discrete data of qualitative characters such as

blood group, Rh group, social group, sex group etc. Pie diagram has circular shape.

A circle is divided into sectors with areas proportional to the frequencies or the

relative frequencies of the categories of the variable.

Eg. Number of episodes of respiratory, digestive, cardiovascular, injuries etc.

attending a clinic are additive. On the other hand, some rates are not added (birth

rate: rural 30/1000 populations, and urban 16/1000 populations). So pie diagram

cannot draw for such rates. Rate will be in between 16 and 30.

Note: Since one patient can and will have more than one sign-symptoms, pie

diagram is not appropriate for this depiction also. All sign-symptoms do not add to

any thing, certainly not to the total number of patients, nor to the total number of

episodes. Thus pie diagram is not applicable.

Page 102: Workshop Report Final

Page | 102

Pie diagram showing gestational age wise distribution of cases in study

group

32%

64%

4%

28 – 31

32 – 34

35 – 37

c. Pictogram:

It is a popular method to impress the frequency of the occurrence of events to a

common man. Eg. Accidents, attacks, deaths, admitted, discharged etc. The

pictures are drawn on horizontal lines. Each picture indicates a unit of 10, 20, 100,

1000 etc. happenings. The number of pictures in each row gives quick idea of

frequency.

d. Map diagram or Spot Map:

To show the geographical distribution of frequencies of a characteristic. A dot

indicates on unit of occurrence such as attacks or deaths. The number of dots will

indicate the frequency in units.

e. Run chart: to display serial data points over time. Because our minds are

not good at remembering patterns in data, a visual display will allow you to

see the measurement of an entire process. This in turn will enable you to

see trends over time and to make adjustments accordingly.

Page 103: Workshop Report Final

Page | 103

Use; Run charts are particularly useful in conjunction with simple data from

tally or check sheets. Record tallies of a particular event that you would like

to capture, e.g. a patient's weight or times you are late for a meeting. Plot

the values on the y-axis versus number of measurements on the x-axis.

Note on the chart where any changes in the process are made. Then analyze

the data for trends. Is your patient's weight stabile over time? Did a

decrease coincide with the start of new medication? Since you bought your

electronic organizer, have you made it to more meetings on time. See

examples below.Examples

Example 1: Patient's weight over time.

SAMPLING

Sampling: The process of selecting a sample from a population.

Sample: A finite subset of statistical individuals in a population. Or sample is that

part of the target population, which is actually enquired on or investigated.

Sample size: The number of individuals in the sample. Or the number of sampling

unit included in the sample. Eg. Investigating mineral density in hipbone, you may

like to include 150 persons. In this case, this is the sample size. For ocular ailment,

the sample may be of 80 eyes irrespective of the number of persons. For clinical

trial, the sample size may be 200 cases divided equally to receive treatment and

placebo.

Sampling Unit: The unit of selection in the sampling process. Eg. a person, a

household, or a district. It is not necessarily the unit of observation or study.

Page 104: Workshop Report Final

Page | 104

In a large scale study that covers, eg. The entire country, it is desirable to select a

sample of states in the first stage, district in second stage, etc. Thus the sampling

unit is state for first stage sampling, district for second stage sampling etc.

The ultimate sampling unit is generally the same as the unit of study.

Eg. In a study on family pattern in hypertension, the sampling unit and the unit of

study is the same i.e. family.

Eg. But in other set up such as for incidence of injuries, the sampling unit could be

a family but the unit of study would be the individual. In this case, the families are

not further sampled and all individuals in the selected family are enquired

regarding the incidence of injury.

Sampling frame: A list of all units in the target population from which a sample is

drawn. If the target population comprises the deliveries in your hospital in one-

year period, the population size is known. Suppose this is N=7000. If you want to

include n=140 of these in your sample, the sampling fraction is n/N = 140/7000

=0.02 = 2%

Parameter: the statistical constant computed from the population value.

TYPES OF DATA COLLECTION DESIGN USED IN HEALTH AND

MEDICINE

Data collection

Objective: Descriptive Analytical

Method: Survey Observational Experimental

Time frame: Prospective Retrospective Cross-Sectional

Setting: Trial Animal

1. Simple random sampling:

Every sample of the same size (Every sampling unit in the sampling frame) has an

equal chance of being selected.

Advantages:

a. The sample is assured of being representative and subject only to sampling

error.

Page 105: Workshop Report Final

Page | 105

b. Estimates are easy to calculate

Disadvantages:

a. If the sampling frame is large, this method may be impracticable because of

difficulty and expense of constructing or updating it in large-scale surveys.

b. Minority subgroups of interest in the population may be present in the sample

in sufficient numbers for study.

Selection can be done by Lottery method or Random number.

Eg. Suppose you are in a big hospital where nearly 500 cases of Myocardial

infarction (MI) are reported every year. You are interested in their physiological

profile – their blood pressure, cholesterol level, creatinine phosphokinase (CPK)

level, lipoprotein (a) level and Homocysteine level. The objective for the time being is

not to compare these parameters with their healthy counterparts because that

objective will push the study into analytical domain. But the sampling will remain

the same in that case too.

You have resources to do these investigations in not more than 100 of these

patients. The target group still is these 500 cases reporting in one particular year.

That is, the findings should be applicable to all these 500 cases. Naturally you will

like to have a sample of 100 that represents a cross-section of these cases.

Lottery method: Prepare 500 similar cards of same size, mix them in a box, and

draw 1st card at random, note these number. Replace the card drawn again, mix

and draw 2nd card. Repeat the process till 100 cards drawn at random. Reject the

cards that are drawn 2nd time. The patients with these numbers will be in your

sample.

Random number: 500 cases of MI is three-digit figure. Three digit numbers are

chosen from random number table or to take help of computer, generate 100

distinct random numbers “between” 001 to 500. If the selected random number is

more that 500 then divide these number by 500 and reminder taken on your list.

2. Systematic random sampling:

The first unit is selected at random from among the first k – units. The selection of

every Kth unit in the population / list / file. Where K = N / n.

Advantages:

a. The sample is easy to select.

b. A suitable sampling frame can be identified more easily.

c. The sample is evenly spread over the entire reference population.

Disadvantages:

Page 106: Workshop Report Final

Page | 106

a. The sample may be biased if a hidden periodicity in the population coincides

with that of the selection.

b. It is difficult to assess the precision of the estimate from one survey.

Eg. One hundred cases out of 500 cases of MI are one out of five. Choose one

number at random out of 1st five and add systematically 5 cases each time. If the

1st random number is 3, the others are 8,13,18,23 etc. Patients with these

numbers will be in your sample.

Note: that in simple or systematic method, there is no assurance in this example

that there would be, say, enough obese person or enough females. Thus

physiological profile of patients in different obesity categories or genders may not be

obtained.

3. Stratified random sampling:

First the population is divided into groups or strata according to characteristic of

interest. (eg. Age, Sex, Geographical location etc.)

A simple random sample is then drawn from each stratum using the same

sampling fraction, unless otherwise prescribed for special reasons.

Advantages:

a. Every unit in a stratum has the equal chance of being selected.

b. Using the same sampling fraction for all strata ensures proportionate

representation in the sample of the characteristic being stratified.

Disadvantages:

a. The sampling frame of the entire population has to be prepared separately for

each stratum.

b. Varying the sampling fraction between strata, to ensure selection of sufficient

numbers in minority subgroups in the sample as a whole.

Eg. Out of 100 cases in the sample, 50 should be overweight or obese (BMI >=25)

and the other 50 of normal or low weight. This is because the physiological

parameters could be different in these two groups. This can be achieved when the

MI patients are first grouped into such categories. This grouping is called

stratification, and each group is called stratum. Suppose out of 500 cases of MI,

200 have BMI <25 and 300 have BMI >= 25 kg/m2. The procedures is now to select

50 out of 200 and another 50 out of the other 300 by Simple random sample

separately. This method ensures that a pre-specified number of individuals are in

the sample from different categories and none is under or over represented.

Eg. Select a stratified random sample of 20 patients from 200 patients given below

by stratification of diseases.

Page 107: Workshop Report Final

Page | 107

Strata I II III IV Total

No. of

patients

100 60 20 20 200

% 50 30 10 10 100

Sample size 10 6 2 2 20

S – I => 50% of 20 =10, S – II => 30% of 20 =6

S – III => 10% of 20 =2, S – IV =>10% of 20 =2

4. Cluster random sampling:

First the population is divided into clusters of homogeneous units, usually based

on geographical contiguity. A sample of such clusters is then selected. All the units

in the selected clusters are then examined or studied.

Advantages:

It reduces the cost of preparing a sampling frame and traveling between selected

units.

Disadvantages:

Sampling error is usually higher than for a simple random sample of the same size.

This procedure is quick and easy to administer. WHO recommends this procedure

to estimate the immunization coverage in developing countries.

Eg. Let us suppose that there are 24000 households in a city and we want to select

a sample of 800 such households. We divide the entire area of the city into clusters

(wards), which are clearly identifiable and ensure the location of each of the city

households in these clusters. Suppose 750 such clusters have been identified. The

average number of households in each cluster is 24000 / 750 = 32 household /

cluster. Cluster to be selected is an 800 / 32 = 25 cluster. Make a random selection

of 25 clusters from among the 750 identified clusters to constitute a sample of 800

households.

Eg. To evaluate vaccination coverage in Expanded Programme of Immunization

(EPI) and Universal Immunization Programme (UIP) where only 210 children, taking

7 from each cluster in the age group of 12 – 17 months are to be examined.

List of all cities, towns, villages and wards of cities with their population

falling in the target area under study for evaluation.

Calculate cumulative population and divide the same by 30, that give the

Sampling Interval (SI)

Select a random number <= SI having same number of digits, this forms the

first cluster.

Page 108: Workshop Report Final

Page | 108

Random number + SI gives 2nd cluster.

2nd cluster + SI gives 3rd cluster and so on

All houses with population are numbered. The first house should be selected

randomly with the help of random number table or number on currency note.

Before starting house-to-house survey defines the age group and item you wish to

study ie. Children age group is 12 – 17 months, fully vaccinated. They have 3 DPT,

3 Polio, 1 BCG and 1 Measles vaccination. Now survey houses starting with the

selected house till you get 7 children fully vaccinated. Thus 210 such children will

be found in 30 clusters.

5. Multistage random sampling:

-The selection is done in stages until the final sampling units (household or

persons) are arrived at. In the first stage, a list of large sized sampling units is

prepared. These may be towns or villages or schools. A sample of these is selected

at random, with probability of selection proportional to size. For each of the

selected first stage units, a list of smaller sampling units is prepared. (If the 1st

stage units are towns, then 2nd stage units may be households or houses). A

sample of these second stage units is then randomly selected from each of the

selected first stage units. These are then studied. The procedure may contain three

or more stages.

Advantages: Reduced the cost of preparing a sampling frame.

Disadvantages:

Sampling error is increased compared with a simple random sample of the same

size. Eg. Let us consider the prevalence of cataract in elderly population. In this

case to carry out sampling in stages. Select a few but predetermined number of

districts randomly in the 1st stage. Then select a few primary health center (PHC)

areas from rural and a few towns in urban areas again by random method from

each selected district. Now select a few villages from rural and a few census block

from urban. Finally select a few families from each selected village or census block.

Examine all elderly persons in these families for cataract.

If a state has 5 crore population with 3.5million elderlies, a random sample of 4

districts, 3 PHC and 3 urban area from each selected district, 10 villages or 10

census blocks from each selected areas and 20 families from each of these, would

yield a sample of 4x(3+3) x10x20 = 4800 families. These may have only about 1700

elderly persons. This sample of 1700 out of a target population of 3.5 million,

which is just about one in 2000, is an exceedingly small fraction. Yet this sample

could be very representative when random component of selection is faithfully

Page 109: Workshop Report Final

Page | 109

carried out. However the method requires that the sampling frame (list of sampling

units) be available within each selected unit.

6. Multiphase random sampling:

Part of the information is collected from whole sample and part from sub-sample.

Eg. In a Tuberculosis (TB) survey physical examination or montoux test may be

done in all cases of the sample in the 1st phase, in the second phase X-ray of the

chest may be done in mantoux test positive cases and in those with clinical

symptoms; while sputum may be examined in X-ray positive cases in the 3rd phase

only. Number in the sub-sample of 2nd and 3rd phase will become successively

smaller and smaller. Survey by such procedure will be less costly, less laborious

and more purposeful.

7. Purposive random sampling:

The sampling is purposive when cases that serve specific purpose are chosen for

generating data. Or In which the sample units are selected with definite purpose in

view. Eg. If we want to give the picture that the standard of living has been

increased in the city of New Delhi. We may take individuals in the sample from rich

and posh localities like defense colony, south extension, golf links, jor bagh,

chanakyapuri etc. and ignore the localities where low income group and the middle

class family live.

Measures of Central Tendency

The value of the measures of central tendency is regarded as the most

representative value of the given data. There are measures of central tendency such

as Mean, Median, Mode, Geometric Mean, Weighted Mean, & Harmonic Mean and

locations other than central are Quartiles, Percentiles, Deciles, & Tertiles.

Mean:

It is the sum of all the observations divided by total number of observations and

denoted by X (X bar).

Merits:

1. It is rigidly defined

2. It is It is easy to calculate and simple to understand.

3. It is based on all observations.

4. It is capable of further algebraic treatment.

Demerits:

1. It is vary much affected by extreme values.

2. When extreme class is open Arithmetic Mean cannot be calculated.

Median:

Page 110: Workshop Report Final

Page | 110

Arrange the data in ascending or descending order of magnitude. The value of the

middle most observation is called median and denoted by Me.

*Ungroup / Discrete series:

a. When the number of observations / subjects is odd then

Me = Middle most observation = ((n+1) / 2)th observation.

b. When the number of observations / subjects is even then

Me = Average of two middle most observation = 1/2((n/2)+((n/2)+1))

Merits:

1. It is rigidly defined

2. It is easy to calculate and simple to understand.

3. It is not affected by extreme values.

Demerits:

1. It is not based on all observations.

2. It is not an exact value when observations are even.

Mode:

The most common or most frequently occurring value is called mode and denoted

by Mo.

Ungroup / Discrete series:

Mo – Most common or frequently occurring value.

Merits:

1. It is easy to calculate and simple to understand.

2. It is not affected by extreme values.

Demerits:

1. It is not based on all observations.

2. It is not capable of further algebraic treatment.

Mean, Median and Mode in the symmetric and skewed distributions:

If the women are generally under-nourished, the lower values would be quite

common and the distribution would be left skewed. Fig. (a) shows a left skewed

distribution of Hb. Level in under-nourished women. A skewed distribution (left /

right) will have different Mean, Median and Mode. Very different values give a clear

indication of skewness. If left skewness Mean < Median < Mode and right skewness

Mean > Median > Mode.

If women are well nourished or in good health, maximum number may have Hb

around 14 g/dl and there will be as many on the lower side as on the upper side. If

Page 111: Workshop Report Final

Page | 111

all the three averages ie. Mean, median and mode are nearly equal; this is a good

indication that the frequency curve is symmetric, possibly Gaussian.

Where to use mean, where median and where mode to represent central value?

Guideline is as follows. Always use mean as the central value because the average

is so easy to comprehend, except when

a. Outliers are present in the data, then use median.

b. Interest specifically is in the most common value, then use mode.

Geometric mean:

It is the nth root of the products of n observation and denoted by GM.

GM =n√X1, X2, --- Xn g = antilog {logx / n}

Merits:

1. It is rigidly defined

2. It is based on all observations.

3. It is capable of further algebraic treatment.

Demerits:

1. It is neither easy to calculate nor simple to understand.

2. If any value in a series is zero then GM is also zero.

Weighted mean:

A mean for which individual values in the set are weighted, very often by their

respective frequencies.

Harmonic mean:

It is the reciprocal of the mean of the reciprocal of value/ observations and denoted

by HM.

HM = 1 / (1/n)* (1/Xi) = n / (1/Xi)

Merits:

1. It is rigidly defined

2. It is based on all observations.

3. It is capable of further algebraic treatment.

Demerits:

1. It is neither easy to calculate nor simple to understand.

2. If any value in a series is zero then HM is zero.

Location other than central:

You may have heard of a very difficult examination and the topper scoring only

72% marks. In medicine too, it is sometimes important how a patient appears

relative to the others, although he may not be optimal. In growth assessment of

Page 112: Workshop Report Final

Page | 112

children, it is common to say that the height of a particular child of age four years

is better than 80% children of his age. The interest is not in the central value but is

in other locations. Most popular measure of such location is percentile.

*Percentiles:

It divides the subject in 100 equal parts, each part containing n/100 subjects. If

n=400, then each part will have 4 subjects. The parts are identified by 99 cut

points of the measurement under consideration i.e. P1, P2, P3, - - - P99.

Ungroup / Discrete series:

Kth percentile = (k*n/100)th value after arranging in ascending order from lowest

to highest.

Group / Continuous series:

Pi = L1+(((iN/100)-c)*h/fi) i=1,2,3, - - - 99

P10=D1, P20=D2, - - - P90=D9

P25=Q1, P50=Q2=D5=Me, P75=Q3

Percentile curve is a cumulative curve drawn on a percentage basis (<type)

Deciles, Quartiles and Tertiles:

Deciles divide the group of subjects into 10 equal parts, quartiles into 4 equal parts

and tertiles into 3 equal parts. The procedures are same as above. The

denominator, which was 100 in the case of percentiles in formula Kth percentiles

mentioned above, would be 10 for deciles, 4 for quartiles and 3 for tertiles.

Measures of Variation (Dispersion)

The deviation of each and every observation from any measures of central tendency

is called measures of variation or dispersion.

When the mean value of a series of measurements has been obtained, it is usually

a matter of considerable interest to express the degree of variation or scatter

around this mean. Are the reading all rather closed to the mean or are some of

them scattered widely in each direction?

Eg. The daily Calorie requirement for a man of 25 years is given as 3200. This

requirement must very from one person to another, how large is the variation?

Types of measures of variation:

Range, Variation, Standard deviation and coefficient of variation.

Range:

The difference between the maximum value and the minimum value and denoted

by R.

R= Xmax. – Xmin.

Eg. Systolic blood pressure

Page 113: Workshop Report Final

Page | 113

Group I: 128, 132, 129, 130, 131

Group II: 140, 150, 120, 130, 110

Systolic level in-group I is 128 to 132, so that the range is 4 mmHg. In-group II,

this is from 110 to 150 and the range is 40 mmHg. The difficulty in measuring

dispersion by range is that an alteration in just one value to an extremely high or

extremely low value, drastically changes the range. If the last BP in-group I is 161

instead of 131, the range shoots to 33 mmHg. Although the other 4 values are still

closed to one another and the dispersion is not high, but the range unnecessarily

will indicate a very high dispersion because of one extremely high value. Because of

this demerit, we look for a measure that considers all the values, and not just the

minimum and the maximum value.

Standard deviation:

It is the square root of the mean of the square deviation from their mean and

denoted by sd/ (sigma)

Ungroup / Discrete series:

= √ (Xi-X)2/n =√(1/n){X2 –(X)2/n}

The square of sd is called variance and denoted by var. / sd2 / 2.

Steps:

1. Calculate the mean ie. Xbar

2. Find the difference of each observation from the mean ie. (Xi-X)

3. Square the difference of each observation from the mean ie. (Xi-X)2

4. Add the square value to get sums of square ie. (Xi-X)2

5. Divide this sums of square by number of observation –1 to get variance ie.

2 = (Xi-X)2/n-1

6.find the square root of the variance to get sd.

Coefficient of variance:

The ratio of sd to the mean and denoted by CV.

CV = sd/x

It is used to compare the variability of a characters in two group or two characters

in the same group. OR When measurements are for different persons and

parameters are different.

NORMAL DISTRIBUTION AND NORMAL CURVE

When the number of observation is very large of any variable characteristics are

taken at random to make it a representative sample eg. Height, Weight, Blood

Page 114: Workshop Report Final

Page | 114

Pressure, Pulse Rate etc. Prepared a frequency distribution table by keeping small

class interval then it will be seen that:

-Some observations are above the mean and other are below the mean.

-If they are arranged in order, deviating towards the extremes from the mean, one +

or – side, maximum number of observations will be seen in the middle around the

mean and fever at the extremes, decreasing smoothly on both sides.

-Normally half of the observations lie above and half lie below the mean

and all observations are symmetrically distributed on each side of the mean.

A nature or shape of this distribution is called Normal Distribution or Gaussion

Distribution.

If mean and standard deviation are known:

a. Mean ± 1 SD covers 68.27% observations. Remaining 32% observations lie

outside the range mean ± 1 SD.

b. Mean ± 2 SD covers 95.45% observations. Remaining 4.55% observations lie

outside the range mean ± 2 SD. (Mean ± 1.96 SD covers exactly 95% observations )

c. Mean ± 3 SD covers 99.73% observations. Remaining 0.27% observations lie

outside the range mean ± 3 SD. (Mean ± 2.58 SD covers exactly 99% observations )

You know that the normal range of fasting blood glucose level is 80 to 110 mg/dI.

Do you know how is this range obtained?

Opposed to a range for fasting blood glucose level, the normal body temperature is

a single value 98.6F. Why is this not a range?

Normal Range:

Most medical measurements show a substantial variation even in healthy subjects.

Thus a range of normal values is obtained. In above case, fasting blood glucose

level, the normal range may be 80 to 110 mg/dI but there will be people with 79 or

75 mg/dI yet absolutely healthy and other healthy people with 115 or 118 mg/dI

level. In other words, for any parameter, there will be healthy people with very low

Page 115: Workshop Report Final

Page | 115

or very high values. If such values are also included in the normal range, the

difficulty is that many diabetics would have overlapping levels such as 115 or 118

mg/dI. Similarly there will be hypoglycemic with levels 79 or 75 mg/dI.

No matter how a normal range is chosen, there is always a risk of exclusion of

healthy subjects and inclusion of non-healthy subjects. The best course, of course,

is to find levels beyond which persons or patients start feeling sick, or the levels

that have increased risk of early mortality. Such level can indeed be considered

pathologic. But this procedure is highly nonspecific and too difficult to adopt. Also,

even in such delineation there would still be a chance of exclusion of healthy and

inclusion of nonhealthy subjects. In view of these difficulties, it is considered

convenient to use statistical principles to determine the normal range.

Normal value:

Any biological measurement must very from person to person, and even in a person

from time to time. The variation could be small or large. Whenever the variation is

small, a single value is obtained as the representative value. There is no need to

worry about SD of such measurement. This is true for body temperature.

Although there would be some healthy people with temperature 98.5 F or 98.7 F,

even with 98F but those will be few. A 5% rise in body temperature has an

enormous clinical significance whereas 5% rise in cholesterol level may not be

much consequence. When the variation is small, the mean is generally chosen as

the reference value.

If you want to establish normal body temperature of adolescent boys in your area,

select a random sample of at least 300 apparently healthy boys, measure their

body temperature and calculate the mean. That will be the normal level for these

boys. No such exercise has ever been undertaken for Indian boys, girls, children‟s

or adults on a large scale. Thus our normal body temperature is not known.

However the internationally recognized level of 98.6 F, which actually was

established for Swedish adults, seems to work for Indian as well.

Characteristics of Normal Curve:

1. It is bell shaped curve.

2. It is symmetrical.

3. Mean, Median and Mode coincide.

4. It has two inflections never touches to the baseline.

5. Area under curve is one.

6. If standardized then mean is zero and SD is one.

Standard Normal deviate/ Standard Normal viriate/ Standard Normal curve:

Page 116: Workshop Report Final

Page | 116

This process of subtracting mean and dividing by SD is standardization or

sometime normalization and denoted by Z

Z =(X-)/SD

Inferential statistics: is the branch of statistics that is used to make inference

about the characteristics of a population based on sample data.

Sampling variation:

The sample after all the part of the population and they may or may not truly

represent all the features of the population.

One sample would differ from the other even if both were taken from the same

target population. Mean and SD obtained from one sample would be different from

the other sample. This is called sampling variation / sampling fluctuation /

sampling error.

Standard error:

The point estimate is simply the corresponding sample statistics of the population

parameter. The forced expiratory volume in one second (FEV1) in students of age

18 to 22 years. Take a random sample 40 students and found that mean FEV1 is

2.71 l. He was somehow not happy with this mean. He took another sample of 40

and found mean FEV1is 2.50 l. He repeatedly took sample of 40 students another

eight times. Considering the variation in various sample mean, he was not sure

how to express this uncertainty. He was then advised to find the SD of these 10

means obtained in 10 different samples. He then understood that the SD of these

10 sample means is the measures of variability in sample means. This SD of

sample mean is called the standard error (SE) of mean.

Estimation: There are two types estimation.

Point Estimation: is the value of your sample statistic (Sample mean or sample

correlation) and it is used to estimate the population parameter (Population mean

or Population correlation) eg. If you take a sample of Dr. living in pune city and you

find that the average income of Dr. in your sample is Rs 25000 then your point

estimate of Dr in pune city will be Rs 25000.

The value of the Sample Mean / Median is an estimate of the population Mean /

Median. Similarly, Sample Proportion is an estimate of the population Proportion.

These are called point estimates.

Point estimate for is Xbar and for P is p.

Interval estimation: is a range of numbers inferred from the sample that

has a known probability of capturing the population parameter over the long

run.

Page 117: Workshop Report Final

Page | 117

Consider the average haemoglobin (Hb) level in women going into hypertension

during labour. This enquiry for Hb level can possibly lead to some etiology of

hypertension during labour. In a random sample of 300 such women, this average

is 10.6 g/dl. Would you accept this 10.6 g/dl as the absolute truth for these

women or you would allow for some sampling fluctuation and say that it is most

likely somewhere between 10 and 11 g/dl? Obtaining such interval for any

parameter is called interval estimation.

It is used in patient care when you inform the relatives of a cancer patient that the

survival duration is somewhere between 2 and 7 months at that stage of disease.

Point estimates have reliability only when SE is small. If SE is large, interval

estimates are obtained.

Confidence level:

The 95% or any other level that is fixed as a measure of hope or expectation is

called the confidence level.

It must be very clear that uncertainties in medical practice can only be minimized

but not eliminated. You can never be 100% confident about the outcome. This is all

the more true while dealing with the samples. Thus a confidence level is fixed at a

sufficiently high level to ensure reasonable reliability. There is a tendency around

the world to consider 95% confidence as adequate while dealing with samples – be

it sample of patients, blood samples or sample of healthy people.

Confidence Interval:

The interval within which a parameter values expected to lie with a certain

confidence level.

* Specifically, if you have the computer provide you with 95 percent confidence

interval then you will be able to be “95% confident” that it will include the

population parameter. That is “level of confidence” is 95%.

Eg. The point estimate of annual income of Dr. in pune city is Rs 25000 and

surround it by a 95% confidence interval. You might find that the confidence

interval is Rs. 22000 to Rs. 27000. In this case, you can be 95% confident that the

average income is somewhere between Rs. 22000 and Rs. 27000.

TEST OF SIGNIFICANCE

A statistical procedure to test whether or not the observations fall into a specified

pattern, such as equal mean of two or more groups or following a linear trend. If

they do no, the result is called statistically significant. This requires prior fixing of

the level of significance that specifies the maximum tolerable probability of type I

error.

Page 118: Workshop Report Final

Page | 118

Hypothesis: Any statement regarding population parameter is called as

hypothesis. Or A statement of belief that is made before the investigation regarding

the status of parameters under study, including those that measure relationship.

Null Hypothesis: A hypothesis that says that there is no difference. The initial

assumption will be that the new regimen is not better. This kind of assumption is

called null hypothesis and denoted by Ho.

Eg. 1. The two regimens or two groups have no difference.

2. The incidence of leukemia in the four blood groups is same.

2. There is no difference in the mean aspartate amino-transferase (AST) level in

the case of hepatitis, cirrhosis and liver malignancy.

Alternative Hypothesis: A hypothesis, which is accepted by default when the null

hypothesis is rejected and denoted by H1.

Note: The null hypothesis is never completely right or wrong, or true or false, but is

only rejected or not rejected at the probability level of significance concerned.

Type I error: The probability of rejecting a null hypothesis (Ho) when it is in fact

true and denoted by alpha ()

= P(Rejecting Ho/Ho is true)

Type II error: The probability of not-rejecting i.e. accepting a null hypothesis (Ho)

when it is false and denoted by beta ()

= P (Accepting Ho/Ho is false)

Level of significance: Size of the type I error (). Or the distance from the mean at

which Ho is rejected. Or the maximum tolerable probability of type I error that is

fixed in advance, such as 5%, denoted by alpha ().

P-value: The probability of committing type I error is called the P-value. Or P-value

is the chance that the presence of difference is concluded when actually there is

none.

One tail Test: Checks only one of the tail (upper or lower of the normal

distribution curve).

Eg. 1. Comparing the rate of cancer between a population exposed to known

carcinogen and a control population. This test can be used because the only

alternative hypothesis of interest is that the exposure was harmful. It is assume

that the exposure was not harmful.

2. Consider a new haematinic that is supposed to increase the Hb. level among

anaemics. In a trial for this preparation, the null hypothesis again is that it is not

effective. This hypothesis implies that the average Hb. level will not alter taking this

preparation. What is the alternative hypothesis?

Page 119: Workshop Report Final

Page | 119

If the Ho is rejected, the conclusion will be that the new haematinic is effective or

that the average Hb level has increased. In this situation, the possibility of

reduction in Hb level is ruled out. Thus, H1 is one sided and is called one tail test.

3. While comparing a test regimen with placebo, if there is an assurance that test

regimen cannot be worse than placebo. This requires one tail test.

Two tail Test: Checks the upper and lower tails of the normal distribution curve.

Eg. 1. Comparing the rates of death between two neighboring communities. This

test is used to look for significant differences because no assumption is made about

the H1.

2. For dilating cervix by Misoprostol Vs the existing ethinyl estradiol, there is a

possibility that the efficacy of new regimen is even less than the existing regimen.

The efficacy can be higher or can be lower who know! The null hypothesis in this

case would be usual saying that the two regimens are equally effective.

When null hypothesis is rejected, what is accepted is called alternative hypothesis.

For Misoprostol efficacy, the alternative hypothesis is that it is either lower or

higher than the efficacy of the existing procedure. This type of alternative is called

two-sided because both possibilities are envisaged. A test in a situation where the

alternative hypothesis is two sided is called a two-tail test.

3. While testing equality of two groups. This happen, when a test regimen is being

compared with the existing regimen. This requires two-tail test.

Commonly used test of significance as per type and size of data:

Type of data

Size ↴

Quantitative Qualitative

Large sample

(n >= 30)

Z-test (SEx, SEx1-x2) Z-test (SEp, SEp1-p2)

χ2 test

Small sample

(n < 30)

t-test (Paired & unpaired)

N-P test

χ2 test with Yates correction

N-P test

Procedure and steps:

Find the type of problem and the question to be answered.

To state null (Ho) and alternative (H1) hypothesis.

Selection of a appropriate test.

Fixation of LOS (a), minimum desirable level is 5%.

Calculation of the test criterion based on the type of test.

Page 120: Workshop Report Final

Page | 120

Comparison of calculated value with theoretical value.

Drawing the conclusion. P-value.

* If the results of a sample fall within the Mean 1.96 SE, the null hypothesis is

accepted, hence this area is called zone of acceptance for null hypothesis.

* If the results of a sample fall outside the Mean 1.96 SE, the null hypothesis is

rejected, hence this area is called zone of rejection for null hypothesis.

Now let us discuss the various situations where we have to apply different test of

significance. For the sake of convenience and clarity these situation may be

summed up under the following three heads:

1. Test of significance for Attributes.

2. Test of significance for Variables (large sample).

3. Test of significance for Variables (small sample).

1.Test of significance for Attributes.

As distinguished from variables where quantitative measurement of a phenomenon

is possible, in case of attributes we can only find out the presence or absence of a

particular characteristic.

Eg. In a study of attribute „Literacy‟ a sample may be taken and people classified as

literates and illiterates. With such data the binomial type of problem may be

formed. The selection of an individual on sampling may be called „event‟ the

appearance of an attribute A may be taken „Success‟ and its non-appearances as

„Failure‟.

a. Test for number of success:

The sampling distribution of the number of success follows a binomial probability

distribution.

SE of number of success = √npq

Where n = Size of sample, p = Probability of success in each trial and q = 1-p =

Probability of failure.

Eg. In a hospital 480 female and 520 male babies were born in a week. Do this

figure; confirm the hypothesis that males and females are born in equal number?

Ho: The male and female babies are born in equal number. i.e. p = ½

H1: p =/ ½

n =1000, p = ½, q = ½

SE = √npq = 15.81

Difference between observed and expected number of female babies =520 – 500 =

20

Z = Observed difference / SE = 20 / 15.81 = 1.265

Page 121: Workshop Report Final

Page | 121

Since the difference is less than 1.96 SE at 5% LOS. Hence the male and female

babies are born in equal number. Or

Z < 1.96, Accept Ho, hence the male and female babies are born in equal number.

2.Test of significance for Variables (large sample). Z – test.

The Z-test for mean has two applications

a. To test the significance of difference between a sample mean (x) and a

known value of population mean (u)

Z = observed difference between x and u / SEx

b. To test the significance of difference between two sample means or

between experimental sample mean and control sample mean.

Z = Observed difference / SE

Standard error of mean (SEx):

When?

-Sample size should be large and drawn randomly.

-Data should be quantitative.

-The variable under study is assumed to follow Normal distribution in the

population.

Why?

-To calculate size of the sample.

-To estimate population parameter, when u is not known.

-To test whether the sample is drawn from representative of population or not,

when u is given.

How?

-See the type of problem and the question to be answered.

-State Ho and H1

-Given data

-Calculate SEx = sd / √n

-Calculate „Z‟ value

Z = Observed difference / SEx

-Fixation of LOS, minimum desirable is 5%.

-Comparison

If Z > 1.96, reject Ho, hence observed difference is significant.

If Z < 1.96, Accept Ho, hence observed difference is insignificant.

-Conclusion

Standard error of Proportion (SEp):

Page 122: Workshop Report Final

Page | 122

When?

-Sample size should be large and drawn randomly.

-Data should be qualitative.

-The variable under study is assumed to follow Normal distribution in the

population.

Why?

-To calculate size of the sample.

-To estimate population proportion.

-To test whether the sample proportion is drawn from representative of population

proportion or not.

How?

-See the type of problem and the question to be answered.

-State Ho and H1

-Given data

-Calculate SEp =√ pq/n

-Calculate „Z‟ value

Z = Observed difference / SEp

-Fixation of LOS, minimum desirable is 5%.

-Comparison

If Z > 1.96, reject Ho, hence observed difference is significant.

If Z < 1.96, Accept Ho, hence observed difference is insignificant.

-Conclusion

Standard error of difference between two mean (SEx1-x2):

When?

-Sample size should be large and drawn randomly.

-Data should be quantitative.

-The variable under study is assumed to follow Normal distribution in the

population.

Why?

-To compare the efficacy of two therapies.

-To test whether the sample is drawn from the representative of population or not.

How?

-See the type of problem and the question to be answered.

-State Ho and H1

-Given data

Page 123: Workshop Report Final

Page | 123

-Calculate SEx1-x2 =√ {(sd12/n1)+(sd22/n2)} = x √ {(1/n1)+(1/n2)}

Where =√ {[(X1-x1)2 +(X2-x2)2]/(n1+n2-2)}

-Calculate „Z‟ value

Z = Observed difference / SEx

-Fixation of LOS, minimum desirable is 5%.

-Comparison

If Z > 1.96, reject Ho, hence observed difference is significant.

If Z < 1.96, Accept Ho, hence observed difference is insignificant.

-Conclusion

Standard error of difference between two proportion (SEp1-p2):

When?

-Sample size should be large and drawn randomly.

-Data should be qualitative.

-The variable under study is assumed to follow Normal distribution in the

population.

Why?

-To test whether the sample proportions are drawn from the representative of

population proportion or not.

How?

-See the type of problem and the question to be answered.

-State Ho and H1

-Given data

-Calculate SEp1-p2 =√{(p1q1/n1)+(p2q2/n2)}

-Calculate „Z‟ value

Z = Observed difference / SEp1-p2

-Fixation of LOS, minimum desirable is 5%.

-Comparison

If Z > 1.96, reject Ho, hence observed difference is significant.

If Z < 1.96, Accept Ho, hence observed difference is insignificant.

-Conclusion

3.Test of significance for Variables (small sample). t-test.

Small samples or their Z values do not follow normal distribution, because the

samples SD do not adequately represent the population SD, even if the sample is

Page 124: Workshop Report Final

Page | 124

drawn from the normal distribution. So the Z value, in these cases will not give the

correct level of significance or probability.

In case of small sample, „t‟ test is applied instead of Z test. This test is design by

W.S. Gossett whose pen name was Student. Hence, this test is also called Student

„t‟ test.

There are two types of „t‟ tests.

1. Paired „t‟ test and 2. Unpaired „t‟ test.

Paired „t‟ test:

When?

-Sample size should be small and drawn randomly.

-Data should be quantitative.

Why?

To paired data of independent observations from one sample only when each

individual give a pair of observations.

-To study the role of a factor or cause when the observations are made before and

after its play. Eg. Of exertion on pulse rate; of a drug on blood pressure‟ of a anti-

depressive drug on the sleep of the patients; of meals on leucocytes count; of

Bengal gram, garlic, onion etc. on cholesterol level in the blood.

-To compare the effect of two drugs, given to same individuals in the sample at two

different occasions. Eg. Adrenaline and non-adrenaline on pulse rate.

-To study the comparative accuracy of two different instruments.

-To compare result of two different laboratory techniques.

-To compare observations made at two different sites in the same body. Eg.

Compare temperature between toes and between fingers or in axilla and mouth.

How?

-See the type of problem and the question to be answered.

-State Ho and H1

-Given data: paired observations

-Calculate difference for each pair of values i.e. d=x1-x2

-Calculate mean of difference i.e. d, SDd and SEd

Where SEd = SDd/√ (n-1)

-Calculate „t‟ value

t = Observed difference / SEd

-Degree of freedom = (n-1), refer „t‟ table and find out the probability of the

calculated „t‟ value to the degree of freedom (n-1)

Page 125: Workshop Report Final

Page | 125

-Comparison

If Z > 1.96, reject Ho, hence observed difference is significant.

If Z < 1.96, Accept Ho, hence observed difference is insignificant.

-Conclusion

OR

Wilcoxon Sign rank sum test is for paired or matched data.

Unpaired „t‟ test:

When?

-Sample size should be small and drawn randomly.

-Data should be quantitative.

Why?

To unpaired data of independent observations made on individual of two different

or separate groups or samples drawn from two populations.

How?

-See the type of problem and the question to be answered.

-State Ho and H1

-Given data:

* If one sample „t‟ test:

-Find the difference between the actually observed mean and the claimed mean.

The claimed mean is the Ho. In terms of notations, this difference is x-uo

-Estimate SEx = s/√n, Where s is the standard deviation and s is the number of

subjects in the actually studied sample. This SE measures the inter sample

variability of mean.

-Check the difference obtained in @ is sufficiently large relative to the SE. Calculate

student „t‟ test.

T = (x-uo)/SEx this follows with (n-1) df

-Comparison

If Z > 1.96, reject Ho, hence observed difference is significant.

If Z < 1.96, Accept Ho, hence observed difference is insignificant.

-Conclusion

* Two-sample „t‟ test:

-Calculate SE

Where SE = x√{(1/n1)+(1/n2)}

Where =√{[(X1-x1)2 +(X2-x2)2]/(n1+n2-2)}

Page 126: Workshop Report Final

Page | 126

-Calculate „t‟ value

t = Observed difference / SE

-Degree of freedom = (n1+n1-2), refer „t‟ table and find out the probability of the

calculated „t‟ value to the degree of freedom (n1+n2-2)

-Comparison

If Z > 1.96, reject Ho, hence observed difference is significant.

If Z < 1.96, Accept Ho, hence observed difference is insignificant.

-Conclusion

OR

Wilcoxon two-sample test is for unpaired data.

CHI-SQUARE TEST (χ2)

When?

-Sample size should be large & small and drawn randomly.

-Data should be qualitative.

Why?

-To test of goodness of fit.

-To test association between two events.

-To test the significance difference between two or more than two proportions.

How?

-See the type of problem and the question to be answered.

-State Ho and H1

-Given data: Observed frequency (O)

-Calculate Expected frequency (E)=(RTxCT)/N Where RT=Row total, CT=Column

total, N =Total number of observations.

-Calculate „χ2‟ value

χ2 =Σ (O-E) 2/E

-Degree of freedom (df)=(r-1)(c-1); (r=row and c=column.)

-Refer χ2 table and find out table value with (r-1)(c-1) df.

-Comparison

If χ2 > table value, reject Ho, hence observed difference is significant.

If χ2 < table value, Accept Ho, hence observed difference is insignificant.

-Conclusion

Association: They are either independent of each other or they are

dependent of each other.

χ2 test of goodness of fir:

Page 127: Workshop Report Final

Page | 127

It is applied as a test of “goodness of fit” to determine if actual numbers are

similar to expected or theoretical number. K- is the number of classes for χ2

in goodness of fit test. We can find whether the observed frequency

distribution fits to a theoretical distribution of qualitative data. Df=K-1

χ2 test of association between two events:

The simplest setup is when two characteristics under investigation for

association are Yes/No type, or any other two categories such as

Male/Female, Child/Adult, Blood group B/Other than B, Systolic

BP<140/Systolic BP>= 140 etc. The null hypothesis is that there is no

association. The sample data are then examined whether they provide

sufficient evidence against the null hypothesis. Two methods are available

for this setup:

1) Proportion test based on Gaussian distribution and

2) Chi-square test based on contingency table.

Both are equivalent and give same result. They both are applicable only

when „n‟ is large.

The association between two sets of events, this table is also called

association table because they are only two samples and each divided into

two classes, it is called 2x2 contingency table or fourfold table.

Eg. Cataract prevalence in males and females.

Gender Cataract Total

Yes No

Male a b a+b (R1)

Female c d c+d (R2)

Total a+c (C1) b+d (C2) N

The objective of the survey is to find whether or not the prevalence is

associated with the gender. The null hypothesis that there is no association.

If that is true, the prevalence in both the groups should be same. Can you

guess what should be the prevalence if it is same in males as in females?

Alternative formula for χ2 is

χ2= ((ad-bc) 2xN)/{(a+b)(c+d)(a+c)(b+d)}

Page 128: Workshop Report Final

Page | 128

Note:

-For any χ2 to be valid, it is necessary that the expected frequency in each

cell is at least 5.

-If any cell frequency is less than 5, a different procedure called Fisher‟s

exact test or yate‟s correction / continuity correction.

-Cochran (1954) recommends the use of the exact test, in preference to the

χ2 test with yate‟s correction, (1) if N<20 or (2) if 20<N<40 and the smallest

expected value is less than 5.

-Yate‟s correction is applicable only 2x2 contingency table.

-If the theoretical / Expected frequencies are smaller i.e. <5 then adjoining

classes should be merged together.

Fisher exact test= (R1! R2! C1! C2!)/(N! a! b! c! d!)

Yates correction (χ2)={(|ad-bc|-(N/2)) 2xN}/{R1R2C1C2}

Or. χ2= Σ (| O-E| - ½) 2/E)

Bigger contingency tables:

Now consider duration of AIDS development in various blood groups. The

duration is divided into three groups, viz; <5years, 5-8 years and >=8 years.

This is counted from the time of HIV infection to the appearance of clinical

symptoms of AIDS. The objective is

-To find whether the duration is associated with blood group or not i.e.

whether the duration is different in different blood groups.

-There is no association between blood group and duration of developing

AIDS.

Eg. Duration of developing AIDS in HIV positive of different blood groups of

are as follows:

Duration of

deve-loping

AIDS (Yrs)

Blood group Total

O A B AB

< 5 20 30 48 7 105

5 - 8 21 27 104 19 171

>=8 59 43 98 24 224

Page 129: Workshop Report Final

Page | 129

Total 100 100 250 50 500

Ho: there is no association between blood groups and the time taken for

development of AIDS in HIV infected cases.

H1: there is a association between blood groups and the time taken for

development of AIDS in HIV infected cases.

Calculate expected frequency of each cell. (12 expected frequency)

Apply χ2 test χ2 = 22.72. ,

df = (r-1)(c-1) =6 Table value χ26,0.05 = 12.59

Comparison: χ2 > χ26,0.05 , R eject Ho.

Conclusion: Hence there is a association between blood groups and the time

taken for development of AIDS in HIV infected cases.

Page 130: Workshop Report Final

Page | 130

Lesson IX: Objectives: 1. Understand the areas in Education and Learning requiring research 2. Delineate the Priority areas of educational research in India 3. Describe the Ethical problems involved in Educational Research Lesson Outline:

Understand the areas in Education and Learning requiring research including Teaching and Learning methods, Assessment, Evaluation and interventions. Able to formulate priority areas based on the need of the institution.

• The barriers and opportunities of initiating the medical education research • Ways to strengthen the research capabilities in medical education • Describe the specific Ethical problems involved in Educational Research

Introduction:

In India, medical institutions are established with an objective of three legged stool

consisting of research, education and service. Medical teachers hence referred as

„triple threat academicians‟. Medical teachers are thus original and productive

investigators, committed teachers and compassionate practicing physicians. Fourth

obligation recently emphasized is „social responsiveness‟. Medical schools are

confronted with the challenge of making their curricula relevant to the needs of the

times. One response to this challenge is increased interest in research in medical

education.

It has been under fire that Medical education in Asia has colonial-biased, subject-

oriented, teacher-centred, discipline-based, lecture-focused and hospital-based

traditions, which failed “to train medical students appropriately for national health

needs and for medical schools to assume leadership role in shaping services

oriented to the needs of the community”. However there are signs of positive winds

in medical education from government of India and Medical Council of India. The

recent trends in Medical Educational Research suggest that Research is either

quantitative or qualitative. Biomedical or objectifying and holistic or humanizing

research confined to more quantitative, experimental or quasi-experimental

approaches. There is a need to change the direction of research to shift towards

qualitative and descriptive methods

The priority areas in medical education are many and depend on the needs and the

mission of individual institutions. Need based research is a pragmatic approach for

Page 131: Workshop Report Final

Page | 131

budding medical education researcher or for a newly established medical education

unit. Need based research directly answers the questions related to individual or

institutional needs and is of immediate interest to the faculty and administrators.

The research in medical education has contributed significantly in our

understandings of teaching and learning medicine. Medical education research is

not merely academic and esoteric in nature. On the contrary, the vast majority of

the studies and publications address issues that are practical and of immediate

interest to medical teachers.

Hardens Approaches to Medical Education Research: Experimental Fact-finding Action-research Open ended research Creative research

*Due to the practical and problem solving nature, action research is becoming popular among teacher-researchers Areas in Education & Learning Requiring Research: Outcomes Interventions Teaching & Learning Methods Assessment Evaluation

Exercise: Suggest areas in medical education which need research Prioritize the areas of research by giving justification

Barriers to Medical Education Research: Poor socio-economic conditions Cultural and religious conservatism Lack of relevance Leadership crisis Faculty development Information poverty Unforeseeable short-term research outcomes

Ways to Strengthen Research Capabilities in Medical Education:

Leadership and commitment Relevance Establishment of centre for medical education research Availability of financial resources Research methodology Access to information

Ethical Problems involved in Research: Ethics deals with values and morals. It is based on people's personal value

systems. What one person or group considers to be good or right might be

Page 132: Workshop Report Final

Page | 132

considered bad or wrong by another person or group. There are three major

approaches to ethics.

1. Deontological Approach - This approach states that we should identify and use a

Universal code when making ethical decisions. An action is either ethical or not

ethical, without exception.

2. Ethical skepticism - Concrete and inviolate ethical or moral standards cannot be

formulated. In this view, ethical standards are not universal but are relative to

one's particular culture, time, and even individual.

3. Utilitarianism - Decisions about the ethics should be based on an examination

and comparison of the costs and benefits that may arise from an action. Note that

the utilitarian approach is used by most people in academia (such as Institutional

Review Boards) when making decisions about research studies.

Ethical Concerns

There are three primary areas of ethical concern for researchers:

1. The relationship between society and science.

• Should researchers study what is considered important in society at a given time?

• Should the government and other funding agencies use grants to affect the areas

researched in a society?

• Should researchers ignore societal concerns?

2. Professional issues.

• The primary ethical concern here is fraudulent activity (fabrication or alteration of

results) by scientists. Obviously, cheating or lying are never defensible.

• Duplicate publication (publishing the same data and results in more than one

journal or other publication) should be avoided.

• Partial publication (publishing several articles from the data collected in one

study). This is allowable as long as the different publications involve different

research questions and different data, and as long as it facilitates scientific

communication. Otherwise, it should be avoided.

3. Treatment of Research Participants

• This is the most fundamental ethical issue in the field of empirical research.

• It is essential that one insures that research participants are not harmed

physically or psychologically during the conduct of research.

Ethical Guidelines for Research with Humans

One set of guidelines specifically developed to guide research conducted by

educational researchers is the ICMR Guidelines. The ICMR is the largest

Page 133: Workshop Report Final

Page | 133

professional association in the field of Medical Research. Ethical guidelines are

based on the following general principles

1 Essentiality

2 Voluntariness, informed consent and community agreement

3 Non-exploitation

4 Privacy and confidentiality

5 Precaution and risk minimisation

6 Professional competence

7 Accountability and transparency

8 Maximisation of the public interest and of distributive Justice

9 Institutional arrangements

10 Public domain

11 Totality of responsibility

12 Compliance

British Educational Research Association (BERA) has issued Ethical Guidelines for

Educational Research in 2004. It emphasizes that research in education should be

conducted within an ethic of respect of Person, Knowledge, Democratic Values,

Quality of educational research and Academic Freedom. The guidelines framed

under the broad headings of Responsibilities of Participants, Sponsors of research

and community of educational researchers.

Institutional Review Board

The IRB is a committee consisting of professionals and lay people who review

research proposals to insure that the researcher adheres to federal and local ethical

standards in the conduct of the research. Virtually every medical college in

maharashtra has an IRB.

• Researchers must submit a Research Protocol to the IRB for review.

• Three of the most important categories of review are exempt studies (i.e.,

studies involving no risk to participants and not requiring full IRB review),

expedited review (i.e., the process by which a study is rapidly reviewed by

fewer members than constitute the full IRB board), and full board review

(i.e., review by all members of the IRB).

• Although many educational studies are fall into the exempt category, it is

essential that you understand that it is the IRB staff and not the researcher

that makes the decision as to whether a research protocol is exempt. The

IRB will provide the formal documentation of this status for your study.

Page 134: Workshop Report Final

Page | 134

Research Project Proposals Submitted Proposal I:

Title: To Study the effectiveness of Objective Structured Practical Examination (OSPE) in Assessing I MBBS students Type of Study:

Mixed Type Study Design:

50 students of I MBBS are included in this study. All the students are assessed twice, once with the traditional method and then with OSPE method

Objectives: 1 To conduct practical examination of I MBBS using traditional method

2 To conduct practical examination of I MBBS using OSPE method 3 To compare the performance of students by two methods and analyse

statistically

4 To analyse students satisfaction in both methods

OSPE: 1 Skill 2 Knowledge

3 Attitude(communicating skills) CONSENT CONVINCE

Members of the group:

Dr SS Pandey PDMC, Amaravati Dr SR Pandey PDMC, Amravati

Dr CD Dange SBH Govt Medical College, Dhule Dr Anita Jadhav SBH Govt Medical College, Dhule Dr Prashant Patil SBH Govt Medical College, Dhule

Dr Anita Kale SBH Govt Medical College, Dhule Dr Navid Shah ACPM Medical College, Dhule

Page 135: Workshop Report Final

Page | 135

Proposal II:

Title: Advantages of PBL over routine methods of teaching Type of Study:

Study Design: PBL for small group of 8 – 10 students. Over a period of 2-3 weeks

Initial disclosure of patients complaints Session I---- Learning issues

Swelling in the neck Swelling in the midline Progress of disease

Change in symptoms Facilitators- 1-2

What are resources to satisfy the knowledge seeking Faculty Internet

Library 3-5 days for study Session II:

Division of students------ Chairman, scribe, topic presenter for each learning issue, 2-3 students as observers

Disclosure no2 Symptoms of thyrotoxicosis/ myxedema Biochemical investigations---TFT, S.Cal, S Cholesterol, Bl.

Sugar, Pulse, BP, Biopsy, ECG, Weight, imaging investigations: X-ray neck, USG,

Radioactive uptake scan

Learning Issues 3-4 days for learning

Session III: Reassemble Regroup Missing investigations FNAC

Expert Common: Ca. Thyroid, other thyroid diseases PBL as topic for research project

1 Control group: Usual way of teaching inpatients 2 Test Group: Those who underwent PBL training

Assessment: Summative for control group and test group Test group may be exposed to formative assessment

Page 136: Workshop Report Final

Page | 136

Testing by questionnaire on topic of thyroid Validation of questionnaire

Objectives:

Members of the group: Dr Suvarna Joshi BJ Medical College, Pune

Dr Anita Kavatkar BJ Medical College, Pune Dr NK Wani ACPM Medical College, Dhule Dr SS Date ACPM Medical College, Dhule

Dr Mrs AB Patil ACPM Medical College, Dhule Dr Mrs AS Gadre ACPM Medical College, Dhule

Page 137: Workshop Report Final

Page | 137

Proposal III:

Title: Development of Newer Teaching Tools Type of Study:

Study Design: Qualitative Research Focus Group Discussion

Objectives: To create better physician

Methods: Using tools such as CDs, Photographs and Case studies

Equipment Required:

Members of the group: Residents of Paediatrics, Medicine, Obstetrics & Gynecology and Physiology

Page 138: Workshop Report Final

Page | 138

Proposal IV:

Title: Assessment of perceptions of third year medical students regarding curriculum in community medicine at Shri. Bhausaheb Hire Govt. Medical College, Dhule .

Type of Study: Cross sectional Study Design: Qualitative Research utilising Focus Group Discussion

Objectives: 1 To explore the present status and deficiencies if any in the teaching

and learning of community medicine 2 To suggest improvements in the teaching process

Participants: VI, VII & VIII Semester students.

As you know, important objective of medical education at graduate level is preparing basic doctor, who should be able to tackle common health problems & fulfilling the responsibilities of first contact public health expert

. In Medical Colleges, Dept. of Community Medicine carries the responsibility of educating & training undergraduate students in the context of above objective. But it is observed that attendance & involvement

of the students in the teaching learning process in community medicine subject is becoming less & less. After passing out, students feel that this

subject is useless subject. This is harmful to the important objective of the medical education. That is why the present study has been planned to explore the deficiencies

in the curriculum & lacunae in teaching learning process at departmental level from the students view. Perceptions of the students will better be appreciated through focus

group discussion.

Members of the Group: Dr RT Ankushe Dr Sarika P Patil Dr Vijay Singh Dr Dr Amol Patil

Dr Samir Sheik

Page 139: Workshop Report Final

Page | 139

Workshop on " Medical Educational research: Concepts & Methodologies" held at Medical Education Unit, SBH Government Medical College,

on 29 & 30 July, 2009 Sn Name Designation College

1 Dr S S Date Professor ACPM Medical College

2 Dr Mrs Alka Patil Professor ACPM Medical College

3 Dr Wasim Sheik Assistant Professor Pramukhswami Medical College

4 Dr Mrs Jadhav Assistant Professor SBH Government Medical College

5 Dr S N Wanjari Assistant Professor SBH Government Medical College

6 Dr Sarika Patil Assistant Professor SBH Government Medical College

7 Dr Arundhati Gadre Assistant Professor ACPM Medical College

8 Dr Rajashri damle PG Student ACPM Medical College

9 Dr Priya Bagle PG Student ACPM Medical College

10 Mr Sunil Kumar Patil Statistician ACPM Medical College

11 Dr Ajit Pathak Associate Professor SBH Government Medical College

12 Dr Naveenchandra Wani Professor ACPM Medical College

13 Dr Suvarna Joshi Associate Professor BJ Medical College

14 Dr Anita Kavatkar Associate Professor BJ Medical College

15 Dr Mrs Sushma Pandey Professor Dr PDM Medical College

16 Dr Santosh Pandey Associate Professor Dr PDM Medical College

17 Dr Anita Kale Assistant Professor SBH Government Medical College

18 Dr Danish Memon PG Student ACPM Medical College

19 Dr Shree deshmukh PG Student ACPM Medical College

20 Dr Jasleen Mavi PG Student ACPM Medical College

21 Dr Prabhneet Kahlon PG Student ACPM Medical College

22 Dr Asma Kahn PG Student ACPM Medical College

23 Dr AN Borde Associate Professor SBH Government Medical College

24 Dr RT Ankushe Associate Professor SBH Government Medical College

25 Dr CD Dange Assistant Professor SBH Government Medical College

26 Dr Mrs Kulkarni Assistant Professor SBH Government Medical College

27 Ms Shilpa Tyagi Student SBH Government Medical College

28 Dr Vujay Singh Assistant Professor LTM Medical College

29 Dr Santosh Suryawanshi Assistant Professor LTM Medical College

30 Dr Prayag Makwana PG Student ACPM Medical College

31 Dr Sandeep Gaidhani PG Student ACPM Medical College

32 Dr Puneet Patil PG Student ACPM Medical College

33 Dr Vishal Gaeikwad PG Student ACPM Medical College

34 Dr Kishore Suryawanshi PG Student ACPM Medical College

35 Dr Mayur Kahate PG Student ACPM Medical College

36 Dr Vaibhav Jain Assistant Professor ACPM Medical College

37 Dr K K Borgaonkar Assistant Professor SBH Government Medical College

38 Dr Naveed Agha PG Student ACPM Medical College

Page 140: Workshop Report Final

Page | 140

39 Dr Prakash Humlekar Assistant Professor ACPM Medical College

40 Mr Amitesh Khare Student SBH Government Medical College

41 Dr N N Shah Assistant Professor ACPM Medical College

42 Dr Prashant Patil Assistant Professor SBH Government Medical College

43 Dr Arun More Associate Professor SBH Government Medical College

44 Dr Amol Patil Assistant Professor SBH Government Medical College

45 Dr Samir Sheik PG Student SBH Government Medical College

Page 141: Workshop Report Final

Page | 141

Teaching & Learning Methods Suggested for the Revised MBBS Curriculum

Method

First MBBS Second MBBS Final MBBS

Anatomy

Physiology

Biochemistry

Pathology

Microbiology

Forensic

Medicine

Pharmacology

Community

Medicine

Medicine

Paediatrics

Dermatology

Psychiatry

TB &

Chest

Surgery

Orthopaedics

ENT

Ophthalmology

Obstetrics &

Gynecology

Anesthesia

Lectures

Structured

interactive

sessions

Small group

discussion

a)

Demonstrati

ons.

b) Tutorials.

c) Seminars.

d) Problem

Based

Learning.

Focused

group

discussion

(FGD)

Projects Participator

y learning

appraisal

(PLA)

Family and

community

visits

Institutional

visits Practical

including

demonstrati

ons

Problem

based

exercises

Video clips

Page 142: Workshop Report Final

Page | 142

Written case

scenario Self

learning

tools

Interactive

learning

e-modules Dissection /

Prosected

parts

demonstrati

ons /

Instructions

on

mannequins

.

Skills Lab

with CDs of

various

stages of

dissection.

Histology

Lab.

Surface

marking.

Imaging

anatomy

Lab.

Visit to the

museum.

Preparation

of scientific

article.

Preparation

of practical

drawing

book

Role Play

Seminars

Algorithms

Integrated

Page 143: Workshop Report Final

Page | 143

teaching

Field visits

Problem based paper & real cases

Simulated Patient Management Problems

Case Studies

Tutorials

Workshops

One to one teaching in theatre

Departmental Morbidity, Audit, Journal Club

Self Assignments