35
The Where, The Where, Why Why , and , and How of Data Collection How of Data Collection ©

The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

  • View
    218

  • Download
    0

Embed Size (px)

Citation preview

Page 1: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

The Where, The Where, WhyWhy, and , and How of Data CollectionHow of Data Collection

©

Page 2: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Business StatisticsBusiness Statistics

Business statistics Business statistics consists of a set of tools consists of a set of tools and techniques that are and techniques that are used to convert data into used to convert data into meaningful information meaningful information for a business for a business environment.environment.

Page 3: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Objective in Business Objective in Business StatisticsStatistics

Descriptive StatisticsDescriptive StatisticsDescribe

Compare

RelateInferential StatisticsInferential Statistics

Page 4: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Descriptive StatisticsDescriptive Statistics

Descriptive StatisticsDescriptive Statistics consists of the tools and techniques designed to describe data, such as charts, graphs, and numerical measures.

Page 5: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Descriptive StatisticsDescriptive Statistics(Figure 1-2: Histogram)(Figure 1-2: Histogram)

BAKER CITY HOSPITAL - LENGTH OF STAY DISTRIBUTION

0

10

20

30

40

50

60

70

0<2 2<4 4<6 6<8 8<10 10<12 12<14 14<16 16<18

Page 6: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Descriptive StatisticsDescriptive Statistics

AVERAGEAVERAGEThe sum of all the values divided by the number of values. In equation form:

where:

N = number of data values

xi = ith data value

sdata valuenumber of

es data valusum of all

N

xi

Average

N

1

Page 7: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Inferential StatisticsInferential Statistics

Inferential StatisticsInferential Statistics consists of techniques that allow a decision-maker to reach a conclusion about characteristics of a larger data set based upon a subset of those data

Page 8: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Two Basic Categories of Two Basic Categories of Statistical Inference ToolsStatistical Inference Tools

EstimationEstimationHypothesis TestingHypothesis Testing

Page 9: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Data TypesData Types

Primary DataPrimary DataThose that are collected by you or

another person with whom you are closely associated.

Secondary DataSecondary DataThose that are collected and compiled

by an outside source or by someone in your organization who may later provide access to the data to other users.

Page 10: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Tools for Collecting DataTools for Collecting Data

ExperimentsExperimentsTelephone SurveysTelephone SurveysMail QuestionnairesMail QuestionnairesDirect Observation Direct Observation

and Personal and Personal InterviewInterview

Page 11: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

ExperimentsExperiments

An experimentexperiment is any process that generates data as its outcome.

Page 12: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Major Steps for a Major Steps for a Telephone or Written Telephone or Written

SurveySurvey

Define the IssueDefine the Issue Define the Population of InterestDefine the Population of Interest Develop Survey QuestionsDevelop Survey Questions Pre-test the SurveyPre-test the Survey Determine the Sample Size and Sampling Determine the Sample Size and Sampling

MethodMethod Select Sample and AdministerSelect Sample and Administer

Page 13: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

SurveysSurveys

• Open ended questionsOpen ended questions

• Closed-ended questionsClosed-ended questions

• Demographic questionsDemographic questions

Page 14: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Populations and SamplesPopulations and Samples

A populationpopulation is a set of specific data values on all objects or individuals of interest.

Page 15: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Populations and SamplesPopulations and Samples

A samplesample is a subset of the population.

Page 16: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Parameters and StatisticsParameters and Statistics

Descriptive numerical measures calculated from the entire population are called parametersparameters.Corresponding measures for a sample are called statisticsstatistics.

Page 17: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Sampling TechniquesSampling Techniques

Non-statistical sampling Non-statistical sampling techniquestechniques refer to those methods of sampling using influence, judgement, or other non-chance processes.Example: Convenience Example: Convenience samplingsampling -- sample from the population based upon accessibility and ease of selection.

Page 18: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Sampling TechniquesSampling Techniques

Statistical sampling Statistical sampling techniquestechniques refer to those methods of sampling that use selection techniques based upon chance selection.

Page 19: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Statistical SamplingStatistical Sampling

Types of statistical sampling include:

Simple Random SamplingSimple Random Sampling Stratified Random SamplingStratified Random Sampling Systematic SamplingSystematic Sampling Cluster SamplingCluster Sampling

Page 20: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Statistical SamplingStatistical Sampling

Simple random samplingSimple random sampling refers to a method of selecting items from a population such that every possible sample of a specified size has an equal chance of being selected.

Page 21: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Statistical SamplingStatistical Sampling

Stratified random samplingStratified random sampling refers to a sampling method in which the population is divided into subgroups called strata so that each population item belongs to only one strata. The objective is to form strata such that the population values of interest are as much alike as possible.

Page 22: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Stratified Sampling ExampleStratified Sampling Example(Figure 1-13)(Figure 1-13)

PopulationPopulation

Stratified PopulationStratified Population

Financial Institutions

Page 23: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Stratified Sampling ExampleStratified Sampling Example(Figure 1-13)(Figure 1-13)

PopulationPopulation

Cash holdings of All Financial

Institutions in the United States

Stratified PopulationStratified Population

Financial Institutions

Page 24: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Stratified Sampling ExampleStratified Sampling Example(Figure 1-13)(Figure 1-13)

PopulationPopulation

Cash holdings of All Financial

Institutions in the United States

Stratified PopulationStratified Population

Financial Institutions

Page 25: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Stratified Sampling ExampleStratified Sampling Example(Figure 1-13)(Figure 1-13)

PopulationPopulation

Cash holdings of All Financial

Institutions in the United States

Large Institutions

Stratified PopulationStratified Population

Stratum 1 Select n1

Financial Institutions

Page 26: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Stratified Sampling ExampleStratified Sampling Example(Figure 1-13)(Figure 1-13)

PopulationPopulation

Cash holdings of All Financial

Institutions in the United States

Large Institutions

Medium Size Institutions

Stratified PopulationStratified Population

Stratum 1

Stratum 2

Select n1

Select n2

Financial Institutions

Page 27: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Stratified Sampling ExampleStratified Sampling Example(Figure 1-13)(Figure 1-13)

PopulationPopulation

Cash holdings of All Financial

Institutions in the United States

Large Institutions

Medium Size Institutions

Small Institutions

Stratified PopulationStratified Population

Stratum 1

Stratum 2

Stratum 3

Select n1

Select n2

Select n3

Financial Institutions

Page 28: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Statistical SamplingStatistical Sampling

Systemic random samplingSystemic random sampling refers to a sampling technique that involves selecting the kth item in the population after randomly selecting a starting point between 1 and k. The value of k is determined as the ratio of the population size over the desired sample size.

Page 29: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Statistical SamplingStatistical Sampling

Cluster samplingCluster sampling refers to a method by which the population is divided into groups, or clusters, that are each intended to be mini-populations. A random sample of m clusters is selected.

Page 30: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Cluster Sampling ExampleCluster Sampling Example(Figure 1-14)(Figure 1-14)

42 22 52

Illinois Scotland Florida

25 105 20 36 152 76 37

Algeria California Alaska New York Idaho Mexico Australia

Mid-Level Managers by Location for Morrison-Knudsen Construction Company

Page 31: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Cluster Sampling ExampleCluster Sampling Example(Figure 1-14)(Figure 1-14)

42 22 52

Illinois Scotland Florida

Mid-Level Managers by Location for Morrison-Knudsen Construction Company

All members selected from these clusters

Page 32: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Quantitative and Quantitative and Qualitative DataQualitative Data

Data that are numeric and which define value or quantity are quantitative quantitative datadata.Data whose measurement scale is inherently categorical are qualitative qualitative datadata.

Page 33: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Time Series Data and Time Series Data and Cross-Sectional DataCross-Sectional Data

Time series dataTime series data consist of a set of ordered data values observed at successive points in time.Cross-sectional dataCross-sectional data are a set of data values observed at a fixed point in time.

Page 34: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Data Measurement LevelsData Measurement Levels

Nominal Nominal DataData

Ordinal Ordinal (Rank) Data(Rank) Data

Interval Interval Data Data

Ratio DataRatio Data

Page 35: The Where, Why, and How of Data Collection ©. Business Statistics Business statistics consists of a set of tools and techniques that are used to convert

Data Level HierarchyData Level Hierarchy(Figure 1-15)(Figure 1-15)

Ratio/Interval Data

Ordinal Data

Nominal Data

Highest Level

Complete Analysis

Higher Level

Mid-level Analysis

Lowest Level

Basic Analysis

Categorical Codes ID Numbers Category Names

Rankings

Ordered Categories

Measurements