33
DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

Embed Size (px)

Citation preview

Page 1: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

DLI TrainingNesstar Workshop

Ernie Boyko, Carol Perry

Ontario DLI TrainingUniversity of Guelph, Guelph, ON

April 10-11, 2006

Page 2: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

DDI Refresher

What, Why, How?

Page 3: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

Data Documentation Initiative

The Data Documentation Initiative is an international effort to establish a standard for technical documentation describing social science data

http://www.icpsr.umich.edu/DDI/index.html

Page 4: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

DTD - Document Type Definition

Consists of a Tag Library Tags have been developed by DDI A set of tags, when filled, are known

as a codebook DDI – intends to comply to Dublin

Core

Page 5: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

Tags

Tags present English language descriptions of XML (eXtensible Markup Language)

Each tag can be optional or mandatory, repeatable or non-repeatable

Set of tags for each section of DTD

Page 6: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

5 Sections of DTD (Document Type Definition)

1.0 Document Description 2.0 Study Description 3.0 Data File Description 4.0 Variables Description 5.0 Other Study Materials

Page 7: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

Document Description

Bibliographic description of the DDI document itself, otherwise known as a marked-up codebook

Page 8: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

Study Description

Describes Study or Survey

Includes title, abstract, keywords, author, publisher, collection methods, etc.

Page 9: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

Data File Description

Contains information describing the data file

Includes file name, file type, case quantity, logical record length, total number of records, etc.

Page 10: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

Variables Description

Describes each variable

Includes variable label, values, value label, question, summary statistics, etc.

Page 11: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

Other Study Materials

Includes documentation files in a variety of formats: pdf, excel, word, etc.

Includes codebooks, questionnaires, user guides, variability tables, etc.

Page 12: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

Fast forward …

What has been done since 2004…

Page 13: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

DLI Training 2005

The group tagging workshop

Page 14: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

CANDDI Tag working group

Michelle Edwards - UG Marie-Joseè Bourgeois – DLI Irene Wong – RDC UA Jane Fry – Carleton U

Page 15: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

DINO Dec 2005Questions for the Group

Sharing the metadata xml files What sections of DDI should be

included in the exchange? All five sections? Select sections?

Page 16: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

How do we choose the tags?

Preliminary set put together by U of Guelph in consultation with DLI staff

Carleton, Guelph, DLI team using same set of tags

Revision 4 was distributed to CANDDI tag team in Dec 2005

Work in progress

Page 17: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

How do we fill the tags?

DDI document occasionally vague Dublin Core tags –mandatory Do we fill these tags using

examples from DDI document? How do we build consistency?

Page 18: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

Example – Study title

Examples from DDI documentation: <titl> 2.1.1.1 Title

<titl>Domestic Violence Experience in Omaha, Nebraska, 1986-1987</titl>

<titl>Census of Population, 1950 [United States]: Public Use Microdata Sample</titl>

<titl>Monitoring the Future: A Continuing Study of American Youth, 1995</titl>

Page 19: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

How would a cataloguer do it?

<titl>Domestic violence experience in Omaha, Nebraska, 1986-1987</titl>

<titl>Census of population, 1950 [United States]: Public use microdata sample</titl>

<titl>Monitoring the future: A continuing study of American youth, 1995</titl>

Page 20: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

Balance

Between structure of DDI and need to co-ordinate with cataloguing rules

Page 21: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

How do we decide …

What is the right way to fill a tag for our needs

Some tags require consensus on how they should be filled

Page 22: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

Example

<IDNo> 2.1.1.5 Description: Unique string or

number (producer’s or archive’s number) for the data collection. An “agency” attribute is supplied.

Page 23: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

Choices – StatCan data

Bibliocat – pre-2000 surveys Statcan catalogue SDDS survey number in IMDB

Page 24: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006
Page 25: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006
Page 26: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006
Page 27: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

Which choice is right for us?

Survey of Household Spending ID

<IDNo>62M0004</IDNo> <IDNo>62M0004XCB</IDNo> <IDNo>3508</IDNo>

Page 28: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

Another decision, same tag Year must be added after IDNo to

distinguish files

<IDNo>3508-2000</IDNo>

What is appropriate separator? – , / : ; Will any cause problems later on?

Page 29: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

Other study-related material

2.5 <othrStyMat> Other Study Description Materials

5.0 <othrMat> Other Study-related Materials

Page 30: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

2.5 <othrStyMat>

may include: appendices, sampling information, weighting details, methodological and technical details, publications based upon the study content, related studies or collections of studies, etc

Page 31: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

How do we identify them?

By catalogue number What if they are on-line

publications Link to pdf? Link to dsp site? Link to StatCan catalogue page Link to them in our own collection

Page 32: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

5.0 <otherMat> may include: questionnaires, coding

notes, SPSS/SAS/Stata setup files (and others), user manuals, sample computer software programs, glossaries of terms, interviewer/project instructions, maps, database schema, data dictionaries, coding information, interview schedules, missing values information, frequency files, variable maps, etc

Page 33: DLI Training Nesstar Workshop Ernie Boyko, Carol Perry Ontario DLI Training University of Guelph, Guelph, ON April 10-11, 2006

We need to collaborate…

How do we decide as a group what we want

How do we articulate our reasons for making the decision