Upload
norman-eaton
View
217
Download
0
Embed Size (px)
Citation preview
CANSIMCANSIM A look atA look at 3 interfaces 3 interfaces
Ontario DLI TrainingOntario DLI TrainingUniversity of GuelphUniversity of Guelph
April 12, 2006April 12, 2006
Suzette GilesSuzette GilesData, Map and GIS Librarian Data, Map and GIS Librarian Ryerson University LibraryRyerson University Library
A look at A look at Stat. Can. website, E-Stat and CHASS Stat. Can. website, E-Stat and CHASS
Where?Where? What?What? AccessAccess ContentContent SearchingSearching
ResultsResults VisualizationVisualization ManipulationManipulation Output formatsOutput formats Which to use?Which to use?
““Imitation is the sincerest flattery”Imitation is the sincerest flattery”Sources:Sources:
Statistics Canada: About CANSIM:Statistics Canada: About CANSIM: http://www.statcan.ca/english/ads/cansimII/index.htmhttp://www.statcan.ca/english/ads/cansimII/index.htm Statistics Canada E-STAT:Statistics Canada E-STAT: http://estat.statcan.ca/content/English/over.shtmlhttp://estat.statcan.ca/content/English/over.shtml University of Toronto CHASS CANSIM information:University of Toronto CHASS CANSIM information:
http://00dc1.chass.utoronto.ca/cansim2/English/index.hthttp://00dc1.chass.utoronto.ca/cansim2/English/index.htmlml
University of Toronto Data Library Services: University of Toronto Data Library Services: http://www.chass.utoronto.ca/datalib/codebooks/cstdli/chttp://www.chass.utoronto.ca/datalib/codebooks/cstdli/cansim.htmansim.htm
Where is CANSIM??Where is CANSIM?? Statistics Canada home page:Statistics Canada home page:
Click on Advanced searchClick on Advanced search Search CANSIM is in left hand menu Search CANSIM is in left hand menu OR Click on Our Products and ServicesOR Click on Our Products and Services CANSIM is under “ Access our Online databases”CANSIM is under “ Access our Online databases”
E-STAT:E-STAT: Left hand menu of Table of Contents pageLeft hand menu of Table of Contents page
CHASS: Google or go via University of CHASS: Google or go via University of Toronto’s Data Library Service page “CHASS Toronto’s Data Library Service page “CHASS interface to selected databases”interface to selected databases”
What is CANSIM?What is CANSIM?
““CANSIM is Statistics Canada's key socio-CANSIM is Statistics Canada's key socio-economic database.” (Stat Can website)economic database.” (Stat Can website)
““CANSIM: Canadian Socio-Economic CANSIM: Canadian Socio-Economic Information Management System.” (CHASS)Information Management System.” (CHASS)
““CANSIMCANSIM is a multidimensional database is a multidimensional database containing more than 26 million time series containing more than 26 million time series regrouped in over 2,400 tables” E-STAT regrouped in over 2,400 tables” E-STAT April 4, 2006April 4, 2006
CANSIM I and CANSIM IICANSIM I and CANSIM II CANSIM I : Original CANSIM database CANSIM I : Original CANSIM database
consisting of 908,879 time series in 9,380 consisting of 908,879 time series in 9,380 matrices. Contains matrices and time series not matrices. Contains matrices and time series not in CANSIM II. Series start with a letter in CANSIM II. Series start with a letter followed by numbers (called a label) Last followed by numbers (called a label) Last updated June 1, 2002. (CHASS)updated June 1, 2002. (CHASS)
CANSIM II (CANSIM). Reorganized CANSIM II (CANSIM). Reorganized database. Matrices called Tables. Time series database. Matrices called Tables. Time series all start with a V, sometimes called a vector or all start with a V, sometimes called a vector or a label (CHASS)a label (CHASS)
Timing!!Timing!!
NOTICE:NOTICE: The CANSIM service will be The CANSIM service will be unavailable most of this coming weekend, unavailable most of this coming weekend, from 7PM (Eastern time) Friday April 7 to from 7PM (Eastern time) Friday April 7 to approximately 7PM Sunday April 9, because approximately 7PM Sunday April 9, because of a major database reconfiguration. of a major database reconfiguration.
AccessAccess
ProductProduct SupplierSupplier UseUse CostCost
CANSIMCANSIM Statistics Statistics CanadaCanada
UnrestrictedUnrestricted Fee ($3.00 to Fee ($3.00 to $5,000)$5,000)
CANSIMCANSIM E-STATE-STAT Restricted –Restricted –
DSPDSP
““Free” via IP Free” via IP AddressAddress
CANSIM ICANSIM I CHASSCHASS Restricted – Restricted – DLIDLI
““Free” via IP Free” via IP AddressAddress
CANSIM IICANSIM II CHASSCHASS Restricted – Restricted – DLIDLI
““Free” via IP Free” via IP AddressAddress
ContentContent
Stat. CanStat. Can E-STATE-STAT CHASSCHASS
Number of TablesNumber of Tables 2,400+2,400+ 2,400+2,400+ 2,5412,541
Number of SeriesNumber of Series 25 million +25 million + 26 million+26 million+ 28 million+28 million+
Terminated seriesTerminated series YesYes YesYes YesYes
UpdatesUpdates DailyDaily YearlyYearly WeeklyWeekly
CANSIM I dataCANSIM I data NoNo NoNo YesYes
CANSIM II dataCANSIM II data YesYes YesYes YesYes
ConcordancesConcordances NoNo NoNo YesYes
NOTESNOTES
When a method of measurement or definition When a method of measurement or definition or an attribute or concept changes, the old or an attribute or concept changes, the old series is terminated, and a new series with a series is terminated, and a new series with a new series identifier is begun. (CANSIM – the new series identifier is begun. (CANSIM – the many faces, UT/DLS)many faces, UT/DLS)
When SIC 1980 was changed to NAICS 1997 When SIC 1980 was changed to NAICS 1997 series were terminated and new ones begun. series were terminated and new ones begun. This explains the limited time line of the This explains the limited time line of the NAICS seriesNAICS series
Content (CANSIM II)Content (CANSIM II)
Stat. CanStat. Can E-STATE-STAT CHASSCHASS
User GuideUser Guide YesYes YesYes LimitedLimited
Table directoryTable directory YesYes YesYes NoNo
Terminated SeriesTerminated Series YesYes YesYes YesYes
IMBD/Survey listsIMBD/Survey lists YesYes YesYes YesYes
Numerical list of SeriesNumerical list of Series NoNo NoNo YesYes
Vector (series) listingVector (series) listing YesYes NoNo (Yes)(Yes)
Link to publications Link to publications
& tables& tables
YesYes NoNo NoNo
SearchingSearching
Stat.CanStat.Can E-STATE-STAT CHASSCHASS
By Keyword /Text By Keyword /Text YesYes YesYes YesYes
By Subject (Browse)By Subject (Browse) YesYes YesYes YesYes
By Table numberBy Table number YesYes YesYes YesYes
By Series number By Series number YesYes YesYes YesYes
Survey number - get Survey number - get TablesTables
YesYes YesYes NoNo
SearchingSearching
Stat.CanStat.Can E-STATE-STAT CHASSCHASS
Advanced /Boolean searchAdvanced /Boolean search YesYes YesYes NoNo
By Dimension member desc.By Dimension member desc. YesYes YesYes NoNo
IMDB (surveys) by keywordIMDB (surveys) by keyword NoNo NoNo YesYes
Frequently requested seriesFrequently requested series NoNo NoNo YesYes
NOTES – Searching/ ResultsNOTES – Searching/ Results
CHASS - get listing of series unless search by Table CHASS - get listing of series unless search by Table numbernumber
Stat Can - get listing of Tables unless search by Stat Can - get listing of Tables unless search by Series numberSeries number
Therefore difficult to compare retrieval Therefore difficult to compare retrieval PETS – CHASS got 60 series (82 with carPETS)PETS – CHASS got 60 series (82 with carPETS) PETS – Stat Can got 5 tables – did not include PETS – Stat Can got 5 tables – did not include
carpetscarpets Important to check “Match full keyword” in Important to check “Match full keyword” in
CHASSCHASS
ResultsResultsText /Keyword searchText /Keyword search Stat. CanStat. Can E-STATE-STAT CHASSCHASS
11stst level - Tables level - Tables YesYes YesYes NoNo
11stst level - Series level - Series NoNo NoNo YesYes
Subject (browse)Subject (browse)
1st level - Tables1st level - Tables YesYes YesYes YesYes
Survey (browse)Survey (browse)
1st level – Tables1st level – Tables YesYes YesYes --
ResultsResults
Text /Keyword searchText /Keyword search Stat. CanStat. Can E-STATE-STAT CHASSCHASS
22ndnd level get: level get:
Link to Survey inform.Link to Survey inform. YesYes YesYes YesYes
Related subjects, categoriesRelated subjects, categories YesYes YesYes NoNo
Vector directoryVector directory YesYes NoNo (Yes)(Yes)
Link to publicat. & tablesLink to publicat. & tables YesYes NoNo NoNo
ResultsResults
Stat. CanStat. Can E-STATE-STAT CHASSCHASS
Selection of series-pick listSelection of series-pick list YesYes YesYes NoNo
Date selection - seriesDate selection - series MultipleMultiple MultipleMultiple SingleSingle
Retrieve as individ. seriesRetrieve as individ. series YesYes YesYes YesYes
Retrieve as a tableRetrieve as a table YesYes YesYes NoNo
Retrieve series from Retrieve series from different tablesdifferent tables
YesYes YesYes YesYes
NOTESNOTES
Notes from Chris Leowski’s presentation in 2002:Notes from Chris Leowski’s presentation in 2002:
CANSIM II: vector numbers not recycled when CANSIM II: vector numbers not recycled when a series terminated. In CANSIM I they were.a series terminated. In CANSIM I they were.
No frequency conversion in the CHASS No frequency conversion in the CHASS CANSIM II, this is not a CHASS priority.CANSIM II, this is not a CHASS priority.
Badly need a way of pointing users to series that Badly need a way of pointing users to series that replace terminated series and vice versa.replace terminated series and vice versa.
Visualisation of resultsVisualisation of resultsIndividual Time SeriesIndividual Time Series E-STATE-STAT CHASSCHASS
Line(s) graphLine(s) graph YesYes YesYes
Bar(s) graphBar(s) graph YesYes YesYes
Lines graph with regression lineLines graph with regression line NoNo ??
Pie chartPie chart YesYes NoNo
Scatter chartScatter chart YesYes NoNo
HistogramHistogram YesYes NoNo
Box and whiskerBox and whisker YesYes NoNo
Manipulation of ResultsManipulation of Results
E-STATE-STAT CHASSCHASS
Change of frequencyChange of frequency MultipleMultiple CANSIM ICANSIM I
Convert to annual - sumConvert to annual - sum YesYes CANSIM ICANSIM I
Convert to annual -averageConvert to annual -average YesYes CANSIM ICANSIM I
Percent changesPercent changes YesYes NoNo
Year to date sums & averagesYear to date sums & averages YesYes NoNo
Moving averagesMoving averages YesYes NoNo
Centred moving averagesCentred moving averages YesYes NoNo
Output formatsOutput formatsE-STATE-STAT CHASSCHASS
HTML table HTML table YesYes NoNo
Comma separated (CSV)Comma separated (CSV) Yes*Yes* YesYes
SpreadsheetSpreadsheet Yes*Yes* YesYes
RATS, SAS, ShazamRATS, SAS, Shazam NoNo YesYes
SPSS, TSP, TSPterseSPSS, TSP, TSPterse NoNo YesYes
PRN (tab separated)PRN (tab separated) Yes*Yes* NoNo
* Choice of time as columns or rows
Choosing which to useChoosing which to use
Currency – Daily vs. weekly vs. yearlyCurrency – Daily vs. weekly vs. yearly Ease of searching – pick lists in Stat. Can. helpfulEase of searching – pick lists in Stat. Can. helpful Sophistication of user – list of series can make Sophistication of user – list of series can make
finding data difficult with CHASS interfacefinding data difficult with CHASS interface Frequently used series are fast – in CHASSFrequently used series are fast – in CHASS Could use Stat Can interface to find series # and then Could use Stat Can interface to find series # and then
go to CHASS to get most recent datago to CHASS to get most recent data Output required – CHASS has more formats for Output required – CHASS has more formats for
statistical packages statistical packages Data manipulation requiredData manipulation required Data visualisation requiredData visualisation required
Statistics Canada: Search results
Statistics Canada: Series selection
Statistics Canada: selecting Dimension members and dates
CHASS: Selection options
CHASS: Keyword search
CHASS: Results page
CHASS: Series information
CHASS: Retrieval, date and output selection
CHASS: Display of data
E-STAT: Search CANSIM
E-STAT: Text search
E-STAT: Advanced search
E-STAT: Search results
E-STAT: Series selection
E-STAT: selecting Dimension members and dates
E-STAT: output options
E-STAT: HTML table, time as rows
Search done on topic Pets in CANSIM II, CHASS interface