CXC
The Chandra Bibliography Database
Arnold Rots, Sherry Winkelman,Sarah Blecksmith, John Bright
Chandra Data Archive Operations Group, CXC/SAO
Stéphane PaltaniObservatoire de Marseille
CXC2004-01-12 Chandra Users Committee: Chandra Data Archive 2
Summary
Existing capability Identifiers and automatic linking Extension of the database Attributes Database design Database maintenance Services Conclusion and coming attractions
This presentation is adapted from a paper given at ADASS XIIIPages 9-13 are less relevant and provided FYI onlyThe interface on pages 15, 16 is an old version, provided for illustrative purposes only; please try the prerelease, using the URL on page 18
CXC2004-01-12 Chandra Users Committee: Chandra Data Archive 3
Existing Capability
On the part of the archive:– Links from datasets (observations)
to articles in the ADS
– Scattered links to some specific articles
On the part of the ADS:– Links from articles (bibcodes) to
datasets in data center archives
– General project tags This is very valuable, but also very
labor-intensive
CXC2004-01-12 Chandra Users Committee: Chandra Data Archive 5
Identifiers and Automatic Linking
The ADS, the data centers, and US journal editors have reached an agreement that will enable authors to insert these links directly in a manuscript
Central to such linking are IVOA-compliant dataset identifiers– Namespace: ivo:– Authority Id: ADS– Data collection– Dataset
ivo://ADS/Sa.CXO#214ivo://ADS/Sa.CXO#M31mosaic
We will provide services that will enable users to insert these IDs
CXC2004-01-12 Chandra Users Committee: Chandra Data Archive 6
Extension of the Database
Originally, the database contained only journal articles and conference proceeding papers that can unambiguously be connected with specific observations, plus an amorphous collection of papers that are “Chandra-related”
Extension of subject categorization:– Referring to specific observations– Referring to published results– Predicting Chandra results– Referring to instrumentation, software, or operations– Other
Inclusion of all other types of publications (except preprints!)
CXC2004-01-12 Chandra Users Committee: Chandra Data Archive 7
Attributes
Subject– Observations, instruments, software, operations
Kind of publication– Book, journal, proceedings, thesis, circular, review, newsletter, internal
Type of publication– Article, abstract, memo, data, erratum, article (abstract only available),
title only, electronic
Number of citations Keywords (standard ApJ as well as custom) A variety of other items
– Date of publication, refereed or not, etc.
CXC2004-01-12 Chandra Users Committee: Chandra Data Archive 8
Database Design
BibTableObsId Subjects
Datasets
DatasetObsIds
Observation Catalog
Proposals
1+0+
0+
1+
1
1+1
1
1+ 1
1+
1
1
1
URLs
Keywords
Std Keywords
0+
1
1
0,1
CXC2004-01-12 Chandra Users Committee: Chandra Data Archive 9
Database Maintenance
Management of new entries through a dedicated database:– Automatic filling of BibWork
– Attributes filled in through GUI
– Migrate entries to BibTable upon completion
– The Datasets and DatasetObsIds table are common with the main database
Automatic updating of number of citations
Automatic check on validity of bibcodes
BibWork
Datasets
DatasetObsIds
ObsId
CXC2004-01-12 Chandra Users Committee: Chandra Data Archive 10
Database Maintenance Interface
Filling the database
CXC2004-01-12 Chandra Users Committee: Chandra Data Archive 11
Database Maintenance InterfaceChecking the paper
CXC2004-01-12 Chandra Users Committee: Chandra Data Archive 12
Database Maintenance Interface
Set attributes
CXC2004-01-12 Chandra Users Committee: Chandra Data Archive 13
Database Maintenance Interface
Establish proposal links
CXC2004-01-12 Chandra Users Committee: Chandra Data Archive 14
Services
Exchange of information with ADS: harvesting of Bibcode – Dataset Identifier pairs in both directions
Provide access to datasets through either a Dataset Identifier or a Bibcode
Provide information to ADS on Bibcodes that are not related to specific observations
Provide access to publications through queries from our archive; see next page and:
http://cxc.harvard.edu/cgi-gen/cda/bibliography.cgi Derive metrics through queries (standardized as well as custom;
see Paul Green’s presentation)
CXC2004-01-12 Chandra Users Committee: Chandra Data Archive 15
ServicesLiterature search from the archive (shown here is an old version)
CXC2004-01-12 Chandra Users Committee: Chandra Data Archive 16
Services
(Previous page) A simple query example: find all
publications related to Chandra Crab observations
This renders 4 articles – be aware that there may be more (e.g., meeting abstracts!) that could not be traced to specific observations
The bibcodes link to the abstracts in the ADS
(This page) The link to the ADS provides more
details on all four papers
CXC2004-01-12 Chandra Users Committee: Chandra Data Archive 17
Database Metrics (Oct 2003)
CategoryAll Articles Referee d only
1999 2000 2001 2002 2003 Total No. Cit. Total No. Cit.
Observations 53 284 485 485 352 1659 5639 712 5597
Refer to obs. 9 94 333 499 322 1257 5300 897 5231
Instr., etc. 34 141 124 69 18 386 1362 354 1355
Predict result 11 67 21 14 21 135 306 22 296
Unclassified 15 90 70 29 40 244 663 118 650
Total 122 676 1033 1097 753 3681 13270 2103 13129
Reviewed 1011 2507 2735 2758 1859 10870
CXC2004-01-12 Chandra Users Committee: Chandra Data Archive 18
Conclusion
We have developed a comprehensive database that is capable of tracking all mission-related publications and preserving all relevant information
Added to this are a database and GUI that make maintenance (i.e., data entry) as painless as possible
Services include cross-linking with the ADS, a powerful literature search from the Chandra archive, and metrics
The entire package is reasonably mission independent and we are happy to provide it to other data centers
Try the new interface at:http://cxc.harvard.edu/cgi-gen/cda/bibliography.cgi
CXC2004-01-12 Chandra Users Committee: Chandra Data Archive 19
Coming Attractions
FITS keyword database:an interactive web-based tool that allows users to look up the
meaning and use of all keywords used in CXC FITS files, and to construct compliant headers
Special Requests:a web-based tool that allows users to make special data requests, backed up by a database that tracks the status of these requests
– Request for previous data versions– Request for special processing– Request for data on physical medium– Request for custom database query– Anything else (reasonable)