24
PubMed Central Update Jennifer Jentsch Medical Library Association Conference May 2006

PubMed Central Update Jennifer Jentsch Medical Library Association Conference May 2006

Embed Size (px)

Citation preview

PubMed Central Update

Jennifer Jentsch

Medical Library Association Conference

May 2006

What is PubMed Central?

• PubMed Central (PMC) is the U.S. National Library of Medicine's digital archive of peer-reviewed, primary research articles from life science journals

• Launched Feb 2000 • In six years has grown to 230 journals, 596,000 articles

• Unique IP addresses: 2 million / month

• Articles retrieved / month – HTML full text, PDF, or scanned article summary: 7.65 million

• Total page views / month, incl. searches: 11.45 million

PMC Philosophy & Function

• A logical extension of NLM’s general mandate• Digitize journal back issues to create a full electronic

record of a journal• Provide free and unrestricted access to PMC, which

ensures durability and utility of the archive as technology changes over time

• Follow in the footsteps of other major NCBI resources• Integrate the literature with other valuable information

resources in the database family – PubMed, Nucleotide, etc.

What’s in PMC?

• Electronic content representing current journal issues

• Small subset is Open Access

• Digitized Print content for older material (Back Issue Digitization Project)

• NIH Manuscripts (NIH Public Access Policy)• Wellcome Trust Manuscripts

PMC’s Archives

Recent & new electronic content• Represents roughly 40% of PMC content• Keep PDF, full-text XML, original images, supplementary material• XML preferred for archiving: - technology-independent and portable - allows for richer linking and automated text analysis

Back issues (print only)• Now approximately 60% of PMC content • Print journals scanned cover to cover – from volume 1• Create and archive high quality PDF with true reproduction of figures / images• OCR text used to do full-text searching and build list of linked reference citations

PMC – Electronic Content

PMC – Digitized Print Content

Scanned articles - References

Titles in PMC

Scanned Content – Coming Soon

• BMA Specialist Journals: 20+ titles including: • Heart• Thorax• Journal of Clinical Pathology• British Journal of Ophthalmology• Annals of the Rheumatic Diseases

• Wellcome Trust Journals: 13 titles including:• British Journal of Pharmacology• Journal of Physiology• Journal of Anatomy• Immunology• British Journal of Cancer

End of 2006: 500,00 scanned articles

PMC and Open Access

• Defined in Bethesda Statement on OA– Goes beyond free access– Author/copyright holder grant users right to access,

copy, use, distribute, transmit and display work on condition of proper attribution of authorship

– Users can also make and distribute derivative works– Copy of work deposited in online repository enabling

“open access, unrestricted distribution, interoperability, and long-term archiving”

Open Access and Copyright

PMC Access Flavors

• Free (PMC default)– Normal “fair use” copyright provisions

• Open Access– Author pays– Creative Commons or similar license to reuse with

attribution– Article source files freely available from PMC via OAI or

FTP• Open Access (me, too!)

– Free + Author pays• Public Access

– Author Manuscripts from NIH-funded researchers– No agreements with publishers

• PubLink– Full text viewable only at journal site

Open Access Journals in PMC• Open Access Journals in PMC:

– BioMed Central Journals– Amphibian and Reptile Conservation – Biological Procedures Online– Journal of Biomedicine and Biotechnology– Journal of Insect Science– Nucleic Acids Research– PLoS Biology and Medicine– Preventing Chronic Disease

• In October 2003, PMC began accepting individual open access articles from journals that do not participate in PMC on a routine basis. – Science– Biological Chemistry

PMC and Public Access• The Public Access Policy requests that investigators

funded by the National Institutes of Health (NIH) submit an electronic version of their final, peer-reviewed manuscripts upon acceptance for publication to the NIH National Library of Medicine's PubMed Central (PMC).

• To facilitate the submission process to PMC, NIH has developed the NIH Manuscript Submission (NIHMS) system (http://www.nihms.nih.gov/), a password-

protected, Web-based system.

• Currently 2500 NIH Manuscripts in PMC

PMC Content Sources

Regular PMC Participation (since 2000)• Formal agreement between NLM and publisher • Journal may delay free access up to a year or more• Copyright is retained by publisher or author• Deposits are permanent• Journal may stop depositing new material but may not withdraw material already deposited

NIH Public Access Policy (May 2005)• Requests NIH grantees and intramural researchers to deposit final manuscript of paper accepted for publication• Author specifies PMC release date – up to 12 months after publication – subject to publisher’s copyright policy

Manuscript in PMC• Distinctive

banner with NIHPA-specific links

• Citation for MS and for published article

• Link to published article at journal site

• Otherwise, looks like other PMC articles

PMC – Recent Developments

• Springer ‘Open Choice’ and Blackwell ‘Online Open’ articles now coming in to PMC

• Also working with OUP ‘Oxford Open’

• Detailed tagging guidelines released for NLM Journal Publishing DTD

• Library of Congress and British Library are adopting NLM Journal DTD as a standard for tagging journal articles

PMC in the Future

• PMC International– PMC-in-a-box for quick setup of a reliable mirror

– Tested by UK (Wellcome Trust), Italy (Italian National Research Council) and S. Africa (National Bioinformatics Network)

– Also working with British Library, China and Japan

• PMC Help as book on NCBI Bookshelf

More Info / Contact Us

PubMed Central Site:http://www.pubmedcentral.nih.gov

Questions about PMC: [email protected]

NIH Manuscript Submission System:http://www.nihms.nih.gov

NIH Public Access Policy:[email protected]