Upload
leslie-gregory
View
213
Download
0
Embed Size (px)
Citation preview
Archimer Ifremer’s institutional repository
Fred MerceurIAMSLIC's 32nd annual conference
Every Continent, Every OceanOctober 8-12, 2006
Portland, Oregon, USA
Some general principles ...
In house development , launched in
august 2005. It is available at : http://
www.ifremer.fr/docelec/
Includes : Thesis, post-publications, In-press publications, proceedings and Ifremer reports
All documents are available in PDF
Some general principles ...
All documents are loaded by library’s staff
To be sure to get the maximum of publications:
We check, every week, the publications writen by Ifremer’s staff in the "Current Contents" database,
We study, for each of these publications, the self-archiving policy of theirs publishers in the Sherpa/Romeo Web site.
If the editor allows the self-archiving of its own pdf files (ex: EDP Sciences, The Company of Biologists…), we download the PDF file from the publishers web-site and we load it into Archimer,
If the publishers only allows the self-archiving of the last author’s draft, we contact the authors to ask them for this version using automatic tools developed by Ifremer. If they are able to provide us this version, we produce a PDF file and we load it into Archimer.
Some results ...
More than 1400 documents are available
in Archimer
More than 77% of the 2005-2006 articles written by Ifremer’s staff (with a Ifremer’s first author) are available for free
More than 60% of the 2005-2006 articles written by Ifremer (with at least one Ifremer’s author) are available for free
A repository linked to other library’s systems
Some of ours tips to improve documents visibility
Some facts:90% of the documents are downloaded
directly from standards web search and
especially from Google
Document indexed by Google will be
downloaded 10 times more than the others
We made sure that the WEB crawlers have a way to find the full text:
We use “Text under image” PDF format when we need to scan a paper copy of a document
Image version
Text version (automatic OCR)
We try to convince authors to remove protection on PDF files
It a end-user want to copy the data, there are
several work-around to by-pass the protection
It will be more dangerous to copy some data
from a document, without citing it, that is really
visible on the internet than a document that is
not indexed anywhere
If the data extraction is protected, the Web
crawlers will not be able to index the
documents. It may decrease the number of
downloads by 10.
It may cause problem to change format in
several years
We take care of the PDF generated by Latex
We ask authors to provide documents to be loaded in repositeries with a complete notice in english :title, abstract, keywords (a lot), ...