Recent trends in preserving and providing perpetual...

Preview:

Citation preview

Recent trends in preserving and providing perpetual access to electronic resources and effective use of related

metadata

G.K.Manjunath

Chief Librarian

Indira Gandhi Inst of Development Research

Film City Road

Goregaon (East)

Mumbai – 400 065

Email: gkm@igidr.ac.in

Phone: 22-28416528

Recent Trends in Librarianship

• Application of Information and Communication Technology (ICT) to libraries has witnessed sea of changes in storage and retrieval of information.

• Information resources are now widely accepted all over the world in electronic format.

• Round the clock access through the campus wide network, conservation of space, cost effectiveness, efficient retrieval methods, easy and effective way of sharing, etc are some of the reasons for their popularity and wide acceptance.

Recent Trends in Librarianship

• It is extreamly difficult to highlight all the trends and developments in LIS in one lecture or session.

• However, I shall make an attempt to highlight some of the existing practices in preserving and providing perpetual access to electronic resources.

• I shall also attempt to highlight certain trends in data standards and effective use of bibliographic data / metadata both by librarians and end users

My Approach

• What can be done ? Understanding • What has been done ? - Examples • How this can be done ? - Hands on / training

Preservation of metadata /bibliographic data and statistical Data

Bibliographic Data - ISO2709 and MARC DC-XML Statistical Data – MS-Excel, Structured PDF,

ASCII files, etc.

Preservation of Full-text materials

Perpetual access to what is subscribed - Online ( Books, Journals, etc). Access to and preservation of what is

purchased and owned- Offline (Books and other databases). Access to and preservation of what is

downloaded, subject to copyright clearance- Offline ( Working papers, govt publications ) Access to what is published by an institution

(IR)

Audios, Videos, Pictures, etc

• Lectures - Audio and Video • Events - Audio and video of Convocation,

Annual lectures, etc • Library members - Students, faculty

members, visitors

Data Standards and inter-conversion of Bibliographic Data/ Metadata

• Conversion of MS-Excel data to ISO2709 (CCF) and MARC (.mrc)

• Copy Cataloging • DC-XML • MARC-XML

File format ISO-2709 0050300000000025300045004600005000006100012000054

4000050001762000160002262000170 0038620001000055620000800065001000200073015000200

0750200006000770220011000830400 0040009405000040009806000040010212000020010610000

1400108200008500122300002000207 400002200227#229p#330.973/SHE#2010#Social

sciences#Financial crisis#Recession#Ec onomy#1#m#IGIDR#11/12/2010#Eng#010#100#1#97807656

25373#The Roller coaster econom y: Financial crisis, great recession, and the public

option#^aSherman, Howard J# ^aNew York^bM E Sharp##

File format .mrc

00371 am 001213u 042000700000100005600007245002600063260005900089520005100148546000800199690002600207655001600233 adc10aKitcher, Philid1947-. eauthor00aThe ethical project / bCambridge, Mass. : Harvard University Press,, c2011.. aIncludes bibliographical references and index. aeng aEthics, Evolutionary.7 atext2local

Dublin Core Tags

Implementation Notes

Contributor An entity responsible for making contributions to the resource.(Secondary Contribution). Co-authors (Added entry for personal names), corporate body / Name and conference name

Coverage The spatial or temporal topic of the resource, the spatial applicability of the resource, or the jurisdiction under which the resource is relevant.(Geographic Name)

Creator An entity primarily responsible for making the resource.(Primary Contribution - Authors) Date A point or period of time associated with an event in the lifecycle of the resource. Publication

Year. Description An account of the resource.( description may include but is not limited to: an abstract, a table

of contents, a graphical representation, or a free-text account of the resource).

Format The file format, physical medium, or dimensions of the resource. Ex : pdf, avi, etc.

Identifier An unambiguous reference to the resource within a given context. URL or DOI

Language A language of the resource. [ Use Language Table ]. Usually inbuilt in the software.

Publisher An entity responsible for making the resource available. Relation A related resource. Ex: Citations Rights Information about rights held in and over the resource. Source A related resource from which the described resource is derived. Subjects The topic of the resource. Title A name given to the resource Type The nature or genre of the resource.( Recommended best practice is to use a controlled

vocabulary such as the DCMI Type Vocabulary [DCMITYPE]. To describe the file format, physical medium, or dimensions of the resource, use the Format element.)

Standard XML file with Dublin Core tags (Alpha tags) <?xml version="1.0" encoding="utf-8"?> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"

xmlns:dc="http://purl.org/dc/elements/1.1/"> <rdf:Description> <dc:title>Development-induced displacement, rehabilitation and resettlement in India : current issues and

challenges /</dc:title> <dc:creator>Somayaji, Sakarama.</dc:creator> <dc:creator>Talwar, Smrithi.</dc:creator> <dc:type>text</dc:type> <dc:publisher>Abingdon, Oxon ; New York : Routledge,</dc:publisher> <dc:date>2011.</dc:date> <dc:language>eng</dc:language> <dc:description>Includes bibliographical references and index.</dc:description> <dc:subject>Forced migration</dc:subject> <dc:subject>Internally displaced persons</dc:subject> <dc:subject>Land use, Rural</dc:subject> <dc:subject>Land settlement</dc:subject> <dc:subject>Eminent domain</dc:subject> <dc:subject>Economic development projects</dc:subject> </rdf:Description> </rdf:RDF>

Customised data in XML <record>

<DocType>Journal Article</DocType> <author>Samantaraya, Amaresh%Verrier, Jeanne</author> <atitle>Do Macroeconomic Indicators Explain India's Sovereign Ratings? An Empirical Analysis</atitle> <Jtitle>Margin: Journal of Applied Economic Research</Jtitle> <Jvol>3</Jvol> <Jissue>3</Jissue> <Jmonth>Jul-Sep</Jmonth> <Jyear>2009</Jyear> <Jpage>193-221</Jpage> <keywords>Sovereign Credit Rating%India</keywords> <abstract>Sovereign rating—an assessment of the relative likelihood of a country defaulting on its financial obligations—is important not only because better ratings facilitate external borrowing at favourable rates of interest, but also for the impetus it gives to the development of domestic financial markets in an emerging economy. Assessing the role of various macroeconomic factors influencing India's sovereign rating, our preliminary analysis indicated that while upgrading India's sovereign rating since 2003–04 both Standard and Poor's and Moody's apparently taken, high GDP growth, rising foreign reserves, declining fiscal deficit and rising exports into account, but the economic rational for downgrading the rating between 1998 and 2002 is not very convincing. The econometric analysis, using an ordered Probit model, did not find strong statistical support for several relevant macroeconomic indicators in the determination of India's sovereign ratings. Could a major part of the explanation have come from some other sources, such as qualitative social and political considerations? India's relatively low sovereign ratings compared to its peers raises questions about rating agencies appropriately accounting for India's excellent economic performance in recent years; its formidable record of serving all external liabilities on time; and factors related to political stability and the social fabric of unity amidst diversity. Had these factors been taken into account appropriately, perhaps India's rating would have been better reflective of reality. </abstract> <url>http://mar.sagepub.com/cgi/reprint/3/3/193</url> <license>SAGE Publications India Pvt Ltd</license> </record>

Backup of Data from your Automation software

• Only Bibliographic Data can be downloaded in ISO2709(CCF) or .mrc (MARC) or in MS-Excel Format

• Entire Data with all related files can be downloaded. In this case find out the path where in data with all the related files are store

• Another option- Get a utility (shell script) written so that backup is copied to a particular folder so that the library staff need not touch the original data.

Backup of Libsys Data • A Shell Scrip has written to take all the data from

data folder • The backup is kept in folder /usr2/libsys_backup • libsys.sql [ name of the backup file ] • This also can be kept in E-library server or a

backup server / external hard disk • W:\libsys\server\backup7\libsys-03-06-2015.sql

Do you need to remembers tags and delimiters ? Confusion ?

• Should you remember all the tag numbers and sub-field delimiters ? Are you a beginner ?

• No, there is no need. The data entry interface will assist you to enter data in the respective field. The system will take care of storing data in corresponding fields with delimiters

• Quite Easy, is it not ?

Free Data Converting Tools

• ISIS-ASCII - Plain text to ISO2709 • ISIS-MARC - Imports MARC records • Marc Edit - Converts MS-Excel file to

MARC. Copy Cataloging possible. Good tool for retrospective conversion.

* Workshop / training required.

Types of electronic documents

• Monographs, working papers, conference proceedings, handbooks, Govt Publications, annual reports, journal articles.

• Journals and other serials publications. • Statistical Data and Databases. • Audio, video, pictures, etc.

Access to what is subscribed ( Books and

Journals).

• Elsevier, Sci Direct • Wiley/Blackwell, Springer, Taylor and Francis,

etc. • Aggregators - Jstor, Project Muse, EBSCO and

Proquest,

Preservation of e-resources on local servers- CD servers, Juke Boxes and NAS

Juke Box: Number of slots for CD/DVD are limited.

CD/DVD Server: Makes Image of CD/DVD. Does fast copying

NAS ( Network Access Storage). Integration of multiple disks Possible.

Network Attached Storage (NAS) • Quick and simple installation of the server allows the users to instantly

store and share all types of files such as text, audio, videos, images, and other files, from any workstation.

• The server comes with pre-installed operating system and a file

management system (FMS). FMS allows the administrator or the general user, depending on their rights /permissions , to create folders and sub-folders, user, groups and to give rights to the users at all levels. In other words, the administrator or a user with such rights, can give rights to other users or to group, to create, modify, add, delete folders or sub-folders or files on the server.

• In a NAS system, the operating system usually will not be loaded on the

disk used for storing data. Rather will be residing on a separate hard disk, with or without mirroring or on a chip that comes along with the mother board.

Network Attached Storage (NAS)

• The operating system, if loaded on a chip of the mother board, instead of on a hard disk is known as thin operating system. An operating system with minimum functionalities, called as a 'Thin OS' or also known as ‘Stripped-down operating system’, is generally is used on a NAS system for storing and managing files as no data processing is involved.

• It is not possible to run other applications on this server. The NAS server usually consists of multiple hard disks configured as RAID [Redundant Array of Integrated Disks]. The acronym RAID means Redundant Array of Independent/Inexpensive Disks refers to a data storage unit using multiple hard drives to share or replicate data among the drives.

RAID – Redundant Array of Independent Disk

• The advantage of RAID is that, depending on its level, it can offer protection and enhance performance of data stored and volumes can be created using the multiple hard disks as one unit

• Accidentally if one hard disk crashes, the system will not come to a halt. The crashed hard disk can be replaced with a new one without stopping the system, what is known as ‘hot swap’. The RAID can be implemented either by adding hardware RAID controller card or through by the software. Generally Hardware RAID is recommended.

RAID • The RAID-5 technology requires minimum 3 hard disks,

preferably with identically sized disk drives and it provides only 75% - 80% of the disks' capacity for actual storage and rest is utilized for the storing configuration related files.

• A Pentium or Xeon server can easily handle multiple hard

disk of both SCSI and SATA hard disk. The maximum capacity available on SATA was 500 GB. Depending on the number of slots available the total storage capacity of the system can vary from system to system.

• The RAID offers increased data integrity and fault-tolerance

compared to single drives.

Backup system

• Even though RAID technology assures safety of data if one hard disk fails, there is risk if two hard discs go bad simultaneously

• Therefore, a secondary backup on external hard disk is always preferred

• Copy all the folders to the external hard disc once. Next time only copy and paste only those folders from the server which are updated ( over writing )

Backup On Cloud

• Google drive [ 15 GB free ]. Sharing possible • Microsoft Onedrive. Sharing possible • Drop Box -”-

• Free up to certain extent. Buy extra space for

a fee. • Best option – Your own external hard disk

Census, 2001

Maintenance of CDs coming accompanying books

• Create a folder with the same accession number given to the book.

• Copy the content of CD to this folder • Provide hyperlink

Perpetual Access to E-Journals

• Even though publishers assure online access to e-jls on continuous basis certain triggered events may cause seamless access.

• When a publisher ceases operations and titles are no longer available from any other source

• When a publisher ceases to publish and offer a title, and it is not offered by another publisher or entity

• When back issues are removed from a publisher’s offering and are not available elsewhere

• Upon catastrophic failure by a publisher’s delivery platform for a sustained period of time

Portico /

• Portico is a digital preservation service provided by ITHAKA, a not-for-profit organization with a mission to help the academic community use digital technologies to preserve the scholarly record and to advance research and teaching in sustainable ways.

• About 257 publishers are participating in this initiative.

CLOCKSS [Controlled Lots of Copies to Keep Stuff Safe

Institutional Repository (IR)

• It is nothing but a Digital Library of an Institution

• IR will usually host all the research papers emanating from an institution for which it has copyright

• Such an Initiative will provide wider access to the research papers and thus increases citations.

Digital Library vs Electronic Library • These are near synonyms • However, the Standard and accepted term is

digital library. • Certain Features : * Must be available on internet * OAI-PMH compatibility * UNICODE * Response to OAI verbs * Should have PURL or Handle for each Document

OAI Verbs

Identify : turns information about the repository e.g. http://oii.igidr.ac.in:8080/oai/request?verb=Identify

ListMetadataFormats Lists the metadata formats supported by the repository. The minumum requirement is

oai_dc (Dublin Core) e.g. http://oii.igidr.ac.in:8080/oai/request?verb=ListMetadataFormats

ListSets: Lists the sets provided by the repository (e.g. departments, subjects, etc.) e.g. http://oii.igidr.ac.in:8080/oai/request?verb=ListSets

ListIdentifiers : Lists record identifiers, dates & any other headers for each deposited item. e.g. http://oii.igidr.ac.in:8080/oai/request?verb=ListIdentifiers&metadataPrefix=oai_dc

ListRecords : Harvests metadata records from the repository Requires the argument 'metadataPrefix' - metadataPrefix=oai_dc should suffice. Results can be limited to specified sub-sets. e.g. http://oii.igidr.ac.in:8080/oai/request?verb=ListRecords&metadataPrefix=oai_dc

• GetRecord (Not working)

Some popular DL open source software

• Dspace • GSDL • E-prints • Fedora

Certain Problems

• LIS professionals may find it difficult to install, maintain and upgrade DL software.

• Require assistance from computer experts and expertise in Operating System

• Requires hardware and its maintenance • What is the Solution ?

Cloud Computing

• Your DL or library automation software on remote server

• Access and update your DL with a login and password

• Installation, maintenance, backup, upgradation will be done by the service provider for a nominal fee

Service Providers on Cloud in India

• INFORMATICS • OSS labs

Zotero- Free Reference Manager Tool

Installation of Zetero

Freeware Can be installed on Mozilla Firefox /

Chrome/Safari Available for Desktop and online Plugins available for MS-Office, MAC-Office

and Liberoffice ( Word processors)

Zotero for Firefox

Search on an Author

Now select the articles

Integration with MS-Word

Options

MENDELEY [Elsevier ]

• Another Reference management Software • Free upto 2 GB for individuals • Institution subscription possible

Other Ref Manager Tools

• Endnote • Refworks

Comparison

Unique Identifiers

• ISBN - Books • ISSN - Journals • DOI - For individual document / research

work. DOI provided by crossref • ORCID- Individual Researchers. Profile can

be retrieved by any one if ORCID id is given. Helps in getting funding/ recruiting agency.

What is DOI federation

ORCID - open researcher and contributor id

• Thanks.

• Any Question ?

Recommended