Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
Digitally connecting the scattered heritage: a Polish perspective
Marcin [email protected]
September 8, 2014
Development of digital libraries infrastructure in Poland
0
10
20
30
40
50
60
70
80
90
2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013
Increase of the number of digitallibraries between 2002 and 2013
10
1
121
15
2
3
1
4
3
111
1
1
2
1
1
1
1
1
1
1
2
1
1
1
1
1
1
1
Digital libraries in the PIONIER network
- Several hundredsinstitutions- 1.8M objects
Digital libraries in the PIONIER Network
~70 institutional digita libraries ~40 regional digital libraries
In total around 2 mln of digital objects
Name Size
1 Cyfrowa Biblioteka Narodowa Polona 308 9332 258 2803 e-‐biblioteka Uniwersytetu Warszawskiego 161 9654 46 9585 43 9896 Polska Biblioteka Internetowa 32 0717 23 5938 22 4449 Muzeum Narodowe w Warszawie 13 060
10 11 789
Name Size
1 Wielkopolska Biblioteka Cyfrowa 222 5212 105 4233 86 9184 Kujawsko-‐Pomorska Biblioteka Cyfrowa 75 6745 Biblioteka Cyfrowa -‐ 52 3766 43 8477 39 9168 Pomorska Biblioteka Cyfrowa 38 221
9 Zachodniopomorska Biblioteka Cyfrowa "Pomerania" 29 733
10 Podlaska Biblioteka Cyfrowa 28 927Average size: 15 826 objectsMedian: 1 357 objects
Average size: 24 020 objectsMedian: 9 399 objects
Digital libraries in the PIONIER Network
Regional digital librariesDevelopment of idea of regional collaboration shaped during the initiation of Wielkopolska Digital Library in 2002Allow smaller institutions to secure collections in digital form and to make them availableon-lineOptimize the use of shared IT infrastructureThey are implemented also in country scale (FIDES, RCIN) as well as in local scale (Tarnowska DL, DL, Make access to digital content easier by providing single point of access
Practice of regional digital libraries
For reader they are simply web portals giving access to collections of cultural heritage from many institutions under a single WWW addressIn practice, realized as consortia, which on the basis of knowledge exchange and collaboration, give their participants:
Access to IT infrastructure necessary to put digital collections on-lineWays to professionally preserve digital copies for long timeKnow-how allowing to prepare high resolution digital materials and metadataWide promotion of resourcesVery good conditions to acquire additional funding in common projects
Digital library of Wielkopolska popularity in 2013According to Google Analytics
Practice of regional digital libraries
Structure of collections of regional digital library often reflects complexity of the consortiumRegional collectionsThematic collectionsInstitutional collections
Regional collaboration gives many benefits, but also requires compromisesCommon metadata schemaCommon web interface
Practice of regional digital libraries
Good solution to balance collaboration and promotion of individual institutions are virtual repositories built on top of regional digital libraries
Role of regional digital libraries
Regional digital libraries are more often a basis for new information services related to the heritage of a regionThey are used as repositories of source data, making the information services more rich and trusted
DInGO software Digitise
Technical ingredient of regional digital librariesdLibra: system for digital libraries (e.g.: http://jbc.bj.uj.edu.pl/) dMuseion: system for digital museums (e.g.: http://cyfrowe.mnw.art.pl/) dLab: system for management of digitisation processesdArceo: system for long-term digital preservation
http://dingo.psnc.pl/
Digitisation process and DInGO software
Planned objects
Presentation files
MASTER files
Digitisation, standarisation
On-‐line access
Preparation of digital object
Selection of objects for digitisation
Archiving
On-‐line publishing
Promotion of regional heritage on (inter)national level
Regional consortia allow small institutions to appear on the InternetRegional digital libraries aggregate local and regional heritage in a digital formNational level access and promotion is organized on the basis of metadata aggregation from distributed sources to one central databaseThis is the responsibility of Digital Libraries Federation of the PIONIER NetworkFederation collaborates with Europena, moving these regional collections even higher, to international level
http://fbc.pionier.net.pl/
Digital Libraries Federation (DLF)http://fbc.pionier.net.pl/
Public portalSearching, browsingDigitisation plans, persistent identifiers
Data provider for external servicesEuropeana, DART-‐EuropeKaRo
Information website for DL creatorsNews, publicationsDigital libraries database
Advanced services for DL administratorsTraffic monitoringMetadata analysis module
Competence center for professionalsE-‐learning coursesQ&A platform
Who is providing data to DLF?
Hundreds of institutions from entire Poland
Digital libraries, repositories, digital museums, digital archives
What kind of objects can you find in DLF?
Based on metadata analysis, done on September 3, 2014
journal46%
article14%
book12%
photo4%
electronic document
4%
PhD thesis3%
ephemera3%
other16%
journal80%
book5%
postcard2%
oldprint2%
manuscript1%
ephemera1%
photo1%
archival document
1%
other6%
80% objects: materials created before 1945 20% objects: materials created after 1945
Increase of the number of objects in the DLF
2014 -‐ ~2 million objects
2007 public opening of DLF,
~75 thousand objects
DLF statistics
Presently: During 2013:
105 data sources
325 institutions
~2 million objects
560 thousands unique users
1,1 million visits
4,5 million views
Collaboration with EuropeanaEuropeana.eu = European Digital Library, Museum and Archive
2009 2010 2011 2012 2013
Beginning of collaboration in EuropeanaLocal
Federation connected to Europeana
Europeana API pilot program participation
Polish edition of Hack4Europe
Two more Hack4Europe contests as a part of Europeana Awareness project
Collaboration on Europeana 1989 Europeana Cloud project started
Visibility of Polish collections in Europeana
Data from http://www.europeana.eu/ (September 3, 2014)
1 090 660
1 711 099
1 766 490
2 486 594
2 655 770
2 707 656
2 975 847
3 515 861
3 650 312
3 876 048
3,3%
5,2%
5,4%
7,6%
8,1%
8,3%
9,1%
10,8%
11,2%
11,9%
10. Irlandia
9. Polska
8. Norwegia
7. Wielka Brytania
5. Szwecja
4. Hiszpania
3. Holandia
2. Niemcy
1. Francja
Top 10 countries in Europeana
1 036 395
1 062 881
1 331 865
1 381 668
1 405 903
2 005 866
2 025 754
2 103 884
2 240 932
6 368 924
3,2%
3,3%
4,1%
4,2%
4,3%
6,1%
6,2%
6,4%
6,9%
19,5%
10. CultureGrid
9. Arts Council Norway
8. Swedish Open Cultural Heritage
7. Linked Heritage
6. Federacja Bibliotek Cyfrowych
5. CARARE
4. Athena
3. OpenUp!
2. Hispana
1. The European Library
Top 10 data providers to Europeana
Public collection days and home digitisation
Community contributions
Long term preservations
Europeana and private collections How to save private collections together with their social context?
europeana1989.eu
fbc.pionier.net.pl/zbiorki
fbc.pionier.net.pl/zbiorki
Example of high value of private collections
Summarizing - Most important success factors
Regional collaborationDevelopment of digital libraries in Poland as they are at the moment was initiated as a series of regional projects, often WITHOUT any dedicated external funding
One host institution which is providing the technical infrastructureA number of partners providing content
First consortium was: Poznan Foundation of Scientific Libraries, PSNC, academic and public institutions from the Wielkopolska region http://www.wbc.poznan.pl/Such approach
Allows to lower the costs for each participating institution (in many aspects)Gives small libraries opportunity to promote their collections on-lineProvides natural platform for collaboration for next projects
Summarizing - Most important success factors
Good technical supportShared technology platform (in case of Poland: dLibra/DInGO)
Common development directionsShared development costsLack of typical risks related to project-based funding
Not maintained in-house solutionsAbandoned commercial softwareRising prices and vendor lock-in
Documentation and technical support available locallyNatural environment for development of good users community
Requires reliable technology partner with proper business model
Summarizing - Lessons learned
Bottom-up approach made all that possible Did I forget to mention any central institutions in my presentation?
Some things were not standardized initially on central levelcreated in many places in parallel
40+ variatons of Dublin CoreOther solutions were blindly copied, while they could be tailored to specific local needs
The curse of DjVu format popularity
Most important challenges
Quality in mass digitization projectsHow to check within a month the quality of what a commercial company was preparing for 6-8 months?How to eliminate cheating companies and not cancel the project?
Long-term digital preservationHow to make sure that results of hundreds of digitisation projects are properly secured for the future?
Most important challenges
Data interoperabilityHow to make sure that newly developed small systems follow best digital libraries practices?How to use data automatically with tools for digital humanities researchers?
Open access to data and proper rights labellingMetadata copyrighted or not?
Europeana requires CC0 statementContent
Is digitisation a creative process?Can commercial reuse of public domain materials be free?
Coordination of Europeana-related effortsAssuring proper representation of Polish heritage
Cloud technologies in the cultural sector
Cloud servicesRemote support and education
Europeana
Mapping
Aggregation
Enrichment
DLaaS
Small libraries
Private archives
Home museums
Wide access
Local memory institutions
Small institutions: LoCloud http://locloud.eu/
Cloud technologies in the cultural sector
LoCloud Collections Digital Library Service in a cloud
https://locloud.pl/The service is now open and available for testing1.0 version is planned for January 2015Until the end of 2015 the service is free, after that time it must becomeself sustainable
Cloud technologies in the cultural sectorEuropean infrastructure: Europeana Cloud
The EuropeanLibrary
Digital Libraries Federation
EU-‐Screen
The EuropeanLibrary
Digital Libraries Federation
EU-‐ScreenPortal Europeana
Europeana Research
vs
http://pro.europeana.eu/web/europeana-‐cloud
IMPACT European Center of Competence
IMPACT
Co
Cin Digitsation
Tools
Data
Services
Trainings
Fou
nd
ing
me
mb
ers
Shared infrastructure for digital libraries competence centers
Optimization of resources usage in digitisation processes
Standardization of data and tools
Prizes, contests, events
Best practicies
http://digitisation.eu/
Virtual Transcription Laboratory
Virtual Transcription Laboratory(http://wlt.synat.pcss.pl) offers:
A free tool supporting creation of textual versions of historical documentsDedicated OCR service for all VTL usersCrowdsourcing platform allowing to collaborate while creating transcriptions of digitized documents
Examples of projects in VTLhttp://wlt.synat.pcss.pl
Books, old-
OCR training tool for profiling with historical documents
http://wlt.synat.pcss.pl/cutouts
OCR training tool
Thank you for your attention!Marcin Werla ([email protected])
http://dl.psnc.pl/
Supercomputing and Networking Center
ul. Noskowskiego 12/14, 61-Office: phone center: (+48 61) 858-20-00, fax: (+48 61) 852-59-54,
e-mail: [email protected], http://www.psnc.pl
affiliated to the Institute of Bioorganic Chemistry of the Polish Academy of Sciences,