Upload
others
View
5
Download
0
Embed Size (px)
Citation preview
« Bibliothèques VirtuellesHumanistes » in Tours:
Digital libraryof
Image, Text and Data
TEI Meeting, Wien, September 30th, 2016
Sandrine Breuil, Mathieu Duboc,Guillaume Porte, Toshinori Uetani
http://www.bvh.univ-tours.fr
I. BVH and its workflows1. BVH2. Main goal3. Two workflows4. Last development:
Bordeaux copy of the Essais of MontaigneII. Bibliotheques françoises
1. Project2. Encoding principles3. Two working hyptheses and our choice4. Output5. Conclusion
2 databases : Facsimiles and Electronic edition
• Facsimiles• About 1400 digitized documents, 1100 titles online
(September 2016)• In Jpeg with browsing tools, Different PDF (binary,
colour, MRC)
• Extraction and indexation of 23 000 illustrations• BaTyR : Database of more than 30 000
typographical materials : Ornamental letters, portrayals, pilcrow, etc.
• Electronic editions• 153 editions encoded with XML-TEI• 8 plain-texts transcriptions only display en HTML
BVH Workflow(s)Work in Progress
Similarities betweenFacsimile & Text
Display: Facsimile Display: Epistemon
Notices database
Summary database
Images
Text transcription
TEI Document
TEI encoding: structureStructuration
Image digitization
TEI encoding: semantic
Document
Description TEI header
ExtractedImages
BVH Workflow(s)Work in Progress
Display: Facsimile Display: Epistemon
BaTyR database
Summary database
Text transcription
TEI Document
TEI encoding: structureStructuration
TEI encoding: semantic
TEI header
Linked by enriched metadataFacsimile & Text
Notices databaseImages
Description
ExtractedImages
Image digitization
Document
Source: BVH / Médiathèque Equinoxe, Châteauroux
BVH Workflow(s)Pantagruel, Rabelais, 1542 – f. 5Indexing: Facsimile versus Text
BVH Workflow(s)Pantagruel, Rabelais, 1542 – f. 5Indexing: Facsimile versus Text
BVH Workflow(s)Work in Progress
Display: Facsimile Display: Epistemon
Summary database
Text transcription
TEI Document
TEI encoding: structureStructuration
Image digitization
TEI encoding: semantic
Description TEI header
Interconnections betweenFacsimile & Text
Notices databaseImagesExtractedImages
Document
« L’Exemplaire de Bordeaux »Essais, Montaigne, 1588 – Annotated by Montaigne himself
Exemplaire de Bordeaux, Livre I, f. 2v. – Source : Bibliothèque de Bordeaux
Visualisation : Book I, chapter II
« L’Exemplaire de Bordeaux »Essais, Montaigne, 1588 – Annotated by Montaigne himself
Exploitation of the XML-TEI encoding with TXMQuantify the ponctuation corrections
Bibliotheques françoisesof La Croix du Maine and of Du Verdier
Source : Bibliothèque municipale de LyonSource : München, Bayerische Staatsbibliothek
Bibliotheques françoisesde La Croix du Maine et de Du Verdier
► Le premier volume de la Bibliotheque du sieur de La Croix du Maine.– Paris : Abel L’Angelier, 1584. – [44], 1-558, [5, 1 bl.] p. ; in-fol.
– 2194 recordings of French authors
► La Bibliotheque d’Antoine Du Verdier, seigneur de Vauprivas. – Lyon : Jean d’Ogerolles pour Barthélemy Honorat, 1585. – i-xxviii, 1-1233, [1, 1 bl., 1] p. ; in-fol.
– About 2300 author records (20 % : non-French authors)
Premier volume de la bibliothequede François Grudé, sieur de La Croix du Maine
Source :Bibliothèque municipale
de Lyon
Bibliotheques françoisesHasty publication (La Croix du Maine)
► Numerous corrigenda in the text► Long list of addenda► Corrections by printer’s atelier (papillons,
manuscript corrections, corrections with typesets,…)
Source : Médiathèque Louis Aragon du Mans
Bibliotheques françoisesEncoding principles (from body to notices)
• Body<body xml:id="B693836101_108467_body">
<div> <div part="I" type="partie"
xml:id="B693836101_108467_partie_I"></div> // main part of the book<div part="F" type="partie"
xml:id="B693836101_108467_partie_F"></div> // addenda
//general trailer</div>
</body>Source :
Bibliothèque municipalede Lyon
Bibliotheques françoisesEncoding principles (from body to notices)
• Parts and chapters<div part="I" type="partie"
xml:id="B693836101_108467_partie_I">//part or general head
<div n="1" type="chapitre"xml:id="B693836101_108467_A">
<head rend="hidden">[A]</head>// réel ou restitué entre crochets
<div type="notice"xml:id="lcdm-not_0001"></div>
<div type="notice“xml:id="lcdm-not_0002"></div>
</div><div n="2" type="chapitre"
xml:id="B693836101_108467_B"><head>B.</head> // etc.
</div>//part trailer</div>
Source :Bibliothèque municipale
de Lyon
Bibliotheques françoisesEncoding principles (from body to notices)
• Notices
<div type="notice" xml:id="lcdm-not_0002"><head rend="hidden">[ABEL FOULON]</head>
<ab><milestone ana="start" unit="section"/>
ABEL <choice><orig>FOVLON</orig><reg> FOULON </reg></choice> natif de la paroisse de Loüé
au< lb/> //encoded text< lb/> //encoded text
<milestone ana="end" unit="section"/><milestone ana="start" unit="section"/>
//etc.<milestone ana="end" unit="section"/>
</ab></div>
Source :Bibliothèque municipale
de Lyon
Bibliotheques françoises<person> element as pattern
Children of <person> element Chosen elements for notice
<persName> <persName>
<roleName> <roleName>
<occupation> <rs type="occupation">
<birth/death/floruit when=""> <date type="birth/death/floruit" when="">
<birth/death/floruit><placeName/></birth> <rstype="birth/death/floruit"><placeName/></rs>
<langKnowledge> <rs type="langKnowledge">
elements attributes
<persName> @ref => person’s id@type="ved« , if is the main name of the notice
<roleName> / <rs type="occupation"> / <langKnowledge>
@ref => function’s or lang’s id@corresp = person’s id
<date> @type => birth, death, floruit@when/@notBefore/@notAfter => ISO date@precision="low" if the given date is not sure@corresp = person’s id
<rs type="birth/death/floruit"> @corresp = person’s id
Bibliotheques françoisesuse of @corresp for biographicalinformations
ADAM <choice><orig>VVANDELAND</orig><reg>WANDELAND</reg></choice>, natif de la ville d'Angers, fils de feu Gilbert <choice><orig>Vvandeland</orig><reg>Wandeland</reg></choice>, Suisse de nation, tous deux excellents <rstype="occupation" ref="#pei" corresp="#bvhbibfr_01089 #bvhbibfr_03017">peintres</rs>
Bibliotheques françoisesbibliographical informations : database and text area1. First orientation : adding metadatas
(without <orig><reg>)<bibl corresp="#LCDM-bibl_5896"> <author
ref="#bvhbibfr_00155"/><title>De la fortune & vertu de France, <choice><orig>auec</orig><reg>avec</reg></choice> <choice><orig>vn</orig><reg>un</reg></choice>sommaire discours, sur le<lb/> dessein de l'histoire de France</title>, imprimé à <pubPlaceref="#par">Paris</pubPlace> chez <publisherrole="imp"><persNameref="#bvhbibfr_02626">l'Huillier</persName></publisher> l'an <lb/><date type="pubDate">1570</date>.
</bibl>
Bibliotheques françoisesFrom the database by Mark Greengrass,to the Bibfr Database
► 7256 bibliographical recordings from La Croix du Maine et de Du Verdier
Bibliotheques françoisesbibliographical informations : database and text area
2. Final choice
<bibl corresp="#lcdm-bibl_5319z">Il a escrit en vers François, <title>le triomphe du temps qui court</title>, imprimé l'an 1532.</bibl>
Id_bibl Id_author title pubplace
Publisher_id
date
lcdm-bibl_5319z bvhbibfr_xxxxxx le triomphe du temps qui court
s.l. s.n. 1532
Bibliotheques françoisesbibliographical informations : database and text area
2. 1. Multi-imprint case
<bibl corresp="#lcdm-bibl_7242z #lcdm-bibl_7242a">Il a descrit le <title>plan, ou portraict de la ville, Cité & Vniuersitéd'Angers</title> imprimé l'an 1575. à Paris chez <persName>Nicolas Chesneau</persName> & autres.</bibl>
Id_bibl Id_author title pubplace Publisher_id date
lcdm-bibl_7242z bvhbibfr_xxxxxx plan, ou portraict de la ville, Cité & Vniuersité d'Angers
Paris bvhbibfr_xxxxxx 1575
lcdm-bibl_7242a bvhbibfr_xxxxxx plan, ou portraict de la ville, Cité & Vniuersité d'Angers
s.l. s.n. s.d.
Bibliotheques françoisesbibliographical informations : database and text area
2. 2. Multi-title case
<bibl>Il a escrit vn <bibl ref="#lcdm-bibl_6212z #lcdm-bibl_6213z"><title>traicté, des admirables vertuzde l'or potable</title>.</bibl> <bibl ref="#lcdm-bibl_6212a #lcdm-bibl_6213a"> <title>L'apologie de la science d'Alchimie</title>. </bibl> Ausquels liures <persName>Iaques Gohorry</perName> Parisien a fait response. Ces liures susdits ont esté imprimez à Lyon, & depuis à Paris l'an 1575. chez <persName>Ian de Lastre</perName>. </bibl>
Id_bibl Id_author title pubplace Publisher_id datelcdm-bibl_6212z bvhbibfr_xxxxxx traicté, des
admirables vertuz de l'or potable
Lyon bvhbibfr_xxxxxx s.d.
lcdm-bibl_6213z bvhbibfr_xxxxxx traicté, des admirables vertuz de l'or potable
Paris bvhbibfr_xxxxxx 1575
lcdm-bibl_6212a bvhbibfr_xxxxxx L'apologie de la science d'Alchimie
Lyon bvhbibfr_xxxxxx s.d.
lcdm-bibl_6213a bvhbibfr_xxxxxx L'apologie de la science d'Alchimie
Paris bvhbibfr_xxxxxx 1575
Bibliotheques françoiseswebsiteGlobal operation
Userinterface
Application : Javascript + PHP
BaseX(XML database)
XML -TEI Documents : - La Croix Du Maine
- Du Verdierxsl files
Authority files :-Persons
- bibliographical items
restXQ requests
Bibliotheques françoiseswebsite1. Searching a name
Bibliotheques françoiseswebsite2. Displaying person’s informations
Bibliotheques françoiseswebsite3. Showing notices attached to the person
Bibliotheques françoiseswebsite4. Displaying bibliographical informations
Bibliotheques françoiseswebsite5. Navigating through the original text, one notice after the other
BVH Workflow(s)Work in Progress
Display: Facsimile Display: Epistemon
Bibfr databaseBaTyR database
Summary database
Text transcription
TEI Document
TEI encoding: structureStructuration
TEI encoding: semantic
TEI header
Linked by enriched metadataFacsimile & Text
Notices databaseImages
Description
ExtractedImages
Image digitization
Document