Mapping Bibliographic Records with Bibliographic Hash Keys

  • Upload
    jakob-

  • View
    988

  • Download
    2

Embed Size (px)

Citation preview

Mapping Bibliographic Records with Bibliographic Hash Keys

Mapping Bibliographic Records with
Bibliographic Hash Keys

Jakob Vo
Verbundzentrale des GBV (VZG)

11. Internationales Symposium fr Informationswissenschaft (ISI 2009)

the problem

citation

record

data

document

same?

004A $00262025388011@ $a2003021A $aA @history of online information services, 1963 - 1976028A $dCharles P.$aBourne$9383311691028B/01 $dTrudi Bellardo$aHahn$9271366931033A $pCambridge, Mass. [u.a.]$nMIT Press034D $aXVI, 493 S

Bourne, C.P.; Hahn, T.B. (2003): A History of Online Information Services, 1963-1976. Cambridge, Mass.; London: MIT Press.

@book{Bourne2003, title={A History of Online Information Services, 1963-1976}, author={T.B. Hahn and C.P. Bourne}, year={2003},

Charles P. Bourne and Trudi Bellardo Hahn, A History of Online Information Services, 1963-1976 (Cambridge: MIT Press, 2003),

examples

solutions

persistent identifiers

duplicate detection

fingerprints

persistent identifiers

duplicate detection

multiple

comparisons

fingerprints (bibkey)

specification of bibkey

selection of fields

normalization

concatenating

prefix and hashing

004A $00262025388011@ $a2003021A $aA @history of online information services, 1963 - 1976028A $dCharles P.$aBourne$9383311691028B/01 $dTrudi Bellardo$aHahn$9271366931033A $pCambridge, Mass. [u.a.]$nMIT Press034D $aXVI, 493 S

Bourne, C.P.; Hahn, T.B. (2003): A History of Online Information Services, 1963-1976. Cambridge, Mass.; London: MIT Press.

@book{Bourne2003, title={A History of Online Information Services, 1963-1976}, author={T.B. Hahn and C.P. Bourne}, year={2003},

Charles P. Bourne and Trudi Bellardo Hahn, A History of Online Information Services, 1963-1976 (Cambridge: MIT Press, 2003),

selection of fields

authors/editorstitleyear

normalization

lowercase, letters, digits

Charles P. Bournec.bourneTrudi Bellardo Hahnt.hahn

unicode normalization form compatibility composition (NFKC)

composing the hash

bibkey level 114ed100f75dd4459cffeb272bdbc2d1e7

title

authors

year

sort andjoin by ,

prefix 1 and MD5 hash

bibkey level 0ahistoryofonlineinformationservices19631976 [t.hahn,c.bourne] 2003

usage in BibSonomy

aggregatedtag cloud

independentbibliographicrecords

bibkey

usage in web services

record

bibkey

response

service

challenges

determining

all authors

the full title

the year

problematic works

no authors

standard titles
(Introduction, News...)

unicode normalization

icons based on

www.gbv.de/wikis/cls/bibkey

Click to edit the title text format

Click to edit the outline text format

Second Outline Level

Third Outline Level

Fourth Outline Level

Fifth Outline Level

Sixth Outline Level

Seventh Outline Level

Eighth Outline Level

Ninth Outline Level

VZG

Bibliographic Hash Keys

Click to edit the title text format

VZG

Click to edit the outline text format

Second Outline Level

Third Outline Level

Fourth Outline Level

Fifth Outline Level

Sixth Outline Level

Seventh Outline Level

Eighth Outline Level

Ninth Outline Level