38
Reusing search statistics to Reusing search statistics to enhance the use of library enhance the use of library services services Morag Watson Morag Watson Digital Library Development Manager Digital Library Development Manager University of Edinburgh University of Edinburgh

Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

Reusing search statistics to Reusing search statistics to

enhance the use of library enhance the use of library

servicesservices

Morag WatsonMorag Watson

Digital Library Development ManagerDigital Library Development Manager

University of EdinburghUniversity of Edinburgh

Page 2: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

StructureStructure

�� BackgroundBackground

–– ChallengeChallenge

�� ProcessProcess

�� TechnologyTechnology

�� ToolsTools

�� DeliverablesDeliverables

Page 3: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data
Page 4: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data
Page 5: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data
Page 6: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

ChallengeChallenge

�� How can we use this technology to How can we use this technology to

encourage use of library services? encourage use of library services?

�� How can we use this technology to present How can we use this technology to present

library services attractively?library services attractively?

Page 7: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

Initial ThoughtsInitial Thoughts

�� WebcamsWebcams–– Views of the campusViews of the campus

–– Views of the cityViews of the city

�� ImagesImages–– Unique Items from Library collectionsUnique Items from Library collections

�� VideoVideo–– GraduationGraduation

–– Art ExhibitionsArt Exhibitions

�� and word cloudsand word clouds

Page 8: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

Word Clouds (1)Word Clouds (1)

�� Library Web SiteLibrary Web Site

–– Generate from top level pagesGenerate from top level pages

–– Regularly UpdatedRegularly Updated

�� TechnologyTechnology

–– WordleWordle

�� RSS / Atom FeedRSS / Atom Feed

�� Flat text fileFlat text file

�� Del.icio.usDel.icio.us tagstags

Page 9: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data
Page 10: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

Word Clouds (1)Word Clouds (1)

�� Library Web SiteLibrary Web Site

–– Generate from top level pagesGenerate from top level pages

–– Regularly UpdatedRegularly Updated

�� TechnologyTechnology

–– WordleWordle

�� Use restrictionsUse restrictions

�� Software StatusSoftware Status

Page 11: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

Word Clouds (2)Word Clouds (2)

�� Other ToolsOther Tools

–– MakecloudMakecloud

–– TagCloudTagCloud

–– TagCrowdTagCrowd

–– Etc. etc. Etc. etc.

Page 12: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data
Page 13: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data
Page 14: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data
Page 15: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

Time for ReviewTime for Review

�� RSS / Atom feedRSS / Atom feed

–– Only gave you word cloud based on Only gave you word cloud based on ‘‘newsnews’’

�� ‘‘ScrapingScraping’’ a web pagea web page

–– Gives you that web pageGives you that web page’’s contents content

�� Loading FileLoading File

–– Snapshot of data at that point in timeSnapshot of data at that point in time

Page 16: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

Time for ReviewTime for Review

�� WidgetsWidgets

–– Embedded on web pagesEmbedded on web pages

–– PerformancePerformance

–– Reliability / SustainabilityReliability / Sustainability

�� PresentationPresentation

–– Nothing beats Nothing beats wordlewordle

Page 17: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data
Page 18: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

New VisionNew Vision

�� Install open source word cloud toolInstall open source word cloud tool

�� Build our own word cloud toolBuild our own word cloud tool

�� Possible TechnologiesPossible Technologies

–– PHPPHP

–– JavaScriptJavaScript

–– HTMLHTML

–– JPGJPG

Page 19: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

New VisionNew Vision

�� Use search statistics from library services to Use search statistics from library services to

drive word cloudsdrive word clouds

–– Library CatalogueLibrary Catalogue

�� VoyagerVoyager

–– Institutional RepositoryInstitutional Repository

�� DSpaceDSpace

�� Edinburgh Research ArchiveEdinburgh Research Archive

Page 20: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

Voyager DataVoyager Data

�� OpacOpac search log tablessearch log tables

–– Turn Turn opacopac logging onlogging on

–– Extract of data using Extract of data using sqlsql

Page 21: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

Raw Voyager Search DataRaw Voyager Search Data

Page 22: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

VoyagerVoyager

�� OpacOpac search log tablessearch log tables–– Lots of data Lots of data

–– Lots of cleanupLots of cleanup�� DatesDates

�� Access methodAccess method–– WebopacWebopac

–– Z39.50Z39.50

�� zzzzzzzzzzzzzz

�� Search indexesSearch indexes–– TALL,NALL,TKEY,NKEY,JALL,CALL, ISBN,ISSNTALL,NALL,TKEY,NKEY,JALL,CALL, ISBN,ISSN

�� Boolean SearchBoolean Search–– AND , ORAND , OR

�� Other CharactersOther Characters–– , (,),", , (,),",

Page 23: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

‘‘cleanedcleaned’’ voyager datavoyager data

Page 24: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

Voyager Word CloudVoyager Word Cloud

�� AutomationAutomation

–– CronCron job to extract data from job to extract data from opacopac search logssearch logs

–– CronCron job to do data clean upjob to do data clean up

–– Transfer to server running PHP word cloud Transfer to server running PHP word cloud

generatorgenerator

–– Generation of word cloudGeneration of word cloud

–– And refresh / restart And refresh / restart

Page 25: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

TagCrowdTagCrowd Word CloudWord Cloud

Page 26: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

Voyager Word Cloud Voyager Word Cloud

�� Dynamic Word CloudsDynamic Word Clouds

–– extracts data directly from Oracle tablesextracts data directly from Oracle tables�� Last 2000 itemsLast 2000 items

–– clean the dataclean the data

–– build the word clouds build the word clouds

–– handle presentation as HTMLhandle presentation as HTML

–– Word cloud refreshed every time page is Word cloud refreshed every time page is reloadedreloaded

–– Word cloud always pointing at real time dataWord cloud always pointing at real time data

Page 27: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

PHPPHP

Page 28: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

Voyager Word CloudVoyager Word Cloud

Page 29: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

Voyager Word Cloud (2)Voyager Word Cloud (2)

Page 30: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

DSpaceDSpace / Edinburgh Research / Edinburgh Research

ArchiveArchive

�� Modified database to collect search termsModified database to collect search terms

�� Data less dynamicData less dynamic

�� Different types of searchingDifferent types of searching

�� Data clean up Data clean up

–– Search syntaxSearch syntax

–– Robots Robots

–– OAI harvestingOAI harvesting

Page 31: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

DSpaceDSpace Search DataSearch Data

Page 32: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

JavaJava

Page 33: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

ERA Home PageERA Home Page

Page 34: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

DSpaceDSpace Word CloudWord Cloud

Page 35: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

HoloproHolopro plus Word Cloudplus Word Cloud

Page 36: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

What next?What next?

�� Integrated with Library servicesIntegrated with Library services

–– Library websiteLibrary website

–– Library CatalogueLibrary Catalogue

–– Edinburgh Research ArchiveEdinburgh Research Archive

�� More Word CloudsMore Word Clouds

–– AquabrowserAquabrowser

–– WebfeatWebfeat

Page 37: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

What next?What next?

�� EnhancementEnhancement

–– Clickable search termsClickable search terms

–– More admin optionsMore admin options

–– Browser supportBrowser support

�� Service ComparisonService Comparison

–– AquabrowserAquabrowser

–– Library CatalogueLibrary Catalogue

Page 38: Reusing search statistics to enhance the use of library ......Voyager Word Cloud Dynamic Word Clouds – extracts data directly from Oracle tables Last 2000 items – clean the data

Reusing search statistics to Reusing search statistics to

enhance the use of library enhance the use of library

servicesservices

Morag WatsonMorag Watson

Digital Library Development ManagerDigital Library Development Manager

University of EdinburghUniversity of Edinburgh

[email protected]@ed.ac.uk