Upload
others
View
3
Download
0
Embed Size (px)
Citation preview
Open Government and the Semantic Web: a field report Enriching government gazette notices with
knowledge graphs
Guido van der Wolk | Taxonic
SEMANTiCS 2019, Karlsruhe
GuidovanderWolk(PhD),So6wareEngineer,Utrecht,TheNetherlands
• Model-drivendevelopment• Datascience• Machinelearning
ITconsultancycompany:• LinkedData• DynamicCaseManagement• ICTservicesPartnerswithTopQuadrant,PoolParty,MarkLogic,FranzInc.,PegaandGOFAIR
Customopengovernment
• NewDutchlegislaPon(starPng01-01-2021)– EverygovernmentorganizaPonmustpublishtheirnoPcesintheonlineofficialgazeVe
– Every18+ciPzenwillreceivecustomizable(frequency,geo-range,topics)e-mailnoPcesaboutpublicaPonsneartheirhomeaddress
Source:DossierBekendmakingswet,hVps://zoek.officielebekendmakingen.nl/dossier/35218
OnlineofficialgazeVeofTheNetherlands
• 300,000+publicaPonsayear• OfficialsourcesinceJuly1,
2009• Centralizedgovernment
– state,parliament,treaPes
• Decentralizedgovernment– provinces(12),municipaliPes(355),waterboards(21)
• XML,HTML,PDF,ODT,Metadata(XML)
• Searchplacorm• Geo-basede-mail
subscripPonplacorm• Opendata(CC-0
license):searchandretrieveviaURL(SRU)interface
• Webaddress:hVps://officielebekendmakingen.nl/
GovernmentpublicaPonsintheneighbourhood
hVps://geozet.koop.overheid.nl/overuwbuurt/onl/
FAIRdataFAIRDataaimstosupportexisPngcommuniPesintheiraVemptstoenablevaluablescienPficdataandknowledgetobepublishedanduPlizedina‘FAIR’manner.Findable-(meta)dataisuniquelyandpersistentlyidenPfiable.Shouldhavebasicmachine-readabledescripPvemetadata.Accessible-dataisreachableandaccessiblebyhumansandmachinesusingstandardformatsandprotocols.Interoperable-(meta)dataismachinereadableandannotatedwithresolvablevocabularies/ontologies.Reusable-(meta)dataissufficientlywell-describedtoallow(semi)automatedintegraPonwithothercompaPbledatasources.
hVp://www.nature.com/arPcles/sdata201618
EnrichinggovernmentgazeVenoPceswithknowledgegraphs:adatahub
Retrieve data and metadata
Data wrangling Define semantic model Make data linkable
Deploy enriched data/metadata
Combine with other data Query combined data Monitoring and
visualization
DatawranglingdoneFAIR
• Understandingthedata• Uniqueandpersistent• Datadomainandrange• UnravelconcatenaPon• Mergingfields• Provenance• XMLtoTurtle
Currentmetadatascheme
SemanPcenrichment• dcterms• dcam• dcat• foaf• geo• prov• rdf/rdfs• skos
• bwb• ecli• lido
• oep• overheid• overheidop
SemanPcenrichment(type/subject)
SemanPcenrichment(spaPal)
CoordinatesinEPSG28992format
SemanPcenrichment(legalsource)
AnewresourceidenPfierstrategyType Example
Register {prefix}/id/gemeente
Registeritem {prefix}/id/gemeente/gm0004
Page {prefix}/doc/gemeente/gm0004
Concept {prefix}/def/concept/{idenPfier}
Class {prefix}/def/class/gemeente
Property {prefix}/def/property/gemeentecode
Dataset {prefix}/set/gemeente
Datasetitem {prefix}/set/gemeente/20190101
BasedoninternaPonalguidelines:• Gov.UK:hVp://ukgovld.github.io/ukgovldwg/recommendaPons/uri-
paVerns.html• ISA:hVps://joinup.ec.europa.eu/sites/default/files/document/
2013-02/D7.1.3%20-%20Study%20on%20persistent%20URIs.pdf
Linkabledata
• ApplythesemanPcmodeltotheoriginaldatatomakedatalinkable
• Deploytodatahub– Searchondataandmetadata– Machinecandiscoverthedata– Inferencing:derivingnewfactsfromexisPngdata– FederatedqueriessPllachallenge
• ServiceableSPARQL• OtherdatasetsincompleteforcertainPmeslices• OtherdatasetsnotFAIR
Monitoringdataquality
• DiscoverpublicaPonswith– addressinPtlebutwithoutgeo:Point– 100+geo:Point’s– dcterms:type“other”buttypeinPtle– arPcleoflawinPtlewithoutreferencetolaw– legalsource“unknown”
MonitoringandvisualizaPon(1)SunburstdiagrampublicaPonproductandorganizaPonfor2018
Type %
Municipality 74%
State 19%
Waterboard 3%
Province 2%
MonitoringandvisualizaPon(2)
SunburstdiagrampublicaPonproductandtypefor2018
Combinewithotherdata
• RegistryofgovernmentorganizaPons• Geo-anddemographicandsocial-economicinformaPon
• JudicialinformaPon:laws,arPcleoflaw,courtjudgments,treaPes,guidelines
QuerycombineddataBuildingsbyageinsideadistrict(geosparql)
CBSandBAGdata,hVps://data.labs.pdok.nl/yasgui
Theroadahead
• Whichdistrictsareunder-oroverpublishing?• Whataregoodgeo-rangestoofferciPzensinnoPficaPonservice?District,street,radius?
• Offersearchinterfacefor– publicaPonsofsimilartypeinstreet/city/province;
– publicaPonsrelatedtosamelegalsource;– olderpublicaPonsforthesameaddress;– publicaPontypes(highergranularity)
OpengovernmentandtheSemanPcWeb:afieldreport
• OfficialgazeVecurrentlyalmostFAIR• EnrichingpublicaPonswithknowledgegraphs– enablesdataqualitymonitoring/analysis– enablescustomopengovernment– preparesformachinelearningenrichment
OpengovernmentandtheSemanPcWeb:afieldreport
Q&A