Upload
others
View
0
Download
0
Embed Size (px)
Citation preview
The Web
• A network of interlinked computers, protocols, software and applications
• A socio-cultural phenomenon
• Changed society over the last 20 years – work, trade, education,
science, entertainment, culture….
• Rapidly evolving – underlying technologies – usage
• Huge – 1 Trillion unique URLs (in
20081)
– 40 Billion pages indexed by Google2
1 http://googleblog.blogspot.com/2008/07/we-knew-web-was-big.html 2 http://www.worldwidewebsize.com/
Technological strata
• Backend – Building a Web “site” – Large complex programs!
• Formats and representations – Documents: HTML and CSS – Data: XML, JSON, RDF – Knowledge: OWL
• Frontend – Rich End User Experience – Mash ups – Social networks
http://www.ibm.com/developerworks/ibm/library/it-booch_web/
Advanced Web Technologies
• We examine – Web oriented and inspired technologies – Design issues and future directions
• You get – Practical skills – Theoretical and conceptual understanding – Appreciation of the design trade offs
This theme aims to provide students with an overview and understanding of advanced web technologies which are being used to facilitate the move from a web of documents, to a web of documents, data and applications.
Course Units in the Theme
• COMP60411: Semi-structured Data and the Web – Data models for the Web – XML family: data models, schema & query languages,
APIs, etc. • COMP60421: Ontology Engineering for the Semantic
Web – Knowledge on the Web – Formalisms, reasoning services, modelling principles
COMP60411: Semi-structured Data and the Web���
• The Web is – a very large (by several metrics) information structure – a complex, distributed, often interactive computing
platform • How do we put information on it?
• This course will acquaint you with the representation of data on (or inspired by) the Web – semi-structured data & data models – XML, HTML, and related technologies – data, documents, and (Web) applications
Sem1, P1: Uli Sattler, Connie Hedeler
An XML-centric study of webesque data, and definitional & manipulation languages from the designer/user point of view
COMP60411: Semi-structured Data and the Web��� • A firm grounding in an array of SSD
languages and supporting technologies – Make sense of the alphabet soup!
– Underlying, unifying, and contrasting principles
– Assess applicability, utility, and tradeoffs
• Acquaintance with applications – Designing formats and data
– Designing for extensibility/usability – Examine some existing formats
(e.g., SVG, XHTML, MathML)
• Understanding of the theory – Conceptual issues – Issues with language design
– Grammar & automata theory
In these areas, we see rapid change from the • top (from big corps and standards bodies) • bottom (from small communities or individuals)
• Goal: Separate hype and flash from fundamentals
COMP60411: Semi-structured Data and the Web��� • Background & XML Basics
• Self-Describingness • intro to XPath
• Schemas overview • DTDs - a simple schema language • more on XPath
• Modelling – some quality criteria • XML Namespaces • W3C XML Schema (WXS)
• more on Modelling • Schemas and Tree Grammars • XQuery
• Types & their use in queries • Validation against schemas via tree
grammars
• RELAX-NG • Internal & External Representations
• The Essence of XML • Roundtripping issues • Error handling
• CSS • Schematron • Errors on the Web
COMP60411: Semi-structured Data and the Web��� Prerequisites
• Moderate Java programming skills; unit testing/JUnit familiarity a plus
• Helpful: – good familiarity with relational database theory – some familiarity with regular expressions, grammars
and automata (e.g., EBNF) • You all have written web pages, right? • Worth reading: Learning XML • This module will look at topics in a rigorous way
COMP60411: Semi-structured Data and the Web��� Assessment
• 50% exam – Covers conceptual issues, simple problems
• 50% weekly course work, consisting of – Quiz to highlight relevant concepts – Short essay
§ 1-2 paragraphs, to get you reading & writing
– Short "thought" problems § small programs to acquaint you with data model, APIs and languages
§ Transform, query, style it
– Modelling exercise: encode data into various formats
COMP60421: Ontology Engineering for the SW
• Knowledge Representation and Ontologies – Increasing role in applications – W3C’s OWL and SKOS established standards
• Key Technologies in realising the Semantic Web and Web of Data
• Module provides Foundations and Practice of understanding, developing and using OWL ontologies
Sem 1, P2: Sean Bechhofer, Robert Stevens
COMP60421: Ontology Engineering for the SW
Ontology • A representation of the shared background knowledge for
a community • Providing the intended meaning of a formal vocabulary
used to describe a certain conceptualisation of objects in a domain of interest
• A vocabulary of terms plus explicit characterisations of the assumptions made in interpreting those terms
• Nearly always includes some notion of hierarchical classification (is-a)
• Richer languages allow the definition of classes through description of their characteristics – Introduce the possibility of using inference to help in
management and deployment of the knowledge.
COMP60421: Ontology Engineering for the SW
Theory • Foundations • Description Logics • Reasoning • Methodologies
Practice • Knowledge Acquisition • Ontology Development • Applications • Comment/critique
AI
v
x
yz
w
BI
COMP60421: Ontology Engineering for the SW
Assessment • 50% Exam
– Electronic/On-line • 50% Coursework
– Weekly short questions/tests – Practical exercises
§ Knowledge Acquisition § Modelling
§ Ontology Critique § Use of tools § Application development – that means Java programming!
Research Related to this Theme
• UoM is a Leading Research Centre in Ontology Engineering, Language Design and Semantic Applications
• Instrumental in W3C Standardization – Web Ontology Language OWL – Simple Knowledge Organisation Systems SKOS – SPARQL Query language
• Tool Development – OWL API, Protégé, FaCT++, SWOOP
• Applications – Life Sciences, eScience
Past Projects in this Theme
• Web-based Support for Teaching Delivery • A Web-accessible Exhibition Knowledge Base Management System • Generation of Websites from Models
• Extracting SKOS vocabularies from News Feeds • Analysing Social Network Data • Publishing Scientific Workflow Provenance
• Design of a modular ontology • Developing Probabilistic���
Ontologies
• Evaluating Java-OWL ��� Integration
• Many many more!
Is it for me?
• This theme is not for… – those who just want to build ���
web sites – those who don’t like to “get their hands dirty” – those who don’t like to read around the subject
• This theme is for… – those who are seeking a deeper understanding of the
technologies that are being used to support the continuing evolution of the Web.
http://www.flickr.com/photos/-bast-/349497988/