Upload
christopher-morris
View
219
Download
1
Embed Size (px)
Citation preview
SWAP-07
Ontology Engineering with OntoClean
Chris WeltyIBM Watson Research Center
2SWAP-07
Acknowledgements
People
Nicola GuarinoCladio MasoloAldo GangemiAlessandro Oltramari
Bill Andersen
OrganizationsIBM Research
Vassar College, USA
LADSEB-CNR, PadovaCNR Cognitive
Science Institute, Trento
OntologyWorks, Inc.
3SWAP-07
Which one is better?
T-Series
ThinkPad
T Series
ThinkPad Model
Thinkpad
model
4SWAP-07
Which one is better?Computer
has-part
MemoryDisk Drive
Computer Part
Memory PartDisk Part
Computer Part
Disk Drive Memory
Computer
has-part
Due to: Guizzardi, et al, 2004.
5SWAP-07
Formal Ontology of Relations
• Subsumption• Instantiation• Part/Whole• Constitution• Spatial (Cohn)• Temporal (Allen)
6SWAP-07
Subsumption• The most pervasive relationship in ontologies
– Influence of taxonomies and OO
• AKA: Is-a, a-kind-of, specialization-of, subclass (Brachman, 1983)– “horse is a mammal”
• Capitalizes on general knowledge– Helps deal with complexity, structure– Reduces requirement to acquire and represent redundant specifics
• What does it mean?
□ x (x) (x)
Every instance of the subclass is necessarily an instance of the superclass
7SWAP-07
Overloading Subsumption
Common modeling pitfalls
• Instantiation• Constitution• Composition• Disjunction• Polysemy• Temporality• Spatial/Containment
8SWAP-07
Instantiation Pitfall
T21
My ThinkPad (s# xx123)
ThinkPad Model
Ooops…
Question: What ThinkPad models do you sell?Answer should NOT include My ThinkPad -- nor yours.
Does this ontology mean that My ThinkPad is a ThinkPad Model?
9SWAP-07
Instantiation
T Series
My ThinkPad (s# xx123)
ThinkPad ModelNotebook Computer
model T 21
10SWAP-07
Composition Pitfall
MemoryDisk Drive
Computer
Question: What Computers do you sell?Answer should NOT include Disk Drives or Memory.
Micro Drive
11SWAP-07
Composition
MemoryDisk Drive
Computer
Micro Drive
part-of
12SWAP-07
Disjunction Pitfall
MemoryDisk Drive
Computer
Micro Drive
has-partComputer Part
Flashcard-110Camera-15has-part
Unintended model: flashcard-110 is a computer-part
13SWAP-07
Disjunction
Computerhas-part
Disk Drive Memory …
14SWAP-07
Polysemy Pitfall(Mikrokosmos)
Abstract EntityPhysical Object
Book
Question: How many books do you have on Hemingway?Answer: 5,000
…..
15SWAP-07
Polysemy(WordNet)
Abstract EntityPhysical Object
BookSense 1
BookSense 2
….. Biography of Hemingway
16SWAP-07
Constitution Pitfall(WordNet)
Amount of Matter
Physical Object
Entity
ComputerClayMetal
Question: What types of matter will conduct electricity?Answer should NOT include computers.
17SWAP-07
Constitution
Amount of Matter Physical Object
Entity
ComputerClayMetal
constituted
18SWAP-07
Temporality Pitfall(Wikipedia)
1963 1964
1960s
Chris
19SWAP-07
Temporality Pitfall(Wikipedia)
1963 births 1964 births
1960s births
Chris
20SWAP-07
Temporality
1963 1964
1960s
Chris
contains
bornIn
Person
Decade
Year
21SWAP-07
Spatial/Containment Pitfall(OWL Guide)
Alsace Region Loire Region
French Region
22SWAP-07
Spatial/Containment
Alsace Loire
France
Region
Country
contains
23SWAP-07
Its about the instances
• For every class, think about what an instance of it is– What is an instance of “Loire Region”?
• Classes do not describe their subclasses– “Regions by Country” is a class of classes
• Criteria for individuation must remain constant within a taxonomy– Instance of a class is also an instance of every superclass• Thus “Chris” is not an instance of “1963 births”
– Explore the “boundary conditions”• E.g. Changes in existence, distinctions with similar classes
• “Leaf Nodes” of a hierarchy have no special significance– Don’t switch to instances
24SWAP-07
Common Pitfalls
• Composition (part of)– Arm subclass body
• Constitution– Statue subclass marble
• Disjunction– (class Car partial (all hasPart CarPart)
– (Engine subclass CarPart)
– (Tire subclass CarPart)• Spatial
– NewYork subclass US
• Polysemy– Book subClass PhysicalObject– Book subClass ConceptualCreation
• Arbitrary organizational nodes– FictionalBookbyLatinAmericanAuthor subClass FictionalBook
• Instance– PinotNoir instanceof Grape
• Temporality– YoungElvis instanceOf Elvis
25SWAP-07
The linguistic tests• If P subclass Q, you should be able to say “P is a kind of Q”
• If a instanceOf P, you should be able to say, “a is a P”
• If a instanceof P subClassOf Q, you should be able to say “a is a Q”
• For every instance, there should be a class it is (rigidly) an instance of that is its natural label
• You should not find it natural to say, if P subclassOf Q, “P has Q”, “P might be Q”, “P was Q”, “P is in Q”, “P is part of Q”
26SWAP-07
What’s in a name• Don’t argue about what specific terms mean
– Common software architecture argument: “What is a bridge?”
• Try and find the distinctions that matter– Assign them labels later
• Avoid “ish” “-thing” & “other-” classes– Find good names that will avoid meaning creep– Other- classes create a maintenance nightmare
• Classes describe their instances– Remember the linguistic tests
• The superclass is not part of the name– So don’t assume it is (e.g. Best_Practices subClassOf Document)