Medical Ontologies: An Overview

Preview:

DESCRIPTION

Medical Ontologies: An Overview. Barry Smith http://ifomis.de. Three levels of ontology. formal (top-level) ontology dealing with categories employed in every domain: object, event, whole, part, instance, class 2) domain ontology , applies top-level system to a particular domain - PowerPoint PPT Presentation

Citation preview

Medical Ontologies: An Overview

Barry Smithhttp://ifomis.de

http:// ifomis.de 2

Three levels of ontology1) formal (top-level) ontology dealing with

categories employed in every domain: object, event, whole, part, instance, class

2) domain ontology, applies top-level system to a particular domaincell, gene, drug, disease, therapy

3) terminology-based ontologylarge, lower-level systemDupuytren’s disease of palm, nodules with no contracture

http:// ifomis.de 3

Three levels of ontology1) formal (top-level) ontology dealing with

categories employed in every domain: object, event, whole, part, instance, class

2) domain ontology, applies top-level system to a particular domaincell, gene, drug, disease, therapy

3) terminology-based ontologylarge, lower-level systemDupuytren’s disease of palm, nodules with no contracture

http:// ifomis.de 4

Three levels of ontology1) formal (top-level) ontology dealing with

categories employed in every domain: object, event, whole, part, instance, class

2) domain ontology, applies top-level system to a particular domaincell, gene, drug, disease, therapy

3) terminology-based ontologylarge, lower-level systemDupuytren’s disease of palm, nodules with no contracture

http:// ifomis.de 5

Three levels of ontology1) formal (top-level) ontology dealing with

categories employed in every domain: object, event, whole, part, instance, class

2) domain ontology, applies top-level system to a particular domaincell, gene, drug, disease, therapy

3) terminology-based ontologylarge, lower-level systemDupuytren’s disease of palm, nodules with no contracture

http:// ifomis.de 6

IFOMISInstitute for Formal Ontology and Medical

Information ScienceLeipzig

http://ifomis.de

philosophers and medical informaticians attempting to build and test a Basic Formal Ontology for applications in biomedical and

related domains

http:// ifomis.de 7

IFOMIS

use basic principles of philosophical ontologyfor quality assurance and alignment of biomedical ontologies

http:// ifomis.de 8

Compare:

1) pure mathematics (theories of structures such as order, set, function, mapping) employed in every domain

2) applied mathematics, applications of these theories = re-using the same definitions, theorems, proofs in new application domains

3) physical chemistry, biophysics, etc. = adding detail

http:// ifomis.de 9

Three levels of ontology1) formal (top-level) ontology =

medical ontology has nothing like the technology of definitions, theorems and proofs provided by pure mathematics

2) domain ontology = UMLS Semantic Network, GALEN CORE

3) terminology-based ontology = UMLS, SNOMED-CT, GALEN, FMA

?????

http:// ifomis.de 10

Strategy

Part 1: Provide an overview of medical ontologies and of the top-level ontologies which they implicitly define

Part 2: Show how principles of classification and definition derived from top-level ontology can help in quality assurance of terminology-based ontologies and in ontology alignment

Parts 3 and 4: IFOMIS Collaboration with L&C

http:// ifomis.de 11

http:// ifomis.de 12part of the UMLS Semantic Network

http:// ifomis.de 13

UMLS Semantic Network

entity event

physical conceptual object entity

http:// ifomis.de 14

UMLS Semantic Network

entity event

physical conceptual object entity

http:// ifomis.de 15

conceptual entity

Organism AttributeFinding

Idea or ConceptOccupation or Discipline

OrganizationGroup

Group AttributeIntellectual Product

Language

http:// ifomis.de 16

Conceptual EntityIdea or Concept

Functional ConceptQualitative ConceptQuantitative ConceptSpatial Concept

Body Location or RegionBody Space or JunctionGeographic AreaMolecular Sequence

Amino Acid SequenceCarbohydrate SequenceNucleotide Sequence

http:// ifomis.de 17

Fairfax Countyis an Idea or Concept

http:// ifomis.de 18

Why is Fairfax County a Conceptual Entity for UMLS-SN?UMLS-SN Spatial Concepts share the following characteristics: a) they are extended in space b) their boundaries are determined not by any underlying physical discontinuities but rather by human fiat. The referent of ‘Fairfax County’ satisfies these conditions, but so also does hand, which is not classified by UMLS as a conceptual entity.

http:// ifomis.de 19

http:// ifomis.de 20

http:// ifomis.de 21

gene part_of cell component

body system conceptual_part_of fully formed anatomical structure

http:// ifomis.de 22

conceptual entity

idea or concept

functional concept

body system

http:// ifomis.de 23

But:

Gene or Genome is defined as: “A specific sequence … of nucleotides along a molecule of DNA or RNA …”

and nucleotide sequence is_a conceptual entity

http:// ifomis.de 24

entity

physical conceptual object entity

idea or concept

functional concept

body system

confusion of entity and concept

http:// ifomis.de 25

Functional Concept:

Body system is_a Functional Concept.

but:

Concepts do not perform functions or have physical parts.

http:// ifomis.de 26

This:

is not a concept

http:// ifomis.de 27

Problem: Confusion of Is_A and Has_Role

Physical Entity

Chemical Entity

Chemical ChemicalViewed Viewed

Structurally Functionally

http:// ifomis.de 28

Chemical Viewed Structurally vs. Chemical Viewed Functionally

reflects a distinction between types of classification – not between types of entity

compare a classificationof people into:tall people, people who play tennis, people who look like flies from a distanceetc.

http:// ifomis.de 29

Confusion of Is_A and Has_Role

Physical Object

Substance

Food Chemical Body Substance

http:// ifomis.de 30

Roles

A box used for storage is not (ipso facto) a special kind of box

An animal belonging to the emperor is not a special kind of animal

http:// ifomis.de 31

The Hydraulic Equation

BP = CO*PVR

arterial blood pressure is directly proportional to the product of blood flow (cardiac output, CO) and peripheral vascular resistance (PVR)

http:// ifomis.de 32

Confusion of Ontology and Epistemology

blood pressure is an Organism Function,cardiac output is a Laboratory or Test Result

or Diagnostic Procedure

BP = CO*PVR thus asserts that blood pressure is proportional either to a laboratory or test result or to a diagnostic procedure

http:// ifomis.de 33

Disease History

is classified by UMLS under Health Care Activity

This runs togetherthe history or course of a disease on the side of the patient (ontology)

with the act of eliciting that history (epistemology).

http:// ifomis.de 34

Object vs. Process =Continuant vs. Occurrent

Continuant entities = endure through time organisms, cells, molecules exist in full in every instant at which you exist at all

Occurrent entities (processes, events, activities, changes, histories) unfold themselves in time; never exist in full in any single instant

http:// ifomis.de 35

Dependent vs. Independent Entities

Dependent entities require support from other entities in order to exist:there is no mass or shape without some body Independent entities are themselves the substrates for qualities, dispositions, motions, functions and other dependent entities

http:// ifomis.de 36

entities

independent dependent occurrents continuants continuants (always

dependent)

ORGANISMS ROLES PROCESSES CELLS FUNCTIONS HISTORIES

MOLECULES CONDITIONS LIVES (diseases) (courses of

diseases)

http:// ifomis.de 37

entities

independent dependent occurrents continuants continuants (always

dependent)

ORGANISMS ROLES PROCESSES CELLS FUNCTIONS HISTORIES

MOLECULES CONDITIONS LIVES (diseases) (courses of

diseases)

classes

instances

http:// ifomis.de 38

A three-category ontology along these lines accepted by

DOLCE = first module of Semantic Web Wonderweb Foundational Ontologies Library

BFO = IFOMIS Basic Formal OntologyUMLS-SN, GO

http:// ifomis.de 39

GALEN

independent dependent occurrents continuants continuants (always

dependent)

GENERALISED MODIFIER GENERALISEDSTRUCTURES + CONCEPT PROCESSGENERALISED (features, SUBSTANCES states, roles)

http:// ifomis.de 40

GALEN

independent dependent occurrents continuants continuants (always

dependent)

GENERALISED MODIFIER GENERALISEDSTRUCTURES + CONCEPT PROCESSGENERALISED (features, SUBSTANCES states,cell + sputum roles)

http:// ifomis.de 41

GALEN

independent dependent occurrents continuants continuants (always

dependent)

GENERALISED MODIFIER GENERALISEDSTRUCTURES + CONCEPT PROCESSGENERALISED (features, SUBSTANCES states, roles)

http:// ifomis.de 42

GALEN

independent dependent occurrents continuants continuants (always

dependent)

GENERALISED ASPECT GENERALISEDSTRUCTURES + PROCESSGENERALISED (features, SUBSTANCES states, roles)

http:// ifomis.de 43

http:// ifomis.de 44

http:// ifomis.de 45

http:// ifomis.de 46

http:// ifomis.de 47

http:// ifomis.de 48

immune system is_a logical structure

http:// ifomis.de 49

GALEN CORE (1996)

Phenomenon = those categories which can be observed

TopThing

DomainCategory DomainAttribute

ModifierConcept ValueType Phenomenon

Feature Collection Selector

GeneralisedProcess

GeneralisedStructure

GeneralisedSubstance State

Modality

Role

Unit

GeneralLevelOfSpecification

Status

Aspect

http:// ifomis.de 50

GALEN CORE (1996)

Phenomenon = categories whose instances can be observed

TopThing

DomainCategory DomainAttribute

ModifierConcept ValueType Phenomenon

Feature Collection Selector

GeneralisedProcess

GeneralisedStructure

GeneralisedSubstance State

Modality

Role

Unit

GeneralLevelOfSpecification

Status

Aspect

http:// ifomis.de 51

SNOMED-CT ConceptSubstanceBody StructureSpecimenContext-Dependent CategoriesAttributeFindingStaging and ScalesOrganismPhysical ObjectEventsEnvironments and Geographic LocationsQualifier ValueSpecial ConceptPharmaceutical / Biological ProductSocial ContextDiseaseProcedurePhysical Force

http:// ifomis.de 52

SNOMED-CT ConceptSubstanceBody StructureSpecimenContext-Dependent CategoriesAttributeFindingStaging and ScalesOrganismPhysical ObjectEventsEnvironments and Geographic LocationsQualifier ValueSpecial ConceptPharmaceutical / Biological ProductSocial ContextDiseaseProcedurePhysical Force

http:// ifomis.de 53

SNOMED-CT ConceptSubstanceBody StructureSpecimenContext-Dependent CategoriesAttributeFindingStaging and ScalesOrganismPhysical ObjectEventsEnvironments and Geographic LocationsQualifier ValueSpecial ConceptPharmaceutical / Biological ProductSocial ContextDiseaseProcedurePhysical Force

http:// ifomis.de 54

An unintuitive top-level

with unintuitive rules for classificationleads to coding errorsdifficulties in training of curatorsobstacles to alignment with other ontology

and terminology systemsobstacles to harvesting content in automatic

reasoning systems

http:// ifomis.de 55

Principles for Building Medical Ontologies

Barry Smithhttp://ifomis.de

http:// ifomis.de 57

Examples

Don’t confuse entities with concepts Don’t confuse domain entities with logical

structuresDon’t confuse ontology with epistemologyDon’t confuse is_a with has_role

http:// ifomis.de 58

Further Principles

univocity: terms should have the same meanings (and thus point to the same referents) on every occasion of use

UMLS-SN: ‘organization’ = body plan‘organization’ = social organization

http:// ifomis.de 59

univocity

Gene Ontology:‘part_of’ = ‘can be part of’ (flagellum part_of

cell)‘part_of’ = ‘is sometimes part of’ (replication

fork part_of the nucleoplasm)‘part_of’ = ‘is included as a sublist in’

http:// ifomis.de 60

don’t forget instances

part_of as a relation between classesvs. part as a relation between instances

A part_of B1. every instance of A is part of some

instance of B 2. every instance of B has some instance of

A as part

http:// ifomis.de 61

Part_of as a relation between classes is more problematic than is

standardly supposed

testis part_of human being ?

heart part_of human being ?

http:// ifomis.de 62

UMLS-SN Semantic Relations

preventsDefinition: Stops, hinders or eliminates an action or condition.Inverse: prevented_by

contraception prevents pregnancypregnancy prevented by contraception

http:// ifomis.de 63

UMLS-SN Semantic Relation

producesDefinition: Brings forth, generates or creates. Inverse: produced_by

artificial insemination produces pregnancypregnancy produced by artificial insemination

http:// ifomis.de 64

positivity

complements of classes are not themselves classes. (Terms such as ‘non-mammal’ or ‘non-membrane’ do not designate natural kinds.)

http:// ifomis.de 65

objectivity

which classes exist is not a function of our biological knowledge. (Terms such as ‘unknown’ or ‘unclassified’ or ‘unlocalized’ do not designate biological natural kinds.)

http:// ifomis.de 66

rules governing levels

the terms in a classificatory hierarchy should be divided into predetermined levels (analogous to the levels of kingdom, phylum, class, order, etc., in traditional biology).

the terms in a partonomic hierarchy should be divided into predetermined granularity levels (organism, organ, cell, molecule, etc.)

http:// ifomis.de 67

JEPD (jointly exhaustive and pairwise disjoint)

single inheritance: no class in a classifi-catory hierarchy should have more than one parent on the immediate higher level

exhaustiveness: the classes on any given level should exhaust the domain of the classificatory hierarchy (difficult to satisfy in biomedicine but is accepted as a goal by every scientist)

http:// ifomis.de 68

Shortfalls from single inheritance

are often clues as to bad coding, since they mark deviations from is_a relations which also block ontology alignment

http:// ifomis.de 69

Is_a Overloading

The success of ontology alignment depends crucially on the degree to which basic ontological relations such as is a and part of can be relied on as having the same meanings in the different ontologies to be aligned.

http:// ifomis.de 70

Use of multiple inheritance

involves the assignment to the is_a relation of a plurality of different meanings within a single ontology.

The resultant mélange makes coherent integration across ontologies achievable (at best) only under the guidance of human beings with relevant biological knowledge

http:// ifomis.de 71

rules for definitions

intelligibility: the terms used in a definition should be simpler (more intelligible) than the term to be defined

otherwise the definition provides no assistance to the understanding (for humans)or is unprocessable (for machines)

http:// ifomis.de 72

substitutabilityin all so-called extensional contexts a defined term should be substitutable by its definition in such a way that the result is both grammatically correct and has the same truth-value as the sentence with which we begin

GO:0015070: toxin activity Definition: Acts as to cause injury to other living

organisms.

http:// ifomis.de 73

substitutability

There is toxin activity here

There is acts as to cause injury to other living organisms here

http:// ifomis.de 74

modularity: you can’t define everything

isolate primitive terms (= level 0)define terms on level n + 1, for each n 0 using only:

• terms taken from levels n and belowplus• logical and ontological constants such as

‘and’, ‘all’, ‘is_a’ and ‘part_of’

http:// ifomis.de 75

univocity and modularity

if these rules are not satisfied then error checking and ontology alignment can be achieved, at best, only with human intervention

http:// ifomis.de 76

The Foundational Model of Anatomy

follows formal rules for definitions laid down by Aristotle. A definition is the specification of the essence (nature, invariant structure) shared by all the members of a class or natural kind.

http:// ifomis.de 77

The Foundational Model of Anatomy

Topmost node are the undefinable primitives. The definition of a class lower down in the hierarchy is provided by specifying the parent of the class together with the relevant differentia, which tells us what marks out instances of the defined class within the wider parent class, as in: human = rational animal.

http:// ifomis.de 78

FMA Examples

Cell is an anatomical structure that consists of cytoplasm surrounded by a plasma membrane with or without a cell nucleus

Plasma membrane is a cell part that surrounds the cytoplasm,

http:// ifomis.de 79

The FMA regimentationbrings the advantage that each definition reflects the position in the hierarchy to which a defined term belongs. The position of a term within the hierarchy enriches its own definition by incorporating automatically the definitions of all the terms above it.The entire information content of the FMA’s term hierarchy can be translated very cleanly into a computer representation

http:// ifomis.de 80

These Rules are Rules of Thumb

The world of biomedical research is a world of difficult trade-offsthe benefits of formal (logical and ontological) rigor need to be balanced

1. against the constraints of computer tractability, 2. against the needs of biomedical practitioners.

But automatic alignment biomedical information resources will be achieved only to the degree that such resources conform to the standard principles of classification and definition

http:// ifomis.de 81

Axioms Every class has at least one instance

Distinct classes on the same level never share instances

Distinct leaf classes within a classification never share instances

http:// ifomis.de 82

Axioms

Every genus has an instantiated species

Each species has a smaller class of instances than its genus

http:// ifomis.de 83

Axioms

Every genus has at least two children

http:// ifomis.de 84

Theorems

Every instance is also an instance of some leaf class

Classes which share a child in common are either identical or one is subordinated to the other

http:// ifomis.de 85

Mathematical Structure

Each class hierarchy constitutes a supremum-semilattice with respect to is_a

http:// ifomis.de 86

Classes vs. SetsBoth classes and sets are marked by

granularity – but sets are timelessA class endures through time and

survives the turnover in its instances

A set is determined by its members A class is not determined by its instances

(as a state is not determined by its citizens and as an organism is not determined by its molecules)

http:// ifomis.de 87

Classes vs. SumsClasses are marked by granularity: they divide up the corresponding domain into whole units or membersThe class of human beings is instantiated only by human beings as single, whole units.

The sum of human beings includes also all cells and molecules existing inside human beings as parts

http:// ifomis.de 88

Classes sets

A set is an abstract structure, existing outside time and space. The set of human beings existing at t is (timelessly) a different entity from the set of human beings existing at t because of births and deaths.

http:// ifomis.de 89

Classes vs. Sets

A set with n members has in every case exactly 2n subsetsThe subclasses of a class are limited in number(which classes are subsumed by a larger class is a matter for empirical science to determine)

http:// ifomis.de 90

Conclusion

The analogue of pure mathematics for biomedical informatics

(the theory of biomedical classification) must look very different from standard mathematical set theory (and from its progeny, including Description Logic)

The formal theory of biological classification is still in its infancy

http:// ifomis.de 91

Biological classes are marked always by an opposition between standard or prototypical instances and a surrounding penumbra of non-standard instances (not all instances of the class human being are marked by the presence of amputation stumps or pituitary tumors). To do justice to these matters FMA introduces the factor of idealization, which means (in first approximation) that the classes of the FMA’s Anatomy Taxonomy AT include only those instances to which canonical anatomy applies.

This means that we need to revise definitions D1–D4 by restricting the range of variables x, y, ... to the realm of individuals which satisfy the generalizations of canonical anatomy, so that the same abstraction of anatomy (structure) will be represented in all the instances of any given AT-class. This device of specifying different ranges of variables gives us the means also to represent the generalizations belonging to the different branches of canonical anatomy, for example to canonical anatomy for male vs. female human beings, for human beings at various developmental stages, and for organisms in other species. It can allow us also to represent the generalizations governing the anatomical variants yielded by the presence of, for example, coronary arteries or bronchopulmonary segments, which deviate from canonical anatomical patterns of organization.

http:// ifomis.de 92

Classes vs. Sets: Granularity and TimeSets in the mathematical sense, too, are marked by the factor of granularity, which means that each

set comprehends its members as single, whole units. A class or set is laid across reality like a grid consisting (1) of a number of slots or pigeonholes each (2) occupied by some member. (This informal talk of grids and slots is formalized in [[14]] in terms of the theory of granular partitions.) Classes are distinguished from sets, however, by the fact that a set is determined by its members. This means that it is (1) associated with a specific number of slots, each of which (2) must be occupied by some specific member. A set is thus specified in a double sense. A class, in contrast, survives the turnover in its instances, and so it is specified in neither of these senses, since both (1) the number of associated slots and (2) the individuals occupying these slots may vary with time.

Sets are distinguished from classes also in this: a set with n members has in every case exactly 2n subsets, constituted by all the combinations of these members. The subclasses of a class, on the other hand, are limited in number, and which classes are subsumed by a larger class is a matter for empirical science to determine. Leaves (lowest nodes) in the taxonomy are (changing) collections of instances. As we move up the taxonomy we encounter in succession collections of such collections of instances, collections of collections of such collections, etc., organized in a nested hierarchy reaching up to the maximal class or ‘root’. We can visualize the classes at different levels as being analogous to geopolitical entities (towns, counties, states) as represented on a map. Instances correspond in this analogy to the corresponding populations: a class is not determined by its instances as a state is not determined by its citizens.

http:// ifomis.de 93

Classes are distinguished from sets also by their relation to time. A set is an abstract structure, existing outside time and space, and this is so even when its members are parts of concrete reality. Since each set is determined by its members, the set of human beings existing at t is (timelessly) a different entity from the set of human beings existing at t because of births and deaths.

Matters are different with regard to classes. The class human being can survive the change in the stock of its instances which occurs when John and Jane die, because classes exist in time. John and Jane themselves can similarly survive changes in the stock of cells or molecules by which they are constituted.

To do justice to the fact that classes in the biological domain endure even when their extensions change, a full definition of the is_a relation must involve a temporally indexed reading of inst (with variables t, t, etc., ranging over times):

D1*A is_a B =def t x ( inst(x, A, t) inst(x, B, t) ),so that A is_a B means: at all times t, if x is an instance of A at t then x is an instance of B

at t. D1* will also take care of false positives such as adult is_a child, which an untensed reading of D1 would otherwise allow. In general, all statements of inst and part relations involving objects in biomedical ontologies, like all the data of instantiated anatomy, are indexed by times.

http:// ifomis.de 94

Taxonomy and PartonomyA taxonomy such as AT is formally speaking a tree in the mathematical sense.

It satisfies axioms to the effect that (1) it has a root or unique maximal genus (here: anatomical entity) and (2) all other classes are connected to this root via finite chains of is_a relations satisfying a principle of single inheritance. A partonomy, in contrast, is a partial order in the mathematical sense, with top (here: organism – the class instantiated by mereologically maximal entities), to which all other classes are connected via chains of part_of relations.

We can then define the concepts of root and leaf of a taxonomy and top and bottom of a partonomy as follows.

D5 root(A) =def B (B is_a A) D6 leaf(A) =def B (B is_a A A = B)D7 top(A) =def

B (A = B or B part_of A) & not-B (A part_of B)D8 bottom(A) =def not-B (B part_of A).

http:// ifomis.de 95

We can then postulate axioms to the effect that every class includes some leaf as subclass, and that every instance of every class instantiates some leaf:

AB ( leaf(B) & B is_a A ) Ax ( inst(x, A) B (leaf(B) & inst(x, B) ) )

The taxonomical union AÈB of classes A and B is defined as the minimal class satisfying the condition that it contains both A and B as subclasses. Such a class always exists, since A and B are in any case subclasses of the root. The taxonomic union of femur and liver, for example, is organ. The partonomic union of two classes A+B is the class, if it exists, whose instances are sums x+y of instances of classes A and B respectively. While every pair of classes has a taxonomic union, only some classes have a partonomic union, since entities of the form x+y are instances of classes only in some highly restricted cases, for example: left lung = upper-lobe-of-left-lung + lower-lobe-of-left-lung. Such examples characteristically involve the phenomenon of fiat boundaries. [[15],[16]]

http:// ifomis.de 96

As concerns taxonomic intersection, a class is never immediately subordinated to more than one higher class within a tree. This means that if two classes overlap in sharing some common sub-class, then this is because one is a subclass of the other. AB, the taxonomic intersection of A and B, if it exists, is then simply the smaller of these two classes. We can add further an axiom to the effect that, if two classes are such as to overlap in sharing some common instances, then this, too, is because one is a subclass of the other:

x (inst(x, A) Ù inst(x, B)) A is_a B or B is_a A.Classes can overlap partonomically, on the other hand, in such a way that there is a

class which stands in the part_of relation to both, though neither stands in this relation to the other:

D9 A1 partonomic_overlap A2 = def A (A part_of A1 & A part_of A2).

For example: pelvis and vertebral column overlap in the sacrum and coccyx. Most classes in the biomedical domain do not overlap partonomically in this sense, yet it is this difference in behavior between taxonomic and partonomic overlap which captures the essential difference between the tree structure of taxonomies and the partial order structure of partonomies.

http:// ifomis.de 97

ConclusionPractitioners in the biomedical sciences move easily between the realm of classes and the realm of

instances existing in time and space. For historical reasons, however, work on biomedical ontologies and terminologies – which grew out of work on medical dictionaries and nomenclatures – has focused almost exclusively on classes (or ‘concepts’) atemporally conceived. This class-orientation is common in knowledge representation, and its predominance has led to the entrenchment of an assumption according to which all that need be said about classes can be said without appeal to formal features of instantiation of the sorts described above. This, however, has fostered an impoverished regime ofof definitions in which the use of identical terms in different systems has been allowed to mask underlying incompatibilities. Matters have not been helped by the fact that description logic, the prevalent framework for terminology-based reasoning systems, has with some recent exceptions (e.g. [[i]]) been oriented primarily around reasoning with classes.

Certainly if we are to produce information systems with the requisite computational properties, then this entails recourse to a logical framework like that of description logic. At the same time, however, we must ensure that the data that serves as input to such systems is organized formally in a way that sustains rather than hinders successful alignment with other systems. The way forward is to recognize, as does the FMA, that these are two distinct tasks, both of which are equally important to the construction of biomedical ontologies and terminologies.

http:// ifomis.de 98

The problem of ontology alignmentGOSCOPSWISS-PROTSNOMEDMeSHFMA

…all remain at the level of TERMINOLOGY (two reasons:

legacy of dictionaries + DL)What we need is a REFERENCE ONTOLOGY = a

formal theory of the foundational relations which hold TERMINOLOGY ONTOLOGIES and APPLICATION ONTOLOGIES together

http:// ifomis.de 99

Analogous distinctions required for nearly all foundational relations of ontologies and semantic

networks:A causes BA is associated with BA is located in Betc.

Reference to instances is necessary in defining mereotopological relations such as spatial occupation and spatial adjacency

http:// ifomis.de 100

Instances are elite individuals

Which classes (and thus which instances) exist in a given domain is a matter for empirical research.

Cf. Lewis/Armstrong “sparse theory of universals”

http:// ifomis.de 101

D extension(A) = {x | inst(x, A)}

D9 differentia(A) =def BC nearestspecies(B, C) & A B & A C & extension(C) = extension(B) extension(C)

http:// ifomis.de 102

The genus together with the differentia of a species constitutes the essence of the species.

differentia (A) not-class(A)

http:// ifomis.de 103

Axioms (Berg)

A1 lowestspecies(A) x inst(x, A) A2 lowestspecies(A) & lowestspecies(B) & A

B (not-x inst(x, A) & inst(x, B))A3 nearestspecies(A, B) & nearestspecies (A, C)

B = CA4 genus(A) & inst(x, A)

B nearestspecies(B, A) & inst(x, B) A5 nearestspecies(A, B) the extension of A

is a subset of the extension of B

http:// ifomis.de 104

Axioms (Berg)

genus(A) & inst(x, A) B nearestspecies(B, A) & inst(x, B)

EVERY GENUS HAS AN INSTANTIATED SPECIES

nearestspecies(A, B) the extension of A is a subset of the extension of B

EACH SPECIES HAS A SMALLER CLASS OF INSTANCES THAN ITS GENUS

http:// ifomis.de 105

Axioms (Berg)nearestspecies(B, A)

C (nearestspecies(C, A) & B CEVERY GENUS HAS AT LEAST TWO CHILDREN

nearestspecies(B, A) & nearestspecies(C, A) & B C) not-x (inst(x, B) & inst(x, C))

SPECIES OF A COMMON A8 There is no infinite sequence <A1, A2, …>

such that nearestspecies(Ai, Ai+1) for all i 1A9 There is no infinite sequence <A1, A2, …>

such that nearestspecies(Ai+1, Ai) for all i 1

http:// ifomis.de 106

Theorems (Berg)

T1 nearestspecies(A, B) the extension of A is a proper subset of the extension of B

T2 A x inst(x, A)T3 nearestspecies(A, B) not-C

(nearestspecies(A, C) & nearestspecies(C, B))T4 lowestspecies(A1) & lowestspecies(A2) &

nearestspecies(A1, B) not-C(nearestspecies (B, C) &

nearestspecies (C, A2)

http:// ifomis.de 107

Theorems (Berg)T5 (genus(A) & inst(x, A)) B

(lowestspecies(B) & B is_a A & inst(x, B))T6 (genus(A) & lowestspecies(B) & x (inst(x,

A) & inst(x, B)) B is_a A T7 A is_a B & A is_a C

(B = C or B is_a C or C is_a BT8 (genus(A) & genus(B) & x(inst(x, A) &

inst(x, B))) C(C is_a A & C is_a B) T9 class(A) & class(B) (A = B or A is_a B

or B is_a A or not-x(inst(x, A) & inst(x, B)))

http:// ifomis.de 108

WordNetNOT: wheel PART OF car

WordNet represents part-of quite sparingly It normally gives trivial holonymic relations which are just true by definition).

wheel PART OF wheeled vehicle steering wheel PART OF steering system

http:// ifomis.de 109

WordNetWith has_part relations it is more generous:

car, auto, automobile, machine, motorcar --    HAS PART: air bag    HAS PART: glove compartment    etc.

http:// ifomis.de 110

Circular definitions

and associated problems in general endemic in biomedical terminology systemsConfusion of use and mention

Confusion of concepts and objectsConfusion of concepts and classes Confusion of terms and objectsConfusion knowledge with what is knownConfusion of object-level with machine-levelSimple stupidity… all of which lead to poor coding

http:// ifomis.de 111

UMLS-SN

http:// ifomis.de 112

UMLS-SN Semantic Relations

Semantic Relation: affectsTUI: T151Definition: Produces a direct effect on. Implied here is the altering or influencing of an existing condition, state, situation, or entity. This includes has a role in, alters, influences, predisposes, catalyzes, stimulates, regulates, depresses, impedes, enhances, contributes to, leads to, and modifies.Inverse: affected_by

http:// ifomis.de 113

UMLS-SN Semantic Relations

Semantic Relation: carries_outTUI: T141Definition: Executes a function or performs a procedure or activity. This includes transacts, operates on, handles, and executes.Inverse: carried_out_by

http:// ifomis.de 114

UMLS-SN Semantic Relations

Semantic Relation: causesTUI: T147Definition: Brings about a condition or an effect. Implied here is that an agent, such as for example, a pharmacologic substance or an organism, has brought about the effect. This includes induces, effects, evokes, and etiology.Inverse: caused_by

http:// ifomis.de 115

UMLS-SN Semantic Relations

Semantic Relation: consists_ofTUI: T172Definition: Is structurally made up of in whole or in part of some material or matter. This includes composed of, made of, and formed of.Inverse: constitutes

http:// ifomis.de 116

UMLS-SN Semantic Relations

Semantic Relation: containsTUI: T134Definition: Holds or is the receptacle for fluids or other substances. This includes is filled with, holds, and is occupied by.Inverse: contained_in

http:// ifomis.de 117

UMLS-SN Semantic Relations

Semantic Relation: derivative_ofTUI: T178Definition: In chemistry, a substance structurally related to another or that can be made from the other substance. This is used only for structural relationships. This does not include functional relationships such as metabolite of, by product of, nor analog of.Inverse: has_derivative

http:// ifomis.de 118

UMLS-SN Semantic Relations

Semantic Relation: developmental_form_ofTUI: T179Definition: An earlier stage in the individual maturation of.Inverse: has_developmental_form

http:// ifomis.de 119

UMLS-SN Semantic Relations

Semantic Relation: evaluation_ofTUI: T161Definition: Judgment of the value or degree of some attribute or process.Inverse: has_evaluation

http:// ifomis.de 120

UMLS-SN Semantic Relations

Semantic Relation: exhibitsTUI: T145Definition: Shows or demonstrates.Inverse: exhibited_by

http:// ifomis.de 121

UMLS-SN Semantic Relations

Semantic Relation: functionally_related_toTUI: T139Definition: Related by the carrying out of some function or activity.Inverse: functionally_related_to

http:// ifomis.de 122

UMLS-SN Semantic Relations

Semantic Relation: indicatesTUI: T156Definition: Gives evidence for the presence at some time of an entity or process.Inverse: indicated_by

http:// ifomis.de 123

UMLS-SN Semantic Relations

Semantic Relation: ingredient_ofTUI: T202Definition: Is a component of, as in a constituent of a preparation.Inverse: has_ingredient

http:// ifomis.de 124

UMLS-SN Semantic Relations

Semantic Relation: issue_inTUI: T165Definition: Is an issue in or a point of discussion, study, debate, or dispute.Inverse: has_issue

http:// ifomis.de 125

UMLS-SN Semantic Relations

Semantic Relation: manifestation_ofTUI: T150Definition: That part of a phenomenon which is directly observable or concretely or visibly expressed, or which gives evidence to the underlying process. This includes expression of, display of, and exhibition of.Inverse: has_manifestation

http:// ifomis.de 126

UMLS-SN Semantic Relations

Semantic Relation: property_ofTUI: T159Definition: Characteristic of, or quality of.Inverse: has_property

http:// ifomis.de 127

UMLS-SN Semantic Relations

Semantic Relation: result_ofTUI: T157Definition: The condition, product, or state occurring as a consequence, effect, or conclusion of an activity or process. This includes product of, effect of, sequel of, outcome of, culmination of, and completion of.Inverse: has_result

http:// ifomis.de 128

UMLS-SN Semantic Relations

Semantic Relation: surroundsTUI: T176Definition: Establishes the boundaries for, or defines the limits of another physical structure. This includes limits, bounds, confines, encloses, and circumscribes.Inverse: surrounded_by

http:// ifomis.de 129

UMLS-SN Semantic Relations

Semantic Relation: traversesTUI: T177Definition: Crosses or extends across another physical structure or area. This includes crosses over and crosses through.Inverse: traversed_by

http:// ifomis.de 130

UMLS-SN Semantic Relations

Semantic Relation: performsTUI: T188Definition: Executes, accomplishes, or achieves an activity.Inverse: performed_by

http:// ifomis.de 131

UMLS-SN Semantic Relations

Semantic Relation: physically_related_toTUI: T132Definition: Related by virtue of some physical attribute or characteristic.Inverse: physically_related_to

http:// ifomis.de 132

UMLS-SN Semantic Relations

Semantic Relation: conceptually_related_toDefinition: Related by some abstract concept, thought, or idea.Inverse: conceptually_related_to

http:// ifomis.de 133

Prototypicality

Biological classes are marked always by an opposition between standard or prototypical instances and a surrounding penumbra of non-standard instances How solve this problem: restrict range of instance variables x, y, to standard instances?Recognize degrees of instancehood? (Impose topology/theory of vagueness on classes?)

http:// ifomis.de 134

Example: joint anatomyjoint HAS-HOLE joint spacejoint capsule IS-OUTER-LAYER-OF jointmeniscus

– IS-INCOMPLETE-FILLER-OF joint space– IS-TOPO-INSIDE joint capsule– IS-NON-TANGENTIAL-MATERIAL-PART-OF

jointjoint

– IS-CONNECTOR-OF bone X– IS-CONNECTOR-OF bone Y

synovia– IS-INCOMPLETE-FILLER-OF joint space

synovial membrane IS-BONAFIDE-BOUNDARY-OF joint space

http:// ifomis.de 135

SNOMED RT (2000)

already has description logic definitionsbut it also has some bad coding, which

derives from failure to pay attention to ontological principles:

e.g.both testes is_a testis

http:// ifomis.de 136

entity

physical conceptual object entity

organism anatomical structure

fully formed anatomical structure

body part, organ or organ component

http:// ifomis.de 137

Body System

Circulatory SystemNervous SystemImmune SystemMusculo-Skeletal Systemetc.

http:// ifomis.de 138

UMLS Semantic Network

entity event

physical conceptual object entity

http:// ifomis.de 139

P1. entities in different highest-level categories (independent continuant, dependent continuant, occurrent) should not be combined within a single class;

P2. objects should not be combined within a single class with the roles they play or with the functions they exercise;

P3. entities in reality should not be combined within a single class with our knowledge about or with our concepts of such entities;

P4. what is concrete (what exists in space and time and enters into causal relations) should not be combined within a single class with what is abstract (for example with abstract spatial regions, measures, and the like);

P5. classifications should respect the factor of time; for example classes should be assigned in a way that is consistent with the fact that continuant entities endure through time.

http:// ifomis.de 140

Anatomical Structure – (Embryonic Structure; Fully Formed Anatomical Structure; Anatomical Abnormality): Anatomical Structure is defined as: “A normal or pathological part of the anatomy or structural organization of an organism.” Note that in the phrase ‘structural organization’, the term ‘organization’ is not used in conformity with SN’s own definition (see below) as meaning ‘social organization’. Rather it is used to mean an entity’s Bauplan. The latter, however, would be, not a concrete three-dimensionally extended independent thing but rather some dependent abstract feature which gives shape and functionality to an entity of this sort. Then, however, it should not subsume liver or leukocyte.

http:// ifomis.de 141

The UMLS Semantic Network

is ‘an upper-level ontology … in which all concepts are given a consistent and semantically coherent representation’.

Alexa McCray, “An upper level ontology for the biomedical domain”. Comp Functional Genomics 2003; 4: 80-84.

http:// ifomis.de 142

Concepts

CEN/TC251 ENV 12264 : – This ENV is applicable to the description of the categorial structure of

systems of concepts supporting computer-based terminological systems, including coding systems, for health-care.

– concept : “unit of thought constituted through abstraction on the basis of properties common to a set of one or more referents”

BUT THEY NEVER IN FACT LOOK AT THE REFERENTS AT ALL!

ISO/TC215/N142: Health informatics —Vocabulary of terminology – The purpose of this International Standard is to define a set of basic

concepts required to describe and discuss formal representation of concepts and characteristics, for use especially in formal computer based concept representation systems.

– concept: “unit of knowledge created by a unique combination of characteristics”

THEY ARE ALREADY TWO LEVELS REMOVED FROM THE REFERENT!

http:// ifomis.de 143

Chemical

Chemical ChemicalViewed Viewed

Structurally Functionally

Inorganic Organic Enzyme Biomedical or Chemical Chemical Dental Material

http:// ifomis.de 144

Genbank

gene =df

DNA region of biological interest with a name and that carries a genetic trait or phenotype

http:// ifomis.de 145

Why is this a problem?

All biomedical ontologies and terminology systems must make themselves conform to the UMLS Semantic Network

Foundational Model of Anatomy divides body parts into physical entities and conceptual entities ‘with some regret’ …

http:// ifomis.de 146

Entities

http:// ifomis.de 147

Entities

universals (classes, types, roles …)

particulars (individuals, tokens, instances …)

Axiom: Nothing is both a universal and a particular

http:// ifomis.de 148

Two Kinds of Elite Entities

classes, within the realm of universals

instances within the realm of particulars

http:// ifomis.de 149

Entities

classes

http:// ifomis.de 150

Entities

classes*

*natural, biological

http:// ifomis.de 151

Entities

classes of objects

different axioms for classes of functions, processes, etc.

http:// ifomis.de 152

Entities

classes

instances

http:// ifomis.de 153

Classes are natural kinds

Instances are natural exemplars of natural kinds(problem of non-standard instances must be dealt with also)

http:// ifomis.de 154

Entities

classes

instancesinstances

penumbra of borderline cases

http:// ifomis.de 155

Entities

classes

instancesjunkjunk

junk

example of junk: beachball desk

http:// ifomis.de 156

Primitive opposition between universals and particulars

variables A, B, … range over universalsvariables x, y, … range over particulars

http:// ifomis.de 157

Primitive relations: inst and part

inst(Jane, human being)part(Jane’s heart, Jane’s body)

A class is anything that is instantiatedAn instance as anything (any individual) that

instantiates some class

http:// ifomis.de 158

Entities

human

Jane

inst

http:// ifomis.de 159

Entities

human

Jane’s heart part Jane

http:// ifomis.de 160

Axioms for part

Axioms governing part (= ‘proper part’) (1) it is irreflexive (2) it is asymmetric (3) it is transitive (+ usual mereological axioms)

part is the usual mereological relation among individuals

http:// ifomis.de 161

Definitions

class(A) =def x inst(x, A)

instance(x) = defA inst(x, A)

Theorem: Nothing can be both an instance and a class

http:// ifomis.de 162

Axiom of Extensionality

Classes which share identical instances are identical(need to take care of the factor of time)

http:// ifomis.de 163

Entities

classes

x, y, …

differentiae (roles, qualities…)

http:// ifomis.de 164

Differentiae

Aristotelian Definitions An A is a B which exemplifies C

C is a differentiaNo differentia is a classexemp(individual, differentia)exemp(Jane, rationality)objects exemplify roles

http:// ifomis.de 165

role

http:// ifomis.de 166

A is_a B genus(A)

species(A)

classes

instances

http:// ifomis.de 167

A is_a B =def x (inst(x, A) inst(x, B))

genus(A)=def B (B is_a A & B A) species(A)=def B (A is_a B & B A)

classes

instances

http:// ifomis.de 168

nearest species

nearestspecies(A, B)=def A is_a B &

C ((A is_a C & C is_a B) (C = A or C = B)

B

A

http:// ifomis.de 169

Definitions

highest genus

lowest species

http:// ifomis.de 170

lowest species and highest genus

lowestspecies(A)=def

species(A) & not-genus(A)

highestgenus(A)=def

genus(A) & not-species(A)

Theorem:class(A) genus(A) or lowestspecies(A)

Recommended