Hierarchical Annotation of Medical Images Ivica Dimitrovski 1, Dragi Kocev 2, Suzana Loškovska 1,...

Hierarchical Annotation Hierarchical Annotation of Medical Imagesof Medical Images

Ivica Dimitrovski1, Dragi Kocev2, Suzana Loškovska1, Sašo Džeroski2

1Department of Computer Science, Faculty of Electrical Engineering and Information Technologies, Skopje, Macedonia

2Department of Knowledge Technologies, Jožef Stefan Institute, Ljubljana, Slovenia

OverviewOverview

• Introduction and problem

definition

• Feature extraction

– Edge Histogram Descriptor (EHD)

• Classifier

– PCTs for HMLC

• Experiments and results

• Conclusions and future work

IntroductionIntroduction

• The amount of medical images is constantly growing

• The cost of manually annotating these images is very high– automatic image annotation algorithms to perform

the task reliably

• Feature extraction from images• Classifier to distinguish between different

classes• Application: Multilingual image annotations and

DICOM standard header corrections

IRMA codeIRMA code

• IRMA coding system: Four axes marked with {0, …, 9, a, …, z}– T (Technical): image modality– D (Directional): body orientation– A (Anatomical): body region– B (Biological): biological system

• IRMA code: TTTT – DDD – AAA – BBB• The code is strictly hierarchical

– Example:2 cardiovascular system

21 cardiovascular system; heart

216 cardiovascular system; heart; aortic valve

IRMA code - exampleIRMA code - example

• IRMA code: 1123-211-520-3a0

– 1123 (x-ray, projection radiography, analog, high energy)

– 211 (sagittal, left lateral descubitus, inspiration)

– 520 (chest, lung)– 3a0 (respiratory system, lung)

Feature extractionFeature extraction

• Obtain features that describe the visual content of an image

• Histogram of local edges– Mark the points in a digital image at which the

luminous intensity changes sharply– Reduction of the amount of data to be processed,

while retaining important information about the shapes of objects in the image

– Frequency and the directionality of the brightness changes in the image

Feature extractionFeature extraction

Classification MethodologyClassification Methodology

• Predictive Clustering Trees framework (PCTs) (Blockeel et al. Top-down induction of clustering trees. In Proc. of the 15th ICML, p.55-63, 1998)

• Ensemble methods to improve the predictive performance– Bagging (L. Breiman. Bagging predictors, Machine Learning Journal, vol.

24 Issue 2, p. 123-140, 1996)

– Random Forests (L. Breiman. Random Forests, Machine Learning Journal, vol. 45, p.5-32, 2001)

Predictive Clustering Trees (PCTs)Predictive Clustering Trees (PCTs)

• A tree is a hierarchy of clusters• Standard top-down induction of decision trees (TDIDT)

algorithm• best acceptable attribute-value test that can be put in a

node• The heuristic for selecting the tests is the reduction in

variance in the induces subsets– Maximizes cluster homogeneity and improves predictive

performance• PCTs can handle different types of target concepts:

multiple targets, time series, hierarchy– Instantiation of the variance and prototype function

Hierarchical Multi-Label ClassificationHierarchical Multi-Label Classification

• HMLC: an example can be labeled with multiple labels that are organized in a hierarchy

{ 1, 2, 2.2 }

Hierarchical Multi-Label ClassificationHierarchical Multi-Label Classification

• HMLC: an example can be labeled with multiple labels that are organized in a hierarchy

{ 1, 2, 2.2 }

• Variance instantiation:• average squared distance between each example’s label

and the set’s mean label• the arithmetic mean of a set of such vectors contains as i’th

component the proportion of examples of the set belonging to class ci

Ensemble methods Ensemble methods

• Ensemble - Set of classifiers • Classification of new example by combination of

the predictions of each classifier from the ensemble– Regression: Average– Classification: Majority Vote

• Bagging• Random Forests

Ensemble methods Ensemble methods

Training set

(Randomized)Decision tree

algorithm

n classifiers

Test set

n predictions

n bootstrap replicates

algorithm

Training set

algorithm

n classifiers

Test set

n predictions

algorithm

Training set

algorithm

n classifiers

Test set

n predictions

algorithm

Training set

algorithm

n classifiers

Test set

n predictions

algorithm

Experimental designExperimental design

• Goal: provide the IRMA code for an image• Data

– ImageCLEF 2008– 12.076 images sorted in 197 classes– 82 classes have less than 10 elements (129 images)– Each image is described with 80 features

• Feature extraction – Contrast enhancement– Histogram equalization

Experimental designExperimental design

• Classifier– Number of classifiers: 100 un-pruned trees– Random Forests Feature Subset Size: 7 (log)

• Comparison of the performance of a single tree and an ensemble– Precision-Recall (PR) curves - “area under the PR

curve” (AUPRC)– 10 fold cross-validation

• Two scenarios1) Each axis is an dataset (4 in total)

2) Single dataset for all axes

Results per axisResults per axis

Results for all axesResults for all axes

DiscussionDiscussion

• Increase of the predictive performance with ensembles compared to a single tree

• Excellent performance for axes T and B (AUPRC of 0.9994 and 0.9862) – The hierarchies for axes T and B contain only few nodes (9 and

27, respectively)

• The classifiers for axes A and D have high predictive performance (AUPRC of 0.8264 and 0.9064) – The hierarchies for axes A and D contain 110 and 36 nodes,

respectively

• Predicting the complete hierarchy at-once yields improvements

SummarySummary

• Medical image annotation using Hierarchical Multi-Label Classification (HMLC)

• Local Edge Histogram Descriptor (EHD) to represent gray-scale radiological (X-Ray) images

• Images annotated with IRMA code

• Ensembles of PCTs for HMLC as classifier

Future workFuture work

• Other algorithms for feature extraction:– SIFT, TAMURA, Scale, Color Histogram…

• Combination of the features obtained from different techniques:– Each technique captures different aspects of an image

• Extension of the classification algorithm:– Distance measures for hierarchies– Learning under covariate shift

Hierarchical Annotation of Medical Images Ivica Dimitrovski 1, Dragi Kocev 2, Suzana Loškovska 1,...

Documents

Macedonian tim: Migulovska Andrijana Dimitrovski Robert, Laste Spasovski Throught Comitment to school Democracy – model of student participation Pestalozzi

Navpreet Sanghra Navreet Kaur Ambika Tejpal Alex Kocev Gaurie Aggarwal Nayani Rajamohan

Transient Stability Simulation Initial Seed (n coarse-k+1 ...Mentors: Srdjan Simunovic, Aleksandar Dimitrovski, ORNL; Kwai Wong, UTK Acknowledgements Steady State System The purpose

Introduction to PRINCE2 Dimitrovski Dejan M.B.A. Webster University Vienna Fall 2, 2011 1

NIKOLA V. DIMITROV PEDESET GODINI NASTAVA …NIKOLA V. DIMITROV \OR\I DIMITROVSKI PEDESET GODINI NASTAVA NA MAKEDONSKI JAZIK VO CAPARI I CAPARSKO Dru{tvo za nauka i umetnost - Bitola

Aleksandar Dimitrovski* Andrew Ford Kevin Tomsovic

ENSEMBLES FOR PREDICTING STRUCTURED OUTPUTSkt.ijs.si/dragi_kocev/stuff/thesis_20110210.pdfDragi Kocev ENSEMBLES FOR PREDICTING STRUCTURED OUTPUTS Doctoral Dissertation ANSAMBLI ZA

Ensembles for predicting structured outputskt.ijs.si/dragikocev/repos/Kocev_IRD2009.pdf · Ensembles for predicting structured outputs Dragi Kocev Jozef Stefan International Postgraduate

COPERNICUS & MACHINE LEARNING Collaboration …...COPERNICUS & MACHINE LEARNING: Collaboration of ESA and the AI community Sašo Džeroski JožefStefan Institute, Ljubljana, Slovenia

ItSMF Macedonia Igor Dimitrovski itSMF Macedonia Steering Committee representative

Power System Transient Stability Simulation 1! Power System Transient Stability Simulation Ashley Cliff Mentors: Srdjan Simunovic, Aleksandar Dimitrovski, Kwai Wong

City of Skopje Good practice Safety in Traffic in Skopje Prof.d-r Mile Dimitrovski, President of the Scientific Commity Macedonian National Council for

Inductive Logic Programming · Foundations of Inductive Logic Programming. SpringerVerlag, 1997, ISBN 3540629270. Nada Lavrač, and Sašo Džeroski. Inductive Logic Programming: Techniques

Prof. Vencislav Dimitrovski - Osnovna Gramatika Na Angliskiot Jazik(Basic English Grammar)

“Interna revizija u odbrani korporativnog upravljanja ... · PDF fileInterna revizija u javnom sektoru u Makedoniji Slobodan Dimitrovski, /Internal Audit in Public Sector in Macedonia

COAP 2000 – XHTML Programming COAP 2000 XHTML PROGRAMMING: INTRODUCTION TO WEB PROGRAMMING Fall I, 2012 Instructor: Dejan Dimitrovski dimitdej@webster.edu

promjenljivo lice sjećanja the changing face of remembrancecliohipbih.ba/wp-content/uploads/2015/09/Katalog_catalogue-MOnuMENTI... · Rat simbola: Sećanje na Kosovu-Valentino Dimitrovski

ARTICLE IN PRESS - IJSkt.ijs.si/DragiKocev/wikipage/lib/exe/fetch.php?media=2015is_pcts... · I. Dimitrovski et al./Information Sciences 000 (2015) 1–15 3 ARTICLE IN PRESS JID:

IGORR -IAEA Joint Conference 2017...IGORR -IAEA Joint Conference 2017 Lubi Dimitrovski Group Executive, Nuclear Operations 18th IGORR Conference & IAEA Workshop ICC Sydney, Dec ember

Power System Transient Stability Simulation · ! 1! Power System Transient Stability Simulation Ashley Cliff Mentors: Srdjan Simunovic, Aleksandar Dimitrovski, Kwai Wong