CHAPTER 2 A REVIEW ON BREAST ABNORMALITY …shodhganga.inflibnet.ac.in/bitstream/10603/26433/7/07_chapter 2.pdf · A REVIEW ON BREAST ABNORMALITY SEGMENTATION AND CLASSIFICATION TECHNIQUES

13

CHAPTER 2

A REVIEW ON BREAST ABNORMALITY

SEGMENTATION AND CLASSIFICATION TECHNIQUES

Segmentation or abnormality detection is the initial step in

mammographic Computer Aided Detection (CAD) system. The review on

different approaches to the segmentation and classification of mammographic

masses and microcalcifications are described in this chapter. It also describes

main features and differences of these approaches. The key objective is to

point out the advantages and disadvantages of these approaches.

2.1 INTRODUCTION

A segmentation algorithm is used to detect region of interest,

usually the part of breast or a specific kind of abnormalities like

microcalcifications or masses. It is generally known that the detection of

masses is technically difficult, because masses can be simulated or obscured

by normal breast parenchyma. Moreover, there is an outsized variability in

these lesions, which is reflected in the morphology features (shape and size of

the lesions). Variations exist in large number of features that have been used

to detect and classify them.

Microcalcifications are considered to be important indicators of

breast cancer. However, its interpretation is very difficult and 10% - 30% of

breast microcalcifications are missed during routine screening (Bird et al

1992, Burhenne et al 1994). It appears as tiny objects which can be described

14

as granular, linear, or irregular on mammograms. Although they have higher

inherent attenuation properties, they cannot be distinguished from the high

frequency noise because of their smaller size. The microcalcifications

typically range in size from 0.1 to 1.0 mm (Thor Ole Gulsrud and John Hakon

Husoy, 2001). Microcalcifications often appear in a heterogeneous

background describing the structure of the breast tissue. Some elements of the

background, like dense tissue, could also be brighter than the

microcalcifications in the fatty part of the breast. The Regions of Interest

(ROI’s) may be of low contrast. The intensity difference between suspicious

areas and their surrounding tissues can be quite slim. Dense tissues may be

easily misinterpreted as calcifications yielding a high False Positive (FP) rate,

which is a major problem with most of the algorithms.

In this chapter, lesion segmentation and classification algorithms

are reviewed in detail. In section 2.2, the algorithms which look for

segmentations of masses and microcalcifications using mammographic

images are described. Section 2.3 describes the literature study on

mammographic classification. The evaluation methodology is given in section

2.4. The summary of reviews is given in section 2.5.

2.2 BREAST ABNORMALITY SEGMENTATION

Segmentation of breast abnormality relies on the fact that pixels

inside a mass or microcalcifications have different characteristics from the

other pixels within the breast area. The characteristics used can be gray level

values, texture features, shape features or morphological features of the

lesions. The outcome of segmentation of image is a set of segments that

collectively cover the entire image, or a set of edges extracted from the image.

Each one of the pixels in a region is related with respect to some

characteristics or computed property, such as color, intensity, or texture.

15

Neighboring regions are considerably different with respect to the

characteristic(s).

In computer vision terminology, segmentation techniques can be

divided into unsupervised and supervised approaches. Supervised

segmentation, also known as model-based segmentation, relies on prior

knowledge about the object and background regions to be segmented. The

prior information is used to determine if specific regions are present within an

image or not. Alternatively, unsupervised segmentation partitions an image

into a set of regions which are distinct and uniform with respect to specific

properties, such as grey features, texture or colour features. Classical

methods used to solving unsupervised segmentation are divided in three major

groups (Fu and Mui, 1981). These are region based methods, contour based

methods and clustering methods. The rows of Table 2.1 shows the reviewed

works arranged according to their unsupervised segmentations. A detailed

description of the methods in each category is given in subsequent sections.

2.2.1 Region Based Methods

The main goal of segmentation is to partition an image into regions.

Region based segmentation relies on the principle of homogeneity, which means

there should be at least one feature which remains uniform for all pixels within a

region. The basic formulation for Region-based segmentation is:

a) 1

n

i iR R (2.1)

b) is a connected region, i=1,2,...,niR

c) iR for all i=1,2,...,njR (2.2)

d) ( ) for i=1,2,...,niP R TRUE (2.3)

e) i i jP(R ) for any adjacent region R and RjR FALSE (2.4)

16

where,

R – Region, n – number of pixels, - null set, ( )iP R - logical predicate.

More than 30 years have passed, since Zucker reviewed region

growing algorithms (Zucker 1976). Region growing is based on the

propagation of an initial seed point according to a specific homogeneity

criterion, iteratively increasing the size of the region. Since then, region

growing has seen a number of improvements, primarily due to the integration

of boundary information in the algorithm. Region growing algorithms have

been widely used in mammographic mass segmentation with the aim of

extracting the potential lesion from its background. Since early nineties,

researchers from the University of Chicago investigated the introduction of

shape information into the homogeneity criterion.

William Mark Morrow et al (1992) have developed an adaptive

method for enhancing the contrast of mammographic features of varying size

and shape. The method uses each pixel in the image as a seed to grow a

region. The size and shape of the region adapt to local image gray-level

variations, corresponding to an image feature. The contrast of every region is

calculated with respect to its individual background. Contrast is, then,

improved by applying an empirical transformation based on each region’s

seed pixel value, its dissimilarity, and its background.

With the aim to integrate the radiologist’s experiences, Huo et al

(1995) have developed a semi-automatic region growing approach. In this

approach, the growing step was automatically computed after a radiologist

had manually placed the seed point. Later, Matthew Kupinski and Maryellen

Giger (1998) have developed radial gradient index method and a probabilistic

method for segmentation of lesions. These methods are seeded segmentation

methods. In both methods, a series of image partitions is created using gray-

level information as well as prior knowledge of the shape of typical mass

lesions.

17

Naga Mudigonda et al (2001) have developed a method for the

detection of masses in mammographic images. The method employs Gaussian

smoothing and sub sampling operations as initial processing steps. The mass

portions are segmented by establishing intensity links from the central

portions of masses into the surrounding areas. Chu et al (2002) have

proposed a region growing approach that represented as a growing tree whose

root is the selected seed. Active leaves are removed in the connection area

between adjacent regions to avoid merging adjacent structures. The authors

affirmed that this graph-based segmentation more closely matches

radiologists’ outlines of masses.

Jiang et al (2007) have developed a genetic algorithm to

automatically classify and detect microcalcification clusters. The genetic

algorithm technique is characterized by transforming input images into a

feature domain. Here, every pixel is represented by its mean and standard

deviation inside a surrounding window of size 9×9 pixel. In the feature

domain, chromosomes are constructed to populate the initial generation. The

features are extracted to enable the genetic algorithm to search for optimized

classification and detection of microcalcification.

Peter Filev et al (2008) have developed a computerized regional

registration and characterization system for analysis of microcalcification

clusters. The system consists of two stages. In the initial stage, a regional

registration procedure is used to identify the local area that may contain the

cluster. A search program is used to detect cluster candidates within the local

area. In the second stage, a temporal classifier is used to classify the region.

Alfonso et al (2009) have developed two methods called dynamic-

programming-based method and a constrained region-growing method to

segment the mass contours. The simplified versions of these contours were

employed to extract a set of six features designed for characterization of mass

18

margins (contrast between foreground and background region, two measures

of the fuzziness of mass margins, coefficient of deviation of edge strength, a

measure of spiculation based on edge-signature information and a measure of

spiculation based on relative gradient orientation). Three accepted classifiers

(Fisher's linear discriminant, Bayesian classifier and a support vector

machine) were used to predict the lesions.

Claudio Marrocco et al (2010) have developed a novel system for

detection of clusters of microcalcifications. The mammogram first extracts the

elementary homogeneous regions of interest in this system. An analysis of

such regions is then performed by means of a two-stage, coarse-to-fine

categorization based on both heuristic rules and classifier combination. Giulia

Rabottino et al (2011) have developed a region growing technique to segment

the lesion region. The shape and texture features are extracted, and fuzzy

classifier is used to classify the lesion regions.

2.2.2 Contour Based Methods

Image segmentation techniques based on contour based method

have been used in the early work of Roberts (1965). Contour-based

approaches usually start with edge detection, followed by a linking process

that seeks to exploit curvilinear continuity. However, the identification of

regions based on the edge detection is far from trivial. The algorithms for

edge detection do not usually possess the ability of the human vision system

to complete interrupted edges. Therefore, sometimes edges which are not the

transition from one region to another are detected. The properly detected

edges often present gaps at places where the transitions between regions are

not abrupt enough. Hence, detected edges might not essentially form a set of

closed connected curves that surround distinct regions.

19

The Table 2.1 shows number of publications is trying to detect

masses in mammogram using contour based methods. The algorithms for

finding edges are based on filtering the image in order to enhance relevant

edges prior to the detection stage. The location of edges, in Petrick et al (1996),

is based on a Gaussian–Laplacian edge detector, after which the image is

enhanced by an adaptive density-weighted contrast enhancement filter.

Judy Kilday et al (1993) have developed an interactive

segmentation procedure to identify the tumor boundary using a thresholding

technique. The several features are extracted based on the gross and fine

shape describing properties of the tumor boundaries. Joachim Dengler et al

(1993) have used a two-stage algorithm for spot detection and shape

extraction. The topology and the number of the spots are determined by using

weighted difference of Gaussians filter and the shape by means of

morphological filters. Parr et al (1994) used Gabor filters to locate the

spicules of stellated lesions.

Kobatake and Yoshinaga (1996) have described an approach, which

starts with a sub-image containing a possible mass lesion. It looks for spicules

using gradient information in three steps: First, the morphological line-

skeletons are extracted in order to detect long and thin anatomical structures

(like spicules). Second, a modified hough transform is used to extract lines

passing near the centre of the mass, and finally the algorithm automatically

select objects based on the number of line skeletons that satisfy the second step.

Fauci et al (2005) and Cascio et al (2006) have looked for the

contours of the mass using an iterative algorithm. At each local maxima a

threshold was selected which is used to draw an intensity contour. The

threshold value is based on user interaction and histogram information.

Consequently, the area of the selected region is refined by adjusting the

threshold. Stelios Halkiotis et al (2007) have used the mathematical

20

morphology tools for the extraction of microcalcifications even if the

microcalcifications are located on a non-uniform background.

Table 2.1 Mammographic unsupervised segmentation techniques

Author Class

Features

Texture ShapeGray

LevelGradient

Region Based Methods

Huo et al (1995) Mass

Matthew Kupinski et al (1998) Mass

Mudigonda et al (2001) Mass

Peter Filev et al (2008) MC

Alfonso et al (2009) Mass

Claudio Marrocco et al (2010) MC

Giulia Rabottino et al (2011) Mass

Chu et al (2002) Mass

Contour-Based Methods

Joachim Dengler (1993) MC

Judy Kilday et al (1993) Mass

Parr et al (1994) Mass

Petrick et al (1996) Mass

Kobatake and Yoshinaga (1996) Mass

Fauci et al (2005) Mass

Cascio et al (2006) Mass

Stelios Halkiotis et al (2007) MC

Clustering Methods

Kobatake and Murakami (1996) Mass

Li et al (1995) Mass

Heng-Da cheng et al (1998) MC

Lei Zheng and Chan (2001) Mass

Wei Qian et al (2002) MC

Cao et al (2008) Mass

Suliga et al (2008) Mass

21

2.2.3 Clustering Methods

Clustering methods are one of the most commonly used techniques

for image segmentation, as discussed in Jain et al (1999). The lesion detection

and segmentation can be seen from the reviewed approaches shown in

Table 2.1. Based on the work of Jain et al (1999), clustering techniques can be

divided into hierarchical and partitional algorithms. The hierarchical methods

produce a nested series of partitions, while partitional methods produce only a

single partition. The hierarchical methods can be more accurate even in small

data sets. The partitional methods are used in applications involving large

datasets, like the ones related to images.

The traditional partitional clustering algorithm is the K-means

algorithm (MacQueen 1967), which is characterized by simple

implementation and low complexity. The Fuzzy C-Means (FCM) algorithm

(Bezdek, 1981) is an extension of the K-means algorithm which allows each

pattern of the image to be associated with every cluster using a fuzzy

membership function. Chang wen chen et al (1998) have developed a robust

segmentation algorithm for three dimensional image data. The algorithm

based on a novel combination of adaptive K-means clustering and knowledge

based morphological operations.

Heng-Da Cheng et al (1998) have used a novel approach to

microcalcification detection based on fuzzy logic technique. In this approach,

microcalcifications are first enhanced based on their brightness and non-

uniformity. Then, the irrelevant breast structures are excluded by a curve

detector. Finally, microcalcifications are located using an iterative threshold

selection method. The shapes of microcalcifications are reconstructed and the

22

isolated pixels are removed by employing the mathematical morphology

technique.

In contrast to fuzzt C-means which improves K-means using a

fuzzy approach of the energy function, the Dogs and Rabbit (DaR) algorithm

performs a more robust seed placement. The DaR was used by Lei Zheng and

Chan (2001) to obtain an initial set of regions which subsequently were used

to initialize a Markov Random Field (MRF) approach.

Andrew et al (2001) have proposed a method called discrete values

clustering algorithm with applications to bio-molecular data. Tapas Kanungo

et al (2002) have developed a simple and efficient implementation of

Lioyd’s K-means clustering algorithm for image analysis. Wei Qian et al

(2002) have developed a distance-based and dense-to-sparse grouping method

for detection of microcalcification clusters. The grouping result should be

independent of the size, shape and orientation of real clusters. The

application, namely cluster-oriented analysis including an adaptive

segmentation method and cluster level feature extraction scheme is employed.

Ng et al (2006) have developed a method for medical image segmentation

using K-means clustering and improved watershed algorithm.

Cao et al (2008) have investigated a robust information clustering

(RIC) algorithm incorporating spatial information for breast mass. The

detection system employs RIC algorithm based on the raw region of interest

extracted from global mammogram by two steps of adaptive thresholding.

Pixels on the fuzzy margin of a mass and noisy data were identified by RIC.

The identified pixels (outliers) were recalculated by incorporating spatial

distance information. It takes into account of the influence of a neighborhood

of 3x3 window. The nine texture features are extracted from the region of

23

interest and SVM classifier is used to further classification of the region of

interest.

Suliga et al (2008) have developed a new pixel based clustering

method for the analysis of digital mammograms. The image pixels are

described only by their intensity (gray level). Therefore, the available

information is limited to one dimension. The Markov random field based

technique is suitable for performing clustering in an environment which is

described by poor or limited data. This method is a statistical classification

method that labels the image pixels based on the description of their statistical

and contextual information.

2.2.4 Model Based Methods

The model based segmentation methods initially train the system to

detect specific objects. Subsequently, the system has to be able to detect and

classify new images depending on the presence or absence of similar object.

The training system covers examples with and without the object (lesion)

present. Thus:

From mammograms containing a lesion, the system learns the

probable location and the variation in shape and size of the

lesion.

From mammograms not containing a lesion, the systems can

learn features that represent normal tissue.

Based on both training aspects, the system learns what features to

look for when presented with a new image. Table 2.2 shows publications

based on such strategy.

24

One of the most commonly used model-based segmentation

methods is pattern matching. In pattern matching, the training is usually based

on images containing the object to detect. Pattern matching has been used in

segmentation of mammographic images by Lai et al (1989) and by

Constantinidis et al (2001). Freixenet et al (2008) have developed a model to

use a probabilistic template matching method to detect lesions. The shape and

deformations of a deformable template were learnt from real mass examples.

Subsequently, a Bayesian scheme was used to adapt the learnt deformable

template to the real contours of the mammogram. On the other hand,

Hatanaka et al (2001) effectively used the same approach to detect masses

with a partial loss of region, i.e. those masses located at the border of the

image or on the boundary between the pectoral muscle and the breast.

Tianhu Lei and Wilfred Sewchand (1992) have developed an

unsupervised stochastic model based image segmentation technique for X-ray

computed tomography image. This model utilizes the finite normal mixture

distribution and the underlying Gaussian random field as the stochastic image

model. The number of tissue classes in the observed image is detected by

information theoretic criteria. The parameters of the model are estimated by

expectation-maximization and classification-maximization algorithms. Image

segmentation is performed by Bayesian classifier.

Zhengrong Liang et al (1994) have developed a statistical

procedure to classify tissue types and to segment the corresponding tissue

regions. The strategy assumes that the distribution of image intensities related

with each tissue type can be expressed as a multivariate likelihood function.

The procedure, further assumes that the underlying tissue regions are

piecewise contiguous and can be characterized by a markov random field

prior. In classifying the tissue types, the strategy models, the likelihood of

25

realizing the images as a finite multivariate-mixture function. The class

parameters associated with the tissue types are estimated by maximum

likelihood. The estimation fits the class parameters to the image data via

expectation-maximization algorithm. The number of classes related with the

tissue types is determined by the information criterion of minimum

description length. The procedure segments, the tissue regions, are given the

estimated class parameters, by maximum a posteriori probability.

One of the approaches using model based strategy was the work of

Nico Karssemeijer and te Brake (1996). They initially found spicules using

second order Gaussian derivatives operators. If a line-like structure is present

at a given site, the method provides an estimation of the orientation of these

structures, whereas in other cases the image noise will generate a random

orientation. With this information they constructed two new features that

formed the input for the classification stage.

Chang et al (1996) have developed a simple method for detecting

suspicious regions based on five rules. The selected regions should contain:

(1) a global maximum in a Gaussian smoothed image; (2) a local maximum in

the original image; (3) a local maximum in the image coming from the

subtraction of two smoothed images (one using a Gaussian filter and the other

using a box filter); and either (4) a small suspicious region of low contrast; or

(5) a small suspicious region of high contrast. There is a series of approaches

which model the masses using statistical approaches. For instance, Reyer

Zwiggelaar et al (1999) detected spiculated lesions as the union of two

techniques: the first one modeled the centre of the mass using a directional

recursive median filter, while the second technique modeled the surrounding

pattern of linear structures applying a multi-scale directional line detector.

26

The combination of both the methods results in a probability image, and the

detection is performed by thresholding the resulting probability image.

Huai Li et al (2001) first applied an image enhancement algorithm

using morphological filtering. Subsequently, they employed a finite

generalized Gaussian mixture distribution to model the histogram. They

incorporated the EM algorithm to determine the optimal number of image

regions and the kernel shape in the finite generalized Gaussian mixture model.

The final step was the use of Bayesian relaxation labeling to perform the

selection of suspected masses. In a recent approach, Szekely et al (2006) used

a decision tree to classify a sliding window to contain mass or normal tissue.

Consequently, a markov random field is used to refine the obtained

segmentation.

Table 2.2 Mammographic model based segmentation techniques

Author

Features

Texture ShapeGray

LevelGradient

Lai et al (1989)

Nico Karssemeijer and te Brake (1996)

Chang et al (1996)

Reyer Zwiggelaar et al (1999)

Constantinidis et al (2001)

Hatanaka et al (2001)

Huai Li et al (2001)

Szekely et al (2006)

Oliver et al (2006)

Freixenet et al (2008)

27

2.2.5 Other Segmentation Methods

Shuk-Me1 Lai et al (1989) have developed a method for detecting

circumscribed masses in mammograms. It relies on a combination of criteria

used by experts including the shape, brightness, contrast, and uniform density

of mass areas. The method uses modified median filtering to enhance

mammogram images and template matching to detect masses. In the template

matching step, suspicious areas are picked by thresholding the cross-

correlation values and a percentile method is used to determine a threshold for

each film. In addition, neighborhood test and histogram test are designed to

remove false alarms from the resulting candidates.

Nafi Gurcan et al (1997) have developed a new method for

detection of microcalcifications. In this method, the mammogram image is

first processed by a sub-band decomposition filter bank. The band pass sub-

image is divided into overlapping square regions in which skewness and

kurtosis as measures. The detection method utilizes these two parameters. A

region with high positive skewness and kurtosis is marked as a region of

interest.

Hidefumi Kobatake et al (1999) have developed for detection of

malignant tumors in digitized mammograms. The method uses iris filter to

enhance the images, and parameters based on boundary characteristics are

extracted to classify the malignant and other tumors. This iris filter can

enhance rounded convex regions such as tumors.

Reyer Zwiggelaar et al (1999) have developed a statistical models

to detect spiculated lesions. The model described a generic method of

representing patterns of linear structures, which relies on the use of factor

analysis to separate the systematic and random aspects of a class of patterns.

28

The model consists of the appearance of central masses using local scale-

orientation signatures. It is based on recursive median filtering and

approximated using principal component analysis.

Giuseppe Boccignone et al (2000) have developed a new method

for computer aided detection of microcalcifications in digital mammograms.

The detection is performed on the wavelet transformed image. The

calcifications are separated from the background by exploiting the evaluation

of Renyi's information at the different decomposition levels of the wavelet

transform.

Reyer Zwiggelaar et al (2004) have described a method for

detecting linear structures in mammograms, and for classifying them into

anatomical types (vessels, spicules, ducts, etc). They described different

methods for extracting linear structures. Each method provides an estimate at

each pixel of both line-strength and orientation. In principle, these methods

can be used for detecting either dark or bright linear structures. They

investigated different approaches such as Line Operator, Orientated Bins,

Gaussian Derivatives and Ridge Detector for detecting linear structures.

Gaussian filtering and sub-sampling methods are used to enhance the images.

The line-strength and orientation images are used to extract a simple

representation of linear structure. Finally, they classified the linear structures

into vessels, spicules and ducts, etc.

Rafayah Mousa et al (2005) have developed two techniques based

on wavelet analysis and fuzzy-neural approaches for breast cancer diagnosis.

These techniques are mammography classifier based on globally processed

image and locally processed image (region of interest). The system is

classified normal from abnormal and mass or microcalcification. They have

29

investigated and analyzed wavelet transform for image enhancement, features

extraction and the adaptive neuro-fuzzy inference system algorithm for

classification process.

Ryohei Nakayama et al (2006) have developed a computerized

scheme for detecting early-stage microcalcification clusters. The system first

developed a novel filter bank based on the concept of the Hessian matrix. The

mammogram images were decomposed into several sub-images. The eight

features were extracted from each sub-image. The Bayes discriminant

function was employed for distinguishing normal and abnormal region.

Tomasz Arodz et al (2006) have developed a new computer-aided

detection system for small field digital mammography. The system first

processes the image using filter that is sensitive to microcalcification contrast

shape. Then, enhance the mammogram contrast by using wavelet-based

sharpening algorithm. Finally, the output image is presented to radiologist, for

visual analysis.

Guillaume Kom et al (2007) have developed a CAD system for

detection of masses in mammograms by local adaptive thresholding. In order

to enhance the image, a mass pattern-dependent enhancement approach was

designed based on the linear transformation of pixels values. This approach is

implemented by two linear functions. Finally, the mass area is segmented by

using local adaptive thresholding method.

Whi-Vin Oh et al (2009) have developed a CAD system for the

automatic detection of clustered microcalcifications in digitized

mammograms. The system consists of three main steps. First, the breast

region is segmented from original mammogram using contrast property of

30

grey level co-occurrence matrix. Second, the potential microcalcification

pixels in the mammograms are detected by foveal method. Third, in order to

reduce false-positive rate, the individual microcalcifications are detected by a

set of eight features extracted from the potential individual microcalcification

objects.

Mohamed Meselhy Eltoukhy et al (2010) have developed an

approach for breast cancer diagnosis in digital mammogram using curvelet

transform. After decomposing the mammogram images in curvelet basis, a

special set of the biggest coefficients is extracted as feature vector. The

Euclidean distance is then used to construct a supervised classifier.

Maciej Mazurowski et al (2011) have developed a CAD system for

mammographic masses that uses a mutual information-based template

matching scheme with intelligently selected templates. The system proceeds

in several steps. First, it starts with reducing the spatial resolution of the

original images to 0.4 mm per pixel. Second, a simple breast segmentation

algorithm is utilized based on global thresholding of the images to segment

the breast region. The global threshold is established separately for each

mammographic image via peak detection in the image histogram. Third, a

mask is created that includes the 30% highest intensity pixels within the

breast. The pixels within the mask are considered as suspicious enough in the

further steps. For each pixel within the mask, a likelihood score is calculated

of that pixel corresponding to an abnormality. Finally, iterative multi-level

thresholding algorithm is applied to extract the suspicious region.

31

2.3 LITERATURE STUDY ON MAMMOGRAPHIC

CLASSIFICATION

Image classification analyzes the numerical properties of various

image features and organizes data into categories. Classification algorithms

typically employ two phases of processing: training and testing. The

characteristic properties of typical image features are isolated in training

phase. The feature-space partitions are used to classify image features in

testing phase.

Classifiers play a crucial role in the implementation of computer-

aided diagnosis of mammography. The features or a subset of features are

employed by classifiers to classify the suspicious regions into normal and

lesions. After the features have been extracted and selected, they are input

into a classifier to categorize the images into lesion/non-lesion or

benign/malignant classes. Majority of the publications focuses on classifying

malignant and benign lesions (usually called lesion classification), and some

of the articles focus on classifying lesions and non-lesions (usually called

lesion detection), and only a few of them focus on both. A detailed

description of the classification methods is given in subsequent section.

Artificial Neural Networks (ANN)

ANNs are the collection of mathematical models that imitate the

properties of biological nervous system and the functions of adaptive

biological learning. They are made of many processing elements that are

highly interconnected together with the weighted links that are similar to the

synapses. Unlike linear discriminants, ANNs usually use non-linear mapping

functions as decision boundaries. The advantages of ANNs are their

capability of self-learning, and often suitable to solve the problems that are

32

too complex to use the conventional techniques, or hard to find algorithmic

solutions.

It includes an input layer, an output layer and one or more hidden

layers between them. Depending on the weight values of ( , )j i and ( , )k j ,

the inputs are either amplified or weakened to obtain the solution in the best

way. The weights are determined by training the ANN using the known

samples.

Baoyu Zheng et al (1996) have developed a system for detection of

microcalcification using mixed feature neural networks. The features

computed in both the spatial and spectral domain and use spectral entropy as a

decision parameter. Back propagation with Kalman filtering is employed as

classifier.

Jong Kook Kim and Hyun Wook Park (1999) have developed a

texture-analysis method for detection of microcalcification. Textural features

are extracted by spatial gray-level dependence method, gray-level run-length

method, and the gray-level difference method. A three-layer back propagation

neural network is used as a classifier.

Wang et al (1998) have developed a probabilistic neural network

based technique for segmentation of brain tissues from magnetic resonance

images. The new technique uses suitable statistical models for both the pixel

and context images. It also formulates the problem in terms of model-

histogram fitting and global consistency labeling. The quantification is

achieved by probabilistic self-organizing mixtures and the segmentation by a

probabilistic constraint relaxation network.

33

Songyang Yu and Ling Guan (2000) have developed a computer-

aided diagnosis system for detection of clustered microcalcifications. The

system consists of two steps. First, the potential microcalcification pixels in

the mammograms are segmented out by using mixed features consisting of

wavelet features, gray level, and statistical features. Second, the individual

microcalcifications are detected by using a set of 31 features extracted from

the individual microcalcification objects. The discriminatory power of these

features is analyzed using general regression neural networks via sequential

forward and sequential backward selection methods. The classifiers used in

these two steps are both multilayer feed forward neural networks.

Paul Sajda et al (2002) have developed a Hierarchical

Pyramid/Neural Network (HPNN), that learns to exploit image structure at

multiple resolutions for detecting clinically significant features. Networks are

trained using a novel error function for the supervised learning of image

search, when the position of the objects to be found is uncertain or ill defined.

Christoyianni et al (2002) have implemented a CAD system

consists of a feature extraction module that extracts a gray level and texture

features from the regions of interest and a classification module based on a

Radial Basis Function Neural Network (RBFNN) classifier that can classify

any normal and abnormal region in digital mammograms.

Papadopoulos et al (2002) have developed a hybrid intelligent

system for the identification of microcalcification clusters in digital

mammograms. The reduction of false positive cases is performed using an

intelligent system containing two subsystems: a rule-based and a neural

network sub-system. In the first step of the classification, 22 features are

computed from individual or cluster of microcalcifications. Further reduction

in the number of features is achieved through principal component analysis.

34

Cheng et al (2004) have used a new Fuzzy Neural Network (FNN)

approach to detect malignant mass lesions on mammograms. The FNN used

four layers. The first layer is the input layer consisting of four input fuzzy

neurons. The second layer has four ordinary neurons. The third layer consists

of N maximum fuzzy neurons. The number of fuzzy neurons in the third layer

is determined during the training process and varies with the network

parameters and data distribution. The fourth layer has two maximum fuzzy

neurons and one competitive fuzzy neuron.

Papadopoulos et al (2005) have developed a novel computer-based

automated method for the characterization of microcalcification clusters in

digitized mammograms. The system has been implemented in three stages: (a)

the cluster detection stage to identify clusters of microcalcifications, (b) the

feature extraction stage to compute the important features of each cluster and

(c) the classification stage, which provides with the final characterization. In

the classification stage, a rule-based system, an artificial neural network and a

support vector machine have been implemented and evaluated using receiver

operating characteristic analysis.

Celia Varela et al (2007) have investigated the behavior of an iris

filter at different scales. After iris filter was applied, suspicious regions were

segmented by means of an adaptive threshold. Suspected regions were

characterized with features based on the iris filter output and, gray level,

texture, contour-related, and morphological features extracted from the image.

A back propagation neural network classifier is trained to reduce the number

of false positives.

Pasquale et al (2007) have developed for the mass characterization

and is mainly based on a segmentation algorithm and sixteen features based

35

on shape, size and intensity of the segmented masses are extracted. The

multi-layered perceptron neural network was used to train the system.

Stelios Halkiotis et al (2007) have developed a new algorithm for the

detection of clustered microcalcifications using mathematical morphology and

artificial neural networks. Mathematical morphology provides tools for the

extraction of microcalcifications even if the microcalcifications are located on a

non-uniform background. Each candidate object is marked as such, using a

binary image. The original mammogram is used for the final feature extraction.

The neural network classifier with multi-layer perception (MLP) and radial

basis function neural networks is employed to detect microcalcifications. The

gray level and statistical features are used as input vector.

Anna Karahaliou et al (2008) have investigated texture properties

of the tissue surrounding microcalcification clusters. The tissue surrounding

microcalcifications is defined on original and wavelet decomposed images,

based on a redundant discrete wavelet transform. Gray-level texture and

wavelet co-efficient texture features at three decomposition levels that are

extracted from surrounding tissue regions of interest. The ability of each

feature set in differentiating malignant from benign tissue is investigated

using a probabilistic neural network.

Brijesh Verma et al (2008) have used network architecture and a

new learning algorithm for the classification of mass abnormalities. The idea

is based on the introduction of an additional neuron in the hidden layer for

each output class. The additional neurons for benign and malignant classes

help in improving memorization ability without destroying the generalization

ability of the network. Brijesh Verma et al (2009) have developed a novel soft

cluster neural network technique for the classification of suspicious areas in

36

digital mammograms. The technique introduces the concept of soft clusters

within a neural network layer and combines them with least squares for

optimizing neural network weights.

Sung-Nien Yu et al (2010) have investigated the performance of

clustered microcalcifications recognition in digital mammograms by using

combined model-based and statistical textural features. In the first stage, a

wavelet filter and two thresholds were used to detect suspicious

microcalcifications from the mammograms. In the second stage, textural

features based on Markov random field (MRF) and fractal models together

with statistical textural features were extracted from the suspicious MCs and

were classified by a three-layer back propagation neural network.

Jinchang Ren et al (2011) have developed a method for early

detection of breast cancer through classification of microcalcification clusters

from mammograms. The method used an improved neural classifier, in which

balanced learning with optimized decision making are introduced to enable

effective learning from imbalanced samples. In ANN classifier, the outputs

are continuous values rather than binary symbols. Conventional methods use

simple thresholding in decision making. If the outputs are larger than a chosen

threshold, a positive sample is detected. Otherwise, it is decided as negative.

However, this simple thresholding suffers uneven distribution of the training

outputs and leads to poor performance. To overcome this drawback, the

author used optimized decision making using optimal thresholding method.

Linear Discriminant Analysis (LDA)

LDA is a traditional method for classification. The main idea of this

method is to construct the decision boundaries directly by optimizing the error

37

criterion to separate the classes of objects. If there are n classes, and linear

discriminant analysis classifies the observations as the following n linear

functions:

( ) . , 1 i n.T

i i ig x W x c (2.5)

where T

iW is the transpose of a coefficient vector, x is a feature vector and ic

is a constant as the threshold. The values of T

iW and ic are determined through

the analysis of a training set. Once the values of T

iW and ic are determined,

they can be used to classify the new observations. The observation is

abnormal, if ( )ig x is positive, otherwise it is normal.

Judy Kilday et al (1993) have developed a method for classification

of lesions using computerized image analysis. An interactive segmentation

procedure is used to identify the tumor boundary. The several features were

extracted based on shape describing properties of the tumor boundaries. The

LDA was used to select the features for classification.

Bruce et al (1999) have used the discrete wavelet transform

modulus-maxima (mod-max) method, and is utilized for the extraction of

mammographic mass shape features. These shape features are used in a

classification system to classify masses as round, nodular, or stellate. The

discriminating power of the shape features were analyzed via Linear

Discriminant Analysis (LDA). The classification system utilized a simple

Euclidean metric to determine class membership.

Naga Mudigonda et al (2001) have used gray level, shape and

gradient features as the input vector and LDA was used for pattern

classification. Sahiner et al (2001) have used two sets of texture,

morphological, and spiculation features as input vectors and a classifier based

38

on stepwise feature selection and linear discriminant analysis was trained and

tested.

Bayesian Network

Bayesian network uses a probabilistic approach to determine an

optimal classification for a given database. It builds an “acyclic” graph in

which the nodes represent the features variables, and connections between

nodes represent direct probabilistic influences between the variables. Each

variable must have at least two discrete states and each state is associated with

a probability value. The total of the probability values for all states equals to 1

for each node. If there is no path between any two nodes, it indicates the

probabilistic independence of two variables.

Charles et al (1997) have used history of five patients, two physical

findings and fifteen mammographic features as input vector and Bayesian

network is used as classifier. Zheng et al (1999) have used Bayesian Belief

Network (BBN) in a computer-assisted diagnosis scheme for mass detection

in digitized mammograms. After initial processing of image segmentation and

adaptive topographic region growth in their scheme, 304 true-positive and

1,586 false-positive regions are identified in the testing set. A BBN was

constructed based on 12 local and four global features in order to classify

these regions as positive or negative for mass.

Sung-Nien Yu et al (2006) have developed a technique for

detection of microcalcifications. At first, all suspicious microcalcifications are

preserved by thresholding a filtered mammogram via a wavelet filter.

Subsequently, Markov random field parameters based on the Derin–Elliott

model are extracted from the neighborhood of every suspicious

microcalcification. The texture features are extracted and Bayesian classifier

was used for computer experiments.

39

Imad Zyout et al (2009) have developed a new framework that

integrates Bayesian classifier and a pattern synthesizing scheme for detecting

microcalcification clusters. This frame work extracts textural, spectral, and

statistical features of each input mammogram and generates models of real

microcalcifications (MC) to be used as training samples through a simplified

learning phase of the Bayesian classifier. Followed by an estimation of the

classifier’s decision function parameters, a mammogram is segmented into the

identified targets (MCs) against healthy tissue.

Support Vector Machines (SVM)

Support Vector Machine is a supervised learning technique that

seeks an optimal hyperplane to separate two classes of samples. Kernel

functions are used to map the input data into a higher dimension space where

the data are supposed to have a better distribution, and then an optimal

separating hyperplane in the high-dimensional feature space is chosen.

Tomasz Arodz et al (2005) have used the Gabor filters for

extracting feature vectors from images and SVM had used as classifier.

Papadopoulos et al (2005) and Fu et al (2005) have used shape, texture and

gray level features as input vector and SVM is used for classification of

microcalcifications.

Jacob Levman et al (2008) have developed a CAD system for

classification of lesions on MR images. A key component of CAD system is

the selection of an appropriate classification function for separating malignant

and benign lesions. This study is to evaluate the effects of variations in

temporal feature vectors and kernel functions on the separation of malignant

and benign lesions by support vector machines. They also demonstrated the

support vector machine approach as offering significant flexibility in the

design of a computer aided-detection system.

40

Defeng Wang et al (2009) have introduced a structured SVM model

to determine if each mammographic region is normal or cancerous by

considering the cluster structures in the training set. Various types of features,

including curvilinear features, texture features, Gabor features, and multi-

resolution features, are extracted from the sample images. Recursive feature

elimination algorithm is used to select the salient features.

Llado et al (2009) have developed method for detection and

classification of masses in mammogram. The key point of this method is the

use of local binary patterns for representing the textural properties of the

masses. Further, they extend the basic local binary patterns histogram

descriptor into a spatially enhanced histogram. It encodes both the local

region appearance and the spatial structure of the masses. Support vector

machines are then used for classifying the true masses from the ones being

actually normal parenchyma.

Subashini et al (2010) have developed a method for assessment of

breast tissue density in digital mammograms. Gray level thresholding and

connected component labeling is used to eliminate the artifacts and pectoral

muscles from the region of interest. Statistical features are extracted from

these regions which signify the important texture features of breast tissue.

These features are fed to the support vector machine (SVM) classifier to

classify it into any of the three classes namely fatty, glandular and dense

tissue.

Ioan Buciu and Alexandru Gacsadi (2011) have used the Gabor

wavelets to filter the images and directional features are extracted at different

orientation and frequencies. Principal Component Analysis is employed to

reduce the dimension of filtered and unfiltered high-dimensional data.

Proximal support vector machines are used to, finally, classify the data.

41

Wener Borges Sampaio et al (2011) have developed a

computational methodology to detect breast masses. The first stage of the

methodology aims to improve the mammogram image. This stage consists in

removing objects outside the breast, reducing noise and highlighting the

internal structures of the breast. Next, cellular neural networks are used to

segment the regions that might contain masses. These regions have their

shapes analyzed through shape descriptors (eccentricity, circularity, density,

circular disproportion and circular density) and their textures analyzed

through geo-statistic functions. Support vector machines are used to classify

the candidate regions as masses or non-masses.

Stylianos Tzikopoulos et al (2011) have developed a segmentation

and classification scheme for mammograms, based on breast density

estimation and detection of asymmetry. First, image preprocessing and

segmentation techniques are applied. It includes breast boundary extraction

algorithm and pectoral muscle segmentation scheme. Features for breast

density categorization are extracted and support vector machines are

employed for classification.

Binary Decision Tree

A Binary Decision Tree recursively divides the feature space into

two subspaces by selecting a threshold to separate input data into two classes

each time. An ordered list of binary threshold operations on the features is

organized as a tree. Each node has a threshold associating with one or more

features to divide the data into its two descendents. The process stops when it

only contains patterns of one class. Comparing with neural networks, the

decision tree approach is much simpler and faster.

Li et al (1995) have developed a method called modified markov

random field for initial reliable segmentation of region of interest. Further, the

42

regions are classified into suspicious and normal by a fuzzy binary decision

tree based on a series of radiographic and density-related features.

Sheng Liu et al (2001) have developed novel multiresolution

scheme for the detection of spiculated lesions in digital mammograms. First, a

multiresolution representation of the original mammogram is obtained using a

linear phase non-separable two-dimensional wavelet transform. A set of

features is then extracted at each resolution in the wavelet pyramid for every

pixel. This approach addresses the difficulty of predetermining the

neighborhood size for feature extraction to characterize objects that may

appear in different sizes. Detection is performed from the coarsest resolution

to the finest resolution using a binary tree classifier.

Lei Zheng and Andrew Chan (2001) have used the Dogs-and-

Rabbits clustering algorithm to initiate the segmentation at the Low Level

sub-band of a three-level discrete wavelet transform decomposition of the

mammogram. A binary tree-type classification strategy is applied at the end to

determine whether a given region is suspicious for cancer.

Other Classification Techniques

Nico Karssemeijer et al (1996) have developed a method for

detection of stellate distortions in mammograms. This method is based on

statistical analysis of a map of pixel orientations. Orientations of the image

intensity map are determined at each pixel using a multiscale approach.

Different classifiers such as K-nearest neighbor, decision tree, neural network

and Bayesian have been compared.

Subhash Bagui et al (2003) have developed a new generalization of

the rank nearest neighbor rule for multivariate data for diagnosis of breast

cancer. The several features such as radius (mean of distances from center to

43

points on perimeter), texture (standard deviation of gray-scale values),

perimeter, area, smoothness (local variation in radius lengths), compactness,

concavity (severity of concave portions of the contour, concave points

(number of concave portions of the contour), symmetry and fractal dimension

are used to classify the breast cancer.

Liyang Wei et al (2005) have used the machine-learning technique

called relevance vector machine for detection of microcalcifications. The

system formulated microcalcification detection as a supervised-learning

problem, and applied relevance vector machine as a classifier. A two-stage

classification network is applied to increase the computation speed.

Thangavel and Kaja Mohidee (2009) have developed a novel

association rule mining approach for classification of microcalcifications. In

this system, the shape features are extracted from the digital mammograms.

With these feature values, association rules are constructed to develop a rule

based system for classification of microcalcifications. A novel

Multidimensional Genetic Association Rule Miner is used for rule

construction.

2.4 EVALUATION METHODOLOGY

In mammography, the most common evaluation methodology is to

compare the results obtained by the algorithms to those obtained by a set of

experts, which is considered ground-truth.

Free Receiver Operating Characteristic (FROC) is based on a

region based analysis (Chakraborty et al 2007). The FROC paradigm is,

nowadays, being increasingly used in the assessment of medical imaging

systems, particularly in the evaluation and comparison of CAD systems

(Bornefalk 2005). It is a plot of operating points showing the tradeoff between

44

the true positive rates versus the average number of false positives per image

(Egan et al 1961).

The FROC curve is the plot of true positive rate versus number of

false positives per image. Thus, FROC seeks location information from the

observer (CAD system), rewarding it when the reported disease is marked in

the appropriate location and penalizing it when it is not. This task is more

relevant to the clinical practice of radiology, where it is not only important to

identify disease, but also to offer further guidance regarding other

characteristics (such as location) of the disease. Each image or case can

contain any number of lesions. Each correctly located true-positive detection

and each false-positive location report is scored independently. The horizontal

axis cannot be normalized to range from 0.0 to 1.0, because the maximum

possible number of false-positives on each image or case is unknown. A

FROC curve is shown in Figure 2.1.

Figure 2.1 FROC curve (Source: www.devchakraborty.com)

45

2.5 SUMMARY

This chapter presents and reviews different approaches to automatic

segmentation and classification of mammographic lesions. It describes several

algorithms, pointing out their specific features. It is clearly shown that a few

algorithms are contour based, probably due to the fact that lesions often have

not a definite one. Moreover, some region based and clustering algorithms

take shape, gray level or texture information into account to segment lesions.

Most of the model based algorithms require the use of a classifier which

implies training the system. The classifier based approach mainly

characterized by single or double features. In this case, algorithms used in the

literature obtained good detection results on one type of lesions, but it

generated unreasonable detection results on other types of lesions. Three or

more features used in classifiers seems to be an all-round lesion segmentation

approach which copes equally well with the small and the large lesions.

Documents

CHAPTER 2 A REVIEW ON BREAST ABNORMALITY …shodhganga.inflibnet.ac.in/bitstream/10603/26433/7/07_chapter 2.pdf · A REVIEW ON BREAST ABNORMALITY SEGMENTATION AND CLASSIFICATION TECHNIQUES