10
ORIGINAL ARTICLE Acoustic thoracic image of crackle sounds using linear and nonlinear processing techniques Sonia Charleston-Villalobos Guadalupe Dorantes-Me ´ndez Ramo ´n Gonza ´lez-Camarena Georgina Chi-Lem Jose ´ G. Carrillo Toma ´s Aljama-Corrales Received: 21 January 2010 / Accepted: 4 July 2010 / Published online: 21 July 2010 Ó International Federation for Medical and Biological Engineering 2010 Abstract In this study, a novel approach is proposed, the imaging of crackle sounds distribution on the thorax based on processing techniques that could contend with the detection and count of crackles; hence, the normalized fractal dimension (NFD), the univariate AR modeling combined with a supervised neural network (UAR-SNN), and the time-variant autoregressive (TVAR) model were assessed. The proposed processing schemes were tested inserting simulated crackles in normal lung sounds acquired by a multichannel system on the posterior thoracic surface. In order to evaluate the robustness of the pro- cessing schemes, different scenarios were created by manipulating the number of crackles, the type of crackles, the spatial distribution, and the signal to noise ratio (SNR) at different pulmonary regions. The results indicate that TVAR scheme showed the best performance, compared with NFD and UAR-SNN schemes, for detecting and counting simulated crackles with an average specificity very close to 100%, and average sensitivity of 98 ± 7.5% even with overlapped crackles and with SNR correspond- ing to a scaling factor as low as 1.5. Finally, the perfor- mance of the TVAR scheme was tested against a human expert using simulated and real acoustic information. We conclude that a confident image of crackle sounds distri- bution by crackles counting using TVAR on the thoracic surface is thoroughly possible. The crackles imaging might represent an aid to the clinical evaluation of pulmonary diseases that produce this sort of adventitious discontinu- ous lung sounds. Keywords Discontinuous adventitious sound imaging Lung sound Fine and coarse crackles Time-variant autoregressive model Fractal dimension 1 Introduction Physicians use the classical auscultation procedure on the chest wall to look for peculiar lung sounds (LS), breathing and adventitious sounds, to diagnose lung disorders [30]. However, the findings are biased by the physician’s expertise to correctly identify diverse adventitious sounds [21]. Furthermore, the classical auscultation is performed using a single stethoscope at different positions on the thoracic surface, and the physician needs to integrate in a qualitative way the whole spatial and temporal acoustic information to reach a diagnosis. Several pulmonary dis- orders are described by the occurrence of discontinuous adventitious LS, also known as crackles, that depending on the severity of the disease the physicians are capable to perceive [28]. It is accepted that to know the extent of the pulmonary area where crackles are occurring is clinically important [31]. Therefore, providing a confident image of the spatial distribution of the pulmonary pathology by detecting and counting crackles using multiple acoustic sensors might help to overcome the restrictions of the clinical auscultation with the stethoscope. S. Charleston-Villalobos (&) G. Dorantes-Me ´ndez T. Aljama-Corrales Department of Electrical Engineering, Universidad Auto ´noma Metropolitana, Mexico City 09340, Mexico e-mail: [email protected] R. Gonza ´lez-Camarena Department of Health Science, Universidad Auto ´noma Metropolitana, Mexico City 09340, Mexico G. Chi-Lem J. G. Carrillo National Institute of Respiratory Diseases, Mexico City 14080, Mexico 123 Med Biol Eng Comput (2011) 49:15–24 DOI 10.1007/s11517-010-0663-5

Acoustic thoracic image of crackle sounds using linear and nonlinear processing techniques

  • Upload
    uam-mx

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

ORIGINAL ARTICLE

Acoustic thoracic image of crackle sounds using linearand nonlinear processing techniques

Sonia Charleston-Villalobos • Guadalupe Dorantes-Mendez •

Ramon Gonzalez-Camarena • Georgina Chi-Lem • Jose G. Carrillo •

Tomas Aljama-Corrales

Received: 21 January 2010 / Accepted: 4 July 2010 / Published online: 21 July 2010

� International Federation for Medical and Biological Engineering 2010

Abstract In this study, a novel approach is proposed, the

imaging of crackle sounds distribution on the thorax based

on processing techniques that could contend with the

detection and count of crackles; hence, the normalized

fractal dimension (NFD), the univariate AR modeling

combined with a supervised neural network (UAR-SNN),

and the time-variant autoregressive (TVAR) model were

assessed. The proposed processing schemes were tested

inserting simulated crackles in normal lung sounds

acquired by a multichannel system on the posterior thoracic

surface. In order to evaluate the robustness of the pro-

cessing schemes, different scenarios were created by

manipulating the number of crackles, the type of crackles,

the spatial distribution, and the signal to noise ratio (SNR)

at different pulmonary regions. The results indicate that

TVAR scheme showed the best performance, compared

with NFD and UAR-SNN schemes, for detecting and

counting simulated crackles with an average specificity

very close to 100%, and average sensitivity of 98 ± 7.5%

even with overlapped crackles and with SNR correspond-

ing to a scaling factor as low as 1.5. Finally, the perfor-

mance of the TVAR scheme was tested against a human

expert using simulated and real acoustic information. We

conclude that a confident image of crackle sounds distri-

bution by crackles counting using TVAR on the thoracic

surface is thoroughly possible. The crackles imaging might

represent an aid to the clinical evaluation of pulmonary

diseases that produce this sort of adventitious discontinu-

ous lung sounds.

Keywords Discontinuous adventitious sound imaging �Lung sound � Fine and coarse crackles � Time-variant

autoregressive model � Fractal dimension

1 Introduction

Physicians use the classical auscultation procedure on the

chest wall to look for peculiar lung sounds (LS), breathing

and adventitious sounds, to diagnose lung disorders [30].

However, the findings are biased by the physician’s

expertise to correctly identify diverse adventitious sounds

[21]. Furthermore, the classical auscultation is performed

using a single stethoscope at different positions on the

thoracic surface, and the physician needs to integrate in a

qualitative way the whole spatial and temporal acoustic

information to reach a diagnosis. Several pulmonary dis-

orders are described by the occurrence of discontinuous

adventitious LS, also known as crackles, that depending on

the severity of the disease the physicians are capable to

perceive [28]. It is accepted that to know the extent of the

pulmonary area where crackles are occurring is clinically

important [31]. Therefore, providing a confident image of

the spatial distribution of the pulmonary pathology by

detecting and counting crackles using multiple acoustic

sensors might help to overcome the restrictions of the

clinical auscultation with the stethoscope.

S. Charleston-Villalobos (&) � G. Dorantes-Mendez �T. Aljama-Corrales

Department of Electrical Engineering, Universidad Autonoma

Metropolitana, Mexico City 09340, Mexico

e-mail: [email protected]

R. Gonzalez-Camarena

Department of Health Science, Universidad Autonoma

Metropolitana, Mexico City 09340, Mexico

G. Chi-Lem � J. G. Carrillo

National Institute of Respiratory Diseases, Mexico City 14080,

Mexico

123

Med Biol Eng Comput (2011) 49:15–24

DOI 10.1007/s11517-010-0663-5

Murphy et al. [26] and then Hoevers and Loudon [14]

have described crackles through their time-expanded

waveform-analysis (TEWA), defining by visual inspection

time parameters as the initial deflection width (IDW), the

two cycle duration (2CD) and the largest deflection width

(LDW). Based on TEWA, fine and coarse crackles were

differentiated and associated to different pathologies [1].

Mori et al. [24] and Munakata et al. [25] studied crackle

sounds in the frequency domain and reported crackle

spectral content mainly in the range from 0.1 to 1000 Hz.

In particular, crackle sounds have been associated to both

cardiac and pulmonary diseases [29]. Furthermore, fine and

coarse crackles have been reported appearing during

inspiration and expiration phases, depending on the pul-

monary disorder. In fibrotic lung diseases, as it occurs in

many parenchymal lung diseases, fine crackles are more

frequently found at the ending of the inspiratory phase [29].

Several efforts have been made to automatically detect

fine and coarse crackles. Among such techniques are non-

linear digital filters [27], the spectral stationarity of LS [15],

high-order statistics AR modeling [10], wavelet transform

[11], neurofuzzy modeling [32], fractal dimension [12], the

empirical mode decomposition [3, 8], and recently a texture-

based classification [9]. Nevertheless, the solution for auto-

mated crackle detection remains as a challenging research

area due to factors such as: (a) non-stationarity behavior of

crackle and breathing sounds (BS), (b) the signal-to-noise

ratio (SNR) between crackle and BS, (c) temporal crackles

overlapping, (d) crackle waveform distortion by BS, and (e)

the task complexity in a multichannel scenario due to tem-

poral and spatial changes of crackle and BS characteristics. It

is interesting to note that most of the actual studies related to

crackles utilize the counting obtained by clinical experts

using TEWA to validate the results. However, it has been

shown that physicians have visual and auditory limitations to

detect fine or coarse crackles [17]. Here, we hypothesize that

2D imaging of crackles distribution on the thoracic surface is

feasible by processing multichannel acoustic information.

The 2D image modality has the advantage to be non-invasive

and related to the function of the lung. Besides, the meth-

odology can be applied to image other kind of adventitious

LS, as wheezes or squawk sounds.

The aim of this article is to propose a novel concept, the

imaging of crackle sounds to determine their spatial dis-

tribution, and the estimated number of crackles using linear

and non-linear processing schemes in a multichannel

framework.

2 Methodology

The methodology for imaging crackles or discontinuous

adventitious LS is depicted in Fig. 1. In order to construct a

simulated database controlling the extent of the abnormal

pulmonary zones, the SNR, the type, and the number of

crackles, a multichannel LS acquisition from healthy sub-

jects, was carried out. Afterward, simulated fine and coarse

crackles were inserted in normal breathing LS by an

additive model. As can be seen in Fig. 1, three schemes

were proposed to estimate and count the number of

crackles based on the normalized fractal dimension (NFD),

the univariate autoregressive modeling combined with a

supervised feedforward neural network (UAR-SNN), and

the time variant autoregressive modeling (TVAR). Finally,

the image of crackle sounds distribution was formed using

the estimated number of crackles or the mapping of a

crackling index.

2.1 Normal breathing (basic) LS and preprocessing

stage

The acquisition protocol was carried out at the Digital

Signal and Image Processing Laboratory of the Metropol-

itan Autonomous University in collaboration with the

Acoustic Laboratory of the National Institute of Respira-

tory Diseases, at Mexico City. Multichannel LS signals

were acquired from 10 healthy subjects with an average

age, weight, and height of 24.3 ± 1.5 years, 77.8 ± 11.0 kg,

and 174.8 ± 7.8 cm, respectively, who signed an informed

consent according to Helsinki guidelines. The sensor array

of 5-by-5, attached to the subject’s posterior thoracic sur-

face, consisted of electret microphones inserted in plastic

bells with a flat frequency response from 50 Hz to 3 kHz.

Nomenclature of the sensor array is described in detail

elsewhere [2, 3]. During the acquisition, the subjects were

seated, breathing through a calibrated type Fleisch pneu-

motachometer at airflow of 1.5 l s-1, and wearing a nose

clip; the acquisition session lasted 15 s with initial and final

apnea phases of 5 s. In order to digitize the multichannel

LS and airflow signals, a 12-bit A/D converter was used

with a sampling frequency of 10 kHz. LS signals were

processed by band pass filtering with cutoff frequencies of

75 and 1,000 Hz; the lower cutoff frequency was selected

to attenuate heart sounds interference. In this study, only

inspiratory phases were used and segmented by the airflow

signal as reference signal.

2.2 Simulation of abnormal LS and pattern images

In order to simulate different spatial distributions of

abnormal LS, mathematically simulated crackles (SCs)

were embedded by an additive model within the acquired

normal breathing LS signals from 10 healthy subjects. Two

crackle types were simulated using the mathematical

function proposed by Kiyokawa et al. [17], which is

16 Med Biol Eng Comput (2011) 49:15–24

123

included in Fig. 1. The Kiyokawa’s function reproduces

the time domain characteristics as 0.5 ms for IDW and

5 ms for 2CD for fine crackle, while for coarse crackle

IDW is 1.2 ms and 2CD is 9 ms. Using the segmented

inspiratory LS information, a known number of SCs was

inserted in a random fashion into the second half of the

inspiratory phase allowing random temporal overlapping.

Also, crackles were inserted at certain number of selected

channels to simulate abnormal pulmonary zones, leaving

the other channels as normal pulmonary zones (basic

sound). Using the known number of inserted crackles and

their specific distribution, a corresponding image was

created by interpolation; such images were defined as the

pattern images.

The SNR for the crackle and basic LS was manipulated

by computing the standard deviation (SD) of the basic LS

segment where the crackles were added. Then, scaling

factors of 1.5, 2.5, and 3.5 times the SD were used to

provide different crackles amplitude. The acoustic infor-

mation of 10 simulated abnormal subjects was used to

assess the performance of the proposed schemes obtaining

the average number and standard deviation values of

detected crackles.

Several pattern images were tested; however, in the

interest of space and simplicity as outcomes were similar

for all of them, here we present the results for a pattern

image with 10 SCs added at basal sensors PLX4, PLX5,

PLC4, and PLC5, and 6 SCs placed at apical sensors PRC2,

PRX2, PRC3, and PRX3. With the former crackles spatial

distribution, a pattern image was created using just fine or

coarse crackles. Besides, a third image was constructed

with a mix of two fine and two coarse crackles (four

crackles in total) per channel. In the images, the blue color

indicates the absence of crackles, whereas the red color is

associated with the highest number of crackles, as can be

seen in Fig. 2.

2.3 Normalized fractal dimension scheme

Fractal dimension (FD) is a measure of the complexity/

irregularity of a signal indicating its spatial extension, its

convolutedness, or its tendency to fill a space [16]. FD is a

number, often a fractional number, and it was originally

associated with the geometry of an object. It has been

extensively applied for the nonlinear study of the heart rate

variability, but for LS analysis its application is recent [12].

It has been proposed that a sudden change in complexity

can be produced by sudden transients signal as a crackle

[12]. The computation of FD can be done directly from the

time series to be analyzed or by the construction of the

corresponding phase-space; in this study, the time series

approach was followed. Hence, for a time series of length

N, FD is defined as:

Derivatives of time series

UARmodel

L

dn

nFD

loglog

log)(,1

nvinynianyp

i

)(1

nvinyianyp

i

SNN

pinia ,...,1,,

)(

)1(

pa

a

-Normal LS -Abnormal LS

LS LS LS

FD time series thresholding

TVARmodel

MultichannelAcquisition

Crackles Simulation

Simulated Data Base

0 0.002 0.004 0.006 0.008 0.01-1

-0.5

0

0.5

1

Crepitancia FinaCrepitancia Gruesa

Multichannel Crackles Detection Schemes

5.02cos1

4sin5.0

5.0t

tty

Estimated number of cracklesCrackling index

Crackles Imaging

Fig. 1 Key diagram of the

proposed methodology and

processing schemes

Med Biol Eng Comput (2011) 49:15–24 17

123

DF ¼ log Lð Þlog dð Þ;

where L is the total length of the signal, calculated as the

sum of all the distances between any two consecutive

points, and d corresponds to the diameter or planar extend

of the signal, calculated as the maximum distance between

the starting point and any other point in the signal. In order

to avoid the dependency of FD on signal’s units, we use the

normalized fractal dimension (NFD) version, Fig. 1, given

by:

NFD ¼ log nð Þlog nð Þ þ log d=Lð Þ;

where n = N - 1. Each inspiratory phase was divided into

99% overlapping temporal windows of length 0.006 times

the sampling frequency in accordance to other authors [12].

The NFD was calculated for each temporal window in the

second half of the inspiratory phase. Afterward, the esti-

mated number of crackles was obtained for each channel

by testing different threshold values on the NFD time series

and then a matrix of 5-by-5 was formed with the estimated

values. Finally, the matrix was interpolated by the Hermite

function to form the crackle sounds image.

2.4 Univariate AR modeling-SNN scheme

A stationary random process can be modeled as the output

of a time invariant linear all-pole filter excited by white

noise as follows:

u½n� ¼ �a1u½n� 1� � a2u½n� 2�. . .� apu½n� p� þ v½n�;

where u[n], the actual sample of the process, is represented

by a lineal combination of its p previous samples and the

actual sample of the error signal. The univariate autore-

gressive (UAR) model involves second-order statistical

information of the stationary process [19] and the param-

eters {ai, i = 1,…,p}, so as the model order p, need to be

determined. Burg’s algorithm was used to estimate the AR

coefficients, and the Akaike criterion was employed to

select the order of the AR model. It is assumed that normal

and abnormal LS are characterized by different sets of AR

coefficients representing two different classes in the feature

space.

In order to classify acoustic information as being normal

(absence of crackles) or abnormal (presence of crackles) by

a SNN, each inspiratory phase was segmented into 30

windows overlapped by 25% [22]. According to the Akaike

criterion, a sixth-order UAR model was obtained for each

6 6

6 6

10 10

10 10

6 6

6 6

9 9

9 9

6 7

6 6

10 10

10 10

5 6

5 5

10 10

9 10

1 1

2 1

4

1 1

2 3 1

1 4

1

1

3 6 1

2 2 4

26 26

26 26

48 48

48 48

33 33

33 33

46 45

46 46

30 30

30 30

54 54

54 54

TVAR NFD UAR-SNN

PATTERN IMAGE

(a) (b) (c)

Fig. 2 Estimated images for

fine simulated crackles

detection using a univariate

autoregressive model-neural

network scheme (UAR-SNN),

b normalized fractal dimension

scheme (NFD), and c time

variant autoregressive model

scheme (TVAR) for the three

scaling factors 1.5, 2.5, and 3.5,

from top to bottom, respectively

18 Med Biol Eng Comput (2011) 49:15–24

123

temporal window, Fig. 1, providing 30 feature vectors of

length 6 for each inspiratory phase and for each channel.

The SNN architecture was defined with two hidden layers;

the number of hidden nodes was varied from 5 to 30, until

the best performance was obtained according to the mean

squared error (MSE). Finally, a SNN with two hidden

layers having 20 and 10 neurons, and a single output unit

was applied for classification purpose. For adjusting the

SNN’s weights, the supervised training algorithm used the

hyperbolic tangent sigmoid function and the gradient-

descendent Levenberg–Marquardt method [7]. Besides,

each temporal window of the training data set was labeled

as normal or abnormal according to the position and

duration of the SCs. The temporal window was labeled as 1

for normal and -1 for abnormal condition. The simulated

database was divided in training (80% of database), vali-

dation (10% of database), and testing data (10% of data-

base). For the case of UAR-SNN scheme, we propose the

crackling index to indicate the relevance of crackle sounds

by the ratio:

Crackling index

¼ Number of windows classified as abnormal

Number of windows classified as normal� 100;

where a temporal window was classified as normal if the

SNN output node value was C0.8. Otherwise, if SNN

output was B-0.8, the information was labeled as abnor-

mal. The crackling index was obtained for each inspiratory

phase, represented by 30 feature vectors, for each one of

the 25 channels. Finally, the crackle image was formed

interpolating the 25 crackling index values by the Hermite

function.

2.5 Time variant AR model scheme

The time variant autoregressive (TVAR) model is proposed

to estimate the number of crackles by accounting for

temporal non-stationary changes produced by the occur-

rence of crackles. The TVAR model is given by:

Xp

i¼0

a�i ðnÞu n� kð Þ ¼ v nð Þ;

where the AR coefficients {ai(n), i = 1,…,p} are calcu-

lated sample by sample. The TVAR coefficients were

obtained by the recursive least squares (RLS) algorithm

minimizing the cost function n(n) =P

i=1n kn-i|e(i)|2,

where k represents the forgetting factor, and e(i) = u(i) -

y(i) is the error signal in terms of the output of the adaptive

filter that depends on the past values of u(n) [13]. The

forgetting factor, k [ (0, 1), controls the influence of prior

information on which the cost function is minimized, i.e.,

with a small k value the RLS algorithm has short memory

length, being more sensitive to recent samples. For crackle

detection, k = 0.97 was found to be adequate for tracking

sudden nonstationary changes by the RLS algorithm,

allowing a confident count of the crackles into the basic

LS. Although the forgetting factor can be determined

adaptively, this approach was not explored here. The order

of TVAR model, for all the channels, was fixed at four.

In addition, we propose a criterion to decide the pres-

ence of a crackle based on the abrupt changes in the

derivative of the TVAR coefficient time series. A thres-

holding procedure was applied to the four derivatives and

only if an abrupt change was above the threshold in the

four time series the information was considered as pro-

duced by a crackle. For all the scaling factors, a threshold

of 0.035 times the standard deviation of the derivatives of

the AR coefficient time series was empirically established.

The estimated number of crackles for each channel was

interpolated by the Hermite function and a discontinuous

adventitious sounds image was constructed.

2.6 Processing schemes assessment

The processing schemes were evaluated by their ability to

detect the simulated abnormal regions of the lung in

accordance with the estimated number of inserted SCs. The

robustness of the processing schemes was achieved fol-

lowing two approaches: (a) controlling the SNR, the

number of crackles, their timing in the inspiratory phase

and their spatial distribution and (b) contrasting the results

against the performance of a human expert to detect by

TEWA simulated and real crackle sounds.

3 Results

3.1 Influence of SNR on crackles morphology

A factor that influences the performance of almost any

signal detection scheme is the SNR between the desired

signal and the background noise. In order to visually cor-

roborate the influence of the basic LS on crackles mor-

phology, Fig. 3 shows fine SCs embedded in LS signals at

sensors PRC2 and PLX4, using scaling factors 1.5 and 3.5.

The arrows at the top of each signal point out the precise

temporal position of the SCs. Analyzing the signal’s mor-

phology with factor 1.5 in Fig. 3a, it is easy to conclude

that some crackles were barely visible and even lost the

TEWA criterion. On the other hand, and knowing in

advance their temporal position, visual recognition of some

crackles was relatively easy with a factor 3.5, Fig. 3b.

However, the morphology of the sixth embedded crackle

looks severely modified by the surrounding basic LS

increasing the difficulty to detect it even by a specialist.

Med Biol Eng Comput (2011) 49:15–24 19

123

Also, crackles’ morphology was severely distorted as the

crackles were in closer sequence or overlapped, as can be

seen for the first and second crackles in Fig. 3b.

3.2 Estimated images for fine SCs using UAR-SNN

scheme

In Fig. 2, the simulated pattern image for fine crackles is

shown at the center of the first column. The three estimated

images by the UAR-SNN scheme for scale factors 1.5, 2.5,

and 3.5 are shown, from top to bottom, in the second column

of Fig. 2; the color bar is related to the proposed crackling

index. The SNN’s weights were adjusted by considering the

acoustic information of each channel independently, and the

sensitivity obtained in the training phase, for the three scaling

factors, was between 98.6 and 99.1%, while the specificity

was between 95.3 and 96.8%. In the estimated images, there

were two well-defined zones and, more important, they were

associated with the crackle number used to synthesize the

pattern image. The estimated image for the scale factor 3.5

was the closest to the pattern image due to the highest SNR.

Furthermore, even with the lowest scale factor 1.5, it is

possible to recognize two regions containing different

number of crackles in the estimated image. Due to the fact

that the inspiratory phase was divided in 30 segments and the

crackles were inserted in its second half, the crackling index

values 54 and 30 correspond to 10.8 and 6 abnormal temporal

windows, respectively. Certainly, if the number of crackles is

unknown, crackles relevance in the estimated image needs to

be evaluated according to the intensity of the colored area on

the thorax. An extension of the former results is shown in

Fig. 4a, where the image was constructed by averaging the

estimated number of crackles for the 10 simulated abnormal

subjects and for the more critical scaling factor of 1.5.

3.3 Estimated images for fine SCs using NFD scheme

In Fig. 2, and using as reference the same simulated pattern

image, the third column presents the results for the NFD

scheme. In this case, the color bar values represent the

estimated number of crackles. As can be observed in the

estimated image for the scaling factor 3.5, there was a

single zone at the left thoracic base, coinciding with the

image pattern, but the estimated number of crackles was

highly underestimated. The results from the 10 subjects are

not shown here as the NFD scheme gave a very low per-

formance for all the scaling factors.

3.4 Estimated images for fine SCs using TVAR scheme

The results by TVAR are shown in the fourth column of

Fig. 2, where the color bar represents, as for the NFD

scheme, the estimated number of crackles. It is evident that

for the three scaling factors there was a good estimation of

the number of fine crackles, and consequently, the esti-

mated crackle images were in good agreement with the

pattern image. For the 10 simulated abnormal subjects, the

image formed by averaging the estimated crackles is pre-

sented in Fig. 4b, for the scaling factor 1.5. As can been

seen, the estimated average number of crackles was very

close to the number of inserted crackles in the pattern

image with standard deviation lower than 1.0. It is worthy

to note that for the remaining channels, where crackles

were not inserted, the TVAR scheme did not detect the

presence of any crackle. Finally, the average specificity

was very close to 100% and the average sensitivity was

98 ± 7.5%, even with overlapped crackles and with SNR

corresponding to a scaling factor as low as 1.5.

3.5 Estimated images for coarse SCs by TVAR scheme

In case of coarse crackles, all the schemes were also

evaluated. However, here we highlight the results for

TVAR as this scheme gave the best performance for all the

scaling factors and it does not require training for the dif-

ferent simulated scenarios as the UAR-SNN scheme does.

Figure 5 provides the estimated images for the scaling

factors 1.5, 2.5, and 3.5, and the color bar represents the

4.7 4.75 4.8 4.85 4.9 4.95 5 x 104

-10

0

10 ↓ ↓↓ ↓↓ ↓

4.6 4.65 4.7 4.75 4.8 4.85 4.9 x 104

-20 -10

0 10 ↓↓ ↓ ↓↓↓

Samples

Am

plit

ude

(m

V)

Fig. 3 Embedded fine crackles for scaling factors 1.5 (upper signal)and 3.5 (lower signal), arrows point out crackles’ temporal position

25.6±3.0

25.4 ±3.4

25.7±2.6

25.4 ±3.3

39.5 ±4.7

39.4 ±5.1

40.0 ±4.8

40.0 ±4.9

5.6

±0.75.9

±0.6

5.7

±0.55.8

±0.4

9.8 ±0.8

9.8 ±0.8

9.8 ±0.8

9.7 ±0.7

(b) (a)

Fig. 4 Estimated average images for ten subjects using fine

simulated crackles and scaling factor 1.5; a crackling index

(mean ± SD) by UAR-SNN and b crackles number (mean ± SD)

by TVAR schemes

20 Med Biol Eng Comput (2011) 49:15–24

123

number of estimated crackles. The TVAR scheme could

fairly identify the coarse crackle zones for the three scaling

factors, and the estimated number of crackles was very

close to the pattern image of Fig. 2.

3.6 Estimated images for combined fine and coarse

SCs using TVAR scheme

The estimated image by the TVAR for the pattern image in

Fig. 6a, using the scaling factor 1.5, is shown in Fig. 6b.

The result indicates the robustness of the TVAR scheme to

detect fine and coarse crackles as the four inserted crackles

were detected and counted at the right positions.

3.7 Estimated number of fine SCs by TVAR scheme

and the expert

Figure 7 depicts the images derived from detection and

counting of simulated crackles with a scaling factor 3.5 by

the TVAR scheme, Fig. 7a, and by a human expert using

TEWA criteria, Fig. 7b. It can be observed that the expert

had an approximation to the simulated abnormal regions

where the crackles were inserted, but crackles’ counting

was deficient. We corroborated that some crackles were

overlapped and some others were of too low amplitude, so

that detecting them by the expert was very difficult.

3.8 Estimated number of real crackles by TVAR

scheme and the expert

Finally, the performance of TVAR scheme was analyzed

on real acquired LS signals from patients with diagnosis of

idiopathic pulmonary fibrosis (IPF) and compared with the

crackle counting estimation by the expert using TEWA

criteria and listening skill. The IPF is a diffuse parenchy-

mal lung disease of difficult diagnosis and management,

usually leading to relatively short half-life after the diag-

nosis is established. Acoustical information from two IPF

patients with evident large amplitude crackles was used to

facilitate crackles identification by the physician; the

patients suffered dyspnea, cough, pulmonary restrictive

pattern, oxygen desaturation during exercise, and the high

resolution computed tomography (HRCT) showed bilateral

ground-glass with some honeycombing pattern. The

acoustic information was acquired with the patients

breathing at 1.5 l s-1. Figure 8a presents the LS signal

containing crackles from one of the patients. It can be seen

in the time expanded waveform, Fig. 8b, that the amplitude

of the crackles was high enough to visually identify them

matching the TEWA criteria. For the first patient, the

crackle counting was performed for seven inspiratory

phases, Table 1. In average, the TVAR scheme obtained

(a) (b)

(c)

6 7

6 6

10 10

10 10 1

5 6

5 5

10 10

9 9 1

6 6

6 6

10 11

10 11

Fig. 5 Estimated images by detecting and counting coarse crackles

using TVAR scheme; a scaling factor 1.5, b scaling factor 2.5, and

c scaling factor 3.5

(a) (b)

4 4

4 4

4 4

4 4

4 4

4 4

4 5

4 4

Fig. 6 Detection for mixed fine and coarse crackles by TVAR:

a pattern image including four fine and four coarse crackles and

b estimated image

5 4

2 2

0 3

4 5

6 6

6 6

10 10

9 9

(a) (b)

Fig. 7 Crackles’ counting comparison between TVAR and medical

expert in one ‘‘sick’’ subject: a estimated image by TVAR, b estimated

image by expert

Med Biol Eng Comput (2011) 49:15–24 21

123

5.7 crackles, while the expert counted 6.0 crackles. For the

second patient, using six inspiratory phases, TVAR scheme

estimated 3.2 crackles and the expert 3.0 crackles. The

results show that for the two patients, due to the high SNR,

the expert was able to identify and count the number of

crackles and, most important, his performance was in

agreement with the estimation of the TVAR scheme.

4 Discussion

X-ray, tomography, MRI, and other techniques for medical

imaging are currently applied almost in all the medical

fields. Furthermore, visualization of anatomical and func-

tional abnormal zones of the pulmonary fields is the focus

of considerable research. The acoustic imaging of the

thorax has been suggested and explored based on non-

simultaneous and simultaneous multichannel approach.

The acoustical imaging has been developed in two ways; in

the first approach, the focus was on determining breathing

sounds energy distribution on the surface of the thorax, as

an estimation of regional ventilation [6]. Accordingly, in

2004, Charleston et al. [2] encouraged the use of the

respiratory acoustic thoracic imaging (RATHI) to analyze

lung sound origin, spatial distribution, frequency content,

and relationship to ventilation in both healthy and ill sub-

jects. In addition, the same study included the visual cor-

relation between the acoustical image and the X-ray of a

patient to show the potentiality of the mapping of multi-

channel LS intensities. Further, since 2007, other authors

reported studies with some clinical applications of the LS

intensity acoustic image using the Vibration Response

Imaging system [4, 5, 23]. In the second approach, the

research was centered on the estimation of sound sources

within the thorax in an effort to reconstruct airway geom-

etry [18]. However, the second approach has been less

explored and no current clinical applications are derived. In

contrast with the former efforts and as an alternative in

providing additional information to the physician, in the

present work instead of mapping the intensity of LS we

propose the mapping of crackles sounds detecting and

counting them to get their relevance. It can be thought that

the crackle image is a kind of functional imaging associ-

ated with the number of abnormally closed airways that

could evidence the extent of the associated pulmonary

disease. The proposed image overcomes restrictions of the

conventional auscultation procedure where subjectivity and

non-controlled settings are common.

To establish the new concept of crackle imaging simu-

lating different scenarios represents an advantage to deal

with morphological changes of crackles due to the back-

ground LS. Besides, working with simulated scenarios is

relevant when a gold standard is not available. This

approach is one of the strengths of the present work as a

thorough control of the number, type, amplitude, and

timing of crackles was achievable. With real cases, such

control would be unfeasible since the TEWA criteria show

important limitations to identify correctly crackles when

they are overlapping or they have low amplitude. In this

article, the simulated scenarios allowed us to show that it is

feasible to automatically detect low amplitude crackles that

the physician missed using the TEWA criteria. The

robustness of the selected processing schemes was assessed

including both hemithorax using simulated cases and real

signals from two patients.

Our main findings pointed out that in contrast to the

human expert, simulated fine and coarse crackles with low

SNR and overlapped can be detected by UAR-SNN and

TVAR schemes, so that an automated and confident image

representing the spatial distribution of crackles was thor-

oughly possible. It should be considered that one of the

fundamental assumptions in this study was that crackle

sounds are added to the basic LS.

For many years, it has been accepted that crackle sounds

have a well-defined waveform. In fact, Murphy et al. [26]

gave the first description of the amplitude and time

5.64 5.66 5.68 5.7 5.72

-5

0

5

10

Samples

(b)

Am

plitu

de

(mV

)

X 1044 4.5 5 5.5 6 6.5 7

-20

-10

0

10

20

X 104

(a)

Fig. 8 a Acquired real LS signal from an IPF patient with five

inspiratory phases and b time-waveform expansion of the third

inspiratory phase

Table 1 Crackles’ counting by TVAR scheme and expert for real LS

signals

Mean ± SD

TVAR Expert

Patient 1 (n = 7 inspiratory phases) 5.7 ± 1.1 6.0 ± 1.0

Patient 2 (n = 6 inspiratory phases) 3.2 ± 1.9 3.0 ± 1.7

TVAR time variant autoregressive model, LS lung sounds

22 Med Biol Eng Comput (2011) 49:15–24

123

characteristics of crackles (TEWA), suggesting some spe-

cific criteria for their visual detection and parametrization.

Among the suggested characteristics, they pointed out that

amplitude of convincing crackles should be twice the

amplitude of the surrounding LS. However, in a recent

paper of the same group, they gave evidence that crackles

waveform shows some inconsistencies that made difficult

the parametrization of crackles by TEWA [34]. Our results

agree with such observation, as many of the simulated

crackles inserted in basic LS lost their morphology and lost

the conditions to be accepted as a factual crackle, more

even when they are overlapped. On the other hand, in the

same study, Vyshedskiy et al. [34] offer evidences that

support the stress-relaxation quadrupole theory, so that

based on the mechanism of generation of inspiratory and

expiratory crackles these authors explain the crackle’s

morphology. However, it is expected that the amplitude

and general morphology of crackles recorded at the tho-

racic surface depend on the mechanism of generation (i.e.,

transpulmonary pressure), the source’s depth, the trans-

mission properties of the pulmonary structure, and the

presence of destructive and constructive waves. Therefore,

the waveform of crackles recorded on the thoracic surface

may not always hold the TEWA criteria. This is extremely

important because if we use the TEWA criteria we might

be missing many crackles that are of low amplitude and do

not cover the defined deflections. Thus, instead of ampli-

tude and temporal parameters, we choose schemes based

on different parameterizations to estimate low amplitude

and random temporal overlapping crackles.

Although NFD scheme has been suggested by some

authors for detecting and counting real crackles [12], the

results were dependent on the crackles’ amplitude. Our

findings indicate that the NFD scheme was unable to

contend with a scaling factor of 3.5 even applying different

threshold values to the NFD time series, and consequently,

this technique alone could be not enough for this applica-

tion. On the other hand, UAR-SNN and TVAR schemes

seem encouraging for automated detection and counting

fine and coarse crackles even with amplitudes below two

times the amplitude of the surrounding basic LS. Even

though the abnormal regions were rightly detected by both

schemes, at this moment the detection and counting of

factual crackles can be assured only for the TVAR scheme

that even contended with the lowest simulated SNR and

with randomly overlapped crackles. A feasible reason of

the TVAR model performance over the UAR model may

be related to the fact that each time sample is modeled by

four coefficients in comparison with six UAR coefficients

used for all the samples included in each time window. In a

future work, it could be interesting to assess the perfor-

mance of both time variant and invariant AR model using

algorithms to rank the importance of the model coefficients

as stated by Zou et al. [35] and Lu et al. [20] and reported

to provide better results than either RLS or Burg’s algo-

rithms. Furthermore, it could be relevant to determine the

most significant coefficients for different respiratory

pathologies associated with different adventitious sounds.

It is noteworthy that for simulated cases, the UAR-SNN

and TVAR schemes provided a zero counting for pul-

monary regions where crackles were absent indicating that

both schemes have a high sensitivity and specificity for

crackles detection. In this study, the outcome of the UAR-

SNN scheme was related to the crackling index on the

pulmonary zone rather than a precise number of crackles as

provided by the TVAR scheme.

For the case of the comparison of the TVAR scheme

against the expert, our results revealed that the expert could

provide just a rough idea of the abnormal pulmonary

regions but he was unable to indicate the crackles number.

The former indicates the superiority of the automated

processing techniques for detecting and counting crackles

as the expert depends on a well-defined morphology and

high intensity crackles.

Some limitations of the present study, however, must be

pointed out. First, we simulated only two types of crackles,

fine and coarse, with typical waveforms, in accordance

with the mathematical model of Kiyokawa et al. [17].

However, since the mechanism of generation for crackles

varies according with the mechanical and dynamic airway

characteristics several crackles temporal morphologies

might be expected; the limits for detection of diverse

morphologies were not explored. Second, we did not care

at the present study about separating fine and coarse

crackles’ timing within the respiratory phases or even

transmission aspects of crackles sounds [33]. Third,

although the results with real data are encouraging, the

discontinuous sound imaging should be validated using a

large database of patients. Finally, although the threshold

of 0.035 times the standard deviation of the derivatives of

the TVAR coefficient time series performed well for all the

scenarios considered in this study, it is possible that the

threshold needs to be tuned according to different lung

characteristics or if the TVAR scheme is used to detect

other types of abnormal lung sound.

5 Conclusion

We conclude that a confident non-invasive and functional

image of crackle sounds distribution by counting crackles

on the thoracic surface using TVAR is thoroughly possible.

The proposed image might represent an important alter-

native for the clinical evaluation of pulmonary diseases that

present low amplitude and overlapped adventitious

Med Biol Eng Comput (2011) 49:15–24 23

123

discontinuous lung sounds. The concept of discontinuous

adventitious sounds imaging established in this study is

currently explored with real data in a clinical setting.

References

1. Al Jarad N, Davies SW, Logan-Sinclair R, Rudd RM (1994) Lung

crackle characteristics in patients with asbestosis, asbestos-rela-

ted pleural disease and left ventricular failure using a time-

expanded waveform analysis—a comparative study. Respir Med

88:37–46

2. Charleston-Villalobos S, Cortes-Rubiano S, Gonzalez-Camarena

R, Chi-Lem G, Aljama-Corrales T (2004) Respiratory acoustic

thoracic imaging (RATHI): assessing deterministic interpolation

techniques. Med Biol Eng Comput 42:618–626

3. Charleston-Villalobos S, Gonzalez-Camarena R, Chi-Lem G,

Aljama-Corrales T (2007) Crackle sounds analysis by empirical

mode decomposition. IEEE Eng Med Biol Mag 26(1):40–47

4. Dellinger RP, Jean S, Cinel I, Tay C, Rajanala S, Glickman YA,

Parrillo JE (2007) Regional distribution of acoustic-based lung

vibration as a function of mechanical ventilation mode. Crit Care

11(1):R26

5. Dellinger RP, Parrillo JE, Kushnir A, Rossi M, Kushnir I (2008)

Dynamic visualization of lung sounds with a vibration response

device: a case series. Respiration 75(1):60–72

6. Dosani R, Kraman SS (1983) Lung sound intensity variability in

normal men. A contour phonopneumographic study. Chest

83:628–631

7. Duda OR, Hart EP, Store GD (2001) Pattern classification. Wiley,

New York

8. Hadjileontiadis LJ (2007) Empirical mode decomposition and

fractal dimension filter: a novel technique for denoising explosive

lung sounds. IEEE Eng Med Biol Mag 26(1):30–39

9. Hadjileontiadis LJ (2009) A texture-based classification of

crackles and squawks using lacunarity. IEEE Trans Biomed Eng

56(3):718–732

10. Hadjileontiadis LJ, Panas SM (1996) Nonlinear separation of

crackles and squawks from vesicular sounds using third-order

statistics. In: Proceedings 18th annual international conference

IEEE-EMBS, Amsterdam, Netherlands, pp 2217–2220

11. Hadjileontiadis LJ, Panas SM (1997) Separation of discontinuous

adventitious sounds from vesicular sounds using a wavelet-based

filter. IEEE Trans Biomed Eng 44:1269–1281

12. Hadjileontiadis LJ, Rekanos IT (2003) Detection of explosive

lung and bowel sounds by means of fractal dimension. IEEE

Signal Process Lett 10(10):311–314

13. Haykin S (1998) Adaptive filter theory. Prentice Hall, Englewood

Cliffs, NJ

14. Hoevers J, Loudon R (1990) Measuring crackles. Chest

98:1240–1243

15. Kaisla TK, Sovijarvi ARA, Piirila P, Rajala HM, Haltsonen S,

Rosqvist T (1991) Validated method for automatic detection of

lung sounds crackles. Med Biol Eng Comput 29:517–521

16. Katz MJ (1988) Fractals and the analysis of waveforms. Comput

Biol Med 18:145–156

17. Kiyokawa H, Greenberg M, Shirota K, Pasterkamp H (2001)

Auditory detection of simulated crackles in breath sounds. Chest

119:1886–1892

18. Kompis M, Pasterkamp H, Wodicka GR (2001) Acoustic imaging

of the human chest. Chest 120:1309–1321

19. Ljung L (1987) System identification. Prentice Hall, Englewood

Cliffs, NJ

20. Lu S, Ju KH, Chon KH (2001) A new algorithm for linear and

nonlinear ARMA model parameter estimation using affine

geometry. IEEE Trans Biomed Eng 48(10):1116–1124

21. Mangione S, Nieman LZ (1999) Pulmonary auscultatory skills

during training in internal medicine and family practice. Am J

Respir Crit Care Med 159(4 Pt 1):1119–1124

22. Martinez-Hernandez HG, Aljama-Corrales T, Gonzalez-Cama-

rena R, Charleston-Villalobos S, Chi-Lem G (2005) Computer-

ized classification of normal and abnormal lung sounds by

multivariate linear autoregressive model. In: Proceeding 27th

annual international conference IEEE/EMBS, Shanghai, China,

pp 1464–1467

23. Mor R, Kushnir I, Meyer J, Ekstein J, Ben-Dov I (2007) Breath

sound distribution images of patients with pneumonia and pleural

effusion. Respir Care 52(12):1753–1760

24. Mori M, Kinoshita K, Morinari H, Shiraishi T, Koike S, Murao S

(1980) Waveform and spectral analysis of crackles. Thorax

35:843–850

25. Munakata M, Ukita H, Doi I, Ohtsuka Y, Masaki Y, Homma Y,

Kawakami Y (1991) Spectral and waveform characteristics of

fine and coarse crackles. Thorax 46:651–757

26. Murphy RL, Holford SK, Knowler WC (1977) Visual lung sound

characterization by time-expanded waveform analysis. N Engl J

Med 296:968–971

27. Ono M, Arakawa K, Mori M, Sugimoto T, Harashima H (1989)

Separation of fine crackles from vesicular sounds by a nonlinear

digital filter. IEEE Trans Biomed Eng 36:286–291

28. Piirila P, Sovijarvi ARA (1995) Crackles: recording, analysis and

clinical significance. Eur Respir J 8:2139–2148

29. Piirila P, Sovijarvi ARA, Kaisla T, Rajala HM, Katila T (1991)

Crackles in patients with fibrosing alveolitis, bronchiectasis,

COPD, and heart failure. Chest 99:1076–1083

30. Reichert S, Gass R, Brandt C, Andres E (2008) Analysis of

respiratory sounds: state of the art. Clin Med 2:45–58

31. Ryu JH, Daniels CE, Hartman TE, Yi ES (2007) Diagnosis of

interstitial lung diseases. Mayo Clin Proc 82(8):976–986

32. Tolias YA, Hadjileontiadis LJ, Panas SM (1998) Real-time sep-

aration of discontinuous adventitious sounds from vesicular

sounds using a fuzzy rule-based filter. IEEE Trans Inf Technol

Biomed 2:204–215

33. Vyshedskiy A, Bezares F, Paciej R, Ebril M, Shane J, Murphy R

(2005) Transmission of crackles in patients with interstitial pul-monary fibrosis, congestive heart failure and pneumonia. Chest

128:1468–1474

34. Vyshedskiy A, Alhashem RM, Paciej R, Ebril M, Rudman I,

Fredberg JJ, Murphy R (2009) Mechanism of inspiratory and

expiratory crackles. Chest 135:156–164

35. Zou R, Wang H, Chon KH (2003) A robust time-varying iden-

tification algorithm using basis functions. Ann Biomed Eng

31(7):840–853

24 Med Biol Eng Comput (2011) 49:15–24

123