Automatic Estimation of Multiple Target Positions and Velocities Using Passive TDOA Measurements of Transients

Embed Size (px)

Citation preview

  • 8/2/2019 Automatic Estimation of Multiple Target Positions and Velocities Using Passive TDOA Measurements of Transients

    1/13

    1

    Automatic Estimation of Multiple Target Positions

    and Velocities Using Passive TDOA Measurements

    of TransientsDragana Carevic

    Abstract This paper considers the problem of the estimationof the motion parameters of multiple targets moving linearly in athree-dimensional (3-D) observation area contaminated by clutter.The measurements are limited to time differences of arrival(TDOA) of short-duration acoustic emissions, or transients, gen-erated by the targets. This problem can arise in situations wherethe level of continuous broadband target-related noise is verylow. Owing to the fact that transient emissions are nonstationaryand can have low signal-to-noise ratio (SNR), the correspondingTDOA measurement errors are usually non-Gaussian. Therefore

    Gaussian mixture distributions are used to appropriately modelthese errors.

    An iterative maximum likelihood optimization technique basedon a modified deterministic annealing expectation-maximization(MDAEM) algorithm is applied to this problem. In each iterationthe algorithm uses a nonlinear least squares (LS) technique incomputing the motion parameters for each target. It generalizesthe variance deflation method previously used for the initializationof target tracking algorithms and increases the possibility ofattaining a globally optimal solution for random initial conditions.Simulation results are presented for several different numbers oftargets, clutter densities, and probabilities of gross error of thetarget related measurements and are found to be comparable tothe estimates obtained when the measurement-to-target assign-ments are exactly known.

    Index Terms Source localization, EM algorithm, deterministicannealing, relative time delay estimation, underwater acoustictransients, Gaussian mixture model.

    I. INTRODUCTION

    THE problem of estimating motion of underwater targetsusing passive sensors is of considerable interest in anumber of surveillance applications. Sensors can be mounted

    on board a ship, towed, or deployed in water, and their position

    is assumed to be known. The measurements are based on the

    differences in propagation travel time of an emitted signal from

    the source to each of the sensors [1], [2] and are commonly

    referred to as the time differences of arrival (TDOA) or relativetime delays.

    Transients belong to a particular class of wide-band acoustic

    emissions that are characterized by relatively short durations

    (from few tens of milliseconds to several seconds) and arbi-

    trary waveform shapes. Examples of these signals include a

    deck hatch slamming, a hull reverberation, and a momentary

    vibration caused by a pump. They are deemed to be important

    for detection and passive localization of quiet targets that would

    otherwise be hardly detectable.

    This work was supported by the Defence Science and Technology Organi-sation, Australia.

    The estimation of TDOA of a passive transient involves

    the computation of a cross-correlation between outputs from

    a given pair of receiving sensors [3], [4]. In the presence of

    multipath propagation the estimate is taken to be the time lag

    of an appropriately chosen peak in the crosscorrelogram [5],

    [6], [7]. The accuracy of the TDOA measurement depends on

    individual characteristics of the received signal spectrum [8]. It

    is also sensitive to low signal-to-noise ratios (SNRs) [8], [9]

    and to signal distortions resulting from acoustic propagationin spatially varying environments [10], [11]. Large errors in

    the TDOA estimation are possible and, as a consequence,

    the TDOA measurement error distribution is usually non-

    Gaussian (has wide tails). A two-component Gaussian mixture

    probability density function (pdf) is proposed as a general

    model for this error distribution [12], [13]. Moreover, transients

    radiated by targets are typically observed in the presence of

    clutter that can originate from other sources, such as nearby

    shipping traffic, biological noise, etc., or as a consequence of

    very large (gross) measurement errors caused by environmental

    effects such as multipath propagation. Besides, there may be a

    number of false detections returned by the transient detection

    algorithm.A number of techniques for tracking single and multiple

    targets using measurements of uncertain origin have been

    proposed. The probabilistic data association (PDA) [14] prob-

    abilistically associates measurements with targets; it defines a

    joint likelihood function and computes the target motion pa-

    rameters by a direct maximization of this function. It is success-

    fully applied to tracking targets in underwater acoustics [15],

    [16], [17]. Alternative approaches [18], [19], [20], [21] formu-

    late multitarget tracking as an incomplete data problem and

    apply an iterative maximum likelihood (ML) estimator based

    on the expectation-maximization (EM) algorithm [22]. This

    algorithm maximizes the likelihood function of incomplete data

    indirectly by iterating the expectation and maximization stepsuntil some appropriate convergence conditions are satisfied. In

    [18], [23] the authors apply the EM-based methods to tracking

    targets in cluttered underwater environments. The problem of

    the localization of a single target in clutter based on passively

    sensed transients in a 2-dimensional (2-D) observation area is

    considered in [12], [13]. Other approaches to localizing and

    tracking a single moving source in an underwater environment

    using TDOA measurements are discussed in [24], [25], [26].

    Recently, Vo et al. [27], [28] and Ma et al. [29] proposed

    a method for tracking unknown number of speakers in 2-D

    multipath environments using a sequential Monte Carlo (SMC)

  • 8/2/2019 Automatic Estimation of Multiple Target Positions and Velocities Using Passive TDOA Measurements of Transients

    2/13

    2

    implementation based on TDOAs. Several other interesting

    approaches to tracking single and multiple targets are presented

    in [30], [31], [32], [33].

    The PDA and the methods based on the EM algorithm are

    sensitive to the choice of the initial values of the parameters.

    The initialization of the algorithms using the values that are

    different from the true parameters may result in convergence

    to a suboptimal solution, i.e., the algorithm may convergeonly to a local ML solution or to a saddle point, and not

    to a globally optimal solution [16], [18], [23]. A technique

    commonly applied to improve the convergence of these al-

    gorithms is to deflate the measurement variance: the initial

    iteration is carried out with a large variance and the variance is

    decreased (or deflated) in the subsequent iterations [18], [34].

    An alternative approach is to use the deterministic annealing

    EM (DAEM) algorithm recently proposed by Ueda and Nakano

    [35], [36]. It formulates the ML estimation as the minimization

    of an effective cost function based on the thermodynamics free

    energy and incorporates a deterministic annealing process that

    is characterized by simultaneous reduction in both the entropy

    and the cost function of the system with gradual decrease ofa global control parameter called the computational tempera-

    ture. At high temperatures the range of possible solutions of

    the algorithm is considerably widened whereby its dependance

    on the initial conditions is decreased.

    Both the EM algorithm implemented with the variance

    deflation and the DAEM algorithm increase the log-likelihood

    in each iteration. Consequently they are not guaranteed to

    obtain a global optimum in the cases where the initial guess

    is poor (e.g., when the initial values are set at random).

    Also, they are highly susceptible to the presence of false

    measurements [13], [18]. To improve the quality of the result

    Takada and Nakano [37], [38] propose the use of multiple-thread search with the DAEM algorithm. Each time a saddle

    point of the likelihood function is reached, which is indicated

    by detecting the eigenvalues of the Hessian matrix of the

    free energy function that are smaller than a given threshold,

    new search paths are instigated along the directions of the

    corresponding eigenvectors. The number of search paths that

    are simultaneously investigated in this way can be very large

    which significantly increases the computational complexity of

    the algorithm.

    An alternative approach that introduces stochastic pertur-

    bation in the DAEM algorithm, called the modified DAEM

    (MDAEM) algorithm, is proposed in this paper. This algorithm

    uses a stochastic imputation principle whereby, in each itera-tion, pseudo-complete data is simulated by sampling from a

    posterior distribution conditioned on the measurements and the

    current approximation of the parameters [39]. This data is used

    to obtain the updated parameter values. Unlike the standard

    DAEM algorithm that guarantees that the log-likelihood is

    increased in each iteration, the MDAEM algorithm has a non-

    zero probability of accepting a solution with a lower likelihood

    value than that in the previous iteration. In this way it avoids

    saddle points or insignificant local maxima of the likelihood

    function. Contrary to the approach using the multiple-thread

    search described in [37], [38], the computational complexity

    of the MDAEM algorithm is close to that of the standard

    DAEM algorithm. Moreover, while in a general case techniques

    based on stochastic simulated annealing [40] in each iteration

    accept parameter updates with a probability that depends on

    the temperature, the MDAEM algorithm accepts such changes

    with probability 1. In this way the total search is executed more

    efficiently than in the simulated annealing.

    This paper considers the problem of estimating position andvelocity of multiple targets observed in clutter based on TDOAs

    of passive transients. To increase robustness with respect to

    unknown initial conditions and number of false measurements

    the proposed technique uses the MDAEM algorithm and jointly

    solves two different problems: 1) measurement-to-target asso-

    ciation, and 2) estimation of the target motion parameters. The

    TDOA measurement errors are modelled as having Gaussian

    mixture pdfs. The Gaussian mixture modelling is solved so as

    to enable the use of a nonlinear least squares (LS) technique

    in computing the motion parameters for each target.

    The proposed algorithm estimates motion parameters for Ptargets, where P is set by the operator. Often the true number

    of targets in the measurements Pt is not known. In these casesP should be selected so as to be equal or greater than themaximum expected number of the true targets, so that P Pt.The algorithm performs robustly under such conditions and is

    capable of estimating motion parameters for the Pt true targetsin addition to computing parameters for the P Pt dummytargets.

    The paper is organized as follows. Background information

    on the problem of localizing multiple targets observed in

    clutter based on TDOAs of passive transients is presented in

    Section II. The derivation of the DAEM algorithm for multiple

    target localization using Gaussian mixture pdfs is described in

    Section III. This section also presents the MDAEM algorithm

    for target localization that uses stochastic imputations. Theresults of numerical simulations for several target geometries

    are presented in Section IV. Finally, some concluding remarks

    are given in Section V.

    I I . BACKGROUND INFORMATION

    A. Problem Formulation

    We consider a scenario where P targets are assumed to bemoving in a three-dimensional (3-D) observation region. Let

    Vp = [Xp(t)T,vTp ]

    T (1)

    denote a six-dimensional position-velocity parameter vec-tor that corresponds to the pth target where: Xp(t) =[Xp(t), Yp(t), Zp(t)]

    T is the position of the target at time t,vp = [vpx, vpy, vpz]

    T is the constant target velocity and t isthe reference time at which the target position is estimated.

    Also, denote by

    V = [V1,V2, . . . ,VP] (2)

    the motion parameters for all P targets.At different discrete time instants the targets emit transient

    signals asynchronously and independently of each other. The

    signals may propagate through the environment along different

  • 8/2/2019 Automatic Estimation of Multiple Target Positions and Velocities Using Passive TDOA Measurements of Transients

    3/13

    3

    paths and are passively sensed using a number of omnidirec-

    tional sensors. For simplicity we assume an isovelocity acoustic

    propagation model. In a general case the number of sensors

    may vary over time and it is assumed to be known. To ensure

    unique solution in 3-D sensors are not allowed to lie on a

    straight line or on a plane [41].

    B. Measurements of TDOAs of Transient SignalsLet there be N transient detections during an observation

    time period which includes both target- and nontarget-related

    signals. Targets are assumed to have emitted a limited number

    of transients that may convey information sufficient for the

    estimation of their positions and velocities. We also define

    a set of generation time instants associated with the detected

    transients1 {ti}Ni=1. Note that, in general, these time instantsare not evenly spaced.

    Let the ith transient be emitted by the pth target at the timeti and received by Ri + 1 sensors. The exact TDOA betweenthe reference sensor l and the kth sensor, k = 1, . . . , l 1, l +1, . . . , Ri + 1, is given by

    i,j,p =rl,p(ti) rk,p(ti)

    c, j =

    j = k, k < l

    j = k 1, k > l (3)

    where rk,p(ti) is the distance between the pth target and thekth sensor at the time ti

    rk,p(ti) = |Xp(ti)Sk(ti)| =

    (Xp(ti) Sk(ti))T(Xp(ti) Sk(ti)),(4)

    Sk(ti) = [Xk(ti), Yk(ti), Zk(ti)]T is the position of the kth

    sensor and c is the velocity of sound in water.The standard approach to estimating TDOA of a transient

    signal is to select the time lag that maximizes the cross-

    correlation between the outputs from the corresponding pair of

    sensors [3], [4]. In a multipath environment there are typically

    a number of paths reaching each sensor. In this situation the

    cross-correlation function has many peaks and the maximal

    peak may not correspond to the TDOA between arrivals

    travelling along direct paths [5]. One approach is to select

    a number of peaks (local maxima) of the cross-correlation

    assuming that one of them corresponds to the direct path signal

    arrivals. A similar technique is used in [28], [29] that considers

    the problem of tracking multiple speakers in the presence of

    multipath reflections. Here, TDOAs are measured based on

    (continuous) speech signals and are sampled at regularly spaced

    update times. At each update time the measurement set may

    contain the direct path TDOA measurements related to one ormore targets in addition to false measurements.

    By contrast, in our application target transient emissions

    are asynchronous and the signals received at the time ti arerelated to only one target. The emission (update) time instants

    ti are random (are not known in advance) and can be quiteseparated in time. A relatively small number of transients is

    1The generation times can be approximated by observer times, as is acommon practice in underwater target tracking. In this case the observer timesare the times of arrival of the transients at the reference sensor. The maineffect for tracking of relatively slow targets (compared to the velocity of soundpropagation in water) is the introduction of a near constant time offset intotarget location. For very slow targets this effect is negligible.

    expected to be emitted by each target. Besides, there may

    be a number of detected transient emissions that correspond

    to true locations within the observation area but are emitted

    by sources other then the desired targets. For these reasons,

    in this paper we propose to select TDOA measurements of

    transients using the method described by Spiesberger [5], [6],

    [7]. This method identifies robustly and efficiently the cross-

    correlation peak that corresponds to the difference betweenfirst (direct path) transient arrivals in the presence of multipath

    signal propagation. In addition to the cross-correlation between

    the two receivers it uses the two auto-correlation functions and

    assumes that the received signals contain only replicas from a

    single transient of unknown waveform and, also, that the spatial

    coordinates of the multipaths are unknown or are impractical

    to estimate.

    In this way, at the time ti only one measurement per a pair ofreceiving sensors is selected. Consequently, there are Ri TDOAmeasurements {Zi,j}Rij=1 at time ti obtained using Ri + 1sensors. Under the pth target assumption the measurement Zi,jis given by

    Zi,j = i,j,p + ei,j (5)

    where i,j,p is the exact TDOA for the pth target motion model(see Eq. (3)). In the cases where the measurement Zi,j iscorrectly taken using the transients that propagated along the

    direct paths, the TDOA measurement error ei,j is modelledas an independent random variable (rv) that is distributed

    according to a two-component Gaussian mixture pdf

    p(ei,j) = (1 i,j)N(ei,j ; 0, (1)i,j ) + i,jN(ei,j; 0, (2)i,j ) (6)where N(x; , ) denotes a Gaussian (normal) pdf in variablex with mean and standard deviation (std) , and i,j 1defines the proportion of the two pdfs in the mixture. The

    stds are related by (2)

    i,j > (1)

    i,j . In some situations Zi,j maycontain a gross measurement error. It may be far removed from

    the exact TDOA i,j,p so that the corresponding error ei,j isnot distributed according to the pdf in Eq. (6). This may occur

    in the cases where e.g., overlapping multipath reflections are

    received by the sensors.

    As a result, for the N transients, a set of cumulative (batch)measurements Z = {Zi,1, Zi,2, . . . , Z i,Ri}Ni=1 is obtained.It contains both the target related TDOA measurements and

    clutter. Clutter is assumed to arise under one of the following

    conditions: 1) if an existing transient is correctly detected by

    the transient detection algorithm but it originates from a source

    other than the desired targets (considered random) (clutter

    type 1); 2) if the transient detection algorithm recorded thedetection when no transient was present (transient false alarm)

    (clutter type 2); 3) if an existing target-related transient is

    correctly detected, but the TDOA measurement between a pair

    of sensors resulted in a gross error (clutter type 3).

    Our goal is to estimate position and velocity of the P targetsusing the cumulative measurements Z. However, there is noprior information about the origin of the measurements nor

    of the position and velocity of any of the targets. In addition

    to the specific TDOA measurement characteristics described

    above, the computational problems are related to the fact that

    the TDOA measurement errors are non-Gaussian and that the

  • 8/2/2019 Automatic Estimation of Multiple Target Positions and Velocities Using Passive TDOA Measurements of Transients

    4/13

    4

    relationship between the TDOA measurements and the target

    motion parameters is highly nonlinear. Under these limitations

    the following algorithm is deemed to be suitable.

    III. THE ALGORITHM FOR MULTIPLE TARGET

    LOCALIZATION USING TDOAS OF TRANSIENTS

    In this section a likelihood function of the observed data

    is first formulated. Using this formulation a DAEM algorithmfor multiple target localization is derived. Finally, a stochastic

    modification of the DAEM algorithm with an aim to increase

    the robustness with respect to unknown initial conditions and

    number of false measurements in the data set is presented.

    A. Likelihood Structure of the Observed Data

    The cumulative measurements Zrepresent incomplete (ob-served) data [22], [18]. The complete data can be obtained

    by associating with the batch Z a set of discrete valuedindices or labels that uniquely assign each measurement to

    one of the targets or to clutter. However, these assignment

    indices are unobservable and are called missing data. The

    standard approach described in [18], [21] estimates the motion

    parameters for the P targets and the missing assignment indicesby maximizing a likelihood function of the observed data Z.The derivation of the likelihood function used by the standard

    method is presented in Appendix I. An alternative approach

    that is tailored more specifically to our problem is discussed

    next.

    We first note that the Gaussian mixture model for the

    measurement error ei,j at time ti implies that ei,j can be

    generated either from the normal pdf N(ei,j; 0, (1)i,j ) or fromthe pdf N(ei,j ; 0, (2)i,j ), with probabilities (1 i,j) and i,j ,respectively (see Eq. (6)). Therefore, two models for the

    measurement Zi,j conditioned on i,j,p are possible, and thesecorrespond to each of the Gaussian pdfs N(Zi,j; i,j,p, (k)i,j ),k = 1, 2.

    Consequently, we introduce a set of discrete valued as-

    signment indices L = {li,1, li,2, . . . , li,Ri}Ni=1 and define thecomplete data by {Z, L}. The set L is unobservable andrepresents missing data. Each index li,j is modelled as a rvthat takes a value from the discrete set {n : 1 n 2P + 1}. It expresses an assignment hypothesis at time ti asfollows. Setting li,j = 2p 1, p = 1, 2, . . . , P , assigns themeasurement Zi,j to the pth target motion model, and chooses

    the measurement model N(Zi,j ; i,j,p, (1)i,j ). Setting li,j = 2p,however, selects the pth target model and the measurementmodel N(Zi,j; i,j,p, (2)i,j ). The relative prior probabilities ofthe two assignments are (1 i,j) and i,j, respectively. Whenthe assignment index is set to ki,j = 2P+ 1, the measurementZi,j is assigned to clutter.

    Let i = {i,j,1, i,j,2, . . . , i,j,2P+1}Rij=1 denote the proba-bilities associated with the assignment indices {li,j}Rij=1 for theith transient, where i,j,n = Prob[li,j = n]. Also denote by

    = {1, 2, . . . , N} (7)the corresponding assignment probabilities for the batch Z.The complete data probability conditioned on the target motion

    parameters V and the batch assignment probabilities isdefined by

    P(Z, L |V, ) =Ni=1

    Rij=1

    i,j,n qi,j,n|n=li,j(8)

    where

    qi,j,n =

    N(Zi,j; i,j,n, (1)i,j ) n = 2p 1, p = 1, . . . , P N(Zi,j; i,j,n, (2)i,j ) n = 2p, p = 1, . . . , P

    j n = 2P + 1(9)

    and where j is a constant that depends on the sensor pair j.The observed data pdf conditioned on the parameters V

    and is computed as the marginal distribution (over allpossible measurement to target/clutter assignments in L) ofP(Z, L |V, ) in Eq. (8), i.e., as

    P(Z |V, ) =L

    P(Z, L |V, ) =N

    i=1Ri

    j=12P+1

    n=1i,j,n qi,j,n.

    (10)It is required that the marginal pdf P(Z |V, ) and the pdf

    P(Z |V, ) in Eq. (37) are equal as they are derived for thesame data. This condition is satisfied when the assignment

    probabilities i and i are related as follows

    i,j,2p1 = i,p (1 i,j) (11)i,j,2p = i,p i,j (12)

    for p = 1, 2, . . . , P , and, also, when i,j,2P+1 = i,P+1.Eqs. (11) and (12) ensure that the measurement-to-target as-

    signment probabilities for all measurements {Zi,j}Rij=1 at timeti as per Eq. (10) are effectively equal to i,p in Eq. (37).

    In this way the probability that the jth measurement at timeti, Zi,j, originates from the target model p is equal for all

    measurements {Zi,j}Rij=1 taken at that time, that is, theseprobabilities are independent of the index j. Besides, since theprobabilities of clutter measurement i,j,2P+1 do not dependon the measurement index j, it also follows that

    i,j,2P+1 = i,2P+1 = i,P+1 for j = 1, . . . , Ri. (13)

    B. The DAEM Algorithm for Multiple Target Localization

    The proposed algorithm computes a ML estimate of the

    motion parameters for the P target models V and the batch

    assignment probabilities = {ti}N

    i=1. It applies the DAEMalgorithm [35], [36] which is an iterative procedure that

    attempts to minimize the effective cost function based on

    thermodynamic free energy given by

    F(Z |V, ; ) = 1

    logL

    P(Z, L |V, ) . (14)

    The iterative minimization is performed similarly as the max-

    imization of the incomplete log-likelihood function in the

    conventional EM algorithm [22]. Additionally, it incorporates

    an annealing loop due to the dependance of the cost function

    in Eq. (14) on the global control parameter that is, by

  • 8/2/2019 Automatic Estimation of Multiple Target Positions and Velocities Using Passive TDOA Measurements of Transients

    5/13

    5

    analogy to statistical mechanics, interpreted as the inverse of

    the computational temperature [35].

    Each iteration of the DAEM algorithm comprises an expecta-

    tion step (E step) and a maximization step (M step). For brevity

    we denote the parameters to be estimated by = {V, }. Alsodenote by m the parameters estimated during the mth iterationobtained via the DAEM algorithm. The iteration m m+1

    is carried out as follows:E-step: Compute the expectation Q( | m ; ) as a function

    of m defined by

    Q( | m; ) =L

    log[P(Z, L | )] P(L |Z, m ; ). (15)

    M-step: Find m+1 that maximizes Q( | m ; ).The function P(L |Z, ; ) in Eq. (15) represents the poste-

    rior distribution of the missing data L given the observed dataZand the parameters and . Using the analogy to statisticalmechanics and the principle of maximum entropy the posterior

    P(L |Z, ; ) is defined by [35], [36]

    P(L |Z, ; ) = P(Z, L | )

    L P(Z, L | ). (16)

    This distribution depends on the parameter as follows.For = 0 the temperature is high and the distributionP(L |Z, ; ) is uniform. At a low temperature, when = 1,P(L |Z, ; ) reduces to standard posterior and the DAEMalgorithm is equivalent to the EM algorithm. For 0 < < 1an increase of means a change in the form of P(L |Z, ; )from uniform to the standard posterior. Usually, is initiallyset to a small value min 0 (for m = 0) and is graduallyincreased in each iteration.

    Using Eqs. (8) and (10), and substituting m = {Vm, m},it can be shown that Eq. (16) produces

    P(L |Z, m; ) =Ni=1

    Rij=1

    wmi,j,n|n=li,j (17)

    where

    wmi,j,n =(mi,j,n q

    mi,j,n)

    k(

    mi,j,k q

    mi,j,k)

    (18)

    denotes the conditional posterior probability that the measure-

    ment Zi,j originates from the target/measurement model n,

    conditioned on the measured data, the parameters m

    estimatedin the mth iteration, and the global control parameter . InEq. (18), qmi,j,n for n {2p 1, 2p} is a Gaussian pdf givenby

    qmi,j,n =N(Zi,j; mi,j,p, (n)i,j ) (19)

    where mi,j,p is a function of the pth target motion parametersV

    mp estimated in the mth iteration (see also Eqs. (3), (8), (9)).

    The computation of the expectation Q( | m; ) in the E-step is analogous to the procedure described in [18], [21].

    Accordingly this function is broken into several independent

    sub-functions by separating the variables in the parameter

    vector = {V, } as follows

    Q( | m; ) =Ni=1

    Q(i | m; ) +P

    p=1

    QV(Vp | m; ) + O(20)

    where

    Q(i | m; ) =Rij=1

    2P+1n=1

    wmi,j,n log i,j,n (21)

    for i = 1, 2, . . . , N ,

    QV(Vp | m; ) =Ni=1

    Rij=1

    2pn=(2p1)

    wmi,j,n log qi,j,n (22)

    for p = 1, 2, . . . , P , and O is a remainder that dependson

    (q)i,j,n and j. In this way the M-step of the algorithm

    is decoupled into a maximization problem for each set of

    assignment probabilities i = {i,j,1, i,j,2 . . . , i,j,2P+1}Rij=1at the time ti, i = 1, 2, . . . , N , and for each target motionmodel Vp, p = 1, 2, . . . , P .

    The maximization ofQ(i | ; ) in Eq. (21) with respect toi is constrained by the requirement that

    2P+1n=1 i,j,n = 1 for

    j = 1, 2, . . . , Ri. Additional constraints are given by Eqs. (11)and (12) for p = 1, 2, . . . , P and j = 1, 2, . . . , Ri. The detailsof the maximization of Q(i | ; ) under these constraints arepresented in Appendix II. The resulting update equations for

    the assignment probabilities in i are given by

    m+1i,j,2p1 =1 i,j

    Ri

    Ris=1

    2pn=2p1

    wmi,s,n (23)

    m+1i,j,2p = i,jRi

    Ris=1

    2pn=2p1

    wmi,s,n (24)

    for the pth target motion model, p = 1, 2, . . . , P , and for j =1, 2, . . . , Ri, and by

    m+1i,2P+1 =1

    Ri

    Ris=1

    wmi,s,2P+1 (25)

    for clutter.

    The update for the pth target motion parameter vector Vpis defined by

    V

    m+1

    p = argmaxVp QV(Vp | m

    ; ). (26)

    Rearranging Eq. (22) yields

    QV(Vp | m; ) Ni=1

    Rij=1

    logN(Zi,j; i,j,p, mi,j,p) (27)

    whereN(Zi,j ; i,j,p, mi,j,p) is a Gaussian with the effective std

    mi,j,p =

    wmi,j,2p1

    ((1)i,j )

    2 +wmi,j,2p

    ((2)i,j )

    2

    1/2

    . (28)

  • 8/2/2019 Automatic Estimation of Multiple Target Positions and Velocities Using Passive TDOA Measurements of Transients

    6/13

    6

    The update for the pth target motion parameter vector Vm+1pcan be obtained by using a standard weighted nonlinear LS

    procedure, such as the Levenberg-Marquardt algorithm [42].

    This algorithm iterates between the computation of the

    conditional posterior probabilities wmi,j,n in Eq. (18) usingthe current parameter estimate m = {Vm, m} and thecomputation of the updated parameter estimates m+1 =

    {Vm+1

    , m+1

    } using the probabilities wm

    i,j,n as per Eqs. (23)-(28). In each iteration the computational temperature 1/ isdecreased by a small value.

    Remark 1: To understand the relationship between the

    DAEM algorithm and the variance deflation approach [18],

    [34] consider the posterior probability of missing data

    P(L |Z, m; ) in Eqs. (17)-(18). The numerator of the ex-pression in Eq. (18), for a case where n = 2p 1, can berewritten as follows (similar expression can be obtained for

    n = 2p)

    (mi,j,2p1 qmi,j,2p1)

    =

    mi,j,2p1

    2(1)i,j

    exp

    (Zi,j mi,j,p)

    2

    1

    (1)i,j

    2

    .

    (29)

    The denominator of the expression in Eq. (18) also consists

    of the subparts of the form as in Eq. (29). It can be seen

    that the variance in the exponent in Eq. (29) is multiplied

    by 1/. Since 1/ is initially a large number and graduallydecreases in each iteration, this, similar to the variance deflation

    method, has an effect of initially increasing the measurement

    variance. Consequently, the distribution P(L |Z, m; ) isuniform (flat) for high temperatures, so all data configurations

    defined by L are equally probable. This makes the algorithmsless dependant on the initial conditions. However, the approach

    based on variance deflation increases variances used in the

    maximization of Eq. (27), whereas the DAEM algorithm does

    not. Besides, contrary to the DAEM algorithm that has a

    theoretical justification as its cost function is derived based on

    statistical mechanics and the principle of maximum entropy

    [35], [36], variance deflation is an ad hoc method.

    Remark 2: The conventional EM algorithm in [22], obtained

    by setting = 1 in Eq. (18) in all iterations, can not be appliedto our problem. The relationship between the TDOA measure-

    ments and the target motion parameters is highly nonlinear

    and the values of the TDOA measurement error variances in

    Eq. (6) are very small. As a consequence, using, in Eq. (18),

    the parameter values that have even a small offset from the

    true target motion parameters for = 1 results in setting the

    exponential functions in Eq. (29) to zero, deeming Eq. (18)undetermined. For similar reasons the DAEM algorithm is

    not applicable to the maximization of the standard likelihood

    function P(Z |V, ) in Eq. (37) that is based on Gaussianmixture pdfs.

    C. Modified DAEM Algorithm for Multiple Target Localization

    The algorithm described in the previous section usually

    attains the desired ML solution when the initial values of the

    parameters are chosen to be close to this solution. As the

    temperature 1/ gets lower, the solution surface becomes morecomplicated, and the number of saddle points and suboptimal

    maxima increases. In the cases where the initial guess is poor

    (e.g., when the initial values are set at random) this algorithm

    may not obtain the desired global optimum.

    In order to make this method more robust for different initial

    conditions we propose the MDAEM algorithm that randomizes

    the DAEM algorithm by using stochastic imputations [39]. In

    each iteration the algorithm generates pseudo-complete data

    by sampling from a posterior distribution conditioned on themeasurements and the current approximation of the parameters.

    This data is used to obtain the updated parameter values in the

    subsequent iteration.

    The motivation for the MDAEM algorithm comes from the

    papers that describe stochastic imputation methods based on

    the EM algorithm [43], [44]. Celeux and Diebolt [43] apply

    a single stochastic imputation, and are concerned with the

    problem of estimating parameters from finite mixture densities.

    The approach described in [44] utilizes multiple imputations

    to compute the expectation in the E-step of the EM algorithm,

    followed by the maximization step. This approach is useful in

    situations where the computation of the expectation function

    is analytically intractable.The pseudo-complete data in the MDAEM algorithm is

    generated based on the assumption that the signals received by

    the sensors at the time ti contain replicas of transient emissionfrom a single target (this assumption is also used by the TDOA

    measurement process described in Section II-B). In this way all

    TDOA measurements {Zi,j}Rij=1 at the time ti are associatedwith the same target. The probability that the measurement

    subset {Zi,j}Rij=1 is generated by the pth target is computed as

    umi,p =1

    Ri

    Rij=1

    wmi,j,2p1 + w

    mi,j,2p

    for p = 1, 2, . . . , P ,

    (30)and as umi,P+1 =1Ri

    Rij=1 w

    mi,j,2P+1 for clutter. An auxiliary

    set of target assignment indices G = {gi}Ni=1 is next introduced.In the mth iteration of the algorithm an index gmi = p,gmi Gm assigns the measurements {Zi,j}Rij=1 to the pthtarget model. The value of the assignment index gmi Gm israndomly simulated from the distribution umi,p, p = 1, . . . , P +1in Eq. (30) and this is done for all i, i = 1, 2, . . . , N .

    The resulting assignments are used to obtain the functions

    QV(Vp | m; ) in the E-step of the algorithm as follows

    QV(Vp | m; ) iSmpRi

    j=1logN(Zi,j; i,j,p, mi,j,p) (31)

    where Smp = {r : gmr = p}. That is, only the measurementsassigned to the pth target motion model via Gm are used tocompute the new estimate Vm+1p . These measurements are now

    assigned to the pth target motion model with probability 1:they can either belong to one of the target/measurement models

    associated with the pth target or they are clutter obtained as theresult of a gross measurement error (clutter type 3, as described

    in Section II-B). This implies that the probabilities wmi,j,2p1and wmi,j,2p used to compute the effective stds of the Gaussiansmi,j,p in Eq. (31) need to be appropriately renormalized.Equivalently, using the existing probabilities wmi,j,n computed

  • 8/2/2019 Automatic Estimation of Multiple Target Positions and Velocities Using Passive TDOA Measurements of Transients

    7/13

    7

    1. Initialization: Set m = 0, = minSet initial value 0

    2. Iteration m m+1E-step: For all i, j, and n compute wmi,j,n using Eq. (18)M-step: Compute update m+1

    Update for {i}Ni=1 for all i, compute m+1i using Eqs. (23)-(25)

    Update for{Vp}

    P

    p=1 generate Gm = {gmi }Ni=1 by randomly simulatingsamples from umi,p

    for p = 1, . . . , P if Smp = {r : gmr = p} =

    compute Vm+1p using Eqs. (26), (31)-(32)

    else set Vm+1p = Vmp

    3. Set m = m + 1; increase 4. If > max end procedure

    Else go to 2.

    Fig. 1. Pseudo-code of the MDAEM algorithm for motion parameterestimation of P targets in clutter.

    as per Eq. (18) the effective stds can be obtained as

    mi,j,p =

    2p

    s=2p1

    wmi,j,s + wmi,j,2P+1

    1wmi,j,2p1(

    (1)i,j )

    2 +wmi,j,2p

    ((2)i,j )

    2

    1/2

    (32)

    for all {(i, j) : i Smp , j = 1, 2, . . . , Ri} and p = 1, 2, . . . , P .Remark 3: The values (1/mi,j,p)

    2 act as weights in the LS

    procedure used to maximize the function QV(Vp | m; ) inEq. (31). It can be seen from Eq. (32) that, for the correctly

    estimated motion parameters of the pth target, the weight that

    corresponds to a clutter measurement assigned to the pth targetwill be close to zero, since in this case wmi,j,2p1 0 andwmi,j,2p 0.

    Remark 4: In a situation where the true number of targets

    Pt is not known the number of targets used by the algorithmP should be selected so as to be greater than the maximumexpected number of targets in the data, P Pt. The algorithmestimates target positions and velocities for the Pt true targetsand for the PPt dummy targets. If the pth target is a dummytarget it may happen that in the iteration m m+1 nomeasurement is associated with its estimated parameters using

    stochastic imputation, so that Smp = . Consequently, for thistarget the parameters Vp are not updated but the estimates

    obtained in the previous iteration are retained, that is Vm+1p =V

    mp .

    The processing steps of the proposed algorithm are summa-

    rized in Fig. 1. The assignment probabilities i are computedusing the update formulas in Eqs. (23)-(25). The details of the

    procedure used to randomly simulate the auxiliary assignment

    index set Gm in the mth iteration of the algorithm are shownin Fig. 2.

    The sequence {m}m0 produced by the algorithm is anirreducible, inhomogeneous Markov chain [39]. If{m}m0 isalso ergodic then its distribution converges to a unique station-

    ary distribution of this Markov chain. Owing to the non-linear

    For i = 1 to N do {For p = 1, 2, . . . , P compute probabilities umi,p

    using Eq. (30)

    Generate a uniform random variable u on {0, 1},u U(0, 1)

    Set Q = 0, p = 0Repeat {

    set p = p + 1compute Q = Q + umi,p

    } until Q uSet gmi = p

    }

    Fig. 2. Simulation of the assignment indices Gm in the mth iteration of thealgorithm.

    TABLE I

    TARGET POSITIONS AT THE TIME tsq , q = 1, 2, 3, AND VELOCITIES FOR THE

    TRUE TARGETS MOTION MODELS USED IN THE SIMULATIONS.

    Target tsq X(tsq) Y(t

    sq) Z(t

    sq) vx vy vz

    q (s) (m) (m) (m) (m/s) (m/s) (m/s)1 0 4000 -3000 -1900 -3.0 5.5 0.1

    2 270 -3000 -400 -1200 4.2 3.8 -0.2

    3 420 -5000 1200 -50 5.0 -4.0 0.0

    maximization of the expression in Eq. (31) and the presence

    of clutter in the measurements, ergodicity of this chain is very

    hard to study. The results obtained by numerical simulations

    using test geometries with Pt {1, 2, 3} targets indicate thatwith the appropriately chosen cooling schedule and for random

    initial conditions this algorithm usually converges to the desired

    optimal solution. Some of these results are presented in the

    following section.

    IV. NUMERICAL SIMULATIONS AND DISCUSSION

    Three target motion models are used in the numerical

    simulations. For each model the transient emission starts at a

    different time instant tsq , q = 1, 2, 3. The positions of the targetsat the times tsq and their velocities are listed in Table I. Thecoordinate system is oriented such that xy plane overlaps withthe water surface and negative z axis corresponds to depth.The simulations are performed for the test scenarios where

    Pt {1, 2, 3} targets are observed in clutter. The test scenariowith Pt = 1 target uses the motion model q = 1 (Target 1) in

    Table I; for the scenario with Pt = 2 targets the motion modelsare q = 1 and q = 2 (Targets 1 and 2), and in the case wherePt = 3 the motion models are q = 1, q = 2 and q = 3.

    An array of five receiving sensors is used in the time delay

    analysis. The sensors are stationary and are positioned at

    U1 = [0, 0, D2]T, U2 = [D1, D1, D1 + D2]T, U3 =

    [D1, D1, D1 + D2]T, U4 = [D1, D1, D1 + D2]T andU5 = [D1, D1, D2 D1]T where D1 = 7 0/

    3) and

    D2 = 300 m.For each test scenario the transient emissions that correspond

    to the qth target, q {1, . . . , P t}, start at tsq and the timeincrements between two consecutive emissions are randomly

  • 8/2/2019 Automatic Estimation of Multiple Target Positions and Velocities Using Passive TDOA Measurements of Transients

    8/13

    8

    drawn from the interval [tmin, tmax], tmin = 10 s,tmax = 180 s. All TDOAs are measured with respect tothe reference sensor U1 and the number of measurements at

    the time ti is Ri = 4. The velocity of sound in water is takento be c = 1500 m/s. The observation area is limited to have amaximum range of 6000 m relative to the reference sensor andconstant depth z = 2300 m. The parameters of the TDOAerror distribution (see Eq. (6)) are:

    (1)

    i,j = (1)

    = 40 s and(2)i,j = 200 s, and i,j = 0.20. The measurements that fall

    outside the observation area are discarded so the real number

    of transients is NTq 12 for q = 1, . . . , P t.A target-originated data set obtained in this way is combined

    with a set of clutter TDOA measurements based on real tran-

    sients emitted by sources other than the desired targets (clutter

    type 1 described in Section II-B). The total number of clutter-

    related emissions per km3 per hour is taken to have a Poisson

    distribution with mean . These emissions occur randomly inthe time interval that starts 8 min. before and finishes 8 min.

    after the observation time for the target-originated measurement

    set, and the corresponding source locations are taken to be

    uniformly distributed within the observation area. Each emis-

    sion of this type is assigned TDOA measurements computed as

    per Eqs. (3) and (4). Clutter type 3 is simulated by assuming

    that every measurement in a target-related measurement set

    can result in a gross error with probability . Clutter type 2caused by false transient detections is not considered. Such

    measurements can be easily identified and removed as they

    usually do not correspond to meaningful physical locations.

    The MDAEM algorithm estimates P target motion models.No previous knowledge of the motion parameters is assumed.

    Therefore, on initialization the target position at the reference

    time t, Xp(t), and target velocity vp for the pth motion

    model, p {1, . . . , P }, are randomly selected from appropriateuniform distributions. In particular, x and y components of theposition are taken to be uniformly distributed within 3000 mto 3000 m, the z component within 0 m to 2300 m, and eachcomponent of the velocity within 5 m/s to 5 m/s. Similarly,initially, assignments to any of the target models or to clutter

    are assumed to be equally probable.

    Lowering the computational temperature (which is inverse

    to the control parameter in Eq. (18)), or cooling, in theMDAEM algorithm should be performed relatively slowly to

    enable search through the entire solution space and to allow for

    the convergence to globally optimal solution. In this paper, in

    the mth iteration of the algorithm, the parameter is computed

    as = min(m + 1)

    2 m = 0, 1, . . . (33)

    is limited to min < max 1, where min =C1(

    (1))2

    , C1 = const. The algorithm usually attains thesteady state condition for < 1 so the computation can bestopped before = 1 is reached. The stopping criterion isdefined when > max = C2 (

    (1))2 where C2 = const,C2 C1. In this paper, C1 = 3 and C2 = 22 106. Theparameter j in Eq. (9) is set to 10

    7 and the reference time

    at which the target positions Xp(t) are estimated is t = 900s.

    The nonlinear LS procedure based on the Levenberg-

    Marquardt algorithm [42] is used in each iteration of the

    MDAEM algorithm to compute the target motion parameters

    by maximizing the expression in Eq. (31). This LS algorithm

    is in itself iterative and requires convergence to obtain the

    solution. In our application, however, it is not necessary for

    this procedure to converge, that is, only few passes through

    this algorithm in each iteration of the MDAEM algorithm are

    sufficient for the overall convergence.A simulation that corresponds to the scenario with Pt =

    3 targets is shown in Fig. 3 (a). The figure shows thetarget-related transient-emission positions and clutter-related

    emissions generated as described above. The noisy TDOA

    measurements Zi,2 and Zi,4, i = 1 . . . , N are shown inFig. 4. The probability of gross measurement error is =0.1. The MDAEM algorithm is run by setting the numberof the estimated target motion models to P = Pt = 3. Thetarget trajectories obtained using the true target positions and

    velocities and the trajectories estimated by the algorithm are

    shown in Fig. 3 (b). True and estimated positions of the targets

    at the reference time t are denoted by the star symbols. Also,

    the symbol denotes the starting position of the target (trueand estimated) at the time tsq, for q = 1, 2, 3.

    Often the number of targets Pt in the measurement set is notknown. In this situation the number of target models estimated

    by the algorithm, P, should be chosen so as to be equal orgreater then the maximum expected number of targets, so that

    P Pt. The proposed algorithm performs robustly under suchmodel mismatch conditions. It estimates the motion parameters

    for the Pt true targets in addition to computing parameters forthe P Pt dummy targets.

    An example where P = 4 target motion models are esti-mated by the MDAEM algorithm using a data set that contains

    measurements from Pt = 1 target is shown in Table II. Themeasurements are derived using N = 53 transients of which 12are related to the target and the rest are clutter type 1 simulated

    as described above. The probability of gross measurement error

    is = 0.1. The last row in Table II shows the true targetparameters. It can be seen that the estimated parameters for

    the target model p = 2 are close to the true target parameterswhile the parameters for other models are far removed.

    The measurement-to-target assignment probabilities that cor-

    respond to the estimated target models p = 1, . . . , 4 inTable II and the measurement-to-clutter assignment proba-

    bilities computed by the MDAEM algorithm are shown in

    Figure 5 (a) and (b), respectively. In both figures the abscissa

    denotes the measurement index i (in time), i = 1, 2, . . . , N ,and the vertical lines mark the position of the target-related

    measurements. For the pth estimated target motion model theassignment probabilities shown in Figure 5 (a) are computed

    as i,j,2p1/(1 i,j) for all i (see Eq. (11)). As discussedin Section III-A these probabilities do not depend on index j.It can be seen from Figure 5 (a) that the target assignment

    probabilities that correspond to the model p = 2 are large forthe target-related measurements. They have value 1 for those

    time indices where the TDOA measurements from all sensor

    pairs associated with one target-related transient emission are

    correct and value 0.75 in the cases where one of the TDOA

  • 8/2/2019 Automatic Estimation of Multiple Target Positions and Velocities Using Passive TDOA Measurements of Transients

    9/13

    9

    60004000

    20000

    20004000

    6000

    60004000

    20000

    20004000

    6000

    2200

    2000

    1800

    1600

    1400

    1200

    1000

    800

    600

    400

    200

    0

    X (m)Y (m)

    Z(m)

    Target 1Target 2Target 3ClutterSensors

    (a)

    60004000

    20000

    20004000

    6000

    6000

    400020000

    20004000

    6000

    2200

    2000

    1800

    1600

    1400

    1200

    1000

    800

    600

    400

    200

    0

    X (m)Y (m)

    Z(m)

    True Target 1Estimated Target 1True Target 2Estimated Target 2True Target 3Estimated Target 3

    (b)

    Fig. 3. Simulation that corresponds to the tracking scenario with Pt = 3targets: (a) true target trajectories; (b) true and estimated target trajectories.True and estimated positions of the targets at the reference time t are denotedby the star symbol. The symbol o denotes the target position at the time tspfor p = 1 , 2, 3.

    measurements has a gross measurement error. The models

    p = 1, p = 3 and p = 4 correspond to dummy targets.They are associated only with a few (clutter) measurements

    in the data set and the association probabilities are in most

    cases small. Figure 5 (b) shows the measurement-to-clutter

    assignment probabilities i,2P+1 estimated by the algorithmfor this data set.

    A. Comparison to Other Methods

    The standard PDA method and the methods based on the EM

    algorithm that can be applied to this problem are very sensitive

    to initial conditions and would require a similar cooling scheme

    in order to assure convergence to a global optimum. Other

    approaches that treat related, but not identical, problems such

    as those described in [30], [31], [32], [33] are also inapplicable.

    Recently, Vo et al. [27], [28] and Ma et al. [29] describe

    a method for the localization of unknown number of speakers

    500 0 500 1000 1500 2000 25000.04

    0.03

    0.02

    0.01

    0

    0.01

    0.02

    0.03

    0.04

    0.05

    Time (s)

    Zi,2

    (s)

    Target 1Target 2Target 3Clutter

    (a)

    500 0 500 1000 1500 2000 25000.05

    0.04

    0.03

    0.02

    0.01

    0

    0.01

    0.02

    0.03

    0.04

    Time (s)

    Zi,4

    (s)

    Target 1Target 2Target 3Clutter

    (b)

    Fig. 4. The TDOA measurements used to obtain the results in Fig. 3 (b):(a) {Zi,2}Ni=1 (b) {Zi,4}

    Ni=1

    . Shown are both clutter and the target relatedmeasurements where = 0.1.

    TABLE II

    TARGET POSITIONS AT THE REFERENCE TIME t

    AND VELOCITIES FOR

    P = 4 TARGETS MOTION MODELS ESTIMATED BY THE MDAEM

    ALGORITHM WHEN THE TRUE NUMBER OF TARGETS IS Pt = 1 . THE TRUE

    TARGET PARAMETERS ARE SHOWED IN THE LAST ROW.

    Target Xp(t) Yp(t) Zp(t) vx vy vz

    p (m) (m) (m) (m/s) (m/s) (m/s)

    1 4495.2 1165.7 -3960.8 7.207 9.212 -5.014

    2 1317.4 1976.1 -1830.1 -3.035 5.570 0.097

    3 -792.1 807.1 -743.2 0.581 1.877 -2.907

    4 2213.8 -519.6 1216.2 -1.677 -7.232 7.654

    true 1300 1950 -1810 -3.0 5.5 0.1

    in multipath environment using TDOA measurements. They

    employ the random finite set (RFS) theory and a SMC imple-

    mentation to develop a Bayesian RFS filter that simultaneously

    tracks the time-varying speaker location in 2-D and the number

    of speakers. This approach treats a problem similar to the one

    considered in this paper and offers a possibility of an extension

    to tracking targets with non-constant velocities. Given that our

    measurements are sparse in time, computationally extensive

    tracking algorithms based on particle filters [45] could be

    applied. However the approach in [28], [29] assumes a known

    environment that is confined to a room enclosure. It uses a

  • 8/2/2019 Automatic Estimation of Multiple Target Positions and Velocities Using Passive TDOA Measurements of Transients

    10/13

    10

    0 10 20 30 40 50 600

    0.2

    0.4

    0.6

    0.8

    1

    Measurement Index

    Target

    AssignmetProbabilities

    Model 1

    Model 2

    Model 3

    Model 4

    (a)

    0 10 20 30 40 50 600

    0.2

    0.4

    0.6

    0.8

    1

    Measurement Index

    ClutterAssign

    metProbabilities

    Clutter

    (b)

    Fig. 5. Assignment probabilities estimated by the MDAEM algorithm forP = 4 target motion models using a data set that contains measurementsfrom Pt = 1 target in clutter: (a) target assignment probabilities for the targetmodels p = 1, . . . , 4 (b) clutter assignment probabilities. Vertical lines markthe position of the target-related measurements.

    Gaussian pdf for the TDOA measurement error and does not

    apply Gaussian mixture model. Other important differences

    between our problem and the problem described in [28], [29]

    are related to the TDOA measurement process as discussed in

    Section II-B. Most notably, the approach in [28], [29] assumes

    regularly spaced measurement update times, whereas the up-

    date times in our application are random and the measurements

    from different targets are asynchronous. For these reasons it is

    conjectured that the approach in [28], [29] can not be applied

    to our data without additional adjustments.

    B. Average Performance of the MDAEM Algorithm

    We test the ability of the MDAEM algorithm to correctly

    estimate motion parameters for Pt targets in a situation wherethe true number of targets in the data set is not exactly known.

    The number of the estimated target models P is taken tobe greater than the maximum expected number of targets, so

    that P > Pt. For all tracking scenarios with Pt {1, 2, 3}targets, the MDAEM algorithm is run by setting the number

    of estimated target motion models to P = 4. For each scenarioaverage performance of the algorithm is obtained by using

    100 sets of target-related measurements generated as described

    above. These sets are combined with clutter measurements that

    are simulated for several different values of the parameters (for clutter type 1) and (for clutter type 3). For each runthe best fitted results are selected by comparing the parameters

    estimated by the algorithm to the known parameters that corre-

    spond to the true targets. In particular, mean (per sample) abso-

    lute distances (MADs) are computed between the P trajectoriesthat correspond to the estimated target motion models and

    the trajectories obtained using the Pt true target parameters.The MADs are computed using the distances evaluated for

    every target-related measurement time (for more information

    on how MAD is computed see [13]). The known true target

    motion models are then associated with those estimated target

    parameters for which the corresponding MADs are found to

    be minimized.

    The means of the MADs related to the estimated target

    models associated with the true targets, over the 100 setsof simulated measurements and for the scenarios with Pt {1, 2, 3} targets, are shown in Tables III-V. The results arepresented as a function of the parameters and . For each

    value of the parameter Tables III-V also show the meannumber of clutter points (MNC) (for clutter type 1) in the

    simulated batch measurements over the 100 data sets.

    A ML estimate of the motion parameters is computed for

    each of the Pt {1, 2, 3} targets in a data set where themeasurement to target assignments are assumed to be exactly

    known, i.e., where only the (correct) measurements generated

    by a specific target are used to estimate the corresponding

    motion parameters. This technique is denoted as the indepen-

    dent ML estimation (IMLE). The results obtained using the

    IMLE are compared to the results obtained using the proposed

    MDAEM algorithm.

    The optimization in the IMLE is done using constrainednonlinear optimization [42]. This technique is also very sen-

    sitive to the initial parameter values that are, similarly as for

    the MDAEM algorithm, determined randomly. To increase the

    accuracy of the estimation this technique is applied iteratively

    in conjunction with variance deflation.

    The last row in Tables III-V shows the mean MAD computed

    for different tracking scenarios using the IMLE. It can be

    seen from Tables III-V that the results obtained using the

    MDAEM algorithm are close to those obtained using the IMLE.

    Moreover, the estimation accuracy of the MDAEM algorithm

    does not change much with the parameters and .

    The results of the experiments show that the MDAEM

    algorithm for multiple target localization performs robustly

    under the above model mismatch conditions (see also Table II).

    It provides correct estimates of the Pt target motion modelsthat correspond to the true targets, in addition to estimating

    parameters of P Pt dummy targets. In a more generalsituation these estimates can be passed on to an appropriately

    designed detection algorithm that can be used to determine

    the number of true targets in the data set and to select the

    corresponding motion parameter estimates. A technique that

    can verify the presence of a single target in the data set has

    recently been proposed in [13]. This technique can be extended

    and used for the detection in scenarios with multiple targets.

  • 8/2/2019 Automatic Estimation of Multiple Target Positions and Velocities Using Passive TDOA Measurements of Transients

    11/13

    11

    TABLE III

    MEA N MAD FOR INDIVIDUAL TARGETS COMPUTED USING 100

    SIMULATIONS THAT CORRESPOND TO THE TEST SCENARIO WITH Pt = 1

    TRUE TARGET. THE ESTIMATION METHODS ARE THE MDAEM

    ALGORITHM AND THE IMLE (LAST ROW).

    Mean MAD (m) MNC Pt = 1

    q = 1

    0 0 0 -

    0 1 105 18.44 83.58

    0 2 105 31.94 86.45

    0.1 0 0 89.38

    0.1 1 105 18.78 93.07

    0.1 2 105 33.51 99.32

    - - - 74.23

    TABLE IV

    MEA N MAD FOR INDIVIDUAL TARGETS COMPUTED USING 100

    SIMULATIONS THAT CORRESPOND TO THE TEST SCENARIO WITH Pt = 2

    TRUE TARGETS. THE ESTIMATION METHODS ARE THE MDAEM

    ALGORITHM AND THE IMLE (LAST ROW).

    Mean MAD (m)

    MNC Pt = 2

    q = 1 q = 2

    0 0 0 80.71 65.35

    0 1 105 20.04 87.31 72.07

    0 2 105 37.46 89.32 68.43

    0.1 0 0 97.83 80.80

    0.1 1 105 20.43 91.21 73.74

    0.1 2 105 37.15 100.25 74.17

    - - - 79.11 57.03

    TABLE V

    MEA N MAD FOR INDIVIDUAL TARGETS COMPUTED USING 100

    SIMULATIONS THAT CORRESPOND TO THE TEST SCENARIO WITH Pt = 3

    TRUE TARGETS. THE ESTIMATION METHODS ARE THE MDAEM

    ALGORITHM AND THE IMLE (LAST ROW).

    Mean MAD (m)

    MNC Pt = 3

    q = 1 q = 2 q = 3

    0 0 0 83.18 74.78 93.01

    0 1 105 19.95 90.22 77.59 91.67

    0 2 105 39.93 89.20 76.20 94.16

    0.1 0 0 93.59 77.86 99.13

    0.1 1 105 19.93 96.29 83.13 95.15

    0.1 2 105 40.43 90.10 83.44 100.73

    - - - 76.79 62.00 86.06

    V. CONCLUSION

    Under consideration has been the problem of estimating

    motion parameters of multiple targets using TDOA measure-

    ments based on target-emitted transient signals. The targets

    are moving linearly in a three-dimensional (3-D) observation

    area contaminated by clutter. The TDOA measurement errors

    are modelled as having (possibly different) Gaussian mixture

    probability density functions. An iterative ML optimization

    technique based on a deterministic annealing version of the

    EM algorithm is applied to this problem. For each target a ML

    estimate of the target motion parameters is obtained using a

    nonlinear LS method. The proposed algorithm generalizes the

    variance deflation method previously used for the initialization

    of several target tracking algorithms.

    The performance of the proposed algorithm is tested using

    simulated measurements related to tracking scenarios with one,

    two and three targets. The number of targets in the measure-

    ment data is supposed not to be known and the algorithm is

    run under the conditions that the number of estimated targets

    is greater than the number of the true targets. The simulationresults are presented for several different clutter densities and

    probabilities of gross error of the target related measurements.

    The proposed algorithm performs robustly under the model

    mismatch conditions. The results are found to be comparable

    to the estimation results obtained when the measurement-to-

    target assignments are exactly known.

    APPENDIX I

    Denote by {Z, K} the complete data where K ={ki,1, ki,2, . . . , ki,Ri}Ni=1 are (unobservable) discrete valuedassignment indices associated with the batch measurements Z.The indices {ki,j}Rij=1 K are modelled as independent andidentically distributed rvs that take a value from the discrete set

    {m : 1 m P + 1}. Following the standard method [18],[21], each index ki,j assigns the corresponding measurementZi,j either to the target motion model m, m = 1, 2, . . . , P (when ki,j = m) or to clutter (for ki,j = P+1). Next define byi = {i,1, . . . , i,P+1} the probability vector associated withthe assignment indices {ki,j}Rij=1, where i,m = Prob[ki,j =m], and where [18]

    {i,m 0}P+1m=1 andP+1

    m=1

    i,m = 1. (34)

    Also, denote the assignment probabilities for the batch Zby = {i}Ni=1. The probability i,m that the jth measurementat time ti, Zi,j, originates from the model m is equal for all

    measurements {Zi,j}Rij=1 taken at that time.The complete data probability of{Z, K}, under the assump-

    tion that the motion parameters V and the batch assignment

    probabilities are known, is given by

    P(Z, K |V, ) =Ni=1

    Rj=1

    i,m pi,j,m|m=ki,j(35)

  • 8/2/2019 Automatic Estimation of Multiple Target Positions and Velocities Using Passive TDOA Measurements of Transients

    12/13

    12

    where pi,j,m is the measurement model pdf corresponding tothe target assignment m, m = 1, 2, . . . , P ,

    pi,j,m = p(Zi,j; i,j,m) = (1i,j)N(Zi,j; i,j,m, i,j(1))+i,jN(Zi,j ; i,j,m, (2)i,j ).(36)

    For clutter assignment ki,j = P + 1 and the correspondingprobability is pi,j,P+1 = j , where j is a constant.

    The marginal distribution of P(

    Z,

    K |V, ) over

    Kis

    defined as

    P(Z |V, ) =K

    P(Z, K |V, ) =Ni=1

    Rij=1

    P+1m=1

    i,m pi,j,m.

    (37)

    APPENDIX II

    The maximization of the function Q(i | ; ) in Eq. (21)with respect to i = {i,j,1, i,j,2, . . . , i,j,2P+1}Rij=1 is con-strained by the requirement that

    2P+1

    n=1

    i,j,n = 1, for j = 1, 2, . . . , Ri (38)

    i.e., these probabilities must sum to unity, as each represents the

    fraction of the measurement produced by a particular model.

    Additional constraints are obtained by using Eqs. (11) and (12),

    asi,j,2p11 i,j =

    i,j,2pi,j

    (39)

    for p = 1, 2, . . . , P and j = 1, 2, . . . , Ri. From Eqs. (11) and(12) it also follows that

    i,1,2p11 i,1 =

    i,2,2p11 i,2 = . . . =

    i,Ri,2p11 i,Ri

    (40)

    and

    i,1,2pi,1

    = i,2,2pi,2

    = . . . = i,Ri,2pi,Ri

    . (41)

    Using the (P + 1)Ri constraints in Eqs. (38) and (39) theLagrangian for this maximization problem is formulated as

    Li = Q(i | ; )+Rij=1

    Pp=1

    j,p [(1 i,j) i,j,2p i,ji,j,2p1]+Rij=1

    j,P+1

    1

    2P+1n=1

    i,j,n

    (42)

    where j,p are Lagrange multipliers. Differentiating the La-grangian with respect to i,j,2p1 and i,j,2p and setting theresults to zero yields

    i,j,2p1 =wi,j,2p1

    i,jj,p + j,P+1(43)

    i,j,2p =wi,j,2p

    j,P+1 (1 i,j) j,p (44)

    for the P target motion models p = 1, 2, . . . , P . For the cluttermodel we have

    wi,j,2P+1 = i,j,2P+1j,P+1. (45)

    Summing both sides of Eq. (45) over j and using the assump-tion in Eq. (13) it follows that

    i,2P+1 =

    Rij=1 wi,j,2P+1

    Rij=1 j,P+1

    . (46)

    We next eliminate i,j,2p1 and i,j,2p from Eqs. (43) and(44) using Eq. (39) and apply the result to express j,p as afunction of j,P+1 as follows

    j,p = j,P+1i,jwi,j,2p1 (1 i,j)wi,j,2p

    i,j(1 i,j)(wi,j,2p1 + wi,j,2p) . (47)

    The substitution of Eqs. (43), (44), (46) and (47) in Eq. (38),

    after some manipulation, results inP

    p=1

    (wi,j,2p1 + wi,j,2p) + j,P+1

    Rij=1 wi,j,2P+1Rij=1 j,P+1

    = j,P+1.

    (48)

    Summing both sides of this expression over j we obtain that

    Rij=1

    j,P+1 =

    Rij=1

    2P+1n=1

    wi,j,n = Ri (49)

    where we noted that2P+1

    n=1 wi,j,n = 1 (according to Eq. (18)).We next substitute Eq. (47) in Eqs. (43) and (44) and obtain

    j,P+1 i,j,2p11 i,j = j,P+1 i,j,2pi,j = wi,j,2p1 + wi,j,2p (50)Again, after summing Eq. (50) over j and by using Eqs. (40),(41) and (49), the expressions for the target assignment prob-

    abilities i,j,2p1 and i,j,2p are obtained as

    i,j,2p1 =1 i,j

    Ri

    Ris=1

    2pn=2p1

    wi,s,n (51)

    i,j,2p =i,jRi

    Ris=1

    2pn=2p1

    wi,s,n (52)

    The expression for the probability of clutter assignment i,2P+1is obtained by substituting Eq. (49) in Eq. (46) as

    i,2P+1 =1

    Ri

    Ris=1

    wi,s,2P+1. (53)

    REFERENCES

    [1] G. C. Carter, Time delay estimation for passive sonar signal processing,IEEE Trans. Acoust., Speech, Signal Processing, vol. 29, no. 3, pp. 463470, 1981.

    [2] C. H. Knapp and G. C. Carter, The generalized correlation methodfor estimation of time delay, IEEE Trans. Acoust., Speech, SignalProcessing, vol. 24, no. 4, pp. 320327, 1976.

    [3] B. G. Ferguson and J. L. Cleary, In situ source level and source positionestimates of biological transient signals produced by snapping shrimp in

    an underwater environment, J. Acoust. Soc. Am., vol. 109, no. 6, pp.30313037, 2001.

    [4] J. C. Hassab and R. E. Boucher, Optimum estimation of time delay by ageneralized correlator, IEEE Trans. Acoust., Speech, Signal Processing,vol. 27, no. 4, pp. 373380, 1979.

    [5] J. L. Spiesberger, Identifying cross-correlation peaks due to multipathswith the application to optimal passive localization of transient signalsand tomographic mapping of the environment, J. Acoust. Soc. Am., vol.100, pp. 910917, 1996.

    [6] , Linking auto- and cross-correlation functions with correlationequations: Application to estimating the relative travel times and am-plitude in multipath, J. Acoust. Soc. Am., vol. 104, no. 1, pp. 300312,1998.

    [7] , Finding the right cross-correlation peak for locating sounds inmultipath environments with a fourth-moment function, J. Acoust. Soc.

    Am., vol. 108, no. 3, pp. 13491352, 2000.

  • 8/2/2019 Automatic Estimation of Multiple Target Positions and Velocities Using Passive TDOA Measurements of Transients

    13/13

    13

    [8] B. G. Ferguson, Improved time-delay estimates of underwater acousticsignals using beamforming and prefiltering techniques, IEEE J. Oceanic

    Eng., vol. 14, no. 5, pp. 238244, 1989.

    [9] J. P. Ianniello, Time delay estimation via cross-correlation in thepresence of large estimation errors, IEEE Trans. Acoust., Speech, SignalProcessing, vol. 30, no. 6, pp. 9981003, 1982.

    [10] L. A. Pflug, G. E. Ioup, J. W. Ioup, R. L. Field, and J. H. Leclere, Time-delay estimation for deterministic transients using second- and higher-order correlations, J. Acoust. Soc. Am., vol. 94, no. 3, pp. 13851399,1993.

    [11] B. G. Ferguson, Variability in passive ranging of acoustic sources inair using a wavefron curvature technique, J. Acoust. Soc. Am., vol. 108,no. 4, pp. 15351543, 2000.

    [12] D. Carevic, Robust estimation techniques for target-motion analysisusing passively sensed transient signals, IEEE J. Oceanic Eng., vol. 28,no. 2, pp. 250261, 2003.

    [13] , Tracking target in cluttered environment using multilateral time-delay measurements, J. Acoust. Soc. Am., vol. 115, no. 3, pp. 11981206,2004.

    [14] Y. Bar-Shalom and T. E. Fortman, Tracking and data association. NewYork: Academic Press, 1988.

    [15] H. M. Shertukde and Y. Bar-Shalom, Detection and estimation formultiple targets with two omnidirectional sensors in the presence of falsemeasurements, IEEE Trans. Acoust., Speech, Signal Processing, vol. 38,no. 5, pp. 749763, 1990.

    [16] C. Jauffret and Y. Bar-Shalom, Track formation with bearing andfrequency measurements in clutter, IEEE Trans. Aerosp. Electr. Syst.,vol. 26, no. 6, pp. 9991010, 1990.

    [17] T. Kirubarajan and Y. Bar-Shalom, Low observable target motionanalysis using amplitude information, IEEE Trans. Aerosp. Electon.Syst., vol. 32, no. 4, pp. 13671384, 1995.

    [18] H. Gauvrit, J. P. Le Cadre, and C. Jauffret, A formulation of multitargettracking as an incomplete data problem, IEEE Trans. Aerosp. Electron.Syst., vol. 33, no. 4, pp. 12421257, 1997.

    [19] D. Avitzour, A maximum likelihood approach to data association, IEEETrans. Aerosp. Electron. Syst., vol. 28, no. 2, pp. 560566, 1992.

    [20] R. L. Streit and T. E. Luginbuhl, Maximum likelihood method forprobabilistic multi-hypothesis tracking, in Proc. SPIE, vol. 2235, 1994,pp. 394405.

    [21] , Probabilistic multi-hypothesis tracking, Naval Undersea WarfareCenter, Tech. Rep. 10428, February 1995.

    [22] A. P. Dempster, N. M. Liard, and D. B. Rubin, Maximum likelihoodfrom incomplete data via EM algorithm, J. Royal Stat. Soc. B, vol. 39,

    pp. 138, 1977.[23] R. G. Hutchins and D. T. Dunham, Evaluation of a probabilisticmultihypothesis tracking algorithm in cluttered environment, in Proc.Thirtieth Asilomar Conf. on Signals, Systems and Computers, vol. 2, LosAlamitos, CA, 3-6 Nov. 1996, pp. 12601264.

    [24] J. C. Hassab, B. W. Guimond, and S. C. Nardone, Estimation of locationand motion parameters of a moving source observed from a linear array,

    J. Acoust. Soc. Am., vol. 70, no. 4, pp. 10541061, 1981.

    [25] S. K. Katsikas, A. K. Leros, and D. G. Lainiotis, Underwater tracking ofmaneouvring target using time delay mesurements, Signal Processing,vol. 41, no. 1, pp. 1729, 1995.

    [26] B. Xerri, J.-F. Cavassilas, and B. Borloz, Passive tracking in underwateracoustics, Signal Processing, vol. 82, no. 8, pp. 10671085, 2002.

    [27] B. Vo, S. Singh, and W. K. Ma, Tracking multiple speakers with randomsets, in Proc. ICASSP 2004, Montreal, Canada, 2004.

    [28] B. Vo, W. K. Ma, and S. Singh, Localizing an unknown time-varyingnumber of speakers: a Bayesian random finite set approach, in Proc.

    ICASSP 2005, Philadelphia, USA, 2005.[29] W. K. Ma, B. Vo, S. Singh, and A. Baddley, Tracking an unknown time-varying number of speakers using TDOA measurements: a random finiteset approach, IEEE Trans. Signal Processing, accepted for publication.

    [30] L. Frenkel and M. Feder, Recursive expectation-maximization (EM)algorithms for time-varying parameters with applications to multipletarget tracking, IEEE Trans. Signal Processing, vol. 47, no. 2, pp. 306320, 1999.

    [31] G. Pulford and B. La Scala, MAP estimation of target manoeuvresequence with the expectation-maximization algorithm, IEEE Trans.

    Aerosp. Electron. Syst., vol. 38, no. 2, pp. 367377, 2002.

    [32] D. B. Reid, An algorithm for tracking multiple targets, IEEE Trans.Automatic Control, vol. 24, no. 6, pp. 843854, 1979.

    [33] Y. Barniv, Dynamic programming solution for detecting dim movingtargets, IEEE Trans. Aerosp. Electron. Syst., vol. 21, no. 1, pp. 144156, 1985.

    [34] P. Willett, Y. Ruan, and R. Streit, PMHT: Problems and some solutions,IEEE Trans. Aerosp. Electron. Syst., vol. 38, no. 3, pp. 738754, 2002.

    [35] N. Ueda and R. Nakano, Deterministic annealing EM algorithm, NeuralNetworks, vol. 11, no. 2, 1998.

    [36] , Mixture density estimation via EM algorithm with deterministicannealing, in Proc. IEEE Workshop Neural Networks for Signal Pro-cessing, Ermioni, Greece, 6-8 Sept. 1994, pp. 6977.

    [37] M. Takada and R. Nakano, Multi-thread search with deterministicannealing EM algorithm, in Proc. Int. Joint Conf. Neural Networks,

    IJCNN 2002, vol. 1, Honolulu, HI ,USA, 12-17 May 2002, pp. 1034

    1038.[38] , Threshold-based dynamic annealing for multi-thread DEAM and

    its extreme, in Proc. Int. Joint Conf. Neural Networks, IJCNN 2003,vol. 1, Oregon, PO, 20-24 July 2003, pp. 501506.

    [39] M. A. Tanner, Tools for statistical inference. Springer-Verlag New YorkInc., 1996.

    [40] S. Kirkpatrick, J. C. D. Gelatt, and M. P. Vecchi, Optimization bysimulated annealing, Science, vol. 220, no. 4598, pp. 671679, 1983.

    [41] K. C. Ho and W. Xu, An acurate algebraic solution for moving sourcelocation using TDOA and FDOA measurements, IEEE Trans. SignalProcessing, vol. 52, no. 9, pp. 24532463, 2004.

    [42] P. R. Gill, W. Murray, and M. H. Wright, Practical optimization.London: Academic Press, 1981.

    [43] G. Celeux and J. Diebolt, The SEM algorithm: a probabilistic teacheralgorithm derived from the EM algorithm for mixture problem, Com-

    putational Statistics Quaterly, vol. 2, pp. 7382, 1985.[44] G. C. G. Wei and M. A.Tanner, A Monte Carlo implementation of the

    EM algorithm and poor mans data augmentation algorithm, J. Am. Stat.Assoc., vol. 85, no. 411, pp. 699704, 1990.

    [45] M. S. Arulampalam, S. Maskell, N. Gordon, and T. Clapp, A tutorialon particle filters for online nonlinear-non-Gaussian Bayesian tracking,

    IEEE Trans. Signal Processing, vol. 50, no. 2, pp. 174188, 2002.