Artifacts removal for GLS map makers - uniroma1.itinfocom.uniroma1.it/~lorenz/trep-01-04-11.pdf2011/01/04 · Artifacts removal for GLS map makers LorenzoPiazzo University of Rome

.

Artifacts removal for GLS map makers

Lorenzo Piazzo

University of Rome ”La Sapienza” - DIET Dept.

Internal Report no. 001-04-11 (no. 001, sector 04, year 2011)

and

HiGal Project Internal Report

25 January 2011

Lorenzo Piazzo

DIET Dept. - University of Rome “La Sapienza”.

V. Eudossiana 18, 00184 Rome, Italy.

Tel.: +39 06 44585480

Fax.: +39 06 4873300

E-mail: [email protected]

1

1 Introduction

Map making is the process of constructing an image from a sequence of georeferenced (or skyref-

erenced) data. A map maker is an algorithm employed to perform the process. An excellent

map maker for astronomical images is the one based on the Generalised Least Square (GLS)

approach [1, 2]. Indeed GLS map makers proved effective in removing the noise affecting the

data sequence and have a manageable computational complexity, which makes their use feasible

even on the huge amount of data produced by current days imaging photometers, like the PACS

and SPIRE instruments onboard the ESA Herschel satellite [3]. Furthermore the GLS approach

has nice features from a theoretical point of view. In fact it is a linear algorithm and therefore

amenable to analysis and it can be shown to yield the maximum likelihood (ML) map when

the noise affecting the data is a Gaussian process. However the approach also has a serious

drawback that became apparent when it was employed to process the data gathered within the

HiGal project [4], the aim of which is that of performing a survey of the galactic plane. Namely

it introduces artifacts in the form of crosses or stripes on strong point sources. These artifacts

are annoying and make the resulting images of little interest, which is a pity since, as we said, the

GLS approach is otherwise a very good map maker. Therefore a procedure to cure this problem

is of both theoretical and practical interest. In this report we present two such procedures.

The first one, termed Post-processed GLS (PGLS), is based on an artifact estimation algo-

rithm. Once the artifacts are estimated they can be subtracted from the GLS image to produce

a clean image. The PGLS algorithm is simple and robust. Its computational complexity is

affordable and comparable to that of the GLS map maker itself. The only drawback is that,

when the artifacts are subtracted, some noise is also added to the image. However the noise

increase is normally moderate and the cleaned image is surely more useful than the original GLS

one.

The second one, termed Split GLS (SGLS), is based on splitting the signal into a diffuse

emission plus correlated noise part and a point sources plus white noise part. Then, the diffuse

part is mapped with GLS, the point source part is mapped by simple rebinning and the SGLS

image is obtained by summing the two maps. Also the SGLS algorithm is simple and robust. Its

computational complexity is essentially that of the GLS map maker itself. Also SGLS introduces

some additional noise, but less than PGLS. However the signal seems to be reconstructed not

as well as by PGLS.

The report is organised as follows. We firstly introduce basic concepts concerning the map

making problem, in section 2, and the basic operations needed to explain the PGLS and SGLS

algorithms, which are the rebin and the unroll, section 3, the GLS map making, section 4, and

the median filter, section 5. Next we describe PGLS, in section 6, and SGLS, in section 7.

Section 8 reports the conclusion and some open points.

2 Data, map and pointing

To be specific we consider the map making problem as it arises in the processing of the data

produced by the scan mode of the PACS instrument of the Herschel satellite. The instrument

2

is composed of a grid of Nb bolometers and a scan amounts at pointing the instrument towards

a set of Nk given points in the sky (organised into a grid of parallel lines) and at sampling the

bolometers output at each point. The result is a set of time ordered data (TOD) which will be

denoted by t = {tb,k} for b = 1, ..., Nb and k = 1, ..., Nk, where tb,k is the readout of the b-th

bolometer at the k-th scan point. Each readout is accompanied by a skyreferencing information,

which is a pair (xb,k, yb,k) giving the right ascension and the declination of the readout.

The map making problem is that of constructing an image (map) of the observed sky from the

TOD. The typical approach is that of defining a pixelization of the observed sky, i.e. partitioning

the sky into a grid of Ni ·Nj non overlapping squares (pixels) si,j . The map maker has to produce

a map m = {mi,j} where mi,j is a measure of the flux in si,j , i.e. in the (i, j)-th pixel. We will

see several map makers in the following sections.

To conclude this section let us note that since each readout is skyreferenced and assuming

that the pixellization covers the whole observed sky, we can assign each readout (b, k) to one

and only one pixel. Therefore we can introduce a pointing function Pb,k yielding the pixel (i, j)

where the readout (b, k) was taken. In other words Pb,k = (i, j) if and only if (xb,k, yb,k) ∈ si,j .

Conversely, for each pixel (i, j) we can consider the (possibly void) set of readouts falling in the

pixel and denote this set by Ci,j . Formally we have Ci,j = {(b, k)|Pb,k = (i, j)}. Furthermore

we will denote by |Ci,j | the number of elements of Ci,j , that is the number of readouts falling in

pixel (i, j).

3 Rebinning and unrolling

Rebinning is a simple and important map making technique where the value of each pixel is set

equal to the mean value of all the readouts falling in the pixel. Namely, the map is computed as

mi,j =1

|Ci,j |

∑

(b,k)∈Ci,j

tb,k.

Rebinning is simple, robust and computationally cheap. It does not require any prior knowledge

about the noise affecting the TOD. Moreover, it is also the optimum solution when the TOD is

a made by a signal affected by zero mean, white noise, in the sense that it yields the ML map

and maximises the SNR. However when the noise affecting the TOD is coloured, rebinning is no

more optimum. Unfortunately the bolometers are typically affected by coloured noise, having

a power spectrum that grows as 1/f for f → 0. As a result the maps obtained by rebinning

the TOD are not of good quality. An example is shown in figure 1, where the rebinning for a

patch of tile L323 in the PACS blue band is shown. The effect of the 1/f noise is apparent and

manifests itself as stripes following the scan directions. Nevertheless rebinning is an important

tool in map making and it will be useful in the following.

A second operation that we will need in the following is the unrolling of a map into a TOD.

This operation assumes that a map m is given and that the pointing function Pb,k is assigned.

It then produces the TOD data by unrolling the map onto the TOD. Specifically, the data of

each readout is set equal to value of the pixel where the readout falls. Therefore, the unrolled

TOD is given by tb,k = mPb,k.

3

100 200 300 400 500 600 700 800 900 1000

100

200

300

400

500

600

700

800

900

1000

Figure 1: Example of rebinning from a TOD of PACS blue, tile L323. Note the stripes due to

the 1/f noise.

Notation: we will write m = R(t) to indicate that the map m is obtained by rebinning the

TOD t and we will write t = U(m) to indicate that the TOD t is obtained by unrolling the map

m.

4 GLS map making

GLS map making, in its simplest form, is based on the following model for the TOD

t = U(m) + n (1)

where n = {nb,k} is a noise affecting the TOD. The last equation models the TOD as obtained

by unrolling a map (the sky) onto a TOD and by adding noise to it. Based on this model, the

approach exploits the GLS technique and seeks for a map mg minimising the weighted sum of the

4

100 200 300 400 500 600 700 800 900 1000

100

200

300

400

500

600

700

800

900

1000

Figure 2: Example of GLS map. Tile L323 for the PACS blue band. Note the crosses centered

on the strong point sources.

squared noise estimate, given by t−U(mg). By properly selecting the weights and, specifically,

by setting them based on the noise covariance matrix, one can show that the resulting map mg

is the minimum variance linear estimator of m [1]. Furthermore, when the noise is Gaussian,

mg is also the ML estimate of m [1, 2].

GLS map making is a sophisticated and effective map making technique. While it is more

complex than the rebinning, its computational complexity can be kept manageable by exploiting

the Conjugate Gradient [5] approach to obtain the solution map [1, 2]. It requires the knowledge

of the noise correlation properties or, what is equivalent, of the noise power spectrum. However

simulations show that it is robust to a mismatch between the spectrum employed in the solution

and the actual spectrum. Its main advantage is that it keeps into account the noise correlation

and is therefore capable of removing the 1/f noise and of producing much better maps than

the rebinning. However it also has a major drawback, namely that it introduces artifacts in the

5

form of crosses centered on bright point sources, with arms parallel to the scan directions. An

example of GLS map is shown in figure 2 where one notes both the absence of the stripes due

to the 1/f noise and the artifacts.

The artifacts are due to the fact that the model of (1) is not correct and neglects several

important facts. The core mismatch of the model from the actual physical process is that the

TOD is not produced by unrolling a pixellised sky. On the contrary it is produced by taking

samples over a continuous sky. This mismatch is not a problem when the sky is a smooth, slowly

varying image, like happens with diffuse emissions or large, wide sources. However for pointed,

bright sources the mismatch is relevant. Two additional mismatches are 1) the fact that the in

the PACS instrument, in order to improve the SNR while reducing the data rate, the TOD are

obtained by coadding and subsamplig an higher resolution TOD, and 2) the fact that the TOD

is affected by a small Relative Pointing Error (RPE). By means of simulations it was verified

that any of these three factors causes the artifacts observed in the GLS maps.

To conclude let us note that, in principle, the model of (1) can be improved to obtain an

artifact-free GLS approach. This can be done also in practice, at least approximately, for the

artifacts due to the RPE, which can be estimated [6] and removed. And for the artifacts due to

the coaddition, following the approach proposed in [7]. However improving the model to keep

into account the continuous nature of the sky would require to change the pointing matrix used

in the implementation of the GLS algorithm [1, 2] from a sparse one to a fully populated one.

Such a change would increase the computational complexity of the GLS algorithm dramatically,

making it unfeasible with current days processing power.

Notation: we will write m = G(t) to indicate that the map m is obtained by GLS map

making from the TOD t.

5 Median filtering

Consider a sequence xn and assume an integer w is given. Denote by med(a1, ..., aN ) the median

of N numbers [a1, ..., aN ]. Then we can define the median filtering (with window length W =

w + 1) of the sequence xn, denoted by M(xn), as the sequence yn obtained as

yn = M(xn) = med(xn−w, ..., xn+w).

In words, yn is the median of the values [xn−w, ..., xn+w]. The sequence yn will be termed the

median of xn. We can also define the residual filtering (with window length W = 2w+1) of the

sequence xn, denoted by M(xn), as the sequence zn obtained as

zn = M(xn) = xn −M(xn)

The sequence zn will be termed the residual of xn.

Median and residual filtering are two key operations in the algorithms to be developed in the

rest of this report. Unfortunately they are non linear and therefore difficult to analyse. However

it is possible to gather a qualitative understanding of their functioning. To this end consider

firstly a deterministic signal and note that short signals, like pulses with duration less than w,

6

will be entirely removed by the median filtering and entirely preserved by the residual filtering.

On the other hand slowly varying signals, with a duration higher than w, will be preserved by

the median filtering and will be removed by the residual filtering. Therefore, we expect that

when median and residual filtering are applied to a signal TOD, diffuse emission will be moved

to the median and point sources will be moved to the residual.

Consider now the filtering of a noise process. One expects that high frequency, fastly varying

harmonic components, with a period lower than w, will be removed by the median filtering and

preserved by residual filtering. On the contrary low frequency harmonics will be preserved by

the median filtering and removed by the residual filtering. Therefore when the filtering is applied

to a noisy TOD, the low frequency noise will be moved to the median and the high frequency

noise will be moved to the residual.

In summary, roughly, one can consider the median filtering as a low pass filtering and expect

that the signal at its output is slowly varying and highly correlated. On the other hand, thinking

to the residual filtering as an high pass filtering is not entirely correct, since low frequency

components are reduced but not really removed from the residual. However the residual will be

fastly varying and weakly correlated. Therefore the residual filtering acts like a whitening filter.

To conclude note that median and residual filtering can be computationally demanding. A

way to reduce the computational complexity is that of performing a subsampling of the median

signal, i.e. compute the sequence hm = ymS where S ≥ 1 is the subsampling factor (or the step).

Then the median yn can be approximated by a sequence yn obtained by linear interpolation of

the samples of hm. And the residual zn can be approximated by the sequence zn = xn − yn.

As long as S << W the approximation will be good and the computational complexity of the

filtering is reduced by a factor S.

Notation: with a slight abuse, we will write f = M(t) to indicate that the TOD f is

obtained by median filtering each bolometer of the TOD t (properly zero padded so that the

size of f is identical to that of t). Similarly we will write r = M(t) to indicate that the TOD r

is obtained by residual filtering each bolometer of the TOD t (again properly zero padded).

6 PGLS map making

The Post-processed GLS (PGLS) map making algorithm starts from the original TOD t and the

GSL map mg = G(t). It is based on the following basic step aimed at producing an estimate of

the artifacts affecting the GLS map:

1. Unroll the GLS map: tg = U(mg).

2. Remove signal TOD: tn = tg − t.

3. Remove correlated noise: tw = M(tn).

4. Compute artifacts estimate ma = R(tw).

The functioning of the basic step can be qualitatively explained as follows. The original TOD

contains the signal S, the correlated noise Nc and the white noise Nw, so that, using the symbol

≈ to indicate the components of a signal, we can write t ≈ S + Nc + Nw. Assuming that the

7

100 200 300 400 500 600 700 800 900 1000

100

200

300

400

500

600

700

800

900

1000

Figure 3: Example of PGLS map. Note the absence of artifacts and the noise increase.

GLS map maker perfectly removes the noise but introduces artifacts, mg contains the signal S

and the artifacts A and we write mg ≈ S + A. The same components are there in the unrolled

TOD computed in step 1, i.e. tg ≈ S + A. By subtracting the original TOD from tg in step

2 we are left with a TOD tn where the signal is removed, the noise is changed polarity and

the artifacts are introduced, i.e. tn ≈ A − Nc − Nw. By performing the median filtering of tn,

provided that the window length is chosen as discussed next, we eliminate the correlated noise

while the artifacts are preserved since they are point sources, i.e. tw ≈ A − Nw. By rebinning

tw we eliminate the white noise so that ma ≈ A.

The basic step just described produces a rough estimate of the artifacts but it is not sufficient

to have a good estimate. However the procedure can be iterated in order to improve the artifact

estimate. We have therefore the following PGLS algorithm for producing the PGLS map mp

from the TOD t:

1. Init PGLS map: mp = G(t).

8

100 200 300 400 500 600 700 800 900 1000

100

200

300

400

500

600

700

800

900

1000

Figure 4: Example of artifacts map.

2. Repeat 2.1 - 2.5 until convergence:

2.1 Unroll: tg = U(mp).

2.2 Remove signal: tn = tg − t.

2.3 Remove correlated noise: tw = M(tn).

2.4 Estimate artifacts: ma = R(tw).

2.5 Improve PGLS map: mp = mp −ma.

The rationale of the iterative procedure is that, at each step, more artifacts are removed from

the map and eventually the PGLS map is obtained.

Results on real data. Examples of the PGLS map mp and of the total artifacts (i.e. the

sum of all the maps ma produced in step 2.4) are shown in figures 3 and 4 respectively. One

notes that the artifacts are perfectly removed. However, by comparing with figure 2 one also

notes that, where the signal is absent, the noise is higher in the PGLS map than in the GLS one.

This is expected since the artifact map ma is not noise free and every time that it is subtracted

9

to mp in step 2.5 its noise adds to the PGLS map. This noise increase is a price to pay to remove

the artifacts.

Results on simulated data. By using a system simulator we also tested PGLS map

making on artificial data. The greatest advantage of using a system simulator is that one knows

the target image and can verify how the algorithm approximates it. Using this approach we

verified that when the PGLS is fed with a noiseless TOD, it produces exactly (within numerical

precision limits) the target image, while the GLS does not.

Parameters and complexity. The only parameter of the procedure is the window length

of the median filter of step 2.3. This should be set at least equal to twice the width of the arms

of the crosses to be removed. Higher values uselessly pass more correlated noise into the artifact

estimation while lower values reduce the quality of the estimation. However this parameter is

not critical and the procedure works well in a wide range of window lengths. In the examples on

real data given above the window length was set to 81 pixels. Furthermore the computational

complexity was reduced by using a step of 20 pixels in the median filtering. Another point to

be discussed is how determine the convergence. Again, it turns out that this is not a critical

point and that any meaningful criterion can be used. In the examples given we simply carried

out 8 iterations. All in all the PGLS procedure is quite robust and effective. Its computational

complexity is comparable to that of the original GLS.

7 SGLS map making

The Split GLS (SGLS) map making algorithm starts from the original TOD t and splits it

into a correlated and a white part. Namely it performs a median filtering of t to produce a

correlated TOD tc containing the diffuse emission and the correlated noise. In then performs

residual filtering of t to produce a white TOD tw containing point sources and white noise.

The correlated TOD can be safely processed by the GSL map maker since it contains no point

sources: one obtains a correlated map mc = G(tc). The white TOD can be safely rebinned since

it contains no correlated noise: one obtains a white map mw = G(tw). The final SGLS map is

obtained by summing the correlated and white maps. In summary we have the following SGLS

algorithm:

1. Extract correlated TOD: tc = M(t).

2. Extract white TOD: tw = M(t).

3. Compute correlated map: mc = G(tc)

4. Compute white map: mw = R(tw)

5. Compute SGLS map: ms = mc +mw

Results on real data. An example of the SGLS map ms is shown in figure 5. One notes

that the artifacts are perfectly removed. Again, by comparing with figure 2 one notes that,

where the signal is absent, the noise is higher in the SGLS map than in the GLS one. This

is expected since the rebin performed in step 4 is not capable of removing the correlated noise

that leaks into the white TOD due to the non perfect suppression of this noise by the residual

10

100 200 300 400 500 600 700 800 900 1000

100

200

300

400

500

600

700

800

900

1000

Figure 5: Example of SGLS map.

filtering. This is a price to pay to remove the artifacts. However, by comparing with figure 3 one

also notes that the additional noise is lower than the additional noise introduced by the PGLS

approach. This is due to the fact that there is no iterative increase of the noise in the SGLS

approach.

Results on simulated data. Using a system simulator we verified that when the SGLS

is fed with a noiseless TOD, it does not produce exactly the target image but only a good

approximation. This is the disadvantage of SGLS with respect to PGLS. In any case the SGLS

estimate is much better than the GLS one.

Parameters and complexity. The only parameter of the procedure is the window length

of the median and residual filters. This should be set at least equal to twice the width of the point

sources that produce artifacts. Higher values will remove artifacts but will uselessly pass more

correlated noise into the white map. Lower values will jeopardize the artifacts removal. In the

example on real data given above the window length was set to 81 pixels and the computational

11

complexity was reduced by using a step of 20 pixels in the filtering. All in all the SGLS procedure

is quite robust and effective. Its computational complexity is comparable to that of the original

GLS.

8 Conclusion and future work

We presented PGLS and SGLS which are two GSL-based map makers capable of suppressing

the artifacts introduced by the GLS while retaining most of the noise removal capabilities of

the GLS. Both algorithms are simple, robust and have a computational complexity similar to

that of the original GLS. With respect to GLS, both algorithms slightly increase the noise in

the final image. The noise increase is higher for the PGLS than for SGLS but the PGLS seems

capable of better treating the signal part.

Future work shall be aimed at better characterizing the signal reconstruction capabilities of

PGSL and SGLS. This shall be done by comparing their results on simulated data, where the

target image is known. A large simulation campaign is needed.

A second, promising research line is that of performing some signal processing of the artifacts

image (see figure 4) in order to automatically identify the artifacts position and dimension. When

this is done, the PGLS map could be produced from the GLS map by subtracting the artifacts

only, thereby avoiding the noise increase.

References

[1] P. Natoli, G. De Gasperis, C. Gheller, N. Vittorio, ”A map-making algorithm for the Planck

surveyor”, Astronomy and Astrophysics 372 (1), pp. 346-356, 2001.

[2] C. M. Cantalupo, J. D. Borrill, A. H. Jaffe, T. S. Kisner, R. Stompor, ”MADmap: A

massively parallel maximum likelihood cosmic microwave background map-maker”, Astro-

physical Journal, Supplement Series 187 (1), pp. 212-227, 2010.

[3] G. Pilbratt et al., ”Herschel Space Observatory”, Astronomy and Astrophysics, Vol. 518,

No. 7-8, July 2010.

[4] S. Molinari et al.: ”Clouds, filaments, and protostars: The Herschel Hi-GAL Milky Way”,

Astronomy and Astrophysics, Volume 518, Issue 3, 16 July 2010

[5] W. T. Vetterling, W. H. Press, S. A. Teukolsky, and B. P. Flannery, ”Numerical Recipes in

C”, Cambridge University Press, 1992.

[6] L. Piazzo et al., ”Study of the PACS and SPIRE Pointing Error”, HI-GAL Internal Report,

10 July 2010.

[7] F. Piacentini et al., ”Map making for PACS in HI-GAL”, HI-GAL Internal Report, 4 July

2009.

12

Documents

Artifacts removal for GLS map makers - uniroma1.itinfocom.uniroma1.it/~lorenz/trep-01-04-11.pdf2011/01/04 · Artifacts removal for GLS map makers LorenzoPiazzo University of Rome