33
Effects of the observing technique on the data Ian McCrea

Effects of the observing technique on the data Ian McCrea

Embed Size (px)

Citation preview

Page 1: Effects of the observing technique on the data Ian McCrea

Effects of the observing technique on the data

Ian McCrea

Page 2: Effects of the observing technique on the data Ian McCrea

Effects of the Radar System on the Data

Although we receive a signal at the antenna that comes from ionospheric incoherent scatter, by the time that signal gets sampled and digitised we have

• amplified and/or attenuated it several times.

• added thermal noise from the radar system.

These processes change the signal by adding system noise. It is important to ensure that the system is as LINEAR as possible.

A linear system has the following properties:

)()()()( taytaxtytx

)()()()()()(),()( 21212211 tytytxtxtytxtytx

Page 3: Effects of the observing technique on the data Ian McCrea

Even in the ideal case of a perfectly linear system, we still have to worry about:

•The transmitter pulse shape (which affects what we can receive)

•The method used to correlate the samples (which imposes numerical weightings)

•The impulse response of the system (which imposes further time/range effects on our measured data)

More things to worry about……..

Page 4: Effects of the observing technique on the data Ian McCrea

What kind of filters to choose ?

We need to choose a filter which:

1. Lets the whole scattered spectrum through without any attenuation

2. Allows for a Doppler shift in the spectrum due to high plasma velocities

3. Allows the extra bandwidth imposed by the transmitter pulse modulation to pass through the filter.

How wide is the spectrum ?

If the speed of the ion-acoustic wave is

i

i

m

kTv

2

Then the Doppler shift is given by

i

i

m

kT

cf

22 0

For an ion mass of 16 amu (oxygen) and Ti of 2000K, f ~ 9 kHz

Note that this is the spectral HALF-WIDTH!

Δf

0

Page 5: Effects of the observing technique on the data Ian McCrea

How big can the Doppler shift get ?

Assuming a 100 mV/m E-field, giving a 2000m/s drift

kHzc

vf 4.12

2 0

So it may be necessary to consider a 12 kHz wide spectrum that has been shifted by 12 kHz.

This explains why it might be necessary to use filters with a half-width of 25 kHz.

Page 6: Effects of the observing technique on the data Ian McCrea

What about the modulation bandwidth?

If our transmitted signal is modulated by pulsing (which it has to be to get any range information) then the output of our quadrature detector is also modulated by the same pulse.

For a stochastic signal y(t) of bandwidth y, modulated by a pulse of

length T, the bandwidth of the modified signal becomes

yx T

2

1

So now we need a filter that is not just wider than y, but needs to be wider than x as well. For long pulses, this turns out not to be too demanding, but for short pulse codes, the modulation bandwidth is much wider than the ion line spectrum.

Page 7: Effects of the observing technique on the data Ian McCrea

Pulse Length (μs) Modulation BW (kHz) Potential Use 2.0 250.0 Alt. Code (D/E region) 4.0 125.0 Alt Code (D/E region) 10.0 50.0 Alternating or Barker Code100.0 5.0 Long Pulse (low F region)300.0 1.7 Long Pulse (F region)

One might think that ideally the filter width should only be just as wide as needed. Making the filter wider would allow through unnecessary noise. However, this is not necessarily a problem, provided it doesn’t affect the signal to noise ratio in the target bandwidth.

Even in the most linear system, what we end up with is not just the incoherent scatter signal. Our signal is deformed by the impulse response of the system, and at some time we’ll have to remove those effects.

Page 8: Effects of the observing technique on the data Ian McCrea

Since we have to pass our signal through filters and other hardware, we should be aware that all this electronics has some associated impulse response. The impulse response of a system is defined as the output signal arising from a delta-function input.

Impulse Response Functions

The resultant measurement is a convolution of the original signal x(t) and the impulse response of the system h(t), written as (x*h)(t).

If two functions are convolved, then their Fourier transforms are multiplied

}{}.{}*{ vuvu and we will see that we can use this fact to correct for the effect of the impulse response, provided it is known.

Page 9: Effects of the observing technique on the data Ian McCrea
Page 10: Effects of the observing technique on the data Ian McCrea

 Impulse Responses for Real Filters in the (old) EISCAT System

Page 11: Effects of the observing technique on the data Ian McCrea

Effects of the transmitted pulse

The pulse shape that we use has an effect similar to that of the impulse response function

The power spectrum FI(f) of the received ionospheric signal is convolved with that of the modulating pulse FT(f)

Fs(f) = FI(f)*FT(f)

which means that their autocorrelation functions are multiplied

As(t) = AI(t).AT(t)

Since the pulse shape is known, the effect can again be removed prior to analysis.

Page 12: Effects of the observing technique on the data Ian McCrea

Ambiguity Functions

•Ambiguity functions can be mathematically complex, but are conceptually simple:

•Because we use finite length pulses, all of our received signals are inherently spread out in range

•The ionospheric signal is further modified by the impulse response function of the radar system .

•Ambiguities in range and time are two sides of the same coin !

•We define ambiguity mathematically in terms of the two-dimensional ambiguity function (which has dimensions of range and lag)

•Provided that we know the ambiguity functions associated with any transmission and reception, we can reconstruct the original ionospheric signal.

Page 13: Effects of the observing technique on the data Ian McCrea

Where (in the ionosphere) do data come from?

For a monostatic radar system. At time t0, we begin to transmit a pulse of length τ.

At time t1, such that t1>t0+ τ+tr, where tr is the receiver recovery time, it is safe for us to begin to sample.

We can represent this situation with a RANGE-TIME DIAGRAM

Ran

ge

Timeτ

½cτ

R The first sample contains signal from the leading edge of thepulse at range R and from the trailing edge at range R- ½cτ, and all ranges in between.

t0 t1 t2

Suppose we now take a second sample at time t2

The range spread is still ½cτ,but the nominal range has moved out by c(t2-t1)/2

Page 14: Effects of the observing technique on the data Ian McCrea

Where (in the ionosphere) do data come from?

Suppose we want to probe the range R with a series of samples

We would take our first sample when the leading edge of the pulse was atrange R, and the last sample when the trailing edge of the pulse was at R.

In between these times, we make a series of closely-separated samples

Ran

ge

Timeτ

½cτ

R over a time, τ, signal is receivedfrom ranges (R- ½cτ) to (R+ ½cτ)

t0 τ

Page 15: Effects of the observing technique on the data Ian McCrea

Where (in the ionosphere) do data come from?

Ran

ge

Time

We then use these samples to calculate the various lags of the ACF.

Contributions to ‘lag 1’ come from almost the whole scattering volume.Contributions to ‘lag 5’ come only from the mean range, R.

½cτR

½cτ

The correlator performs cross-correlations and combines all examples of a given lag. Returns can be correlated only when there is spatial overlap between samples.

Page 16: Effects of the observing technique on the data Ian McCrea

Where (in the ionosphere) do data come from?

Ran

ge

Time

The previous example was somewhat extreme:

Note that the choice of pulse length and sampling interval are dictated by altitudevariation of the target ionosphere and the ion line (or modulation) bandwidth.

½cτR

½cτ

For real F-region sounding, we might use a long pulse of ~300 us, and a samplinginterval of 10 us, so the number of available lags would be ~30

Page 17: Effects of the observing technique on the data Ian McCrea

Creating the Autocorrelation Function

Let’s assume that we have two streams of samples, from the two outputs of the hybrid detector.

•From the in-phase output we have X1, X2, X3, X4 … and so on.

•From the quadrature output we have Y1, Y2, Y3, Y4 … and so on.

•From these, we can calculate the different lags of the real and imaginary parts of the autocorrelation function as follows:

R(i,j) = (XiXj + YiYj)

I(i,j) = (XiYj – XjYi)

If i=j, then we are calculating lag 0. If |i-j| =1, then we are calculating lag 1, |i-j| = 2 is lag 2 and so on …

We often refer to these computations as CROSS PRODUCTS or LAG PRODUCTS of the ith and jth samples, written as sisj

*.

Although each sample is used many times, data from a stochastic process only makes sense when integrated over a relatively long time. The data are therefore heavily post-integrated and each individual cross-product does not need to be stored (although storing the variances does help in error-analysis).

Page 18: Effects of the observing technique on the data Ian McCrea

Zero-Lag Profiles (“Power Profiles”)

If we take every sample pair (xi and yi for I=1:n) and use them to compute n estimates of the zero lag, this generates the zero lag profile.

Remember, because this is a stochasticprocess, multiple estimates of the zerolag profile have to be added before thesum approaches the expectation value and the measurement becomes meaningful.

The zero lag is an estimator of the received POWER. Hence we have a measure of power as a function of range.

Although this estimate contains no information about spectral shape (and hence temperature, velocity etc) we canuse it to calculate RAW ELECTRON DENSITY.

This first estimate is spread in range. At first, a shorter pulse might seem better, since this would minimise the range ambiguity, but it isn’t. (Remember the radar equation).

Page 19: Effects of the observing technique on the data Ian McCrea

Non-Zero Lags: Spatial Weightings

To calculate non-zero lags, what we are doing is:

jiji YYXXjiR ),(

ijji YXYXjiI ,jiwhere

To form lag 1 for example, we have to form the cross product of two adjacent samples. What effect does this have on the correlation?

The range spreads of the two samples are not the same (because the signal has moved on in range by δt/2 between the times at which the samples were made.

As we increase the spacing between samples, the correlated volume shrinks untilit comes from the centre of the volume.

This is bad, because there is less correlated signal in the data and so longer integration times are needed to form the expectation value.

It follows that there is no point calculating a lag longer than the pulse length sincethe correlated volume will have shrunk to zero.

Ran

ge

Time

lag

1

lag

3

lag 5

Page 20: Effects of the observing technique on the data Ian McCrea

Non-Zero Lags: Numerical Weightings

In this simplified scheme, it’s not only the spatial weighting that is a function of lag number. Imagine we had a 300 s pulse. If we are sampling at 10 s intervals, we can use a sample together with 29 subsequent samples to form our ACF, without completely losing our spatial correlation. From our group of 30 samples, s1 to s30, we have 30 potential estimates of zero lag;

s1s1*, s2s2

*, s3s3*, s4s4

*, ……….s29s29*, s30s30

*

and there is only one way of making lag 29;

s1s30*

s1s2*, s2s3

*, s3s4*, s4s5

*, ……….s29s30*

However there are only 29 ways of making lag 1

In addition to the spatial weighting, there is also a numerical weighting factor that we need to consider. Both of these conspire to make the longer lags very ill-determined.

Considering how primitive this scheme is, it’s surprising that this technique – sometimes called pre-gating - was used at EISCAT for many years!

Page 21: Effects of the observing technique on the data Ian McCrea

How can we do better?Let’s consider lag 29 again. We can see that it can be formed using the cross-product s1s30

* but it can also be formed by s2s31*, s3s32

* etc. Although the area of spatial correlation between these samples is still small and different every time, each of these small correlated volumes is still a subset of the correlated volume for lag 0, lag 1 etc.

If we consider the whole series of samples that we take, without restricting ourselves to groups of 30, then we could arrange the cross-products which we use to calculate the various lags of the ACF in such a way as to make the numerical and spatial weightings approximately even as a function of lag. The price we pay is that adjacent gates are not independent!

This flexible way of handling cross-products and re-using the same samples many times, was first introduced in the GEN correlator algorithms in the early 1990s.

Page 22: Effects of the observing technique on the data Ian McCrea

The Lag Profile Matrix

The lag profile matrix is essentially a matrix of sample cross-products. If the x and y axes are defined by samples 1 to n, then the leading diagonal consists of the cross-products sisj* where i=j. In other words, the leading diagonal represents the zero lag profile and the axis running parallel to this diagonal represents increasing range.

In the same way, the next diagonal is sisi+1*, i.e. it is the range profile of lag 1.

In reality of course the matrix is much bigger than 9x9 samples. Suppose we sample the pulse at 10 us intervals between heights of 170 and 900 km. The time taken for the pulse to propagate through the ionosphere is about half a millisecond, and the number of samples per pulse is:

487~1010

)70800(2 36

xc

n

If we were to post-integrate the correlated data at a resolution of 5 seconds (which is typical) then we are effectively adding around 10,000 of these lag profile matrices together, which turns out to be much more than adequate to form the expectation value.

Page 23: Effects of the observing technique on the data Ian McCrea

Gating StrategiesWe can now think of how to form “range gates” from our lag profile matrices. Within a range gate, the simplest assumption is that our lag profile is constant, so we can average all the cross-products together into a single estimate.

A pre-gated ACF, made from a closed set of samples, has a triangular weighting function like this:

The GEN-type correlation schemes, which use an increasing number of samples to determine the longer lags in order to achieve more even spatial and numerical weightings have functions like this:

In GEN-type schemes, different numbers of samples are used in the calculation of every lag. The number of samples used to calculate the zero lag is sometimes called the VOLUME INDEX.

Both of these approaches have theproperty that different lags come from different spreads of ranges. This concept is defined mathematically by the RANGE AMBIGUITYFUNCTION.

Page 24: Effects of the observing technique on the data Ian McCrea

Range Ambiguity FunctionA knowledge of the range ambiguity is essential when working with any kind of radar data, and indeed it is the requirement for certain levels of range ambiguity which drives the pulse coding techniques which we’ll see later on.Range Ambiguity arises because:

a) All the sounding pulses we use have a finite length (and thus occupy a finite volume of ionosphere).

b) The receiver systems we use have finite (i.e. not function) impulse responses, so that the system’s response to an input signal is also smeared in time (range).

Let’s assume that the spectral shape (or ACF shape) is constant throughout the range over which our gate is calculated. In this case, range ambiguity is the only ambiguity which we need to worry about.

It turns out that the range ambiguity function is given by the convolution of the modulation envelope with the receiver’s impulse response. Note that the range ambiguity is lag-dependent.

Page 25: Effects of the observing technique on the data Ian McCrea

e.g. for the lag t-t|

)])(*)].[()(*[( || StenvpStenvpW r

tt

where p is the impulse response and S is the travel time of the signal (2r/c). Note that the ambiguity function must vanish if there is no longer any spatial correlation between the samples, so the range ambiguity function for different lags of a long pulse with old-style pre-gating looks like this:

What’s happening here is that p*env is shifted by the range delay of the lag. Then the shifted p*env is multiplied by the unshifted one. The products are the shaded regions, which are the regions where the spatial correlation doesn’t vanish to zero.

Page 26: Effects of the observing technique on the data Ian McCrea

Some range ambiguity fuctions for a real experimentThe pictured range ambiguity functions are for the CP1K experiment, where we use the GEN-type approach, including additional samples in the calculation of the longer lags.  For the long pulse, the pulse length is 350 us, the impulse response is that of a 50 kHz Butterworth filter, sampling is done at 10 us intervals and lags are calculated every 10 us to a length of 250 us. The volume index is 15.

Note that all the range ambiguity functions are now approximately the same length, though the detailed shapes are different.

Note how long these ambiguities are: 500 s = (500 x 10-6 x 3 x 108 x 10-3 )/2 = 75 km.

This kind of long pulse might be acceptable for probing the F-region, but not for the E-region as the ionosphere will change significantly within the pulse length, and the range ambiguity will “smear” these variations.

Page 27: Effects of the observing technique on the data Ian McCrea

Lag Ambiguity Functions

Range ambiguities are not the only ones we have to worry about. So far we’ve assumed that all our samples are taken in an infinitely short time, and a cross-product of two samples gives the value of the acf at precisely that lag. In fact this is not the case either.We also have to worry about the lag ambiguity function, which is given by the product of the autocorrelation functions of the modulation envelope and the impulse response:

)()(2

)( || ttRR

cW henv

S

tt

What this means is that ambiguity functions are two-dimensional, having finite width in both range and lag.  In other words, every cross-product we compute is not a precise point measurement. Rather it is a measurement distributed over a finite range and a finite time.For this reason, the full ambiguity function is often called the TWO-DIMENSIONAL AMBIGUITY FUNCTION, and the range and lag ambiguity functions are called ONE-DIMENSIONAL AMBIGUITY FUNCTIONS or REDUCED AMBIGUITY FUNCTIONS.

Page 28: Effects of the observing technique on the data Ian McCrea

Two-dimensional ambiguity function for a power profile

Two dimensional and reduced ambiguity functions for the 40 μs pulse of EISCAT CP1K-experiment.

Zero lag of the two-dimensionalambiguity function of a single pulse

Page 29: Effects of the observing technique on the data Ian McCrea

A Timing Diagram for a Simple Radar Experiment

ms

Transmitted pulse

Reception windows

Calibration signal

Calibration reception

Page 30: Effects of the observing technique on the data Ian McCrea

Writing out the data

In principle, it doesn’t matter in what sequence we write the output data, provided we know which bits are which. In practise, it is usually done in one of two ways.

1) As Pre-correlated ACFs

i.e. lag 0, lag 1, lag 2……lag n from range gate 1, followed bylag 0, lag 1, lag 2…....lag n from range gate 2 ………

…….. lag 0, lag 1, lag 2……lag n from range gate n.

This was usually the data format for EISCAT mainland, pre-renovation data.

Note that most data dumps contain data dump from more than one pulse scheme, and most pulse schemes contain separate sets of gates for signal, background and calibration.  All the pre-gated data are written out in EISCAT LDR (Logical Data Record) binary format.

Page 31: Effects of the observing technique on the data Ian McCrea

2) As lag profiles

The structure is implied from the name:

Lag 0 from range gate 1, gate 2, gate 3………. gate n, followed by,Lag 1 from range gate 1, gate 2, gate 3………. gate n, followed byLag 2 from range gate 1, gate 2, gate 3………. gate n, …………to….Lag n from range gate 1, gate 2, gate 3………. gate n.

This is equivalent to writing out the lag profile matrix, and is probably the preferred method, as it allows the data analyst to make his/her own decision about gating. This format can be found in ESR data and data from the post-renovation mainland system. All the lag profile data dumps (almost) are written out in MATLAB (v5) format. During this course, we’ll see several examples of each type.

Page 32: Effects of the observing technique on the data Ian McCrea

Data Dump with Pre-gated ACFs (e.g. CP1KT)

Data Dump with Lag Profiles (e.g. GUP3)

Page 33: Effects of the observing technique on the data Ian McCrea