43
Data Challenges in Astronomy: NASA’s Kepler Mission and the Search for Extrasolar Earths Jon M. Jenkins SETI Institute/NASA Ames Research Center Thursday September 22, 2011 STScI SAO

Data Challenges in Astronomy: NASA’s Kepler Mission and the Search for Extrasolar Earths

Embed Size (px)

DESCRIPTION

SAO. Data Challenges in Astronomy: NASA’s Kepler Mission and the Search for Extrasolar Earths. STScI. Jon M. Jenkins SETI Institute/NASA Ames Research Center Thursday September 22, 2011. The Kepler Mission. - PowerPoint PPT Presentation

Citation preview

Page 1: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Data Challenges in Astronomy: NASA’s Kepler Mission and

the Search for Extrasolar Earths

Jon M. JenkinsSETI Institute/NASA Ames Research Center

Thursday September 22, 2011

STScISAO

Page 2: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

The Kepler Mission

What fraction of sun-like stars in our galaxy host potentially habitable Earth-size planets?

Page 3: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

How Hard is it to Find Good Planets?

Earth or Venus0.01% area of the Sun (1/10,000)

Page 4: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Kepler Field Of View

Credit: Carter Roberts

Page 5: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Kepler: Big Data, Big Challenges

Big Processing Challenges Instrument effects are large compared to signal of interest Observational noise is non-white and non-stationary ~100×106 tests per star for planetary signatures [O(N2)] Stellar variations are higher than expected

Big Data: >150,000 target stars 6x106 pixels collected and stored per ½ hour ~40 GB downlinked each month >40×109 points in the time series over 3.5 years

Page 6: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

The Kepler Science Pipeline: From Pixels To Planets

CALPixel Level

Calibrations

PAPhotometricAnalysis

Sums Pixels Together/Measures Star Locations

TPSTransiting

PlanetSearch

RawData

TCEs: Threshold Crossing Events

CorrectedLight Curves

CalibratedPixels

RawLight

Curves/Centroids

DVData

Validation

Diagnostic

Metrics

CALPixel Level

Calibrations

PAPhotometricAnalysis

Sums Pixels Together/Measures Star Locations

TPSTransiting

PlanetSearch

DVData

Validation

Page 7: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Image Data

0.09x0.09 degrees80x80 pixels6400 pixels total

6.6x6.6 millidegrees28 pixels collectedBlack = no data

Scaled to show faint detail1.13 (h) x1.22 (w) degrees

Page 8: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Pixel Time Series

Page 9: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

What Do Stars Sound Like?

HAT-P-7B Another Star

Page 10: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Data Challenge Number 1

Dealing with Instrumental Systematic Errors

Page 11: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Correcting Systematic Errors

CALPixel Level

Calibrations

PAPhotometricAnalysis

Sums Pixels Together/Measures Star Locations

TPSTransiting

PlanetSearch

RawData

TCEs: Threshold Crossing Events

CorrectedLight Curves

CalibratedPixels

RawLight

Curves/Centroids

DVData

Validation

Diagnostic

Metrics

Page 12: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

PDC Often Does a Good Job

Bayesian approaches look promising!

Page 13: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

PDC Often Over-Fits Variable Stars

Page 14: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

PDC Is Fundamentally Flawed

PDC co-trends against instrumental signatures using least squares (LS) approach

LS attempts to explain all of a given time series, not just the part the model can explain well

There is no way a simple LS fit can “put on the brakes”

PDC often trades bulk RMS for increased noise at short time scales

Page 15: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

A Bayesian Solution

Examine behavior of ensemble of stars responding to systematics Formulate prior probability distributions for model coefficients Maximize Posterior Distribution:

“A Bayesian is one who, vaguely expecting a horse, and catching a glimpse of a donkey, strongly believes he has seen a mule.”

ˆ θ = argmaxθ

log p x c( )[ ] + log p c( )[ ]{ }Maximum Likelihood Prior PDF

Page 16: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

A Much Better Result

Page 17: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

PDC MAP Example

Page 18: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

PDC MAP Example 2

Page 19: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Data Challenge Number 2

Detecting Weak Transits Against Non-White, Non-Stationary Noise

Page 20: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Detecting Transiting Planets

CALPixel Level

Calibrations

PAPhotometricAnalysis

Sums Pixels Together/Measures Star Locations

TPSTransiting

PlanetSearch

RawData

TCEs: Threshold Crossing Events

CorrectedLight Curves

CalibratedPixels

RawLight

Curves/Centroids

DVData

Validation

Diagnostic

Metrics

Page 21: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

21

Matched Filtering: What Does This Mean?

Page 22: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Detection StatisticsDefine

Under H0:

Under H1:

If T < , then choose H0, if T > , then choose H1

T =xT s

σ w sT s

T = 0, σ T2 =1

T = 1σ w

sT s, σ T2 =1

s

ws+w

TT

Page 23: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Detection Statistics For Colored Noise

w is (colored) Gaussian noise with autocorrelation matrix Rx is the datas is the signal of interest

Decide s is present if

How do we determine R?

If the noise is stationary, we can work in the frequency domain:€

T =xT R−1s

sT R−1s=

Hx( )T

Hs( )

Hs( )T

Hs( )=

˜ x T ˜ s

˜ s T ˜ s >γ

T =X( f )S*( f )

P( f )df∫ S( f )S*( f )

P( f )∫ df

Page 24: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Solar Variability

Page 25: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

PSDs for Solar-Like Variability

Is stellar variability stationary?

No!

We must work in a joint time-frequency domain

Wavelets are a natural choice

High Solar Activity

Low Solar Activity

Detect

able E

nergy

Page 26: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

A Wavelet-Based Approach

Filter-Bank Implementation of an Overcomplete Wavelet Transform The time series x(n) is partitioned (filtered) into complementary channels:

WX(i,n) = {h1(n) x(n), h2(n) x(n),…, hM(n) x(n)} = {x1(n), x2(n),…, xm(n)}

Page 27: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

A Wavelet-Based Approach

Page 28: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Kepler-like Noise + Transits

Page 29: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Single Transit Statistics

Page 30: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Folded Transit Statistics

Page 31: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Folded Statistics at Best-Matched Period

Page 32: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Data Challenge Number 3

Excess Stellar Variability

Page 33: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Image by Carter Roberts (1946-2008)

Excess Stellar VariabilityOriginal Noise Budget (Kp=12):

14 ppm Shot Noise10 ppm Instrument Noise10 ppm Stellar Variability

=> 20 ppm Total Noise

Reality (11.5 ≤ Kp ≤ 12.5)17 ppm Shot Noise13 ppm Instrument Noise20 ppm Stellar Variability

=> ~29 ppm Total Noise

Page 34: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Original expectations yielded ~65% completeness for Earth analogs at 3.5 years

Completeness Vs. Time

Expected

Page 35: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Current expectations yield <5% completeness for Earth analogs at 3.5 years

Expected Reality

Completeness Vs. Time

Page 36: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

~65% completeness for 1.2-Re planets in same orbits at 3.5 years

Expected Reality

Completeness Vs. Time

Page 37: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Kepler will recover >60% completeness for Earth analogs after 8 years

Expected Reality

Completeness Vs. Time

Page 38: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Completeness Vs. Time

20 ppm

30 ppm

Kepler will detect virtually all Venus analogs within 8 years

Page 39: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Kepler is revolutionizing the field of exoplanets Kepler data are in a class of their own with

significant data challenges Huge dynamic range for measurements requires

sophisticated Bayesian techniques for correcting systematic errors

Planet detection requires an efficient, adaptive

Conclusions

method that accounts for non-white noise: wavelets fit the bill Kepler can reach its goal of detecting Earth-Sun analogs with an extended 8 year

mission Each day we are getting closer and closer to finding an Earth-Sun analog

Page 40: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Image by Carter Roberts (1946-2008)

Music From the Stars

Page 41: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Image by Carter Roberts (1946-2008)

Music From the Stars (2)

41

Page 42: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Image by Carter Roberts (1946-2008)

Music From the Stars (3)

42

Page 43: Data Challenges in Astronomy: NASA’s  Kepler Mission  and the Search for Extrasolar Earths

Image by Carter Roberts (1946-2008)

Music From the Stars (4)

43