44
Photo-z's with Bayesian priors on physical properties of galaxies Masayuki Tanaka 0 – Photo-z primer 1 – A new photo-z code 2 – Some results

0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Photo-z's with Bayesian priors on physical properties of galaxies

Masayuki Tanaka

0 – Photo­z primer

1 – A new photo­z code

2 – Some results

Page 2: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

0 – photo-z primer

Info from an astronomical observation is fundamentally 2D.

Photon energy

Our physical understandingSpectral stretch (= redshift)

Cosmology

distance

Page 3: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Spectral energy distribution of galaxies

A sample BOSS/SDSS-III spectrum from Thomas+ 2013

Page 4: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Why don't you get spec-z's for all the galaxies?

That's impossible! Spec-z's are VERY expensive for two reasons.

1 – Depth:To get unbiased redshfits for i~22.5 objects on 8m telescopes, you need

~1 hour

Probably not a fair comparison, but if you just want to detect an i=22.5 object with imaging, you just need

10 seconds!

Page 5: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Why don't you get spec-z's for all the galaxies?

2 – Multiplexity:e.g., VLT/VIMOS: ~150 galaxies in one go

Again as a non-fair comparison, a single pointing with HSC (1.5sqdeg) detects >10^4 objects.

Page 6: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Spectroscopy vs. imaging

Technique spectral resolution depth Ngal (~redshift accuracy) (1h on 8m)

Low-res spectroscopy R~1000 i<~23 ~100

Broad-band imaging R~5 i~26 ~10^5

Spectroscopy is a technique to precisely measure redshifts for relatively bright galaxies.

Photometric redshift based on imaging data is a technique to infer redshifts of a large number of objects with a limited accuracy.

Page 7: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Three categories for photo-z estimators

1 – template fitting : old-fashioned but powerful technique

2 – numerical fitting (polynomial, random forest, ANN) : relatively new technique

3 – spatial clustering : very new technique

Page 8: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Template fitting technique

Use SEDs of galaxies as a priori knowledge and infer redshifts.

SEDs can be either observed ones, theoretical ones, PCA templates. But, it is often a problem whether you cover all the diversity of spectral types of galaxies at all redshifts.

Advantage is that you can go deeper than spec-z.

Page 9: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Template fitting technique

Page 10: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Numerical technique

Polynomial fitting by Connolly+ 1996.

There are many numerical techniques: artificial neural network, random forest, etc.

An unbiased training spec-z sample that uniformly covers the multi-color space is essential. Obviously, you cannot go beyond the spectroscopic limit. A good thing is that there is no physics involved here.

Page 11: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Numerical technique (machine learning)

There are many numerical techniques: artificial neural network, random forest, etc.

An unbiased training spec-z sample that uniformly covers the multi-color space is essential. Obviously, you cannot go beyond the spectroscopic limit. A good thing is that there is no physics involved here.

Collister and Lahav 2004

Page 12: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Numerical technique (machine learning)

Kind et al. 2013

There are many numerical techniques: artificial neural network, random forest, etc.

An unbiased training spec-z sample that uniformly covers the multi-color space is essential. Obviously, you cannot go beyond the spectroscopic limit. A good thing is that there is no physics involved here.

Page 13: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Cross-correlation with spec-z sample

Cross-correlation between photometric and spectroscopic sample:

If the spectroscopic objects are located within a narrow redshift slice,

Integrated angular cross-correlation:

measure this from data measurable UNmeasurable

Integrated dark matter correlation functiondN/dz of photometric sample

Refer to Menard+ 2013 for details

Page 14: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Cross-correlation with spec-z sample - continued

z

dN/dz

z

spectroscopic sample

If photometric sample has a narrow, single peak in dN/dz, then bias can be considered as a constant and its evolution can be ignored.

Page 15: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Cross-correlation with spec-z sample - continued

z

dN/dz

z

spectroscopic sample photometric sample

If photometric sample has a narrow, single peak in dN/dz, then bias can be considered as a constant and its evolution can be ignored.

dN/dz

Page 16: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Cross-correlation with spec-z sample - continued

z

dN/dz

z

spectroscopic sample

..but, if the photometric sample has multiple redshift peaks, then the bias evolution of the photometric sample is an issue.

Page 17: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Cross-correlation with spec-z sample - continued

z

dN/dz

z

spectroscopic sample

..but, if the photometric sample has multiple redshift peaks, then the bias evolution of the photometric sample is an issue.

dN/dz

photometric sample

Need bp(z) at these peaks to translate the clustering signal into dn/dz

Page 18: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Three categories for photo-z estimators

Template fitting: Pros: Can be applied to beyond spectroscopic flux limit. Generally works slow. Cons: Need to be calibrated against a 'reasonably' representative spec-z sample

Numerical fitting: Pros: No need to worry about physics. Works fast Cons: Need to be calibrated against 100% representative spec-z sample. Cannot be applied beyong spectroscopic limit.

Spatial clustering: Pros: Works for very faint galaxies. Cons: Need a large number of spec-z sample over a wide redshift range. But it does not have to be a representative sample. Cannot get redshift for individual galaxies. Bias evolution is an issue.

Page 19: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Three categories for photo-z estimators

Template fitting: Pros: Can be applied to beyond spectroscopic flux limit. Generally works slow. Cons: Need to be calibrated against a 'reasonably' representative spec-z sample

Numerical fitting: Pros: No need to worry about physics. Works fast Cons: Need to be calibrated against a representative spec-z sample. Cannot be applied beyond spectroscopic limit.

Spatial clustering: Pros: Works for very faint galaxies. Cons: Need a large number of spec-z sample over a wide redshift range. But it does not have to be a representative sample. Cannot get redshift for individual galaxies. Bias evolution is an issue.

Page 20: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Three categories for photo-z estimators

Template fitting: Pros: Can be applied to beyond spectroscopic flux limit. Generally works slow. Cons: Need to be calibrated against a 'reasonably' representative spec-z sample

Numerical fitting: Pros: No need to worry about physics. Works fast Cons: Need to be calibrated against 100% representative spec-z sample. Cannot be applied beyong spectroscopic limit.

Spatial clustering: Pros: Works for very faint galaxies. Cons: Need a large number of spec-z sample over a wide redshift range. But it does not have to be a representative sample. Cannot get redshift for individual galaxies. Bias evolution is an issue.

Page 21: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

1 – A new photo-z code: overview

Page 22: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Motivation

You are probably not interested in muddy details of the photo-z business, but in short:

1 – as a galaxy person, I would like to know individual redshifts and physical properties. → clustering redshift is not an option

2 – I would like to go below the spectroscopic flux limit → template fitting is the only option

3 – Most templates used in the literature are observed ones at z=0, but we know galaxies evolve → generate SPS templates and apply Bayesian priors to let those templates evolve

The code outputs: -- P(galaxy), P(AGN), P(star) with corresponding redshifts -- stellar mass, SFR, and dust extinction fully marginalized over all the other parameters including redshift

Page 23: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

The code finally got a name!

The code did not have a name for a long time. It has been called TANAKA just to distinguish it from other codes, but it is not a unique name – there are 1.3million Tanaka's in Japan! It is now tentatively called MIZUKI: MasayukI's photo-Z compUting KIt

Page 24: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

a-1) Sample SED

Page 25: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

a-2) Priors: N(z), SFR vs M*, extinctin vs SFR

Assume that we can reconstruct N(z) reasonably well with the 30-band photo-z in COSMOS and also assume this functional form:

data

functional fit

18<i<19 24<i<25

A 'floor' suggested by Hildebrandt+2012

Page 26: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

a-2) Priors: N(z), SFR vs M*, extinctin vs SFR

SFR and stellar mass are correlated in star forming galaxies. That relation is known to evolve with redshift.

Wuyts et al. 2011

Page 27: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

a-2) Priors: N(z), SFR vs M*, extinctin vs SFR

Sobral et al. 2012

SFR and extinction are known to correlate. It seems that correlation evolves with redshift.

Page 28: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

d) Template error function

Page 29: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

3 – Results

Page 30: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Assumed data set

The observed COSMOS grizY photometry as an input.

Assume the 30-band photo-z's as the truth table.

Consider objects down to i=25 for now.

It is REALLY BAD that we do not have the u-band photometry.

DO NOT EXPECT TO SEE GOOD PHOTO-Z'S IN THE FOLLOWING SLIDES!!!

Page 31: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

'Raw' photo-z

dispersion f_outlier

0.100 33.5 %

Page 32: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Photo-z with template errfn

dispersion f_outlier

0.100 33.5 %

0.083 29.5 %

Page 33: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Photo-z with errfn + M*-SFR prior

dispersion f_outlier

0.100 33.5 %

0.083 29.5 %

0.073 26.9 %

Page 34: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Photo-z with errfn + M*-SFR + tau_V-SFR

dispersion f_outlier

0.100 33.5 %

0.083 29.5 %

0.073 26.9 %

0.065 22.3 %

Page 35: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Photo-z with errfn + M*-SFR + tau_V-SFR + N(z)

dispersion f_outlier

0.100 33.5 %

0.083 29.5 %

0.073 26.9 %

0.065 22.3 %

0.059 19.3 %

Page 36: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Is this code any better than the existing codes?

In this particular case, we got

i=24.0: f_outliers ~20% (Mizuki), ~30% (LePhare) i=25.0: f_outleirs ~35% (Mizuki), ~50% (LePhare)

So, it seems Mizuki is slightly better than LePhare.

Page 37: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

HSC survey

Miyazaki-san made all of my points. I would encourage you to use data from HSC!

Page 38: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Systematic biasDahlen et al. 2013

Mizuki uncalibrated Mizuki calibrated

Page 39: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Systematic bias

Hildebrandt et al. 2013

(CFHTLS)

Mizuki calibrated

Page 40: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Is P(z) reliable?

Dahlen et al. 2013

Page 41: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

dN/dz reconstruction

black: input

red: sum z_phot

green: sum P(z)

Page 42: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Use point estimates and discard z_phot>1.4...?

dN/dz reconstruction

black: input

red: sum z_phot

green: sum P(z)

<0.02

Observing condition dependent...

Nishizawa+ 2010:

Page 43: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

4 – Summary

Page 44: 0 – Photoz primer 1 – A new photoz code 2 – Some resultsth.nao.ac.jp/MEMBER/hamanatk/obs_workshop13/tanaka.pdf · Need bp(z) at these peaks to translate the clustering signal

Summary

Evolving priors on physical properties of galaxies are useful!

A template error function is useful, too.

We now can get physical properties measured in a self-consistent manner.

Systematic photo-z offset. Why?

P(z) is not very precise at this point, but I will wait for real HSC photometry off the pipeline.

Need further work on dn/dz reconstruction and outlier clipping.