Download and reformat benchmark data set from NASA/WMAP page #93

bnord · 2023-07-05T18:41:03Z

https://lambda.gsfc.nasa.gov/product/foreground/fg_sz_cluster.html

Download the simulated SZ halo catalogs
Do they connect to submaps that are also at that link?

kbanker · 2023-07-17T15:52:29Z

I've been working on this today, downloaded and have been playing around with the simulated SZ halo catalogs. I think the catalogs connect to submaps that are at the same link, but I haven't confirmed it independently yet, as the submap is for an octant of the sky, but my code keeps treating it as a full sky map (so I need to convert the long/lat positions)

bnord · 2023-07-17T16:25:46Z

Could you post some stats of the catalog? E.g., distributions/histograms of masses, redshifts, and SZ data

bnord · 2023-07-17T16:26:37Z

Does it look like the octant of the sky will need to be sliced around individual objects?

bnord · 2023-07-17T16:27:30Z

We did a paper on classification using this data in 2021: https://arxiv.org/abs/2102.13123

There may be some good info on data processing there.

bnord · 2023-07-17T16:27:45Z

Could we plan to put this into an h5 data format for posterity?

kbanker · 2023-07-18T17:52:26Z

These are a few histograms showing parts of the dataset. Notice the log scale for all of them other than the redshift.

kbanker · 2023-07-18T18:12:00Z

Does it look like the octant of the sky will need to be sliced around individual objects?

What exactly do you mean by this? I don't believe the octant needs to be sliced at all. In fact, there might be no need to slice the full-sky either as doing some more reading has led me to believe that the full-sky map is generated from the single octant via reflections, so we can probably just work with that one octant. I am working on trying to verify this with the data though

bnord · 2023-07-18T19:11:44Z

We'll need a little cut-out image of each individual halo/cluster to use as input to the inference process. Is that available already, or will we need to make cut-outs from already-existing larger maps?

kbanker · 2023-07-19T14:04:14Z

We would need to do cut-outs from the larger octant maps, since the maps are all full-size.

kbanker · 2023-07-27T15:44:02Z

I managed to figure out how to do cut-outs and put the maps in a comparable state to our sims. Here's an example:

A couple of things to note: Since these are simulated maps, this specific cutout is from the map that just describes the SZ effect, so doesn't have any noise, either from galactic dust, or even from the CMB. However, it does include the kSZ effect, that we do not have included. We may want to use an added map of the tSZ effect + CMB + noise to have a better comparison to our simulations

bnord · 2023-07-27T15:55:25Z

I can imagine their being KSZ-related physics in the simulation, but shouldn't the KSZ signal itself be a different signal and thus not in the map? Maybe I'm forgetting some of my SZ physics.

kbanker · 2023-07-27T16:24:25Z

I can imagine their being KSZ-related physics in the simulation, but shouldn't the KSZ signal itself be a different signal and thus not in the map? Maybe I'm forgetting some of my SZ physics.

It is actually a different signal, but the map that I had downloaded/worked with was actually the tSZ map + the kSZ map + relativistic corrections, and that's why the map shown has KSZ and TSZ signals. The map that I'm switching to now would be just the tSZ map.

bnord · 2023-07-27T19:01:37Z

ah cool cool. Yeah, let's go with the pure tSZ map for now.

Later, we can add in kSZ when we want to get spicy and upgrade.

kbanker · 2023-07-28T15:50:48Z

I downloaded the new pure tSZ map, and here's a comparison of a similar sized cluster from this data :

vs from our simulation

bnord · 2023-07-31T16:36:59Z

very cool.

I'm guessing ours with noise will look similar.

@evavagiakis What do you think of the following for a diagnostic plot (for benchmark comparisons): choose haloes in our simulation and the NASA sim that are comparable in some variables (e.g., mass and redshift), and then subtract the two images and divide by one of the images (pixel by pixel)? This would give a residual map for each our objects. We could potentially further summarize by taking the average over the pixels in the plot, or looking at the distribution over the pixels in the residual image.

evavagiakis · 2023-07-31T20:09:58Z

Sounds good to me. If we can do a component by component residual plot (just y, or just cmb, or just noise for example) that might help in diagnostics as well. An average over the pixels sounds like the aperture photometry filter to me which we can also use for comparison

kbanker · 2023-08-01T16:07:20Z

Here's another cluster comparison + residuals. I've been comparing clusters with just the tSZ signal from the data, and just the dT map from our sims, but I'm wondering if I should add noise/beam convolution? It don't think it makes sense to compare with the CMB included, as its always generated differently and could cause the residuals to be really large.

kbanker · 2023-08-01T16:26:06Z

The same cluster at 148 Ghz

kbanker · 2023-08-01T16:30:58Z

Another cluster at 148 Ghz

kbanker · 2023-08-01T16:32:53Z

I'm still going to continue looking at more examples, but it seems as though there isn't a specific higher/lower bias in our profile based off of these 2 tests since the first one our sim went low, but it went high in the second one.

evavagiakis · 2023-08-01T16:40:03Z

We could start looking at sample averages, so for example take a set of sources with masses within one of the mass bins in Battaglia 2012, generate our sims for the same (z, M) sample, and either stack (average) the maps together and look at the residual between those two stacked maps, or look at the average of the residuals between each of those submap pairs (and plot a histogram of the residuals, either from a central value or from aperture photometry). Since the B12 profiles are fits to average profiles, it wouldn't surprise me too much if there's some scatter as long as the scatter is around 0.

I'd also like to understand better what is present in the Seghal Compton-y maps (any instrument noise modeled in? beam assumptions?) to better advise on whether we should be beam convolving/adding noise ourselves in this comparison. There's probably a paper describing what is included, could you link that here if you have it?

kbanker · 2023-08-02T14:11:45Z

I think the Seghal sims used N-body simulations, but I don't see any mention of instrument noise/beam effects in the paper here https://ui.adsabs.harvard.edu/abs/2010ApJ...709..920S/abstract.

I think getting an average of the residuals is doable, so I think I'll do that for one of the mass bins today.

bnord · 2023-08-02T14:23:21Z

When we used this data set in the past, I think we added our own noise and other stuff. I think there's some code for that here: https://github.com/deepskies/deepsz

kbanker · 2023-08-02T14:40:48Z

Here's the average of 100 loops, where each makes a map of the residuals for that (z, M, R). This is the average of those residual maps. Specifically, this is for the mass bin 1.1e14 < M200 < 1.7e14 solar masses.

kbanker · 2023-08-02T15:04:37Z

Here it is with 1000 loops, but I think this is making it obvious there is a centering error, where the Seghal sims are centered but our are off by 1 pixel. Im going to work on fixing that now.

bnord · 2023-08-02T15:12:28Z

I'm glad to see the smoothing (reduction in noise)

kbanker · 2023-08-03T14:54:52Z

After fixing the centering, I ran the loops again for the mass bin 1.1e14 < M200 < 1.7e14 solar masses, and it seems as though the signal from the Seghal sims is generally higher by an average of 5.6 uK, and the seghal sims have a larger spread, likely due to the substructures included.

bnord · 2023-08-03T14:58:56Z

How big a difference do we expect for haloes in that mass range?
Have we figured out of $dT$ and $\mu_K$ are the right variables and units to be using here?
Could you remind me what to expect when it comes to the absolute units for clusters at a given mass?
Should we also plot our scaling relations of mass for TSZ to make sure we know what to expect?

bnord · 2023-08-03T14:59:29Z

Since this issue is about downloading and formatting the benchmark data set, we should probably move this analysis to a different issue or a to a github discussion.

kbanker · 2023-08-03T16:54:08Z

I think the central tSZ signal that we expect to see in that mass range is approximately 30-50 uK, so an average of 5.6 uK is not too bad. I do think this would still be dT and uK, since we are talking about the difference in temperature that the tSZ signal creates from the background, but I'm not sure?

I'm unclear on what do you mean by the absolute units for clusters at a given mass? Also, how would we plot our scaling relations for mass vs tSZ given that the profile is a function of mass, radius and redshift? I could plot mass vs tSZ signal given a specific z, and R200 and radius if that's what you mean?

kbanker · 2023-08-03T16:55:21Z

Since this issue is about downloading and formatting the benchmark data set, we should probably move this analysis to a different issue or a to a github discussion.

I just saw this but have to agree. Do you know of a good way to get all the comments from here into a new issue/discussion, or should we maybe just rename this issue to something that encompasses downloading + analyzing the data set?

bnord · 2023-08-03T17:00:42Z

I think it's okay to create a new issue and reference this one.
We could also re-name some things. I don' think I have a big preference.

but, we should probably split these tasks up so tha twe can keep track a little better.

bnord · 2023-08-03T17:03:56Z

You answered my question about the absolute value being 30-50.

The scaling relation would have to be something like

for a given redshift range, plot mass vs TSZ at the center of the cluster (so literally central pixel)
2 for a given redshift range, plot mass vs TSZ within some finite aperture (this is what Elaine and Eve are working on; aperture photometry)

kbanker · 2023-08-04T15:25:07Z

We can continue the discussion in #123, as I thought a discussion might be better since that seems appropriate for this sort of broad analysis

bnord added the enhancement New feature or request label Jul 5, 2023

bnord assigned kbanker Jul 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Download and reformat benchmark data set from NASA/WMAP page #93

Download and reformat benchmark data set from NASA/WMAP page #93

bnord commented Jul 5, 2023

kbanker commented Jul 17, 2023

bnord commented Jul 17, 2023

bnord commented Jul 17, 2023

bnord commented Jul 17, 2023

bnord commented Jul 17, 2023

kbanker commented Jul 18, 2023

kbanker commented Jul 18, 2023

bnord commented Jul 18, 2023

kbanker commented Jul 19, 2023

kbanker commented Jul 27, 2023

bnord commented Jul 27, 2023 •

edited

Loading

kbanker commented Jul 27, 2023

bnord commented Jul 27, 2023

kbanker commented Jul 28, 2023

bnord commented Jul 31, 2023

evavagiakis commented Jul 31, 2023

kbanker commented Aug 1, 2023

kbanker commented Aug 1, 2023 •

edited

Loading

kbanker commented Aug 1, 2023

kbanker commented Aug 1, 2023

evavagiakis commented Aug 1, 2023

kbanker commented Aug 2, 2023

bnord commented Aug 2, 2023

kbanker commented Aug 2, 2023

kbanker commented Aug 2, 2023

bnord commented Aug 2, 2023

kbanker commented Aug 3, 2023 •

edited

Loading

bnord commented Aug 3, 2023

bnord commented Aug 3, 2023

kbanker commented Aug 3, 2023

kbanker commented Aug 3, 2023

bnord commented Aug 3, 2023

bnord commented Aug 3, 2023

kbanker commented Aug 4, 2023

Download and reformat benchmark data set from NASA/WMAP page #93

Download and reformat benchmark data set from NASA/WMAP page #93

Comments

bnord commented Jul 5, 2023

kbanker commented Jul 17, 2023

bnord commented Jul 17, 2023

bnord commented Jul 17, 2023

bnord commented Jul 17, 2023

bnord commented Jul 17, 2023

kbanker commented Jul 18, 2023

kbanker commented Jul 18, 2023

bnord commented Jul 18, 2023

kbanker commented Jul 19, 2023

kbanker commented Jul 27, 2023

bnord commented Jul 27, 2023 • edited Loading

kbanker commented Jul 27, 2023

bnord commented Jul 27, 2023

kbanker commented Jul 28, 2023

bnord commented Jul 31, 2023

evavagiakis commented Jul 31, 2023

kbanker commented Aug 1, 2023

kbanker commented Aug 1, 2023 • edited Loading

kbanker commented Aug 1, 2023

kbanker commented Aug 1, 2023

evavagiakis commented Aug 1, 2023

kbanker commented Aug 2, 2023

bnord commented Aug 2, 2023

kbanker commented Aug 2, 2023

kbanker commented Aug 2, 2023

bnord commented Aug 2, 2023

kbanker commented Aug 3, 2023 • edited Loading

bnord commented Aug 3, 2023

bnord commented Aug 3, 2023

kbanker commented Aug 3, 2023

kbanker commented Aug 3, 2023

bnord commented Aug 3, 2023

bnord commented Aug 3, 2023

kbanker commented Aug 4, 2023

bnord commented Jul 27, 2023 •

edited

Loading

kbanker commented Aug 1, 2023 •

edited

Loading

kbanker commented Aug 3, 2023 •

edited

Loading