-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Download and reformat benchmark data set from NASA/WMAP page #93
Comments
I've been working on this today, downloaded and have been playing around with the simulated SZ halo catalogs. I think the catalogs connect to submaps that are at the same link, but I haven't confirmed it independently yet, as the submap is for an octant of the sky, but my code keeps treating it as a full sky map (so I need to convert the long/lat positions) |
Could you post some stats of the catalog? E.g., distributions/histograms of masses, redshifts, and SZ data |
Does it look like the octant of the sky will need to be sliced around individual objects? |
We did a paper on classification using this data in 2021: https://arxiv.org/abs/2102.13123 There may be some good info on data processing there. |
Could we plan to put this into an h5 data format for posterity? |
What exactly do you mean by this? I don't believe the octant needs to be sliced at all. In fact, there might be no need to slice the full-sky either as doing some more reading has led me to believe that the full-sky map is generated from the single octant via reflections, so we can probably just work with that one octant. I am working on trying to verify this with the data though |
We'll need a little cut-out image of each individual halo/cluster to use as input to the inference process. Is that available already, or will we need to make cut-outs from already-existing larger maps? |
We would need to do cut-outs from the larger octant maps, since the maps are all full-size. |
I can imagine their being KSZ-related physics in the simulation, but shouldn't the KSZ signal itself be a different signal and thus not in the map? Maybe I'm forgetting some of my SZ physics. |
It is actually a different signal, but the map that I had downloaded/worked with was actually the tSZ map + the kSZ map + relativistic corrections, and that's why the map shown has KSZ and TSZ signals. The map that I'm switching to now would be just the tSZ map. |
ah cool cool. Yeah, let's go with the pure tSZ map for now. Later, we can add in kSZ when we want to get spicy and upgrade. |
very cool. I'm guessing ours with noise will look similar. @evavagiakis What do you think of the following for a diagnostic plot (for benchmark comparisons): choose haloes in our simulation and the NASA sim that are comparable in some variables (e.g., mass and redshift), and then subtract the two images and divide by one of the images (pixel by pixel)? This would give a residual map for each our objects. We could potentially further summarize by taking the average over the pixels in the plot, or looking at the distribution over the pixels in the residual image. |
Sounds good to me. If we can do a component by component residual plot (just y, or just cmb, or just noise for example) that might help in diagnostics as well. An average over the pixels sounds like the aperture photometry filter to me which we can also use for comparison |
I'm still going to continue looking at more examples, but it seems as though there isn't a specific higher/lower bias in our profile based off of these 2 tests since the first one our sim went low, but it went high in the second one. |
We could start looking at sample averages, so for example take a set of sources with masses within one of the mass bins in Battaglia 2012, generate our sims for the same (z, M) sample, and either stack (average) the maps together and look at the residual between those two stacked maps, or look at the average of the residuals between each of those submap pairs (and plot a histogram of the residuals, either from a central value or from aperture photometry). Since the B12 profiles are fits to average profiles, it wouldn't surprise me too much if there's some scatter as long as the scatter is around 0. I'd also like to understand better what is present in the Seghal Compton-y maps (any instrument noise modeled in? beam assumptions?) to better advise on whether we should be beam convolving/adding noise ourselves in this comparison. There's probably a paper describing what is included, could you link that here if you have it? |
I think the Seghal sims used N-body simulations, but I don't see any mention of instrument noise/beam effects in the paper here https://ui.adsabs.harvard.edu/abs/2010ApJ...709..920S/abstract. I think getting an average of the residuals is doable, so I think I'll do that for one of the mass bins today. |
When we used this data set in the past, I think we added our own noise and other stuff. I think there's some code for that here: https://github.com/deepskies/deepsz |
I'm glad to see the smoothing (reduction in noise) |
How big a difference do we expect for haloes in that mass range? |
Since this issue is about downloading and formatting the benchmark data set, we should probably move this analysis to a different issue or a to a github discussion. |
I think the central tSZ signal that we expect to see in that mass range is approximately 30-50 uK, so an average of 5.6 uK is not too bad. I do think this would still be dT and uK, since we are talking about the difference in temperature that the tSZ signal creates from the background, but I'm not sure? I'm unclear on what do you mean by the absolute units for clusters at a given mass? Also, how would we plot our scaling relations for mass vs tSZ given that the profile is a function of mass, radius and redshift? I could plot mass vs tSZ signal given a specific z, and R200 and radius if that's what you mean? |
I just saw this but have to agree. Do you know of a good way to get all the comments from here into a new issue/discussion, or should we maybe just rename this issue to something that encompasses downloading + analyzing the data set? |
I think it's okay to create a new issue and reference this one. but, we should probably split these tasks up so tha twe can keep track a little better. |
You answered my question about the absolute value being 30-50. The scaling relation would have to be something like
|
We can continue the discussion in #123, as I thought a discussion might be better since that seems appropriate for this sort of broad analysis |
https://lambda.gsfc.nasa.gov/product/foreground/fg_sz_cluster.html
The text was updated successfully, but these errors were encountered: