-
Notifications
You must be signed in to change notification settings - Fork 183
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ability to define merged channels in PostFitShapesFromWorkspace (+ a few random fixes) #295
base: main
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, this looks good.
I was not familiar with LimitCompare.cc
, but I gues it was unused and being removed for keeping things clean?
@@ -1191,6 +1192,9 @@ def FixTopRange(pad, fix_y, fraction): | |||
if ymin == 0.: | |||
print('Cannot adjust log-scale y-axis range if the minimum is zero!') | |||
return | |||
if fix_y <= 0: | |||
print('Cannot adjust log-scale y-axis range if the maximum is zero!') |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should we maybe put "less than or equal to zero" here in the printout, just since that's what the check is doing?
For the merging would it make sense to check if the channel that a user wants to merge have the same binning and if it is not the case print out a warning message? I think TH1.Add() adds histograms by bin number and completely disregards the differences in bin labels |
@anigamova I think it's a good idea. It's true it probably does something undesirable if this is the case |
Have been rethinking this one a bit, and #267. We might be able to make a bigger improvement, that will also help performance, by also doing some restructuring. One of the issues often reported is that PostFitShapesFromWorkspace is quite slow for more complex models. This is partly due to the fact that each call to A better way could be to add a new function that returns a list of shapes/yields according to some input specification. By default, this could be a list of each category, within which a list of processes, as well as the usual signal/background/total sums. But we could allow custom specs, e.g. defining a new category based on a list of actual categories, or a regex to match them - similarly for sums of processes. Then all the shapes/yields the user has requested can be computed with just one loop. We might also build into this a mechanism to optionally return the full set of variations. |
Hi @ajgilbert , Yes, this seems like a good way to increase performance! Since the changes you propose are mostly relevant for the backend of the harvester, it might make sense to still go ahead with this PR and #267 in my personal opinion. This way, we could already use the improvements to |
Adds option
--merged-channels
to PostFitShapesFromWorkspace that lets the user define additional channel labels that will be calculated as the sum over all the channels matching a particular regex pattern. E.g., given a card with channels:chn_ee_2016, chn_ee_2017, chn_ee_2018, chn_mm_2016, chn_mm_2017, chn_mm_2018
, we could add:--merged-channels 'chn_ee_tot=chn_ee_201.' 'chn_mm_tot=chn_mm_201.'
, and these additional merged channels will appear as directories in the ROOT file.Partially addresses #291.
This PR also fixes a few assorted bugs, mostly cases where we need to avoid divide by zero.