Filter efficiency from fragments should be dropped once and for all #3269

efeyazgan · 2022-09-30T08:45:44Z

Since Si hyun and Sitian are sure (see https://cms-talk.web.cern.ch/t/filter-efficiency-check-in-gen-check-script/15409) that the filter efficiency is not used at all I removed the check in the request checking script (see:
#3268) so this filter should be removed from all fragments. Please do that soon and then I will adapt the script not to cause inconsistency in UL consistency check. However, if it is not removed soon, I think we should revert back the request checking script.

In any case, I do not take any responsibility for any problems that this change may cause: #3268.

Thanks.

agrohsje · 2022-10-03T13:43:43Z

Hi Efe. The concern I had was that this info is written in our edm files. So we cannot know if someone is using or not. Technically, it is not relevant. Only this statement is true. That's why I like the solution of Sihyun, i.e. consider 1 or -1 as some default that is ok. Allowing again random numbers is not so great. We are just too many people to assure no one is using.

efeyazgan · 2022-10-03T15:58:17Z

Hi Alexander, yes, but following the same reasoning we could have kept the check. If something is used, we should check how it is used. I am sure that check was there for a reason...

sihyunjeon · 2022-10-05T19:30:34Z

Hi so there are several things to think about here.

This creates somewhat unnecessary entropy for the MC contacts because
a. We don't know whether there are any real users for this metadata so we might be putting our efforts to something without real gain.
b. Removing the filter efficiency fragment still works (avoids gen checking script errors) - which means subset of the samples would for sure have broken filter efficiency written in GEN files as metadata. So subset of our samples are already broken.
c. It's not so easy to modify this through python scripts because efficiency from the run log (for filter efficiency) only gets delivered through email. I don't think the run log results are stored somewhere in McM, at least I am not aware. If I am correct, one needs to crop out the values from email boxes and tweak it into the fragments and I would hardly imagine MC contacts doing this. So in the end it would return to b. where MC contacts will just remove the line to avoid the problem, breaking the variables that should not be used.

But as Alexander said, SOME might use this and it MIGHT be not totally useless to store correct values. So allowing default settings (filter efficiency >= 1.0 or <=0.0 which doesn't make sense) is sort of compromise in between.

agrohsje · 2022-10-11T10:32:49Z

Hi Efe,
maybe this was not clear in my message: I think we should have kept the check (and the motivation now is the same as back then: it is stored in EDM and we don't know if someone is using it or not) but just slightly modify it as proposed by Sihyun:
But as Alexander said, SOME might use this and it MIGHT be not totally useless to store correct values. So allowing default settings (filter efficiency >= 1.0 or <=0.0 which doesn't make sense) is sort of compromise in between.
Cheers, Alexander.

sihyunjeon · 2022-10-11T10:37:08Z

Just to add a bit more

(filter efficiency >= 1.0 or <=0.0 which doesn't make sense)

This means that "filter efficiency itself doesn't make sense already and the users if they exist, they would know it's not trustable so they would avoid using it. but if it's some realistic value e.g. 0.48, people might believe the value is true and mistakenly use it if wrong values are stored."

So my proposal was, avoid checking unrealistic values BUT check realistic values to make sure people don't use them.

efeyazgan · 2022-10-11T11:45:03Z

OK, done. See #3280

efeyazgan · 2022-10-12T11:32:13Z

See the update: #3282

tvami · 2023-12-06T10:07:09Z

I run into this now too, if the eff is set to -1 it's still failing the script. I even understand from the thread above that it was discussed that in case of the eff is negative we should not give an error. Anyway, I made a PR to add that patch #3572

efeyazgan mentioned this issue Sep 30, 2022

filter efficiency from fragment and from generator field consistency … #3268

Merged

tvami mentioned this issue Dec 6, 2023

Fix the case when filter eff is negative in the fragement #3572

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Filter efficiency from fragments should be dropped once and for all #3269

Filter efficiency from fragments should be dropped once and for all #3269

efeyazgan commented Sep 30, 2022

agrohsje commented Oct 3, 2022

efeyazgan commented Oct 3, 2022

sihyunjeon commented Oct 5, 2022

agrohsje commented Oct 11, 2022

sihyunjeon commented Oct 11, 2022

efeyazgan commented Oct 11, 2022

efeyazgan commented Oct 12, 2022

tvami commented Dec 6, 2023

Filter efficiency from fragments should be dropped once and for all #3269

Filter efficiency from fragments should be dropped once and for all #3269

Comments

efeyazgan commented Sep 30, 2022

agrohsje commented Oct 3, 2022

efeyazgan commented Oct 3, 2022

sihyunjeon commented Oct 5, 2022

agrohsje commented Oct 11, 2022

sihyunjeon commented Oct 11, 2022

efeyazgan commented Oct 11, 2022

efeyazgan commented Oct 12, 2022

tvami commented Dec 6, 2023