You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I tried to reproduce the error for a wf, at least to see what goes inside the loop, and notice that the loop is exited before the rejectedWork variable is assigned.
Now, I need to understand
if the wfs are stuck due to this failure or not
why does it only affect ReReco Workflows
How to reproduce it
Steps to reproduce the behavior:
Use any of the following wfs with the following piece of code
INFO:reqmgrInteraction:Added 0 new elements for request: cmsunified_ACDC1_Run2024G_Muon1_ZMu_PromptMUODPGNano_241105_094516_3263
INFO:reqmgrInteraction:Added 0 new elements for request: cmsunified_ACDC1_Run2024G_Muon1_ZMu_PromptMUODPGNano_241105_094536_6961
INFO:reqmgrInteraction:Added 0 new elements for request: cmsunified_ACDC2_Run2024F_Muon1_PromptMUOPOGNano_241105_094752_6353
INFO:reqmgrInteraction:Added 0 new elements for request: cmsunified_ACDC3_Run2024D_Muon0_PromptMUOPOGNano_241105_094735_2323
INFO:reqmgrInteraction:Added 0 new elements for request: cmsunified_ACDC3_Run2024E_EGamma0_PromptEGMNano_v2_241105_094838_4316
INFO:reqmgrInteraction:Added 0 new elements for request: cmsunified_ACDC3_Run2024E_JetMET0_PromptBTVJMENano_v2_241105_094856_6606
from WMCore.WMSpec.WMWorkload import WMWorkloadHelper
from WMCore.WMSpec.WMWorkload import WMWorkloadHelper, getWorkloadFromTask
from WMCore.WorkQueue.Policy.Start import startPolicy
from WMCore.Services.Rucio.Rucio import Rucio
## Fetch spec file
spec_file = rmr.getSpec('cmsunified_ACDC3_Run2024E_JetMET0_PromptBTVJMENano_v2_241105_094856_6606')
wh = WMWorkloadHelper(spec_file)
for topLevelTask in wh.taskIterator():
spec = getWorkloadFromTask(topLevelTask)
policyName = spec.startPolicy()
print(policyName)
policy = startPolicy(policyName, {'ResubmitBlock': {'args': {}, 'name':'ResubmitBlock'}},
rucioObj=rucioObj)
print(policy.supportsWorkAddition())
if not policy.supportsWorkAddition() and True:
print('continue - loops exit before rejectedWork is assigned')
Expected behavior rejectedWork to be defiend outside of the loop.
I think i saw some conversation about this topic in the recent days, but i can not find it now. I remember Alan saying that yes this error is not clear and that we can improve the message, but that this means that something is broken with the workflow itself and that the cause for this error must be searched elsewhere.
@hassan11196 thank you for creating this ticket.
As Dario correctly pointed out, this ticket is a duplicate of #11681
Given that you provided more content and some useful information, I will link this ticket in the previous one and close this as a duplicate. Thank you again for taking the time to report this!
Impact of the bug
Global WorkQueue
Describe the bug
I was debugging ACDCs of ReReco wfs and while going through the Global WorkQueue logs, I found the following error
which is thrown by the following line,
https://github.com/hassan11196/WMCore/blob/master/src/python/WMCore/WorkQueue/WorkQueue.py#L1096
I tried to reproduce the error for a wf, at least to see what goes inside the loop, and notice that the loop is exited before the
rejectedWork
variable is assigned.Now, I need to understand
How to reproduce it
Steps to reproduce the behavior:
Use any of the following wfs with the following piece of code
Expected behavior
rejectedWork
to be defiend outside of the loop.FYI @mapellidario
The text was updated successfully, but these errors were encountered: