-
Notifications
You must be signed in to change notification settings - Fork 39
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
submission is broken when asking for GPU #8784
Comments
|
note this maybe we need to keep |
maybe all of this is useless CRABServer/src/python/TaskWorker/Actions/DagmanSubmitter.py Lines 96 to 109 in 95ce26a
Note this ! CRABServer/src/python/TaskWorker/Actions/DagmanSubmitter.py Lines 31 to 35 in 95ce26a
Lack of cleanup strikes back :-( |
for reference, here's the user's config file config = config()
# General settings
config.General.requestName = 'gpu_test_job'
config.General.workArea = 'testcrabgpu_nov12_1'
config.General.transferOutputs = True
config.General.transferLogs = True
# JobType settings
config.JobType.pluginName = 'PrivateMC'
config.JobType.psetName = 'PSet.py'
config.JobType.allowUndistributedCMSSW = True
config.JobType.scriptExe = './run_job.sh' # Shell script that runs the Python job
config.JobType.inputFiles = ['gpu_test.py', 'run_job.sh', 'FrameworkJobReport.xml'] # Include Python code and shell script
config.JobType.outputFiles = ['gpu_output.txt'] # Expected output file
config.JobType.maxMemoryMB = 2000
config.JobType.maxJobRuntimeMin = 100
config.Data.outputPrimaryDataset = 'GPU_Test_Dataset'
config.Data.splitting = 'EventBased' # Splitting type for non-CMSSW jobs
config.Data.unitsPerJob = 1
config.Data.totalUnits = 1
#config.Data.outLFNDirBase = '/store/user/aherrera' # Output directory for job results
config.Data.publication = False
#config.Data.secondaryInputFiles = ['root://cmseos.fnal.gov//store/user/aherrera/JOBMERGED/ttboosted/ttboosted_01/tt_jj0p5.root']
# Site settings
config.section_("Site")
config.Site.storageSite = 'T3_US_FNALLPC'
#config.Site.whitelist = ['T2_US_Caltech', 'T2_US_Florida', 'T2_US_Purdue', 'T2_US_Wisconsin']
config.Site.requireAccelerator = True # Specify supports GPUs |
removing the lines indicated above made dag bootstrap run and submit jobs. I have asked SI for help: https://mattermost.web.cern.ch/cms-o-and-c/pl/yi4eoususjgo8gg8k616qu6m9r |
there is some special problem with KIT. Once I extended the possible site list job ran immediately at T2_US_Wisconsin. |
closed via #8796 |
see https://cms-talk.web.cern.ch/t/crab-jobs-requesting-gpu-stay-idle-forever/61932/1
The problem is that the initial dag bootstrap job submitted to scheduler universe requires one GPU.
Need to convert "Request_GPUs" to "CRAB_Request_GPUs".
so the dag boostrap stay idle forever
The text was updated successfully, but these errors were encountered: