Skip to content

Commit

Permalink
[amd] regenerate all processes with the workaround for HIP FPE madgra…
Browse files Browse the repository at this point in the history
  • Loading branch information
valassi committed Oct 4, 2024
1 parent 13ebdbe commit dcf3a99
Show file tree
Hide file tree
Showing 45 changed files with 258 additions and 202 deletions.
16 changes: 8 additions & 8 deletions epochX/cudacpp/ee_mumu.mad/CODEGEN_mad_ee_mumu_log.txt
Original file line number Diff line number Diff line change
Expand Up @@ -58,7 +58,7 @@ generate e+ e- > mu+ mu-
No model currently active, so we import the Standard Model
INFO: load particles
INFO: load vertices
DEBUG: model prefixing takes 0.006308317184448242 
DEBUG: model prefixing takes 0.006434440612792969 
INFO: Restrict model sm with file models/sm/restrict_default.dat .
DEBUG: Simplifying conditional expressions 
DEBUG: remove interactions: u s w+ at order: QED=1 
Expand Down Expand Up @@ -150,7 +150,7 @@ INFO: Checking for minimal orders which gives processes.
INFO: Please specify coupling orders to bypass this step.
INFO: Trying process: e+ e- > mu+ mu- WEIGHTED<=4 @1
INFO: Process has 2 diagrams
1 processes with 2 diagrams generated in 0.005 s
1 processes with 2 diagrams generated in 0.004 s
Total: 1 processes with 2 diagrams
output madevent_simd ../TMPOUT/CODEGEN_mad_ee_mumu --hel_recycling=False --vector_size=32
Output will be done with PLUGIN: CUDACPP_OUTPUT
Expand Down Expand Up @@ -180,19 +180,19 @@ INFO: Finding symmetric diagrams for subprocess group epem_mupmum
DEBUG: iconfig_to_diag =  {1: 1, 2: 2} [model_handling.py at line 1551] 
DEBUG: diag_to_iconfig =  {1: 1, 2: 2} [model_handling.py at line 1552] 
Generated helas calls for 1 subprocesses (2 diagrams) in 0.004 s
Wrote files for 8 helas calls in 0.070 s
Wrote files for 8 helas calls in 0.069 s
DEBUG: self.vector_size =  32 [export_v4.py at line 7023] 
ALOHA: aloha starts to compute helicity amplitudes
ALOHA: aloha creates FFV1 routines
ALOHA: aloha creates FFV2 routines
ALOHA: aloha creates FFV4 routines
ALOHA: aloha creates 3 routines in 0.199 s
ALOHA: aloha creates 3 routines in 0.201 s
ALOHA: aloha starts to compute helicity amplitudes
ALOHA: aloha creates FFV1 routines
ALOHA: aloha creates FFV2 routines
ALOHA: aloha creates FFV4 routines
ALOHA: aloha creates FFV2_4 routines
ALOHA: aloha creates 7 routines in 0.252 s
ALOHA: aloha creates 7 routines in 0.255 s
<class 'aloha.create_aloha.AbstractRoutine'> FFV1
<class 'aloha.create_aloha.AbstractRoutine'> FFV1
<class 'aloha.create_aloha.AbstractRoutine'> FFV2
Expand Down Expand Up @@ -232,9 +232,9 @@ Type "launch" to generate events from this process, or see
Run "open index.html" to see more information about this process.
quit

real 0m2.042s
user 0m1.792s
sys 0m0.243s
real 0m2.097s
user 0m1.775s
sys 0m0.272s
Code generation completed in 2 seconds
************************************************************
* *
Expand Down
5 changes: 4 additions & 1 deletion epochX/cudacpp/ee_mumu.mad/src/HelAmps_sm.h
Original file line number Diff line number Diff line change
Expand Up @@ -478,7 +478,10 @@ namespace mg5amcCpu
}
else
{
const fptype emp = pvec0 / ( vmass * pp );
//printf( "DEBUG1011 (before emp): pvec0=%f vmass=%f pp=%f vmass*pp=%f\n", pvec0, vmass, pp, vmass * pp );
//const fptype emp = pvec / ( vmass * pp ); // this may give a FPE #1011 (why?! maybe when vmass=+-epsilon?)
const fptype emp = pvec0 / vmass / pp; // workaround for FPE #1011
//printf( "DEBUG1011 (after emp): emp=%f\n", emp );
vc[2] = cxmake( hel0 * pp / vmass, 0. );
vc[5] = cxmake( hel0 * pvec3 * emp + hel * pt / pp * sqh, 0. );
if( pt != 0. )
Expand Down
16 changes: 8 additions & 8 deletions epochX/cudacpp/ee_mumu.sa/CODEGEN_cudacpp_ee_mumu_log.txt
Original file line number Diff line number Diff line change
Expand Up @@ -58,7 +58,7 @@ generate e+ e- > mu+ mu-
No model currently active, so we import the Standard Model
INFO: load particles
INFO: load vertices
DEBUG: model prefixing takes 0.006402015686035156 
DEBUG: model prefixing takes 0.0062215328216552734 
INFO: Restrict model sm with file models/sm/restrict_default.dat .
DEBUG: Simplifying conditional expressions 
DEBUG: remove interactions: u s w+ at order: QED=1 
Expand Down Expand Up @@ -150,7 +150,7 @@ INFO: Checking for minimal orders which gives processes.
INFO: Please specify coupling orders to bypass this step.
INFO: Trying process: e+ e- > mu+ mu- WEIGHTED<=4 @1
INFO: Process has 2 diagrams
1 processes with 2 diagrams generated in 0.004 s
1 processes with 2 diagrams generated in 0.005 s
Total: 1 processes with 2 diagrams
output standalone_cudacpp ../TMPOUT/CODEGEN_cudacpp_ee_mumu
Output will be done with PLUGIN: CUDACPP_OUTPUT
Expand All @@ -169,13 +169,13 @@ INFO: Creating files in directory /data/avalassi/GPU2023/madgraph4gpuX/MG5aMC/TM
FileWriter <class 'MG5aMC_PLUGIN.CUDACPP_OUTPUT.model_handling.PLUGIN_CPPWriter'> for /data/avalassi/GPU2023/madgraph4gpuX/MG5aMC/TMPOUT/CODEGEN_cudacpp_ee_mumu/SubProcesses/P1_Sigma_sm_epem_mupmum/./CPPProcess.h
FileWriter <class 'MG5aMC_PLUGIN.CUDACPP_OUTPUT.model_handling.PLUGIN_CPPWriter'> for /data/avalassi/GPU2023/madgraph4gpuX/MG5aMC/TMPOUT/CODEGEN_cudacpp_ee_mumu/SubProcesses/P1_Sigma_sm_epem_mupmum/./CPPProcess.cc
INFO: Created files CPPProcess.h and CPPProcess.cc in directory /data/avalassi/GPU2023/madgraph4gpuX/MG5aMC/TMPOUT/CODEGEN_cudacpp_ee_mumu/SubProcesses/P1_Sigma_sm_epem_mupmum/.
Generated helas calls for 1 subprocesses (2 diagrams) in 0.004 s
Generated helas calls for 1 subprocesses (2 diagrams) in 0.003 s
ALOHA: aloha starts to compute helicity amplitudes
ALOHA: aloha creates FFV1 routines
ALOHA: aloha creates FFV2 routines
ALOHA: aloha creates FFV4 routines
ALOHA: aloha creates FFV2_4 routines
ALOHA: aloha creates 4 routines in 0.265 s
ALOHA: aloha creates 4 routines in 0.267 s
<class 'aloha.create_aloha.AbstractRoutine'> FFV1
<class 'aloha.create_aloha.AbstractRoutine'> FFV1
<class 'aloha.create_aloha.AbstractRoutine'> FFV2
Expand All @@ -194,7 +194,7 @@ INFO: Created files Parameters_sm.h and Parameters_sm.cc in directory
INFO: /data/avalassi/GPU2023/madgraph4gpuX/MG5aMC/TMPOUT/CODEGEN_cudacpp_ee_mumu/src/. and /data/avalassi/GPU2023/madgraph4gpuX/MG5aMC/TMPOUT/CODEGEN_cudacpp_ee_mumu/src/.
quit

real 0m0.659s
user 0m0.600s
sys 0m0.042s
Code generation completed in 0 seconds
real 0m0.781s
user 0m0.590s
sys 0m0.053s
Code generation completed in 1 seconds
5 changes: 4 additions & 1 deletion epochX/cudacpp/ee_mumu.sa/src/HelAmps_sm.h
Original file line number Diff line number Diff line change
Expand Up @@ -478,7 +478,10 @@ namespace mg5amcCpu
}
else
{
const fptype emp = pvec0 / ( vmass * pp );
//printf( "DEBUG1011 (before emp): pvec0=%f vmass=%f pp=%f vmass*pp=%f\n", pvec0, vmass, pp, vmass * pp );
//const fptype emp = pvec / ( vmass * pp ); // this may give a FPE #1011 (why?! maybe when vmass=+-epsilon?)
const fptype emp = pvec0 / vmass / pp; // workaround for FPE #1011
//printf( "DEBUG1011 (after emp): emp=%f\n", emp );
vc[2] = cxmake( hel0 * pp / vmass, 0. );
vc[5] = cxmake( hel0 * pvec3 * emp + hel * pt / pp * sqh, 0. );
if( pt != 0. )
Expand Down
12 changes: 6 additions & 6 deletions epochX/cudacpp/gg_tt.mad/CODEGEN_mad_gg_tt_log.txt
Original file line number Diff line number Diff line change
Expand Up @@ -58,7 +58,7 @@ generate g g > t t~
No model currently active, so we import the Standard Model
INFO: load particles
INFO: load vertices
DEBUG: model prefixing takes 0.006418943405151367 
DEBUG: model prefixing takes 0.0059719085693359375 
INFO: Restrict model sm with file models/sm/restrict_default.dat .
DEBUG: Simplifying conditional expressions 
DEBUG: remove interactions: u s w+ at order: QED=1 
Expand Down Expand Up @@ -181,12 +181,12 @@ INFO: Finding symmetric diagrams for subprocess group gg_ttx
DEBUG: iconfig_to_diag =  {1: 1, 2: 2, 3: 3} [model_handling.py at line 1551] 
DEBUG: diag_to_iconfig =  {1: 1, 2: 2, 3: 3} [model_handling.py at line 1552] 
Generated helas calls for 1 subprocesses (3 diagrams) in 0.006 s
Wrote files for 10 helas calls in 0.071 s
Wrote files for 10 helas calls in 0.072 s
DEBUG: self.vector_size =  32 [export_v4.py at line 7023] 
ALOHA: aloha starts to compute helicity amplitudes
ALOHA: aloha creates VVV1 set of routines with options: P0
ALOHA: aloha creates FFV1 routines
ALOHA: aloha creates 2 routines in 0.153 s
ALOHA: aloha creates 2 routines in 0.150 s
ALOHA: aloha starts to compute helicity amplitudes
ALOHA: aloha creates VVV1 set of routines with options: P0
ALOHA: aloha creates FFV1 routines
Expand Down Expand Up @@ -226,9 +226,9 @@ Type "launch" to generate events from this process, or see
Run "open index.html" to see more information about this process.
quit

real 0m1.900s
user 0m1.626s
sys 0m0.264s
real 0m1.997s
user 0m1.613s
sys 0m0.278s
Code generation completed in 2 seconds
************************************************************
* *
Expand Down
10 changes: 5 additions & 5 deletions epochX/cudacpp/gg_tt.sa/CODEGEN_cudacpp_gg_tt_log.txt
Original file line number Diff line number Diff line change
Expand Up @@ -58,7 +58,7 @@ generate g g > t t~
No model currently active, so we import the Standard Model
INFO: load particles
INFO: load vertices
DEBUG: model prefixing takes 0.006226539611816406 
DEBUG: model prefixing takes 0.006254673004150391 
INFO: Restrict model sm with file models/sm/restrict_default.dat .
DEBUG: Simplifying conditional expressions 
DEBUG: remove interactions: u s w+ at order: QED=1 
Expand Down Expand Up @@ -174,7 +174,7 @@ Generated helas calls for 1 subprocesses (3 diagrams) in 0.006 s
ALOHA: aloha starts to compute helicity amplitudes
ALOHA: aloha creates VVV1 set of routines with options: P0
ALOHA: aloha creates FFV1 routines
ALOHA: aloha creates 2 routines in 0.143 s
ALOHA: aloha creates 2 routines in 0.144 s
<class 'aloha.create_aloha.AbstractRoutine'> VVV1
<class 'aloha.create_aloha.AbstractRoutine'> FFV1
<class 'aloha.create_aloha.AbstractRoutine'> FFV1
Expand All @@ -189,7 +189,7 @@ INFO: Created files Parameters_sm.h and Parameters_sm.cc in directory
INFO: /data/avalassi/GPU2023/madgraph4gpuX/MG5aMC/TMPOUT/CODEGEN_cudacpp_gg_tt/src/. and /data/avalassi/GPU2023/madgraph4gpuX/MG5aMC/TMPOUT/CODEGEN_cudacpp_gg_tt/src/.
quit

real 0m0.559s
user 0m0.480s
sys 0m0.044s
real 0m0.532s
user 0m0.478s
sys 0m0.045s
Code generation completed in 0 seconds
5 changes: 4 additions & 1 deletion epochX/cudacpp/gg_tt.sa/src/HelAmps_sm.h
Original file line number Diff line number Diff line change
Expand Up @@ -478,7 +478,10 @@ namespace mg5amcCpu
}
else
{
const fptype emp = pvec0 / ( vmass * pp );
//printf( "DEBUG1011 (before emp): pvec0=%f vmass=%f pp=%f vmass*pp=%f\n", pvec0, vmass, pp, vmass * pp );
//const fptype emp = pvec / ( vmass * pp ); // this may give a FPE #1011 (why?! maybe when vmass=+-epsilon?)
const fptype emp = pvec0 / vmass / pp; // workaround for FPE #1011
//printf( "DEBUG1011 (after emp): emp=%f\n", emp );
vc[2] = cxmake( hel0 * pp / vmass, 0. );
vc[5] = cxmake( hel0 * pvec3 * emp + hel * pt / pp * sqh, 0. );
if( pt != 0. )
Expand Down
20 changes: 10 additions & 10 deletions epochX/cudacpp/gg_tt01g.mad/CODEGEN_mad_gg_tt01g_log.txt
Original file line number Diff line number Diff line change
Expand Up @@ -58,7 +58,7 @@ generate g g > t t~
No model currently active, so we import the Standard Model
INFO: load particles
INFO: load vertices
DEBUG: model prefixing takes 0.006343364715576172 
DEBUG: model prefixing takes 0.006289482116699219 
INFO: Restrict model sm with file models/sm/restrict_default.dat .
DEBUG: Simplifying conditional expressions 
DEBUG: remove interactions: u s w+ at order: QED=1 
Expand Down Expand Up @@ -159,7 +159,7 @@ INFO: Please specify coupling orders to bypass this step.
INFO: Trying coupling order WEIGHTED<=3: WEIGTHED IS QCD+2*QED
INFO: Trying process: g g > t t~ g WEIGHTED<=3 @2
INFO: Process has 16 diagrams
1 processes with 16 diagrams generated in 0.020 s
1 processes with 16 diagrams generated in 0.019 s
Total: 2 processes with 19 diagrams
output madevent_simd ../TMPOUT/CODEGEN_mad_gg_tt01g --hel_recycling=False --vector_size=32
Output will be done with PLUGIN: CUDACPP_OUTPUT
Expand Down Expand Up @@ -201,23 +201,23 @@ INFO: Finding symmetric diagrams for subprocess group gg_ttx
DEBUG: len(subproc_diagrams_for_config) =  3 [model_handling.py at line 1527] 
DEBUG: iconfig_to_diag =  {1: 1, 2: 2, 3: 3} [model_handling.py at line 1551] 
DEBUG: diag_to_iconfig =  {1: 1, 2: 2, 3: 3} [model_handling.py at line 1552] 
Generated helas calls for 2 subprocesses (19 diagrams) in 0.045 s
Wrote files for 46 helas calls in 0.191 s
Generated helas calls for 2 subprocesses (19 diagrams) in 0.042 s
Wrote files for 46 helas calls in 0.189 s
DEBUG: self.vector_size =  32 [export_v4.py at line 7023] 
ALOHA: aloha starts to compute helicity amplitudes
ALOHA: aloha creates VVV1 routines
ALOHA: aloha creates FFV1 routines
ALOHA: aloha creates VVVV1 set of routines with options: P0
ALOHA: aloha creates VVVV3 set of routines with options: P0
ALOHA: aloha creates VVVV4 set of routines with options: P0
ALOHA: aloha creates 5 routines in 0.326 s
ALOHA: aloha creates 5 routines in 0.338 s
ALOHA: aloha starts to compute helicity amplitudes
ALOHA: aloha creates VVV1 routines
ALOHA: aloha creates FFV1 routines
ALOHA: aloha creates VVVV1 set of routines with options: P0
ALOHA: aloha creates VVVV3 set of routines with options: P0
ALOHA: aloha creates VVVV4 set of routines with options: P0
ALOHA: aloha creates 10 routines in 0.306 s
ALOHA: aloha creates 10 routines in 0.311 s
<class 'aloha.create_aloha.AbstractRoutine'> VVV1
<class 'aloha.create_aloha.AbstractRoutine'> VVV1
<class 'aloha.create_aloha.AbstractRoutine'> FFV1
Expand Down Expand Up @@ -265,10 +265,10 @@ Type "launch" to generate events from this process, or see
Run "open index.html" to see more information about this process.
quit

real 0m2.598s
user 0m2.282s
sys 0m0.314s
Code generation completed in 3 seconds
real 0m2.618s
user 0m2.304s
sys 0m0.310s
Code generation completed in 2 seconds
************************************************************
* *
* W E L C O M E to *
Expand Down
5 changes: 4 additions & 1 deletion epochX/cudacpp/gg_tt01g.mad/src/HelAmps_sm.h
Original file line number Diff line number Diff line change
Expand Up @@ -478,7 +478,10 @@ namespace mg5amcCpu
}
else
{
const fptype emp = pvec0 / ( vmass * pp );
//printf( "DEBUG1011 (before emp): pvec0=%f vmass=%f pp=%f vmass*pp=%f\n", pvec0, vmass, pp, vmass * pp );
//const fptype emp = pvec / ( vmass * pp ); // this may give a FPE #1011 (why?! maybe when vmass=+-epsilon?)
const fptype emp = pvec0 / vmass / pp; // workaround for FPE #1011
//printf( "DEBUG1011 (after emp): emp=%f\n", emp );
vc[2] = cxmake( hel0 * pp / vmass, 0. );
vc[5] = cxmake( hel0 * pvec3 * emp + hel * pt / pp * sqh, 0. );
if( pt != 0. )
Expand Down
20 changes: 10 additions & 10 deletions epochX/cudacpp/gg_ttg.mad/CODEGEN_mad_gg_ttg_log.txt
Original file line number Diff line number Diff line change
Expand Up @@ -58,7 +58,7 @@ generate g g > t t~ g
No model currently active, so we import the Standard Model
INFO: load particles
INFO: load vertices
DEBUG: model prefixing takes 0.006256103515625 
DEBUG: model prefixing takes 0.0062618255615234375 
INFO: Restrict model sm with file models/sm/restrict_default.dat .
DEBUG: Simplifying conditional expressions 
DEBUG: remove interactions: u s w+ at order: QED=1 
Expand Down Expand Up @@ -151,7 +151,7 @@ INFO: Please specify coupling orders to bypass this step.
INFO: Trying coupling order WEIGHTED<=3: WEIGTHED IS QCD+2*QED
INFO: Trying process: g g > t t~ g WEIGHTED<=3 @1
INFO: Process has 16 diagrams
1 processes with 16 diagrams generated in 0.021 s
1 processes with 16 diagrams generated in 0.022 s
Total: 1 processes with 16 diagrams
output madevent_simd ../TMPOUT/CODEGEN_mad_gg_ttg --hel_recycling=False --vector_size=32
Output will be done with PLUGIN: CUDACPP_OUTPUT
Expand Down Expand Up @@ -180,23 +180,23 @@ INFO: Finding symmetric diagrams for subprocess group gg_ttxg
DEBUG: len(subproc_diagrams_for_config) =  15 [model_handling.py at line 1527] 
DEBUG: iconfig_to_diag =  {1: 1, 2: 2, 3: 3, 4: 4, 5: 5, 6: 6, 7: 7, 8: 8, 9: 9, 10: 10, 11: 11, 12: 12, 13: 13, 14: 14, 15: 15} [model_handling.py at line 1551] 
DEBUG: diag_to_iconfig =  {1: 1, 2: 2, 3: 3, 4: 4, 5: 5, 6: 6, 7: 7, 8: 8, 9: 9, 10: 10, 11: 11, 12: 12, 13: 13, 14: 14, 15: 15} [model_handling.py at line 1552] 
Generated helas calls for 1 subprocesses (16 diagrams) in 0.037 s
Wrote files for 36 helas calls in 0.120 s
Generated helas calls for 1 subprocesses (16 diagrams) in 0.038 s
Wrote files for 36 helas calls in 0.123 s
DEBUG: self.vector_size =  32 [export_v4.py at line 7023] 
ALOHA: aloha starts to compute helicity amplitudes
ALOHA: aloha creates VVV1 routines
ALOHA: aloha creates FFV1 routines
ALOHA: aloha creates VVVV1 set of routines with options: P0
ALOHA: aloha creates VVVV3 set of routines with options: P0
ALOHA: aloha creates VVVV4 set of routines with options: P0
ALOHA: aloha creates 5 routines in 0.362 s
ALOHA: aloha creates 5 routines in 1.397 s
ALOHA: aloha starts to compute helicity amplitudes
ALOHA: aloha creates VVV1 routines
ALOHA: aloha creates FFV1 routines
ALOHA: aloha creates VVVV1 set of routines with options: P0
ALOHA: aloha creates VVVV3 set of routines with options: P0
ALOHA: aloha creates VVVV4 set of routines with options: P0
ALOHA: aloha creates 10 routines in 0.311 s
ALOHA: aloha creates 10 routines in 0.315 s
<class 'aloha.create_aloha.AbstractRoutine'> VVV1
<class 'aloha.create_aloha.AbstractRoutine'> VVV1
<class 'aloha.create_aloha.AbstractRoutine'> FFV1
Expand Down Expand Up @@ -237,10 +237,10 @@ Type "launch" to generate events from this process, or see
Run "open index.html" to see more information about this process.
quit

real 0m2.525s
user 0m2.188s
sys 0m0.271s
Code generation completed in 3 seconds
real 0m3.568s
user 0m2.185s
sys 0m0.276s
Code generation completed in 4 seconds
************************************************************
* *
* W E L C O M E to *
Expand Down
5 changes: 4 additions & 1 deletion epochX/cudacpp/gg_ttg.mad/src/HelAmps_sm.h
Original file line number Diff line number Diff line change
Expand Up @@ -478,7 +478,10 @@ namespace mg5amcCpu
}
else
{
const fptype emp = pvec0 / ( vmass * pp );
//printf( "DEBUG1011 (before emp): pvec0=%f vmass=%f pp=%f vmass*pp=%f\n", pvec0, vmass, pp, vmass * pp );
//const fptype emp = pvec / ( vmass * pp ); // this may give a FPE #1011 (why?! maybe when vmass=+-epsilon?)
const fptype emp = pvec0 / vmass / pp; // workaround for FPE #1011
//printf( "DEBUG1011 (after emp): emp=%f\n", emp );
vc[2] = cxmake( hel0 * pp / vmass, 0. );
vc[5] = cxmake( hel0 * pvec3 * emp + hel * pt / pp * sqh, 0. );
if( pt != 0. )
Expand Down
10 changes: 5 additions & 5 deletions epochX/cudacpp/gg_ttg.sa/CODEGEN_cudacpp_gg_ttg_log.txt
Original file line number Diff line number Diff line change
Expand Up @@ -58,7 +58,7 @@ generate g g > t t~ g
No model currently active, so we import the Standard Model
INFO: load particles
INFO: load vertices
DEBUG: model prefixing takes 0.006231069564819336 
DEBUG: model prefixing takes 0.006242036819458008 
INFO: Restrict model sm with file models/sm/restrict_default.dat .
DEBUG: Simplifying conditional expressions 
DEBUG: remove interactions: u s w+ at order: QED=1 
Expand Down Expand Up @@ -177,7 +177,7 @@ ALOHA: aloha creates FFV1 routines
ALOHA: aloha creates VVVV1 set of routines with options: P0
ALOHA: aloha creates VVVV3 set of routines with options: P0
ALOHA: aloha creates VVVV4 set of routines with options: P0
ALOHA: aloha creates 5 routines in 0.330 s
ALOHA: aloha creates 5 routines in 0.326 s
<class 'aloha.create_aloha.AbstractRoutine'> VVV1
<class 'aloha.create_aloha.AbstractRoutine'> VVV1
<class 'aloha.create_aloha.AbstractRoutine'> FFV1
Expand All @@ -197,7 +197,7 @@ INFO: Created files Parameters_sm.h and Parameters_sm.cc in directory
INFO: /data/avalassi/GPU2023/madgraph4gpuX/MG5aMC/TMPOUT/CODEGEN_cudacpp_gg_ttg/src/. and /data/avalassi/GPU2023/madgraph4gpuX/MG5aMC/TMPOUT/CODEGEN_cudacpp_gg_ttg/src/.
quit

real 0m0.806s
user 0m0.706s
sys 0m0.070s
real 0m0.777s
user 0m0.714s
sys 0m0.058s
Code generation completed in 1 seconds
5 changes: 4 additions & 1 deletion epochX/cudacpp/gg_ttg.sa/src/HelAmps_sm.h
Original file line number Diff line number Diff line change
Expand Up @@ -478,7 +478,10 @@ namespace mg5amcCpu
}
else
{
const fptype emp = pvec0 / ( vmass * pp );
//printf( "DEBUG1011 (before emp): pvec0=%f vmass=%f pp=%f vmass*pp=%f\n", pvec0, vmass, pp, vmass * pp );
//const fptype emp = pvec / ( vmass * pp ); // this may give a FPE #1011 (why?! maybe when vmass=+-epsilon?)
const fptype emp = pvec0 / vmass / pp; // workaround for FPE #1011
//printf( "DEBUG1011 (after emp): emp=%f\n", emp );
vc[2] = cxmake( hel0 * pp / vmass, 0. );
vc[5] = cxmake( hel0 * pvec3 * emp + hel * pt / pp * sqh, 0. );
if( pt != 0. )
Expand Down
Loading

0 comments on commit dcf3a99

Please sign in to comment.