MYNN SFC OpenACC Acceleration #1891

grantfirl · 2023-09-08T14:46:13Z

PR Author Checklist:

I have linked PR's from all sub-components involved in section below.
I am confirming reviews are completed in ALL sub-component PR's.
I have run the full RT suite on either Hera/Cheyenne AND have attached the log to this PR below this line:
- LOG:
  RegressionTests_hera.log
I have added the list of all failed regression tests to "Anticipated changes" section.
I have filled out all sections of the template.

Description

See ufs-community/ccpp-physics#97 for a complete description.

Overview from the developer:

With very minimal changes to the original code of the scheme, the MYNN surface scheme has been enhanced with OpenACC statements which introduce the capability for offloading computational execution to OpenACC-supported accelerator devices (e.g. Nvidia GPUs). Since the scheme operates by looping multiple times over independent vertical columns, the overall computational strategy maps well to GPU hardware where multiple iterations of each loop can be run in parallel with SIMD methods. Data movement has been optimized to ensure data transfers from host memory to device memory are limited as data movement is a significant source of latency when performing offloading to accelerator devices. Performance increases on a GPU ranged from a 3.3x slowdown to a 41.9x speedup versus CPU execution (See the Performance section for more information).

Linked Issues and Pull Requests

Associated UFSWM Issue to close

Subcomponent Pull Requests

NOAA-EMC/fv3atm#693
ufs-community/ccpp-physics#97

Blocking Dependencies

None

Subcomponents involved:

Anticipated Changes

Input data

No changes are expected to input data.
Changes are expected to input data:
- New input data.
- Updated input data.

Regression Tests:

No changes are expected to any regression test.
Changes are expected to the following tests:

Tests effected by changes in this PR:

Libraries

Not Needed
Needed
- Create separate issue in JCSDA/spack-stack asking for update to library. Include library name, library version.
- Add issue link from JCSDA/spack-stack following this item

Code Managers Log

This PR is up-to-date with the top of all sub-component repositories except for those sub-components which are the subject of this PR.
Move new/updated input data on RDHPCS Hera and propagate input data changes to all supported systems.
- N/A

Testing Log:

RDHPCS
- Hera
- Orion
- Jet
- Gaea
- Cheyenne
WCOSS2
- Dogwood/Cactus
- Acorn
CI
- Completed
opnReqTest
- N/A
- Log attached to comment

grantfirl · 2023-09-12T16:55:47Z

I'm rerunning RTs on Hera since the first set ran into memory issues apparently.

zach1221 · 2023-09-12T17:36:12Z

I'm rerunning RTs on Hera since the first set ran into memory issues apparently.

Sounds good @grantfirl . Keep us posted. Once those Hera logs are ready I think we can start working this PR.

grantfirl · 2023-09-12T18:36:44Z

I'm rerunning RTs on Hera since the first set ran into memory issues apparently.

Sounds good @grantfirl . Keep us posted. Once those Hera logs are ready I think we can start working this PR.

@zach1221 OK, RTs on Hera finished successfully, and I attached the log in the description.

zach1221 · 2023-09-12T23:50:42Z

ORTs passed. Jenkins-ci logs attached.
Jenkins-ci 1891 ORTs.pdf

epic-cicd-jenkins · 2023-09-12T23:55:41Z

Automated RT Failure Notification
Machine: jet
Job: RT
[RT] Repo location: /lfs4/HFIP/hfv3gfs/role.epic/autort/pr/1508030375/20230912235213/ufs-weather-model
Please make changes and add the following label back: jet-RT

jkbk2004 · 2023-09-13T19:45:23Z

Jet system issue. System is very crowded. Skip jet!

zach1221 · 2023-09-13T19:55:34Z

Jet system issue. System is very crowded. Skip jet!

Sure. I should have the hera logs ready soon.

jkbk2004 · 2023-09-14T13:55:08Z

Jet queue is not moving even today. So confirm to skip jet. @grantfirl we can start merging process. @zach1221 FYI

zach1221 · 2023-09-14T13:57:58Z

Jet queue is not moving even today. So confirm to skip jet. @grantfirl we can start merging process. @zach1221 FYI

I've followed up with Grant on ccpp-physics pr #97 .

jkbk2004 · 2023-09-14T14:08:47Z

Jet queue is not moving even today. So confirm to skip jet. @grantfirl we can start merging process. @zach1221 FYI

I've followed up with Grant on ccpp-physics pr #97 .

@zach1221 Thanks!

grantfirl · 2023-09-14T14:12:25Z

The ccpp-physics PR is merged and the fv3atm PR is updated and ready for merge. @zach1221 @jkbk2004

point to the PR branch for testing

f7793a7

grantfirl changed the title ~~point to the PR branch for testing~~ MYNN SFC OpenACC Acceleration Sep 8, 2023

Merge branch 'develop' into mynnsfc_openacc

c7edaf9

zach1221 added No Baseline Change No Baseline Change Ready for Commit Queue The PR is ready for the Commit Queue. All checkboxes in PR template have been checked. jenkins-ci Jenkins CI: ORT build/test on docker container labels Sep 12, 2023

zach1221 added hera-RT Run Hera regression testing jet-RT Run AutoRT Jet regression testing labels Sep 12, 2023

epic-cicd-jenkins removed the jet-RT Run AutoRT Jet regression testing label Sep 12, 2023

zach1221 and others added 4 commits September 13, 2023 09:15

add gaea RT logs: passed

6a0a21c

add orion RT logs: passed

cc4646a

add cheyenne RT logs: passed

cd0aed4

Acorn RT Log

d66c216

zach1221 and others added 2 commits September 13, 2023 20:45

add hera RT logs: passed

c2d3998

WCOSS2 RT Log

e300e01

update FV3 sub and revert .gitmodules

7dfefa0

zach1221 requested review from SadeghTabas-NOAA, jkbk2004 and BrianCurtis-NOAA September 14, 2023 14:34

SadeghTabas-NOAA approved these changes Sep 14, 2023

View reviewed changes

BrianCurtis-NOAA approved these changes Sep 14, 2023

View reviewed changes

jkbk2004 approved these changes Sep 14, 2023

View reviewed changes

jkbk2004 merged commit dd41cc6 into ufs-community:develop Sep 14, 2023

grantfirl mentioned this pull request Jan 24, 2024

UFS-dev PR#97 NCAR/ufs-weather-model#110

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MYNN SFC OpenACC Acceleration #1891

MYNN SFC OpenACC Acceleration #1891

grantfirl commented Sep 8, 2023 •

edited

Loading

grantfirl commented Sep 12, 2023

zach1221 commented Sep 12, 2023

grantfirl commented Sep 12, 2023

zach1221 commented Sep 12, 2023

epic-cicd-jenkins commented Sep 12, 2023

jkbk2004 commented Sep 13, 2023

zach1221 commented Sep 13, 2023

jkbk2004 commented Sep 14, 2023

zach1221 commented Sep 14, 2023

jkbk2004 commented Sep 14, 2023

grantfirl commented Sep 14, 2023

MYNN SFC OpenACC Acceleration #1891

MYNN SFC OpenACC Acceleration #1891

Conversation

grantfirl commented Sep 8, 2023 • edited Loading

PR Author Checklist:

Description

Linked Issues and Pull Requests

Associated UFSWM Issue to close

Subcomponent Pull Requests

Blocking Dependencies

Subcomponents involved:

Anticipated Changes

Input data

Regression Tests:

Libraries

Testing Log:

grantfirl commented Sep 12, 2023

zach1221 commented Sep 12, 2023

grantfirl commented Sep 12, 2023

zach1221 commented Sep 12, 2023

epic-cicd-jenkins commented Sep 12, 2023

jkbk2004 commented Sep 13, 2023

zach1221 commented Sep 13, 2023

jkbk2004 commented Sep 14, 2023

zach1221 commented Sep 14, 2023

jkbk2004 commented Sep 14, 2023

grantfirl commented Sep 14, 2023

grantfirl commented Sep 8, 2023 •

edited

Loading