Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

rasdaemon: Add page offline support for cxl memory #182

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

thsrini
Copy link

@thsrini thsrini commented Nov 20, 2024

CXL Type 3 device implements a threshold for corrected errors as described in CXL 3.1 specification section 8.2.9.2.1.2 and 8.2.9.9.11.3. Device can set the threshold field in the DRAM event descriptor when it detects corrected errors that meet or exceed the threshold value.

This patch is intended to offline pages for corrected memory errors when the device sets the threshold in the DRAM event descriptor. This helps prevent corrected errors from becoming uncorrected.

Record the hpa for given dpa, then do pageoffline for hpa when corrected errors threshold is set.

CXL Type 3 device implements a threshold for corrected errors as described in
CXL 3.1 specification section 8.2.9.2.1.2 and 8.2.9.9.11.3.
Device can set the threshold field in the DRAM event descriptor when
it detects corrected errors that meet or exceed the threshold value.

This patch is intended to offline pages for corrected memory errors when the
device sets the threshold in the DRAM event descriptor.
This helps prevent corrected errors from becoming uncorrected.

Record the hpa for given dpa, then do pageoffline for hpa when corrected
errors threshold is set.

Signed-off-by: Srinivasulu Thanneeru <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant