Skip to content

Commit

Permalink
minor fixes
Browse files Browse the repository at this point in the history
  • Loading branch information
Katy Sadowski committed Oct 24, 2024
1 parent 4f4c416 commit 4a2e2aa
Show file tree
Hide file tree
Showing 6 changed files with 22 additions and 23 deletions.
15 changes: 7 additions & 8 deletions docs/articles/checks/plausibleGenderUseDescendants.html

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

12 changes: 6 additions & 6 deletions docs/articles/checks/plausibleUnitConceptIds.html

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

2 changes: 1 addition & 1 deletion docs/articles/index.html

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

2 changes: 1 addition & 1 deletion docs/pkgdown.yml
Original file line number Diff line number Diff line change
Expand Up @@ -34,7 +34,7 @@ articles:
standardConceptRecordCompleteness: checks/standardConceptRecordCompleteness.html
Thresholds: Thresholds.html
withinVisitDates: checks/withinVisitDates.html
last_built: 2024-10-24T13:49Z
last_built: 2024-10-24T14:34Z
urls:
reference: https://ohdsi.github.io/DataQualityDashboard/reference
article: https://ohdsi.github.io/DataQualityDashboard/articles
Expand Down
6 changes: 3 additions & 3 deletions vignettes/checks/plausibleGenderUseDescendants.Rmd
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
---
title: "plausibleGender"
title: "plausibleGenderUseDescendants"
author: "Katy Sadowski and Dmytry Dymshyts"
date: "`r Sys.Date()`"
output:
Expand Down Expand Up @@ -54,9 +54,9 @@ GROUP BY 1,2,3
```
The above query should help to identify if a mapping issue is the cause of the failure. If the source value and source concept ID are correctly mapped to a standard concept, then the issue may be that the person has the incorrect gender, or that the finding is a true data anomaly. Examples of true anomalies include:

- Occasional stray code (e.g., due to typo in EHR). This gives an idea how noisy the data is.
- Occasional stray code (e.g., due to typo in EHR).
- Newborn codes recorded in the mother’s record (e.g., circumcision).
- Gender reassignment procedures (e.g., penectomy and prostatectomy in patients with acquired (female) gender). **NOTE** that this is technically a violation of the OMOP CDM specification, since the CDM actually requires that the `gender_concept_id` represents the person's sex at birth. For more information on this convention, see https://ohdsi.github.io/Themis/populate_gender_concept_id.html
- Gender reassignment procedures (e.g., penectomy and prostatectomy in patients with acquired female gender). **NOTE** that this scenario is technically a violation of the OMOP CDM specification, since the CDM requires that the `gender_concept_id` represents a person's sex at birth. For more information on this convention, see https://ohdsi.github.io/Themis/populate_gender_concept_id.html

### ETL Developers

Expand Down
8 changes: 4 additions & 4 deletions vignettes/checks/plausibleUnitConceptIds.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,7 @@ output:
## Summary

**Level**: CONCEPT\
**Context**: Verification\
**Context**: Validation\
**Category**: Plausibility\
**Subcategory**: Atemporal\
**Severity**: Characterization ✔
Expand Down Expand Up @@ -43,7 +43,7 @@ The above issues could either be due to incorrect data in the source system or i
### Violated rows query
```sql
SELECT
m.unit_concept_id,
m.unit_concept_id,
m.unit_source_concept_id,
m.unit_source_value,
COUNT(*)
Expand All @@ -54,12 +54,12 @@ WHERE m.@cdmFieldName = @conceptId
GROUP BY 1,2,3
```

Inspect the output of the violated rows query to identify the root cause of the issue. If the `unit_source_value` and/or `unit_source_concept_id` are populated, check them against the list of plausible unit concept IDs to understand if they should have been mapped to one of the plausible standard concepts. If the `unit_source_value` is NULL but the list of plausible unit concept IDs includes -1, then you may need to check your source data to undersand whether or not a unit is available in the source.
Inspect the output of the violated rows query to identify the root cause of the issue. If the `unit_source_value` and/or `unit_source_concept_id` are populated, check them against the list of plausible unit concept IDs to understand if they should have been mapped to one of the plausible standard concepts. If the `unit_source_value` is NULL and the list of plausible unit concept IDs does not include -1, then you may need to check your source data to undersand whether or not a unit is available in the source.

### ETL Developers

Ensure that all units available in the source data are being pulled into the CDM and mapped correctly to a standard concept ID. If a unit is available in the source and is being correctly populated & mapped in your ETL but is *not* present on the list of plausible unit concept IDs, you should verify whether or not the unit is actually plausible - you may need to consult a clinician to do so. If the unit is plausible for the given measurement, please report this as a DataQualityDashboard bug here: https://github.com/OHDSI/DataQualityDashboard/issues. If the unit is not plausible, do not change it! Instead, you should document the issue for users of the CDM and discuss with your data provider how to handle the data.

### Data Users

It is generally recommended not to use measurements with implausible units in analyses as it is impossible to determine whether the unit is wrong; the value is wrong; and/or the measurement code is wrong in the source data. If a measurement is missing a `unit_concept_id` due to an ETL issue, and the `unit_source_value` or `unit_source_concept_id` is available, you can utilize these values to perform your analysis. If `unit_source_value` and `unit_source_concept_id` are missing, you may consider consulting with your data provider as to if and when you may be able to infer what the missing unit should be.
It is generally not recommended to use measurements with implausible units in analyses as it is impossible to determine whether the unit is wrong; the value is wrong; and/or the measurement code is wrong in the source data. If a measurement is missing a `unit_concept_id` due to an ETL issue, and the `unit_source_value` or `unit_source_concept_id` is available, you can utilize these values to perform your analysis. If `unit_source_value` and `unit_source_concept_id` are missing, you may consider consulting with your data provider as to if and when you may be able to infer what the missing unit should be.

0 comments on commit 4a2e2aa

Please sign in to comment.