cleaning up measurement/unit pairs #517

dimshitc · 2023-12-22T16:15:26Z

Instead of 2000 measurements, we check only 180 most used measurements (as per JnJ database network). Since it was a small count, I manually curated them, so results are more reliable.
But people can have their own most common measurements, so
Once it's merged, I'm going to create a wiki page that says how to submit new measurement-unit pairs and plausible numbers as well.
Currently Wiki already has this notion "It will eventually house measurement/unit pair definitions and descriptions on plausibleValueLow and plausibleValueHigh values."
People can submit their measurement-unit-plausible numbers (optional) as a table attached to the github issue.
I don't expect that much attention as the vocabulary contribution, so we don't need a centralized storage for these requests

… and units. units were manually curated. denominator now includes cases when source_unit is null

- synced the other CDM versions CSVs

…QualityDashboard into meas_unit_check_clean # Conflicts: # inst/sql/sql_server/concept_plausible_unit_concept_ids.sql

codecov · 2023-12-22T16:26:41Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (2883027) 86.42% compared to head (88c2775) 87.30%.
Report is 13 commits behind head on develop.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop     #517      +/-   ##
===========================================
+ Coverage    86.42%   87.30%   +0.87%     
===========================================
  Files           16       16              
  Lines          884      945      +61     
===========================================
+ Hits           764      825      +61     
  Misses         120      120

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

katy-sadowski · 2023-12-26T22:35:41Z

inst/sql/sql_server/concept_plausible_unit_concept_ids.sql

-		  }:{
-		  m.unit_concept_id NOT IN (@plausibleUnitConceptIds)
-		  }
+		  AND COALESCE (m.unit_concept_id, -1) NOT IN (replace (@plausibleUnitConceptIds, 'NA', '-1')) -- '-1' stands for the cases when unit_concept_id is null


why don't we put -1 in the threshold file instead of NA to skip this replace? (it will simplify the query, and i also worry about translation of REPLACE - we'd probably need to add it to SqlRender translation rules if we want to use it)

it was done intentionally, so in the output you'll see list of unit concept_ids and NA, while -1 can be confusing.
On the other hand, we can add a note, that '-1' stands for a NULL unit as permitted one, to avoid changes in the SQL render. what do you think?

Makes sense. I do think it'd be better just to use -1 in the threshold file, and provide an explanation (in check descriptions and the documentation) - we probably want an explanation either way, and I do think it's better to keep the SQL simple 😃 Thanks!

I changed it to -1 and added explanation into Description files.
Looks like this branch originates from the older DQD version. Should I recreate this branch from the latest develop branch and reapply the changes?

OK great thank you!! And yes, I was thinking the same regarding the branch. It's probably easiest to move your changes to a new branch based off the latest version of develop because there are all of these auto-generated docs files which will be really hard to resolve conflicts for.

Dmitry Dymshyts added 4 commits November 27, 2023 10:30

in measurement/unit checks included 180 most frequently used concepts…

b985708

… and units. units were manually curated. denominator now includes cases when source_unit is null

fix typo

1edde33

- replaced -1 to NA, so it's better readable

224e1b1

- synced the other CDM versions CSVs

Merge branch 'meas_unit_check_clean' of https://github.com/OHDSI/Data…

1b66e64

…QualityDashboard into meas_unit_check_clean # Conflicts: # inst/sql/sql_server/concept_plausible_unit_concept_ids.sql

dimshitc requested review from clairblacketer and katy-sadowski December 22, 2023 16:15

dimshitc changed the base branch from main to develop December 22, 2023 16:17

katy-sadowski reviewed Dec 26, 2023

View reviewed changes

Dmitry Dymshyts added 2 commits January 12, 2024 08:29

removed NA, simply use -1

b40451e

CDMv53 and CDMv52 fixed as well

88c2775

dimshitc changed the base branch from develop to main January 26, 2024 10:50

dimshitc closed this Feb 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cleaning up measurement/unit pairs #517

cleaning up measurement/unit pairs #517

dimshitc commented Dec 22, 2023

codecov bot commented Dec 22, 2023 •

edited

Loading

katy-sadowski Dec 26, 2023

dimshitc Jan 9, 2024

katy-sadowski Jan 10, 2024

dimshitc Jan 12, 2024

katy-sadowski Jan 14, 2024

cleaning up measurement/unit pairs #517

cleaning up measurement/unit pairs #517

Conversation

dimshitc commented Dec 22, 2023

codecov bot commented Dec 22, 2023 • edited Loading

Codecov Report

katy-sadowski Dec 26, 2023

Choose a reason for hiding this comment

dimshitc Jan 9, 2024

Choose a reason for hiding this comment

katy-sadowski Jan 10, 2024

Choose a reason for hiding this comment

dimshitc Jan 12, 2024

Choose a reason for hiding this comment

katy-sadowski Jan 14, 2024

Choose a reason for hiding this comment

codecov bot commented Dec 22, 2023 •

edited

Loading