-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add vital signs mappings for MIMIC-IV v2.0 - SSSOM format #1417
base: main
Are you sure you want to change the base?
Conversation
Thanks @a-chahin! Same comments as: #1418 and #1419 (comment). Also tagging @matentzn in case he would like to add his thoughts. |
mimic:227632,Arctic Sun/Alsius Temp #1 C,skos:closeMatch,loinc:8310-5,Body temperature,HumanCurated,orcid:0000-0001-8822-1884,orcid:0000-0002-9348-9284,0.8, | ||
mimic:227634,Arctic Sun/Alsius Temp #2 C,skos:closeMatch,loinc:8310-5,Body temperature,HumanCurated,orcid:0000-0001-8822-1884,orcid:0000-0002-9348-9284,0.8, | ||
mimic:226707,Height,skos:exactMatch,loinc:8302-2,Body height,HumanCurated,orcid:0000-0001-8822-1884,orcid:0000-0002-9348-9284,1, | ||
mimic:226730,Height (cm),skos:exactMatch,loinc:8302-2,Body height,HumanCurated,orcid:0000-0001-8822-1884,orcid:0000-0002-9348-9284,1, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I made some comments in #1418 so I don't repeat them here, but I am just nothing that there is something odd about using exactMatch for unit-restriced terms mapped to unit-unrestricted terms. Not necessarily wrong, just.. Should be considered!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you @matentzn! Do you think "skos:closeMatch" is a better predicate_id
in this case?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I am not sure. Probably better, but broadMatch
may also be worthy of consideration, if you consider:
height (cm)
---> height
it almost seems like height (cm)
is simply a narrower interpretation of a height. But you may be right as well.. I don't know, sorry!
Thanks again @matentzn |
@danamouk it looks like your new pull request at #1660 is duplicating a lot of the work in this pull request. I think it would be best to:
If Northwestern aren't able to provide their mapping in SSSOM format, then we should, if possible, convert to SSSOM format for them. If there are columns that are difficult to populate, then we can leave them empty and complete them later. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @tompollard for the suggestions. I have added the new mappings for both mimic and northwestern across both loinc and omop to comply with the SSSOM data structure.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- CSV is not a well defined format, I would suggest TSV.
HumanCurated
is from an old version of sssom, usesemapv:ManualMappingCuration
instead. See example here: https://mapping-commons.github.io/sssom/tutorial/#automated-processing-1-creating-an-embedded-sssom-file
Otherwise looks great!
Thanks @matentzn for the feedback! I have updated the mapping justification to |
This looks great! For the future, you may want to consider to set up a proper mapping registry (https://mapping-commons.github.io/sssom/mapping-commons/). This allows you to get features like mapping set validation (schema validation, and more) as part of your pull requests. |
@matentzn, please could you say more about the following comment?
We've always used CSV as our default format and it is fairly well defined as part of RFC 4180. @danamouk, TSV is inconsistent with (1) the other data files in this repository and (2) the data files that you will be releasing as part of the Northwestern project. |
It's not a big deal, but we have always pushed for TSV from everything, because, among others, of the need to escape the (frequent) commas in entity labels.. you can scroll through here for details: https://mapping-commons.github.io/sssom/spec/#serialisation It also has the advantage that standard tools (esp. validators) can process the files more easily, but I guess this is debatable. |
Following the RFC spec by quoting strings (or at least strings with commas) solves the issue of commas in labels. Tabs appear pretty frequently in the strings that we work with, and the benefit of commas over tabs is that they are easier to see! |
Sure,sounds good! If you get make an issue at https://github.com/mapping-commons/sssom/issues explaining why you prefer to, and will, push for CSv, that will be helpful in prioritising respective tool support! Should not be an issue. Sorry about the suggestion wit TSV! |
Thanks, I'm glad you raised this point and it is helpful to talk this through. It's not so much that I think one is better than the other, more that we made a decision to use CSV in the past and I'm trying to understand whether there are good reasons to switch. |
Thanks @tompollard & @matentzn for your suggestions on our initial vital signs mapping! This will be really helpful as we map the rest of the mappings to SSSOM format. I have retained the csv file format and updated the mapping justification to @tompollard if we are happy with the changes, feel free to merge this pull request #1417 , and close #1660 and we can update any issues or changes that come up in another pull request! |
This pull request adds two mapping csv files for vital signs concepts from the
d_items
andchartevents
tables in MIMIC-IV v2.0. The first filechartevents_to_loinc.csv
containsitemid
toLOINC
mappings. The second filechartevents_to_omop.csv
containsitemid
toOMOP
mappings.Both files use the simple standard for sharing ontology mappings
SSSOM
format @sssom