Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Matching numeric files with ICU Admission ids using time stamp from numeric files #123

Open
phaniparsa opened this issue Jul 14, 2020 · 1 comment

Comments

@phaniparsa
Copy link

Hello Team,

I'm currently working on a project on MIMIC-III matched subset and trying to link numeric record files with ICU stays from clinical aspect.

Below is the logic I'm using to combine numeric records with ICU Stays:
Step#1: Extracting SUBJECT_ID and DateTime stamp from numeric files.
Step#2: Joining the [SUBJECT_ID, DateTime] with ICU Stays clinical file on SUBJECT_ID.
Step#3: For each row in the merged data, I'm further checking if the DateTime on file is in range of an ICUSTAY INTIME and OUTTIME. If yes, I'm retrieving those. If not, I'm not considering the row.

After completing this process, I'm ending up with 15860 files out of 22247. I'm wondering if this approach is valid or not, as nearly 7000 files are missed in this way.

Also, as per: https://archive.physionet.org/mimic2/mimic2_matching.shtml , there is a chance for some records to not find a match. So I'm a bit curious if I can validate the numeric files based on the ICUSTAYS IN and OUTTIMEs, and retrieve only those files which lie in the [INTIME, OUTTIME] interval. Could you please confirm if this approach is fine and it is acceptable to not find matches for nearly 7000 files?

@alistairewj
Copy link
Member

The approach is what I would do - and I think I similarly found around ~70-75% of icustay_id could be matched. You may want to add some fuzziness to the intime/outtime windows (+- 6 hours perhaps), as those times aren't exact (sometimes patients are put on the monitoring half an hour before they are "admitted" to the ICU).

The waveform matching process was done retrospectively so it was pretty difficult to do and unfortunately wasn't perfect for all patients. I don't actually know why so many files aren't matched but at least what you have found is roughly consistent with what I've found.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants