Add a CLI, add some light unit testing, and change config from Python to Yaml/Json #2

mikix · 2023-10-12T19:33:54Z

OK on my first pass at interacting with the chart-review code, I wrote down some possible improvements here: #1

This is the PR to solve them. Specifically:

Add a wrapping CLI so that researchers don't need to touch Python if they don't want to.
- This CLI currently only offers one subcommand: accuracy which does the 3-way accuracy calculation from paper.py in the covid folder. 🤷 it seemed like a reasonable place to start, but we should talk about what top-level operations make sense.
Add some very brief initial unit tests (just one method right now)
Biggest change is converting the config file format from Python to Yaml/Json
- Researchers will touch this, so static config is easier to understand/explain and there's less room to shoot yourself in the foot
- It's awkward to import arbitrary Python from a random project dir safely/wisely
- I'm generally preferring Yaml where possible because you can have comments in it, but config.json will work too.
Bumped minimum python to 3.10 -- just for convenience of using some nicer type hinting it has. (Cumulus ETL requires 3.10, so this doesn't seem crazy to me)

I've split this PR up into different commits, which should hopefully make it easier to read. But there's still a fair bit.

Also deletes a duplicate method and refreshes pyproject.toml

CLI: - New `chart-review` script gets installed along with Python module. - One sub-command right now: `accuracy` which calculates accuracy matrixes across labels for two reviewers and a base third Config: - Switch away from Python config files and towards yaml/json files. - I've added yaml versions of the two studies in the repo, as examples.

This will make it easier for someone just using the python to call it, if they want to.

mikix · 2023-10-12T19:34:15Z

chart_review/commands/accuracy.py

@@ -0,0 +1,41 @@
+"""Methods for high-level accuracy calculations."""


This file is basically a generic version of the calculation in paper.py

comorbidity · 2023-10-13T16:14:49Z

README.md

+Chart Review operates on a project folder that holds your config & data.
+1. Make a new folder.
+2. Export your Label Studio annotations and put that in the folder as `labelstudio-export.json`.
+3. Add a `config.yaml` file (or `config.json`) that looks something like this (read more on this format below):


I like your idea and I strongly prefer config.json over yaml

Ah fair - but note:

Json is technically a subset of yaml. (That is, a yaml parser can also read json)

So what I've done here is use a yaml parser and look for both config.yaml and config.json -- it will read either one

The reason I personally prefer yaml for config files is that you can have comments, which are often very useful for explaining why a config is the way it is (and also json can be annoyingly fussy about stuff like trailing commas, but that's less important than the comments thing)

So the way I made this PR, either yaml or json works - whichever the researcher in question is more comfy with.

How do you feel about that? (Or do you feel like standardizing on a specific syntax is worth disallowing yaml?)

mikix added 4 commits October 11, 2023 13:54

refactor: rename module from chartreview to chart_review

e640a12

Also deletes a duplicate method and refreshes pyproject.toml

ci: add initial test case

eb33d5a

refactor: move accuracy logic to separate file

7ae1ce8

This will make it easier for someone just using the python to call it, if they want to.

mikix commented Oct 12, 2023

View reviewed changes

mikix added 2 commits October 12, 2023 15:35

build: bump minimum python to 3.10

82ea4d6

ci: and bump the CI python too, whoops

2c816ff

comorbidity reviewed Oct 13, 2023

View reviewed changes

Add a config file test suite

06c56d4

mikix merged commit 8aab0c2 into main Oct 13, 2023
1 check passed

mikix deleted the mikix/cli branch October 13, 2023 19:43

mikix mentioned this pull request Oct 13, 2023

Initial robustification thoughts #1

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a CLI, add some light unit testing, and change config from Python to Yaml/Json #2

Add a CLI, add some light unit testing, and change config from Python to Yaml/Json #2

mikix commented Oct 12, 2023 •

edited

Loading

mikix Oct 12, 2023

comorbidity Oct 13, 2023

mikix Oct 13, 2023

		@@ -0,0 +1,41 @@
		"""Methods for high-level accuracy calculations."""

Add a CLI, add some light unit testing, and change config from Python to Yaml/Json #2

Add a CLI, add some light unit testing, and change config from Python to Yaml/Json #2

Conversation

mikix commented Oct 12, 2023 • edited Loading

mikix Oct 12, 2023

Choose a reason for hiding this comment

comorbidity Oct 13, 2023

Choose a reason for hiding this comment

mikix Oct 13, 2023

Choose a reason for hiding this comment

mikix commented Oct 12, 2023 •

edited

Loading