[FDS-1797] input graph api #1481

afwillia · 2024-08-28T16:35:10Z

This PR adds a graph_url parameter to the manifest/generate, validate, and submit API endpoints. graph_url is a URL to a pickled data model graph. Supplying it should make the request run faster. It depends on PR1425 and PR1396 which I have already merged into this branch to facilitate development.

graph_url will be added to other endpoints that also accept schema_url, but the three endpoints above are the most relevant to DCA and the current improvement sprint.

Linking #1425 , #1396

… all.

… scan.

…export-pickle

…gic in the CLI function

…t_type

…th is pickle. Also test both parameters are empty.

schematic/utils/io_utils.py

schematic/manifest/generator.py

schematic/schemas/commands.py

linglp · 2024-09-10T22:12:40Z

tests/test_cli.py

+    def test_schema_convert_cli(self, runner, output_path, output_type):
+        model = "tests/data/example.model.csv"
+        label_type = "class_label"
+        expected = 0

-        output_path = helpers.get_data_path("example.model.jsonld")
+        resultOne = runner.invoke(schema, ["convert", model])

-        label_type = "class_label"
+        assert resultOne.exit_code == expected
+        # check output_path file is created then remove it
+        assert os.path.exists(output_path)
+
+        resultTwo = runner.invoke(
+            schema, ["convert", model, "--output_path", output_path]
+        )
+
+        assert resultTwo.exit_code == expected
+        # check output_path file is created then remove it
+        assert os.path.exists(output_path)
+
+        resultThree = runner.invoke(
+            schema, ["convert", model, "--output_type", output_type]
+        )
+
+        assert resultThree.exit_code == expected
+        # check output_path file is created then remove it
+        assert os.path.exists(output_path)
+
+        resultFour = runner.invoke(
+            schema,
+            [
+                "convert",
+                model,
+                "--output_type",
+                output_type,
+                "--output_jsonld",
+                output_path,
+            ],
+        )
+
+        assert resultFour.exit_code == expected
+        # check output_path file is created then remove it
+        assert os.path.exists(output_path)

        result = runner.invoke(
            schema,
            [
                "convert",
-                data_model_csv_path,
+                model,
                "--output_jsonld",
                output_path,
                "--data_model_labels",
                label_type,
            ],
        )

-        assert result.exit_code == 0
+        assert result.exit_code == expected
+        # check output_path file is created then remove it
+        assert os.path.exists(output_path)

-        expected_substr = (
-            "The Data Model was created and saved to " f"'{output_path}' location."
+        resultFive = runner.invoke(
+            schema,
+            [
+                "convert",
+                model,
+                "--output_jsonld",
+                "tests/data/example.model.pickle",
+                "--output_path",
+                "tests/data/example.model.pickle",
+            ],
        )

-        assert expected_substr in result.output
+        assert resultFive.exit_code == expected
+        # check output_path file is created then remove it
+        assert os.path.exists(output_path)
+
+        resultSix = runner.invoke(
+            schema, ["convert", model, "--output_jsonld", "", "--output_path", ""]
+        )
+
+        assert resultSix.exit_code == expected
+        # check output_path file is created then remove it
+        assert os.path.exists(output_path)


1 to Tom's suggestion. Please split this up so that it is more clear what has been tested.

tests/test_metadata.py

sonarqubecloud · 2024-09-10T22:21:51Z

Quality Gate passed

Issues
5 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarCloud

schematic/manifest/generator.py

schematic/visualization/attributes_explorer.py

linglp · 2024-09-11T17:00:28Z

@afwillia another concern that I have about the PR is that I am seeing a lot of:

        if data_model_graph_pickle:
            self.graph_data_model = read_pickle(data_model_graph_pickle)

and also a lot of classes have both graph_data_model parameter and data_model_graph_pickle parameter. If we always get graph_data_model from the pickle file, then wouldn't it make sense to just keep one parameter?

Here's an example of what I meant: #1499

linglp · 2024-09-11T18:59:29Z

@afwillia another point is that if you change parameter create_manifests, I am thinking most likely that you will need to change test_api.py because test_api.py test the process of generating a manifest by hitting the manifest/generate endpoint, and in this case, it needs to test the process of generating a manifest using a pickle file. But I am not seeing any changes totest_api.py. Can changes be added to test_api.py too?

sonarqubecloud · 2024-10-30T03:58:54Z

Quality Gate passed

Issues
5 New issues
0 Accepted issues

Measures
0 Security Hotspots
91.9% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarCloud

thomasyu888 · 2024-11-01T16:15:14Z

Just taking a note here, sophia will reach out to discuss the PR/state of this feature at large prior to merge. Thanks for all the work!

afwillia added 30 commits March 25, 2024 12:33

WIP: add export_as_graph flag to convert CLI command.

2527224

Add convert CLI test for exporting graph as a pickle.

70bfc19

Add export_as_graph to help.py and update the CLI click docs.

5b41aa5

Change export_as_graph to output_format and specify graph, jsonld, or…

76a5567

… all.

Add tests for CLI convert output_type options.

670b5c7

Change text of CLI convert output_type help.

a87d2fe

Test that jsonld is created correctly by convert CLI

07dbd70

Add type hints to convert, clean up formatting issues from sonarcloud…

421c59a

… scan.

Add convert CLI test for not specifying --output_type

2f5abf1

Merge branch 'Sage-Bionetworks:develop' into FDS-1796-schema-convert-…

119cf40

…export-pickle

parameterize the convert CLI test.

c6c5cb8

remove unused test code for the convert CLI.

dd8b2cd

run black on the updated files.

6ec81db

Use logger.info and logger.error instead of click.echo.

bbe6c98

Trim pickle from end of filename, in addition to csv and jsonld.

b99d9eb

add output_path alias for output_jsonld

1189107

Add combination of arguments to convert CLI tests

86f7af8

add export_graph to schema_utils.py to dump a pickle file

d3c3769

use export_graph util to save pickle file instead of handling that lo…

215cc86

…gic in the CLI function

add docstrings to graph_export

242c885

Update tests to run different combinations of output_jsonld and outpu…

a1cfcce

…t_type

Add data_model_graph_pickle to generator

1c67b4a

Add ata_model_graph_pickle to metadata.py

a986b39

Add data_model_graph to attributes_explorer

0390b69

Add data_model_graph_pickle to tangled_tree

37bbf74

Add convert CLI test case where output_jsonld is pickle and output_pa…

441b865

…th is pickle. Also test both parameters are empty.

move tests into one function

fe4fffb

Merge branch 'develop' into FDS-1796-schema-convert-export-pickle

8c64981

Run black on commands.py and schema_utils.py

f20f88b

fix single-letter variable names for pylint error

d56c965

afwillia added 4 commits September 10, 2024 15:09

document schema convert tests

3197abf

turn get_temp_ functions into one single function

abd1630

remove unhelpful comments and whitespace

445aad5

clarify test cases for attributes explorer

0c4fad7

linglp requested changes Sep 10, 2024

View reviewed changes

run black

f0b266d

andrewelamb reviewed Sep 10, 2024

View reviewed changes

schematic/manifest/generator.py Outdated Show resolved Hide resolved

andrewelamb reviewed Sep 10, 2024

View reviewed changes

schematic/manifest/generator.py Outdated Show resolved Hide resolved

andrewelamb reviewed Sep 10, 2024

View reviewed changes

schematic/visualization/attributes_explorer.py Show resolved Hide resolved

Merge branch 'develop' into FDS-1797-input-graph-api

559581c

afwillia requested review from GiaJordan, jaymedina and BWMac as code owners October 28, 2024 19:37

afwillia added 10 commits October 28, 2024 12:38

merge develop updates

9808f2b

simpify message when export graph as pickle.

ec5e121

add tests for read_pickle

5d59890

add error handling and messaging to read_pickle

79ad3d8

add explanation to display label test

763dbdd

remove extra import and update docstring for create_manifests

288431c

add pickle file to test_create_manifests

9ec9ddb

run black

86e1646

run black

d20c1fc

fix pylint issues

5a42f8a

thomasyu888 changed the title ~~Fds 1797 input graph api~~ [FDS-1797] input graph api Nov 2, 2024

thomasyu888 marked this pull request as draft November 29, 2024 00:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FDS-1797] input graph api #1481

[FDS-1797] input graph api #1481

afwillia commented Aug 28, 2024 •

edited by thomasyu888

Loading

linglp Sep 10, 2024

sonarqubecloud bot commented Sep 10, 2024

linglp commented Sep 11, 2024 •

edited

Loading

linglp commented Sep 11, 2024

sonarqubecloud bot commented Oct 30, 2024

thomasyu888 commented Nov 1, 2024

[FDS-1797] input graph api #1481

Are you sure you want to change the base?

[FDS-1797] input graph api #1481

Conversation

afwillia commented Aug 28, 2024 • edited by thomasyu888 Loading

linglp Sep 10, 2024

Choose a reason for hiding this comment

sonarqubecloud bot commented Sep 10, 2024

Quality Gate passed

linglp commented Sep 11, 2024 • edited Loading

linglp commented Sep 11, 2024

sonarqubecloud bot commented Oct 30, 2024

Quality Gate passed

thomasyu888 commented Nov 1, 2024

afwillia commented Aug 28, 2024 •

edited by thomasyu888

Loading

linglp commented Sep 11, 2024 •

edited

Loading