-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CAM KP does not respond to any of ICEES KG-derived input CURIES #101
Comments
Thanks for this identifiers! I've added these identifiers to the brand new Automat-CAM-KP test suite (#111), and here are the results I have:
|
@balhoff Do you have thoughts on figuring out how to plug in the gaps we see here in node coverage? I'm guessing we need new data sources. |
Thanks, @gaurav! While we don't have a 1:1 match between CURIEs, the matches that we do have are representative, with two drugs and two chemical exposures, and will allow us to move this effort along. |
This Swagger example query runs successfully, but it returns 0 results. If I replace the input CURIES with PUBCHEM.COMPOUND:5865 from the table above, the query also runs successfully, but it returns 0 results. I think the Automat example queries are standardized and not tailored to the underlying KGs, so perhaps you can send me an example query that returns results from CAM KP? Thanks!
|
Hi Kara! Sorry about the confusion: that Swagger example query can't currently be configured for individual platers, so we share a single Swagger with all the platers on Automat. That one isn't relevant to us, and has two main problems:
So the following query will work: {
"message": {
"query_graph": {
"nodes": {
"n0": {
"categories": [
"biolink:ChemicalEntity"
]
},
"n1": {
"categories": [
"biolink:GeneOrGeneProduct"
],
"ids": [
"NCBIGene:2099"
]
}
},
"edges": {
"e01": {
"subject": "n0",
"object": "n1",
"predicates": [
"biolink:affects"
]
}
}
}
},
"workflow": [
{
"id": "lookup"
}
]
} |
No confusion, I was aware that the Swagger examples aren't really "examples" for most of the Automats, including cam-kp and icees-kg. Thanks for an actual example query! |
This query returns results when sent directly to automat-icees-kg at https://automat.renci.org/#/.
And this query returns responses when sent directly to automat-cam-kp at https://automat.renci.org/#/.
But this query, while able to run successfully, returns an empty response when sent to WFR at https://translator-workflow-runner.renci.org/docs#/trapi/run_workflow_query_post.
|
This comes from going through ARAs that have strict kp timeouts vs sending queries directly to kps. I also wasn't able to get any results from the WFR, but sending directly to Aragorn with an extended timeout returns a 16.6MB response. 12k results in total. ICEES-KG took 35 seconds to respond to the first hop (normal timeout is 10s) and returned 106 results, and then CAM-KP took 90 seconds to respond with the 12k results. If you want, I can share entire response. |
Thanks, Max. Given your findings, then the revised query below should run when sent to WFR and return results. However, while it runs successfully, it returns an empty KG.
|
Your query doesn't have the extended timeout that I'm able to set directly in Aragorn. So WFR is returning nothing because icees-kg is timed out on the first hop. This is a performance issue, and I'm only able to get results back because I can peek behind the curtain and turn some hidden knobs. |
Oh, I see. That makes sense. In that case, perhaps you can send me the full response? |
Just so everyone is clear, the goal of this effort is three-fold:
|
Also see this GitHub folder and slide 9 in this slide deck. |
Per decision on 01.03.2024: Max will rerun the above queries with extended timeouts in ARAGORN and cache the results. Kara will then test. |
From Meisha, 01/17/2024: Title: Peptide Oxidation Leading to Hypertension Description from the wiki: Here we present the supporting information on an AOP describing how vascular endothelial peptide oxidation leads to hypertension via perturbation of endothelial nitric oxide (NO) bioavailability. The molecular initiating event is oxidation of amino acid (AA) residues on critical peptides of the NO pathway, notably protein kinase B (AKT), guanosine triphosphate cyclohydrolase-1 (GTPCH-1), endothelial nitric oxide synthase (eNOS), and also the cellular ROS scavenger; glutathione. Oxidation of the enzymic components of the pathway lead to reduced expression of the phosphorylated proteins, and protein loss via proteasomal degradation. Oxidation of reduced glutathione to GSSG promotes bonding of GSSG to critical AA residues on eNOS, and the reduced expression of GTPCH-1 reduces bioavailability of tetrahydrobiopterin (BH4), both of which lead to uncoupling of eNOS (reduced NO production, increased superoxide production). The combination of these molecular events lead to reduced bioavailabilty of NO, which in turn reduces the potential for vasodilation and shifts the balance of vascular tone towards vasoconstriction. Repeated perturbation of this pathway via chronic exposure to toxicants, ultimately increases vascular resistance and contributes towards the development of hypertension. |
From Max, 02/05/2024: cam_kp_integration_response.json - CF - ChemicalEntity - GeneOrGeneProduct ChemicalEntity = propranolol https://pubmed.ncbi.nlm.nih.gov/23539159/ https://www.uspharmacist.com/article/advances-in-the-management-of-cystic-fibrosis https://www.journal-of-hepatology.eu/article/S0168-8278(15)00349-9/fulltext |
This PR adds a test suite in Python for CAM-KP-API to cam-pipeline. Some of this code has been moved over from https://github.com/ExposuresProvider/cam-kp-api and the rest has been newly written for this. There are three tests here: - test_api.py: test the Automat-CAM-KP API endpoints. - test_examples.py: tests the example files in `examples/`. - test_curies.py: test a set of CURIEs to see if Automat-CAM-KP has information or not about them. This currently includes failing tests from #101 I tried to move over the integration tests from CAM-KP-API, but I couldn't work out the easiest way to figure out how to move them over as a Scala project. I think these Python tests are easier to read and maintain, but I'm happy to be proved wrong. Closes #94
I took another stab at the CURIEs I couldn't figure out previously, and found three more of them in CAM-KP. Most of these are NodeNorm issues in one way or another, but at least one of them could be fixed by turning on drug conflation when processing CAM-KP. I propose we use the alternate CURIEs I listed below while I try to figure out the NodeNorm issues.
|
…ated-pubchem-compound Updated PubChem identifiers to the correct ones for the test as per #101 (comment)
This issue is to report that CAM KP does not respond to any of the ICEES KG-derived CURIES in this sheet and also appended below. Is this expected behavior? Is this a normalization issue? Is this something else?
The text was updated successfully, but these errors were encountered: