-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Omics Integrator 2 Testing Error #133
Comments
This may not be a problem with the Omics Integrator 2 container or run function but rather have to do with its input data and parameters. I modified the test function to use the parameters (b=4, g=0) and input data from the workflow. I was able to get a complete output file with the in_solution column when running the
I'll need to run more tests or read through its source code to understand why this column is sometimes missing. |
I pushed the branch I had started looking at where Omics Integrator 2 sets I was testing |
When I was grid searching the parameters for the EGFR dataset, oi2 kept breaking due to this error. For parameter tuning, it will be helpful to find a solution for this issue soon. |
Examples of headers from the raw pathway file
|
quick fix, if the first 3 column headers are there, then write an empty file and say it is corrupted then try to figure what the code is doing |
I resumed investgating the Omics Integrator 2 adds that information here using In addition, if Omics Integrator 2 adds None of this explains why |
#182 Pull request for error |
The cost is always missing when the raw pathway is empty. |
For the in_solution and cost swapping situation, this might be a problem with the version of python used as well. For python 3.7 and newer, dictionaries are guaranteed insertion order. For 3.6, it is not officially guaranteed by the language specification that insertion order is guaranteed. So, there is a chance that Python 3.6 could be potentially changing the order. |
Based on my understanding, if the forest is empty, the code will attempt to iterate through the edges, but since there are none, no exception will be raised, and the function will simply end without making any changes. Therefore, in_solution will never be added as an edge attribute. |
Still need to figure out what is an empty file for OI2. Completely empty files vs files for 'protein1 protein2 cost' that also are technically empty because the forest is empty. |
#182 has been merged. It looks like you want to leave it open until we understand the |
When I was writing the tests for parse outputs, I kept getting an error on this line of the code for oi2: df = df[df['in_solution'] == True] # Check whether this column can be empty before revising this line
I was taking the output from the oi2 test suite, but it was failing because it was missing the 'in_solution' column, yet the column 'in_solution' appears in the raw-pathways for oi2 when SPRAS is run over the datasets.
There might be an issue with the test suite or merely the result. I haven't checked it out, but it's something we should look at because it's not a current test.
The text was updated successfully, but these errors were encountered: