Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding Header Lines #142

Merged
merged 29 commits into from
Jul 10, 2024
Merged

Adding Header Lines #142

merged 29 commits into from
Jul 10, 2024

Conversation

ntalluri
Copy link
Collaborator

No description provided.

@ntalluri
Copy link
Collaborator Author

ntalluri commented Dec 22, 2023

For some reason I am not able to remove the header line of each files from the cytoscape files.

@ntalluri ntalluri changed the title Adding Header Files Adding Header Lines Dec 22, 2023
@ntalluri
Copy link
Collaborator Author

I still cannot get the header line to be skipped, it might be something on my end version wise, but I'm not sure

@ntalluri
Copy link
Collaborator Author

rebuild and retag the image for docker

@ntalluri ntalluri requested a review from agitter January 19, 2024 00:59
@ntalluri ntalluri self-assigned this Jan 19, 2024
@agitter
Copy link
Collaborator

agitter commented Jan 19, 2024

I resolved a merge conflict that occurred after updating the py4cytoscape image tag.

Were you able to get Cytoscape working now?

@ntalluri
Copy link
Collaborator Author

ntalluri commented Jan 19, 2024 via email

Copy link
Collaborator

@agitter agitter left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The changes look good overall. My initial review focused only on the file updates. I will still test the updated code locally before we merge.

I built the v3 py4cytoscape image and pushed it to DockerHub.

docker-wrappers/Cytoscape/cytoscape_util.py Show resolved Hide resolved
spras/analysis/cytoscape.py Show resolved Hide resolved
spras/analysis/ml.py Outdated Show resolved Hide resolved
spras/analysis/ml.py Outdated Show resolved Hide resolved
spras/domino.py Outdated
edges_df.columns = ['Node1', 'Node2', 'Rank', 'Direction']
edges_df.to_csv(standardized_pathway_file, sep='\t', header=True, index=False)
else:
edges_df.to_csv(standardized_pathway_file, sep='\t', header=None, index=False)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Will this output a completely blank file? Should we output the header row now instead? We could do that without pandas if it is hard to do with pandas.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Randomly the output file for domino (and for only domino) will add a '\n' to the file when header = True. This was our version of a quick fix to allow for that to not happen.

spras/allpairs.py Outdated Show resolved Hide resolved
@agitter agitter mentioned this pull request Feb 7, 2024
spras/analysis/ml.py Outdated Show resolved Hide resolved
test/ml/test_ml.py Show resolved Hide resolved
@ntalluri
Copy link
Collaborator Author

I've finished the review/comments of the pull request. When you review it again, could you pay special attention to the domino section? I'm not sure that my changes are that good but I was hesitant to make extensive changes to the code.

@ntalluri
Copy link
Collaborator Author

Ready for Code review

@ntalluri
Copy link
Collaborator Author

ntalluri commented Jun 13, 2024

I need to update the contributing guide

@agitter
Copy link
Collaborator

agitter commented Jun 14, 2024

We need to revisit #143 (comment) before merging this because I expect to merge #143 first

@agitter
Copy link
Collaborator

agitter commented Jun 14, 2024

We should add documentation of the standard pathway output format now that it includes rank, direction, and universal header rows. Where would that go? In the doc directory?

It can copy the input file example from input/README.md and add the header row.

@ntalluri
Copy link
Collaborator Author

the doc directory seems like the right place for now.

@agitter
Copy link
Collaborator

agitter commented Jul 4, 2024

@ntalluri I re-reviewed all of the files and pushed a few commits with code cleanup. Can you please review my changes to see if they look okay?

If the GitHub actions tests pass, then I will run the workflow locally to inspect if the output pathways with headers look good. It is hard to be fully confident all of the parse output logic will work as intended with empty and non-empty pathways for all algorithms.

@ntalluri
Copy link
Collaborator Author

ntalluri commented Jul 5, 2024

The changes look good. It seems like the parse output logic is working how I thought it would, but I'm sure what you mean on it not being fully confident on it working.

@agitter
Copy link
Collaborator

agitter commented Jul 5, 2024

I meant that even though the parse output code changes look good, I don't fully remember all of the behaviors of the various pathway reconstruction algorithms. There could be some corner cases I'm forgetting that would affect how they write headers or deal with empty pathways. It's hard to consider all the possibilities in a big change like this.

@ntalluri
Copy link
Collaborator Author

ntalluri commented Jul 5, 2024

One case to consider is if the raw pathway file only containing a header line and nothing else.

It seems like the code already deals with this for omics integrator 2, but I haven't written any test cases for it.

@agitter
Copy link
Collaborator

agitter commented Jul 5, 2024

That's a good example of algorithm-specific behavior. Some algorithms may output an empty file. Others may output a header only. Their output parsers prior to this change should have already handled that though.

Copy link
Collaborator

@agitter agitter left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I ran the config and egfr workflows locally and reviewed the Cytoscape files, raw pathways, and standardized pathways. Everything looked okay.

@agitter agitter merged commit 98d7e35 into Reed-CompBio:master Jul 10, 2024
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants