New metabolic pathway #1361

ioanagherman · 2023-03-16T21:40:40Z

The changes made on this branch correspond to the addition of an external metabolic pathway, where reactions are catalyzed by exogenous genes. The main changes include:

Two additional files have to always be present in every new_gene_data folder (wcEcoli/reconstruction/new_gene_data), metabolites.tsv and metabolic_reactions_external.tsv. These files will contain details about the external metabolic pathway and the corresponding metabolites involved in the pathway.
If the files mentioned above are empty, the new genes will be added to the chromosome, but their corresponding monomer products will not catalyze any reaction.
If the files mentioned above are not empty, the new genes will be added to the chromosome as well as the new (external) reactions. This will trigger a new process called metabolism_external_pathway. The corresponding scripts for the process were added in models/ecoli/processes and dataclasses/ecoli/process.
This process is modelled using Michaelis Menten kinetics and it uses the reactions and parameters specified in the metabolic_reactions_external.tsv file.
The flux listener FBAResults is updated to store the fluxes of the external pathway reactions as well.
Four analysis scripts were created for the external metabolic process:
a.models/ecoli/analysis/single/flux_external_metabolic_pathway.py - creates a plot of the fluxes corresponding
to the external reactions added. This will be one plot for each cell.
b. models/ecoli/analysis/multigen/flux_external_metabolic_pathway.py - creates a plot of the fluxes corresponding
to the external reactions added. This will show all these fluxes for all generations simulated in one variant.
c. models/ecoli/analysis/single/molecules_external_pathway.py - creates plots of molecule counts for each
molecule involved in the external reactions, for each cell separately, from every generation.
d. models/ecoli/analysis/multigen/molecules_external_pathway.py - creates plots of molecule counts for each
molecule involved in the external reactions, for all cells from all generations corresponding to a variant.

ggsun

Hi Ioana,
I'm submitting the first round of comments I had, I haven't looked at the main file in reconstruction, I'll try to have a look at the file tomorrow.

ggsun · 2023-03-28T22:38:44Z

models/ecoli/processes/metabolism_new_pathway.py

I think "external pathway" would be a more appropriate name for this file/class to make it clear that we are treating this pathway separately from the existing metabolic network.

ggsun · 2023-03-28T22:39:40Z

models/ecoli/processes/metabolism_new_pathway.py

+"""
+Metabolism for new(external) pathway
+Metabolism for new pathway sub-model
+
+"""


I think adding good explanation on why this process needs to exist (why we are treating added pathways to be separate from the existing metabolic network) would be good to have here.

ggsun · 2023-03-28T22:40:05Z

models/ecoli/processes/metabolism_new_pathway.py

+Metabolism for new pathway sub-model
+
+"""
+from __future__ import absolute_import, division, print_function


This line is no longer needed after completely transitioning to Python 3.

ggsun · 2023-03-28T22:41:01Z

models/ecoli/processes/metabolism_new_pathway.py

+
+import wholecell.processes.process
+from wholecell.utils import units
+from wholecell.utils.constants import REQUEST_PRIORITY_METABOLISM_NEW


I think it would be okay to just use the existing REQUEST_PRIORITY_METABOLISM value here, since they have the same value anyway. Do you think there will be cases when this value will be different?

ggsun · 2023-03-28T22:41:34Z

models/ecoli/processes/metabolism_new_pathway.py

+from wholecell.utils.constants import REQUEST_PRIORITY_METABOLISM_NEW
+
+
+class MetabolismNewPathway(wholecell.processes.process.Process):


Same comment as the filename. "New" pathway seems a little bit too vague.

ggsun · 2023-03-28T23:55:56Z

reconstruction/ecoli/flat/new_gene_data/vioAE/genes.tsv

+"NG002"	"vioB"	["vioB"]	1257	4253	"+"	["NG002_RNA"]
+"NG003"	"vioC"	["vioC"]	4254	5543	"+"	["NG003_RNA"]
+"NG004"	"vioD"	["vioD"]	5544	6665	"+"	["NG004_RNA"]
+"NG005"	"vioE"	["vioE"]	6666	7241	"+"	["NG005_RNA"]


Same comment about the newline character here, and many of the new vioAE files.

ggsun · 2023-03-28T23:59:42Z

reconstruction/ecoli/flat/new_gene_data/vioAE_meta/metabolic_reactions_new.tsv

@@ -0,0 +1,2 @@
+"id"	"stoichiometry"	"direction"	"catalyzed_by"	"forward_rate"	"reverse_rate"	"kcat"	"km"


What are the purposes of the forward and reverse rates columns?

ggsun · 2023-03-29T00:03:22Z

reconstruction/ecoli/knowledge_base_raw.py

-			# Join datasets
-			for row in added_data:
-				data.append(row)
+			if added_data:


I don't think this change would be necessary if we remove the empty file requirements for the metabolism files?

ggsun · 2023-03-29T00:03:48Z

runscripts/fireworks/fw_queue.py

This change should not be included in this PR.

ggsun · 2023-03-29T00:04:24Z

wholecell/fireworks/nohup.out

All fireworks files should not be part of this PR.

ggsun · 2023-03-29T21:09:13Z