-
Notifications
You must be signed in to change notification settings - Fork 40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
biosynthesis processes that are single-step in some organisms and multi-step in others #29672
Comments
Aside: this is my technique for initial review:
Using this template Results: The biological process in focus is the thiamine diphosphate biosynthetic process. Thiamine diphosphate (ThDP) is an essential cofactor for several enzymatic reactions critical to cellular metabolism. The gene products associated with this process play roles in either the synthesis or transport of components crucial for ThDP biosynthesis.
The intricate interplay of these gene products outlines the biological context of thiamine and ThDP within cellular metabolism, from uptake and transport to modification into active cofactor form. There's nothing particularly unclear about this model, but it is worth noting that the involvement of THTPA might be seen as more indirect unless explicitly considering organism-specific regulatory needs for ThDP and related metabolites. Overall, this path includes dependent and supportive roles rather than a strict linear A -> B style synthesis route. |
fission yeast GO-CAM curated by @PCarme We would not annotate anything upstream of thiamine to thiamine diphosphate biosynthetic process, and these annotations are not included in the model (although we have lots of inferred annotations to this term which are upstream, (I will query these if we agree) However, we currently have tnr3 annotated to thiamine salvage, which I'm not sure is quite right (I have only used salvage to apply to the output molecule, this would be salvage in the opposite direction). I have been thinking that there is a case for single-step pathway for some functions. Quite often, we have 2 clearly modular pathways and a 'linking reaction'. There seems no other way to describe these, and it seems natural that the MF really is a single-step process. Also, since thiamine diphosphate is the active form of vitamin B1, wouldn't it seem weird not to have a biosynthesis term for it? |
I agree with Val, the 1 step = MF, > 1 step = BP is a misleading rule. |
Except here is an argument, put out for sanity testing, there are not really two different uses of a single term here. Our prototrophic distant ancestors evolved a pathway to synthesize thiamine de novo from simple starting materials. More recent ancestors evolved the ability to consume prototrophs and their products, relieving the selective pressure to maintain the full, metabolically expensive, de novo pathway. As a result, many descendants including us have lost some or all of the molecular functions needed for the de novo process. But, as here, we have retained a few. I don't know the genomics in this case, but it's easy, for other recently truncated processes like purine catabolism, to find pseudogene relics of now-lost function steps, reinforcing the link between the current stub and the ancestral process. Now, how do we label this few- or single-step stub that remains of the multistep de novo process. The founding statement for GO describes it as a "tool for the unification of biology" [over large evolutionary distances]. With that statement in mind, the present-day stub is still thiamine metabolism, viewed in terms of both present day chemistry and evolutionary origin. We need a label that ties it to the large de novo process it is descended from. And setting some minimum amount of the original process that needs to be preserved (here, two or more functions), seems like tidy bookkeeping but not good evolutionary biology. This is not an argument for allowing single-function processes in general, but only for preserving the "process" label in cases where evolution has partly dismantled an ancestral multi-step process. |
There is a pathway boundary issue here. I think that by default we treat metabolism / biosynthesis / catabolism of a chemical entity as including only the steps that transform a starting chemical into the final product or products, but not the steps that move the starting chemical to the place where the metabolism will happen, nor the steps that move the products away. Thus glycolysis starts with cytosolic glucose and ends with cytosolic pyruvate. Should we try to be consistent about this? Maybe variation in this aspect of defining process specific boundaries is OK as long as there is consensus on how to define these various boundaries correctly / consistently in each case? |
Consider this process
the start point is undefined (formally or informally)
Some euks, such as plants and yeast have a whole pathway for this, and the annotations in these species look more or less what you might expect (with a few annotations that seem to be indirect roles). In all cases the penultimate step is making thiamine, when is then turned into ThDP by TPK1.
But we mammals rely on other organisms to make thiamine, and our "thiamine diphosphate biosynthetic process (sensu stricto)" is trivially the activity of a single gene (TPK1). Ideally we would see only one human gene annotated to this "pathway" -- and in fact we do, if we look only at IBAs. But there is a lot of annotation of genes (THTPA, SLC19A2, SLC19A3, SLC25A19) that are related in some other way.
These other annotations make sense when we look at what Reactome calls "thiamin metabolism" https://reactome.org/PathwayBrowser/#/R-HSA-196819
I think Reactome's use of the catch-all "metabolism" is right here, because really it's only one step here that is biosynthesis (TPK1).
IMO it's clear the PAINT annotators and other annotators are annotating to a different concept here.
My proposal here is to ban single-step pathways, restrict ThDP biosynthesis to be bacteria, plants, yeast, etc. However, I do think it's useful to have a concept for what is shown in the Reactome diagram and retain the same genes. ThDP transport (no such term in GO) isn't quite right since it's really uptake-modification-transport. Metabolism is not wrong but we use metabolism as a general grouping.
This would be a bit of work and reannotation. But right now the concept we have in GO is being interpreted in at least two different ways which is not good.
The text was updated successfully, but these errors were encountered: