Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Saved variant timeout #3510

Merged
merged 5 commits into from
Jul 25, 2023
Merged

Saved variant timeout #3510

merged 5 commits into from
Jul 25, 2023

Conversation

hanars
Copy link
Collaborator

@hanars hanars commented Jul 24, 2023

The issue appears to be that the increase in the number of TPM models now that we loaded TPM==0 has increased the size of the response and we are actually losing most of the time serializing and deserializing the response. This updates the behavior so we only return TPM genes for a family if the family has a variant in that gene, instead of if any family has a variant in that gene. For RGP this redoes the response size by over 100Mb

@hanars hanars requested a review from ShifaSZ July 24, 2023 22:23
Comment on lines +95 to +96
for family_guid in var['familyGuids']:
family_genes[family_guid].update(var.get('transcripts', {}).keys())
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we need to include the families who have no transcripts?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes overall, as this is used for the the list of families for filtering which projects to return etc. However, I can add an extra filter before we get the family list for rna data, as theres no reason to include rna data for families with no genes

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

added this below

Comment on lines 162 to 165
return {
agg['sample__individual__family__guid']: {'tpmGenes': [
gene for gene in agg['genes'] if gene in family_genes[agg['sample__individual__family__guid']]
]} for agg in tpm_family_genes
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we filter out the families with no genes in the family_genes?

@hanars hanars merged commit 092dbe0 into dev Jul 25, 2023
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants