Skip to content
This repository has been archived by the owner on Nov 8, 2021. It is now read-only.

Using regression to estimate the probabilities for each gene to be essential or not given the SATAY data #20

Open
leilaicruz opened this issue Aug 19, 2020 · 1 comment
Assignees

Comments

@leilaicruz
Copy link
Member

leilaicruz commented Aug 19, 2020

See HERE the web visualization of the code :-)

@leilaicruz
Copy link
Member Author

Go HERE to see the details of the python program.

If we plot the reads and insertions per gene and highlight if they are essential or not from published data , we see this 👇
image

Since both datasets sort of overlap (after truncating the datasets and removing outliers) the regression model can not predict essential genes with more than 0.5 probability .
image

However, if we go deep into the probabilites we can see that if the probability of being essential is bigger than 0.3 already 76% of all essential genes fall inside it .
image

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

1 participant