Using regression to estimate the probabilities for each gene to be essential or not given the SATAY data #20

leilaicruz · 2020-08-19T13:51:26Z

See HERE the web visualization of the code :-)

leilaicruz · 2020-08-19T14:07:49Z

Go HERE to see the details of the python program.

If we plot the reads and insertions per gene and highlight if they are essential or not from published data , we see this 👇

Since both datasets sort of overlap (after truncating the datasets and removing outliers) the regression model can not predict essential genes with more than 0.5 probability .

However, if we go deep into the probabilites we can see that if the probability of being essential is bigger than 0.3 already 76% of all essential genes fall inside it .

leilaicruz self-assigned this Aug 19, 2020

leilaicruz added the data processing label Aug 19, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using regression to estimate the probabilities for each gene to be essential or not given the SATAY data #20

Using regression to estimate the probabilities for each gene to be essential or not given the SATAY data #20

leilaicruz commented Aug 19, 2020 •

edited

Loading

leilaicruz commented Aug 19, 2020

Using regression to estimate the probabilities for each gene to be essential or not given the SATAY data #20

Using regression to estimate the probabilities for each gene to be essential or not given the SATAY data #20

Comments

leilaicruz commented Aug 19, 2020 • edited Loading

leilaicruz commented Aug 19, 2020

leilaicruz commented Aug 19, 2020 •

edited

Loading