Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

a question about the formula of computing $e_{i,j,k_{\text{alt}}}$ #1

Open
Chiyasa opened this issue Oct 28, 2024 · 4 comments
Open

Comments

@Chiyasa
Copy link

Chiyasa commented Oct 28, 2024

very excellent work! After reading the paper, I have a question about the formula of $e_{i,j,k_{\text{alt}}}$:
$$e_{i,j,k_{\text{alt}}} = \max_{k} \left| \log_2 \left( \frac{\text{odds}(n_j = k | n_1, \ldots, n_i = k_{\text{alt}}, \ldots, n_{N})}{\text{odds}(n_j = k | n_1, \ldots, n_i = k_{\text{ref}}, \ldots, n_{N})} \right) \right| \quad \text{with} \quad k \in {A, C, G, T}$$
in the formula, it seems like $n_{j}$ could be any one nucleotide of A/C/G/T, but should be the same nucleotide between the numerator and denominator. My question is: Why 1) ${n_{j}}$ not be fixed to $k_{ref}$ on position $j$ 2) ${n_{j}}$ cannot be different nucleotide on the numerator and denominator?

@Chiyasa Chiyasa changed the title a question about the formula of computing e_{i,j,k_{\text{alt}}} a question about the formula of computing $e_{i,j,k_{\text{alt}}}$ Oct 28, 2024
@pedrotomazsilva
Copy link
Collaborator

thank you very much @Chiyasa! On the second point, that is right n_j is the same nucleotide on the numerator and denominator.
On the first point, for this metric our rationale was to capture the highest “potential” an alternative (k_alt) nucleotide has to influence the prediction at position j. That means looking at any nucleotide at j and not only its reference.

@Chiyasa
Copy link
Author

Chiyasa commented Oct 29, 2024

thank you very much @Chiyasa! On the second point, that is right n_j is the same nucleotide on the numerator and denominator. On the first point, for this metric our rationale was to capture the highest “potential” an alternative (k_alt) nucleotide has to influence the prediction at position j. That means looking at any nucleotide at j and not only its reference.

Another question is that, in this formula, both in the numerator and denominator, in the condition of probability, $n_{j}$ should always be $k_{ref}$ at position $j$, does not change with $k$ in its outcomes?

@pedrotomazsilva
Copy link
Collaborator

yes, the odds are conditioned on position n_j being its reference nucleotide, regardless of k.

@Chiyasa
Copy link
Author

Chiyasa commented Oct 29, 2024

yes, the odds are conditioned on position n_j being its reference nucleotide, regardless of k.

Thank you for your response! I have no questions now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants