Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sankoff with multiple criteria #55

Open
marybarker opened this issue Dec 18, 2022 · 1 comment
Open

Sankoff with multiple criteria #55

marybarker opened this issue Dec 18, 2022 · 1 comment

Comments

@marybarker
Copy link
Contributor

It might be useful to generalize the Sankoff algorithm to admit both sequence and geographic data.

This can be done naively for the 2-region case by appending a binary-value character to the existing sequences that represents the geographic region of interest. However, there are 2 limitations to this that we would like to get around:

  • We would like to allow for more than 2 geographic regions.
  • We might want to weight the cost of different geographic regions differently from the transition weights for the bases of the actual sequences.

One way to get around this would be to create separate node attributes, as @willdumm suggests in this comment with cost matrices for these.

In this case, we would need to couple the attributes carefully. A potential problem when optimizing cost using multiple criteria is that it is possible for the overall cost to achieve an optimal value at a combination that is not a local optimal for any of the attributes independently. That is, the overall cost of the choice at a given node should be realized for the entire set of attributes on each tree below it.

@marybarker
Copy link
Contributor Author

With some care, this might also be generalized to allow a constrained Sankoff-geography method, where the sequences are resolved first, and then geography is inferred based on the optimal Sequence labeling.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant