support for reference sequences with lowercase characters (a, c, g, t) #19
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This addresses issue #18. It fixes problems rendering reference sequences with lowercase characters.
The fix comes in two parts. First is an expansion of the
COLOR
dictionary to associate hex codes to each of the lowercase bases. All the colors are defined separately. This is makes the code long, but leaves the option to tune the colors later.The second part is an adjustment in the drawing of the coverage track. Without this edit, the coverage track signals discrepancies between bases in reads (uppercase) and the reference. The images below show a random sequence with made-up reads, before and after implementing the edit.
To reproduce, here is a random reference sequence and two reads in sam format.