Skip to content

Conversation

glennhickey
Copy link
Collaborator

This adds the --rgfa option to cactus-pangenome. It will get you

  • graph.rgfa.gz: Like the usual GFA file, but no paths -- only rGFA tags. These tags will cover off-reference sequence as determined by greedy cover. By default, only intervals >= 50bp are included. (graph.gfa.gz changed to no longer have rank-0 tags)
  • graph.rgfa.gbz: This is the usual GBZ, but with the full rGFA cover added as reference path fragments (that have special sample name _rGFA)
  • graph.rgfa.vcf.gz: This VCF contains nested calls inside insertions by means of the rGFA cover. It does not go through vcfbub like usual.

It'd be nice to not have different GBZ/VCF outputs, but keeping them separate will make it easier to benchmark. There's also still some work needed regarding VCF tags...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant