-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix!: SequenceLocation
start
/end
for TranscriptSegmentElement
#172
Conversation
This reverts commit 24e1a4c.
This reverts commit 3e52ff2.
Co-authored-by: Kori Kuzma <[email protected]>
Co-authored-by: Kori Kuzma <[email protected]>
Co-authored-by: Kori Kuzma <[email protected]>
Co-authored-by: Kori Kuzma <[email protected]>
@ahwagner My question pertains for how to use def transcript_segment_element():
"""Create transcript segment element test fixture"""
params = {
"type": "TranscriptSegmentElement",
"exonEnd": 8,
"exonEndOffset": 0,
"exonStart": 1,
"exonStartOffset": 0,
"gene": {
"id": "hgnc:12012",
"label": "TPM3",
"type": "Gene",
},
"transcript": "refseq:NM_152263.3",
"elementGenomicStart": {
"id": "ga4gh:SL.2K1vML0ofuYrYncrzzXUQOISRFJldZrO",
"type": "SequenceLocation",
"sequenceReference": {
"id": "refseq:NC_000001.11",
"refgetAccession": "SQ.Ya6Rs7DHhDeg7YaOSg1EoNi3U_nQ9SvO",
"type": "SequenceReference",
},
"end": 154192135
},
"elementGenomicEnd": {
"id": "ga4gh:SL.rtR6x2NnJEpROlxiT_DY9C-spf6ijYQi",
"type": "SequenceLocation",
"sequenceReference": {
"id": "refseq:NC_000001.11",
"refgetAccession": "SQ.Ya6Rs7DHhDeg7YaOSg1EoNi3U_nQ9SvO",
"type": "SequenceReference",
},
"start": 154170399
},
}
return TranscriptSegmentElement(**params) |
@ahwagner here is what UTA provides. Note that UTA is inter-residue based
|
@ahwagner Was wondering if you could also comment on the correct use of
|
Yes.
Depending on what is meant by "reversed", yes. Specifically, you should always expect that the lower genomic coordinate is defined by a SequenceLocation
These are VRS SequenceLocations, so it should be interresidue following VRS conventions. I want to be clear that 0- or 1- indexing generally corresponds to interresidue or residue, respectively, but I don't use these terms interchangeably as one could very well 1-index interresidue coordinates or 0-index residue.
This follows the conventions outlined in the VICC Fusions Spec. Specifically, this image: Offset is always with respect to the exon-based representation, and therefore with respect to the transcript (not genomic) sequence. To get here from a genomic location, you need coordinates, a direction (specified by use of Also, is the mixed use of camel case and snake case intentional for To answer your question about the correct use of offset, I am assuming the use case is the Note Incidentally, the interface for this tool might be clarified by replacing "start position" and "end position" with "Coordinate" and "Junction Direction", and removing the toggle for "Strand" (which should be captured from the transcript alignment). This tool should support whether the returned object represents a transcript segment starting at a given genomic coordinate (genomic SeqLoc uses Under these conditions, a genomic coordinate falling within an exon represents a positive offset from the beginning of that exon if it is starting at that coordinate, or a negative offset from the end of that exon if it is ending at that coordinate. Likewise, an RNA that includes exon-adjacent intronic sequence would have a genomic coordinate with a negative offset from the beginning of that exon if it is starting at that coordinate, or a positive offset from the end of that exon if it is ending at that coordinate. |
This PR is stale because it has been open 1 day(s) with no activity. Please review this PR. |
This PR is stale because it has been open 1 day(s) with no activity. Please review this PR. |
If #176 is pulling SequenceLocations directly from Cool-Seq-Tool, then we do not need this. I'm going to be adding @jarbesfeld as a reviewer to #176 just in case and will close this. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks good to me!
Ah rip. Didn't realize I had an exclamation in PR title |
…172) close #171 The tests for transcript segment elements are kind of confusing IMO. `transcript_segments` test fixture in `test_models` leverages `sequence_locations`. Some of these sequence locations have both start/end, but we only use start or end. I didn't change these because I don't think the values are actually tested. We can revisit in the future.
close #171
The tests for transcript segment elements are kind of confusing IMO.
transcript_segments
test fixture intest_models
leveragessequence_locations
. Some of these sequence locations have both start/end, but we only use start or end. I didn't change these because I don't think the values are actually tested. We can revisit in the future.