Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BCR-tutorial #199

Open
grst opened this issue Oct 7, 2020 · 4 comments · May be fixed by #542
Open

BCR-tutorial #199

grst opened this issue Oct 7, 2020 · 4 comments · May be fixed by #542
Labels
documentation Improvements or additions to documentation

Comments

@grst
Copy link
Collaborator

grst commented Oct 7, 2020

Add a tutorial for BCR (or joint BCR/TCR analysis).

Either as additional tutorial, or replace the TCR tutorial as the main "getting started" tutorial.

The latter may make particularly sense, as we could show a joint BCR/TCR analysis.

@grst grst mentioned this issue Oct 7, 2020
14 tasks
@ktpolanski
Copy link
Contributor

Another signal boost post. I just tried to analyse some BCR, so I mimicked the TCR tutorial.

@grst
Copy link
Collaborator Author

grst commented Nov 5, 2020

Hi @ktpolanski,

the main reason why I didn't prioritize the tutorial so far is that the analysis steps don't differ a lot between TCR and BCR... So following the TCR tutorial should be fine. Would still be nice to have it covered, of course.

Do you have something particular in mind with respect to BCRs?

Best,
Gregor

@ktpolanski
Copy link
Contributor

Having had a chat with a biologist, I believe Scirpy's existing functionality covers BCR-specific phenomena (non-identity CDR3 comparisons for somatic hypermutation, and .obs['..._c_gene'] for class-switch recombination). As such, it should actually be fine, yeah!

Given how vital a role the neighbour/clonotype calling seems to have in the process, do you have any suggestions for Levenshtein parameterisation for BCRs?

@grst
Copy link
Collaborator Author

grst commented Nov 12, 2020

We don't have any evidence, what the optional threshold is. For TCR, I would say that anything with a levenshtein distance of >1-2 is unlikely to recognize the same antigen. @szabogtamas, our immunology expert, suggested that for BCR it may make sense to relax this threshold a bit.

In the introduction of this preprint, they write:

The authors in [14], have noticed that the distribution of distances between sequences and their nearest neighbors (distance-to-nearest) tends to be bi-modal, with a first mode corresponding to clonally related sequences and second mode corresponding to sequences without clonal relationship (singletons). Using this bi-modality, [14] proposes to set a threshold that separates the two modes. Following this observation, [1, 17] use the bi-modality of this distribution to suggest an automatic way to set the threshold. A recent method by [2] uses spectral clustering with an adaptive threshold to identify the groups of clonally related sequences.

I gave that a quick shot with the Maynard 2020 dataset, and coudn't find a bimodal distribution. However, the number of B-cells in that dataset is rather small (~1,700), so it might very well be that this pattern only starts to emerge with 10,000s or even 100,000s of cells.

In any case, this is a topic that interests me myself. If you find something useful, let me know and I'm happy to add it to the documentation.

@grst grst added the documentation Improvements or additions to documentation label Nov 12, 2020
@grst grst linked a pull request Aug 30, 2024 that will close this issue
3 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
Status: In progress
Development

Successfully merging a pull request may close this issue.

2 participants