Add Pipistrellus hanaki #84

SMangenot · 2024-10-03T06:48:44Z

Assembly review request

ToLID: mPipHan1
Species: Pipistrellus hanaki
Project: ERGA-BGE
Affiliation: Genoscope

erga-ear-bot · 2024-10-03T06:49:10Z

Hi @SMangenot, thanks for sending the EAR of Pipistrellus hanaki.
I added the corresponding tag to the PR and will contact a supervisor and a reviewer ASAP.

erga-ear-bot · 2024-10-03T06:49:13Z

Hi @tbrown91, do you agree to supervise this assembly?
Please reply to this message only with OK to give acknowledge.

tbrown91 · 2024-10-03T07:05:18Z

ok

erga-ear-bot · 2024-10-03T07:05:45Z

*****
EAR Reviewer Selection Process
Date: 2024-10-03 07:05

All Eligible Candidates:

Github ID  | Full Name       | Institution | Total Reviews | Last Review | Active | Busy | Calling Score | Adjusted Score
-------------------------------------------------------------------------------------------------------------------------
talioto    | Tyler Alioto    | CNAG        | 2             | 2024-09-30  | Y      | N    | 1004          | 1054          
epaule     | Michael Paulini | Sanger      | 2             | 2024-09-05  | Y      | N    | 1002          | 1052          
DomAbsolon | Dom Absolon     | Sanger      | 2             | 2024-09-23  | Y      | N    | 1002          | 1052          
additive3  | Jo Wood         | Sanger      | 3             | 2024-06-20  | Y      | N    | 1001          | 1051          
tommathers | Tom Mathers     | Sanger      | 3             | 2024-09-30  | Y      | N    | 1001          | 1051          
tbrown91   | Tom Brown       | IZW         | 8             | 2024-07-05  | Y      | N    | 994           | 989           
diegomics  | Diego De Panis  | IZW         | 7             | 2024-07-05  | Y      | N    | 992           | 987           

Selected reviewer: Tyler Alioto (talioto)
The decision was based on:
- different institution ('CNAG')
- active ('Y')
- not busy ('N')
- highest adjusted calling score in this particular selection (1054)

erga-ear-bot · 2024-10-03T07:05:47Z

Hi @talioto, do you agree to review this assembly?
Please reply to this message only with Yes or No by 10-Oct-2024 at 09:05 CET

tbrown91 · 2024-10-03T07:08:56Z

@SMangenot Could you please add BUSCO scores from a more appropriate lineage, for example mammalia or laurasiatheria? I am concerned about the number of duplicated genes and that they didn't seem to decrease even though you removed quite a number of sequences during the curation

@tbrown91

Hi @tbrown91, here's the new EAR report with the BUSCO scores from laurasiatheria lineage

erga-ear-bot · 2024-10-04T12:42:59Z

The researcher has updated the EAR PDF. Please review the assembly @tbrown91.

talioto · 2024-10-09T13:28:59Z

Yes

…

On 3 Oct 2024, at 10:06, erga-ear-bot[bot] ***@***.***> wrote: Hi @talioto <https://github.com/talioto>, do you agree to review this assembly? Please reply to this message only with Yes or No by 10-Oct-2024 at 09:05 CET — Reply to this email directly, view it on GitHub <#84 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAI5CMMLNI3ZWGXPBEKML23ZZTUGBAVCNFSM6AAAAABPJD7LMGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGOJQGY3TMNJXHE>. You are receiving this because you were mentioned.

erga-ear-bot · 2024-10-09T13:29:27Z

Thanks for agreeing!
I appointed you as the EAR reviewer.
I will keep your status as Busy until you finish this review.
Please check the Wiki if you need to refresh something. (and remember that you must download the EAR PDF to be able to click on the link to the contact map file!)
Contact the PR assignee for any issues.

tbrown91 · 2024-10-14T07:59:29Z

Hi @talioto Have you had a chance to look through the assembly? We need to get this submitted by the end of the month unless there are major issues

talioto

I was not quite done, but here are my notes so far:

Contact density is in general very low, making decisions somewhat difficult. It's too bad the library was not sequenced to higher depth.

Telomeric and subtelomeric sequence is often incorporated but there is a lot left in the chaff/shrapnel at the end with non-specific signal. I don't know how much to trust the placement of this stuff. This is why my review has taken some time.
SUPER_1: some faint telomeric sequence in the middle around 91.1-91.8 Mb. BUT, it seems to be contiged and contacts support this being some relic signal from a chromosome fusion. Perhaps nothing to do here.

join SUPER_7 and SUPER_5? Similar to signal seen in SUPER_1. There is a telomeric repeat region, though.

SUPER_8: 6.5-7 Mb contig repeat that is maybe misplaced. Better to unloc it.

SUPER_14: 0-3.8 Mb. I don't think the signal is specific enough to keep this attached to this chromosome. Based on pattern of contacts to other subtelomeric regions, it seems like this is the odd one out. II would place it in the chaff.

SUPER_15: beginning subtelomeric region. I'm not sure I trust YaHS in putting this together.

SUPER_17: interior telomeric region around 32-26 Mb. Not sure how to handle.

"Y" should probably be X. It is a male specimen: https://www.ebi.ac.uk/biosamples/samples/SAMEA115120470 so I assume XY. Other species in genus have 104 Mb X and 4 Mb Y. or 106 Mb X and 6 Mb Y.
"Y": interior telomeric region 54.3-61.8 Mb. I don't think there's enough support to keep it in the middle of the X. In fact I think this is the actual Y sequence. Keep the part next to it that has contacts with the X and is higher coverage. This is likely part of the PAR. See my savestate.

Perhaps minimap2 alignments to other species in the genus would help sort some of this out.

talioto · 2024-10-14T11:14:45Z

Here's my savestate: https://ccnag-my.sharepoint.com/:u:/g/personal/tyler_alioto_cnag_eu/EfjGxTn7WFhGlJ8ovoUgzvsBCqk3wB01soLHeweT8NyVUQ?e=4fDcGT

tbrown91 · 2024-10-15T11:22:27Z

Thank you for the review @talioto!

@SMangenot can you please look through Tyler's suggested changes and see if they make sense in the context of the Hi-C map. I wonder if looking at synteny to other pipistrellus genomes would also help, e.g. these 4 are all in chromosomes, including the kuhlii which is stated as scaffold: https://www.ncbi.nlm.nih.gov/datasets/genome/?taxon=27671&reference_only=true

talioto · 2024-10-20T20:09:56Z

It would be good to know if there's been any progress on this. We're in a time crunch here. Could use this genome span.

ldemirdj · 2024-10-21T10:35:13Z

Hi @talioto,

I'll reach out to Sophie for an update, but she's still working on the map. We should have news soon, and we'll be able to submit it before the October 31st deadline.

Thanks!

Lola

A new update for Pipistrellus hanaki I made the corrections according to your remarks but when comparing with the reference genome I did not join SUPER_7 and SUPER_5 and SUPER_14 and SUPER_15.

erga-ear-bot · 2024-10-28T07:37:30Z

The researcher has updated the EAR PDF. Please review the assembly @talioto.

ldemirdj · 2024-10-30T10:47:19Z

Hi everyone,

Given tomorrow's deadline, it won’t be possible to submit this genome on time. Sophie and I are awaiting Tyler's feedback for a final review, and we will submit it once that's completed 👍 .

Thank you for your understanding.

Best,

Lola

talioto · 2024-10-30T14:11:00Z

Man, this one is not easy, but I assume you did some alignments to other Pipistrellus bats. Is the X colinear? Are there any Y's to align to. I'm not sure the piece labeled Y is the Y. I broke off the little bit that matches the X and placed in the gap in the middle of X.
SUPER_13: 53.6 to the end I would break off an unplace it in the chaff.

Here's a link to folder where I have the pretextmap and a savestate.

ldemirdj · 2024-10-30T14:37:21Z

Yes, it's a hard genome, the message was mostly for Tom to keep him informed. Sophie will answer your question. Thanks @talioto.

additive3 · 2024-10-30T15:43:55Z

My 2 cents on the Y chromosome.
What is currently annotated as Y looks to be just satellite.. perhaps centromeric, and looks like placement is to SUPER_3 (at a guess ~91.4Mb).
I agree that there is a small bit that clips off and is X centromere (also scaffold_56).

So Y... looking at the map, I would suggest that scaffold_32, scaffold_37, scaffold_58, scaffold_34 (in that order) are it.

Hi-C coverage is really not high enough and for another discussion.

additive3 · 2024-10-30T16:12:45Z

Y chrom.

erga-ear-bot · 2024-11-05T10:01:32Z

Attention @talioto, the EAR PDF was updated.

tbrown91 · 2024-11-05T11:21:05Z

Hi @SMangenot Thank you for the new EAR. Could you please detail here the changes that you have made? I'm finding it a little difficult to go through the conversation here and find everything.

Thanks

SMangenot · 2024-11-05T14:51:05Z

The Y chromosome was wrong, I made a mistake in my last card.
I aligned the X chromosome against a reference genome and it now looks correct.
I followed @talioto's instructions for SUPER_13
I've organized scaffold_32, scaffold_37, scaffold_58, scaffold_34 (SUPER_22) to reconstruct the Y chromosome but the alignments against a reference don't seem conclusive.

additive3 · 2024-11-05T16:55:48Z

I wouldn't necessarily expect to see alignment between Y, esp. from different species.
While gene content is likely conseved, copy number, structure and composition are likely quite different.

A new map with the Y chromosome tagged

erga-ear-bot · 2024-11-07T08:27:07Z

Attention @talioto, the EAR PDF was updated.

tbrown91 · 2024-11-08T15:32:26Z

Thanks @SMangenot

@talioto @additive3 let's try to get this one finalised. I don't see much more room for improvement

talioto

Go ahead. Not much else to improve. Really need higher coverage of Hi-C from Genoscope to better scaffold and curate and spend less time doing it. The agreed on target is 50x coverage minimum. For bad libraries we go to 100x.

erga-ear-bot · 2024-11-10T16:20:37Z

Thanks @talioto for the review.
I will add a new reviewed species for you to the table when @tbrown91 merges the PR ;)

Congrats on the assembly @SMangenot!
Please make sure that the fasta file to upload to ENA is generated based on the final reviewed version of the assembly.

After @tbrown91 confirmation, you can start with the assembly submission to save time.
The PR will be merged only when the final version of the EAR pdf is available.

diegomics · 2024-11-10T16:52:19Z

Hi @SMangenot, out of curiosity, do you know why HiC throughput was so low?

Add Pipistrellus hanaki

2fceb6a

erga-ear-bot bot added the ERGA-BGE label Oct 3, 2024

erga-ear-bot bot assigned tbrown91 Oct 3, 2024

Add files via upload

a62676a

Hi @tbrown91, here's the new EAR report with the BUSCO scores from laurasiatheria lineage

erga-ear-bot bot requested a review from talioto October 9, 2024 13:29

talioto requested changes Oct 14, 2024

View reviewed changes

Add files via upload

72fb781

A new update for Pipistrellus hanaki I made the corrections according to your remarks but when comparing with the reference genome I did not join SUPER_7 and SUPER_5 and SUPER_14 and SUPER_15.

erga-ear-bot bot requested a review from talioto October 28, 2024 07:37

Add files via upload

e12c75e

Add files via upload

53fc795

A new map with the Y chromosome tagged

talioto approved these changes Nov 10, 2024

View reviewed changes

tbrown91 merged commit a95e7ed into ERGA-consortium:main Nov 11, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Pipistrellus hanaki #84

Add Pipistrellus hanaki #84

SMangenot commented Oct 3, 2024

erga-ear-bot bot commented Oct 3, 2024

erga-ear-bot bot commented Oct 3, 2024

tbrown91 commented Oct 3, 2024

erga-ear-bot bot commented Oct 3, 2024

erga-ear-bot bot commented Oct 3, 2024

tbrown91 commented Oct 3, 2024

erga-ear-bot bot commented Oct 4, 2024

talioto commented Oct 9, 2024 via email

erga-ear-bot bot commented Oct 9, 2024

tbrown91 commented Oct 14, 2024

talioto left a comment •

edited

Loading

talioto commented Oct 14, 2024

tbrown91 commented Oct 15, 2024

talioto commented Oct 20, 2024

ldemirdj commented Oct 21, 2024

erga-ear-bot bot commented Oct 28, 2024

ldemirdj commented Oct 30, 2024

talioto commented Oct 30, 2024

ldemirdj commented Oct 30, 2024 •

edited

Loading

additive3 commented Oct 30, 2024 •

edited

Loading

additive3 commented Oct 30, 2024

erga-ear-bot bot commented Nov 5, 2024

tbrown91 commented Nov 5, 2024

SMangenot commented Nov 5, 2024

additive3 commented Nov 5, 2024

erga-ear-bot bot commented Nov 7, 2024

tbrown91 commented Nov 8, 2024

talioto left a comment

erga-ear-bot bot commented Nov 10, 2024

diegomics commented Nov 10, 2024 •

edited

Loading

Add Pipistrellus hanaki #84

Add Pipistrellus hanaki #84

Conversation

SMangenot commented Oct 3, 2024

Assembly review request

erga-ear-bot bot commented Oct 3, 2024

erga-ear-bot bot commented Oct 3, 2024

tbrown91 commented Oct 3, 2024

erga-ear-bot bot commented Oct 3, 2024

erga-ear-bot bot commented Oct 3, 2024

tbrown91 commented Oct 3, 2024

erga-ear-bot bot commented Oct 4, 2024

talioto commented Oct 9, 2024 via email

erga-ear-bot bot commented Oct 9, 2024

tbrown91 commented Oct 14, 2024

talioto left a comment • edited Loading

Choose a reason for hiding this comment

talioto commented Oct 14, 2024

tbrown91 commented Oct 15, 2024

talioto commented Oct 20, 2024

ldemirdj commented Oct 21, 2024

erga-ear-bot bot commented Oct 28, 2024

ldemirdj commented Oct 30, 2024

talioto commented Oct 30, 2024

ldemirdj commented Oct 30, 2024 • edited Loading

additive3 commented Oct 30, 2024 • edited Loading

additive3 commented Oct 30, 2024

erga-ear-bot bot commented Nov 5, 2024

tbrown91 commented Nov 5, 2024

SMangenot commented Nov 5, 2024

additive3 commented Nov 5, 2024

erga-ear-bot bot commented Nov 7, 2024

tbrown91 commented Nov 8, 2024

talioto left a comment

Choose a reason for hiding this comment

erga-ear-bot bot commented Nov 10, 2024

diegomics commented Nov 10, 2024 • edited Loading

talioto left a comment •

edited

Loading

ldemirdj commented Oct 30, 2024 •

edited

Loading

additive3 commented Oct 30, 2024 •

edited

Loading

diegomics commented Nov 10, 2024 •

edited

Loading