Inquiry and question about data usage in pretraining the foundation model #56

wuwusky · 2024-09-18T01:35:33Z

Thank you for the excellent work!
I am interested in learning more about the pretraining process. Based on the paper, it appears that the scFoundation model was trained exclusively on single-cell gene expression matrices, including the count values calculated internally and the RDA training strategy. Could you clarify if any metadata or additional prior knowledge was incorporated during the pretraining process?

WhirlFirst · 2024-12-03T03:54:27Z

Hi, thank you for your interest! We didn't use any metadata or prior knowledge for pre-training.

wuwusky closed this as completed Dec 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inquiry and question about data usage in pretraining the foundation model #56

Inquiry and question about data usage in pretraining the foundation model #56

wuwusky commented Sep 18, 2024

WhirlFirst commented Dec 3, 2024

Inquiry and question about data usage in pretraining the foundation model #56

Inquiry and question about data usage in pretraining the foundation model #56

Comments

wuwusky commented Sep 18, 2024

WhirlFirst commented Dec 3, 2024