including invalid transcripts in training data #461

sparthib · 2025-01-27T18:51:57Z

Hi, I have a question about how XGboost is used to train TPS prediction for read classes. I can see how true transcripts are readily available from a reference annotation (y_ij = 1) ? But it is unclear to me how invalid transcript observations are input for training (y_ij = 0) ?

Thanks,
Sowmya

cying111 · 2025-02-03T05:27:32Z

Hi @sparthib ,

As described in our paper, TPS prediction for all RCs is performed using a supervised machine learning algorithm. During training, the labels for these RCs are determined based on whether their intron junctions align exactly with those of annotated transcripts—in other words, only annotated reference transcripts are used for labeling during training. For more details, please check out our paper here.

I hope this clarifies your question! Let me know if you need any further clarification.

Thank you
Warm regards,
Ying

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

including invalid transcripts in training data #461

including invalid transcripts in training data #461

sparthib commented Jan 27, 2025 •

edited

Loading

cying111 commented Feb 3, 2025

including invalid transcripts in training data #461

including invalid transcripts in training data #461

Comments

sparthib commented Jan 27, 2025 • edited Loading

cying111 commented Feb 3, 2025

sparthib commented Jan 27, 2025 •

edited

Loading