You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Textpresso categories queries are too slow, and pap_gene already contains a lot of gene names. Future automated extraction pipelines should make pap_gene contain gene names matching those extracted by tpc
The text was updated successfully, but these errors were encountered:
For now we can continue to use the list of genes from textpresso, but in case of papers with a high number of genes (#genes in paper / total # c. elegans genes), we could remove genes mentioned only once. This would take care of high throughput experiments. Reading genes from postgres would still be faster, but we need to wait for a pipeline that is able to extract genes from full text and not only abstracts.
Textpresso categories queries are too slow, and pap_gene already contains a lot of gene names. Future automated extraction pipelines should make pap_gene contain gene names matching those extracted by tpc
The text was updated successfully, but these errors were encountered: