Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Write single script that runs the ner_text_extraction pipeline end to end #20

Closed
Tracked by #16
jmelot opened this issue Nov 5, 2023 · 1 comment
Closed
Tracked by #16
Assignees

Comments

@jmelot
Copy link
Owner

jmelot commented Nov 5, 2023

We have a description of the method in TheStackDataset.md but I don't think we have any code committed to this repo that allows us to run this end to end. Given a directory of jsonls from The Stack, can we write a python script that does the affiliation extraction and (using #23) outputs mappings between rors and software? Would you be interested in contributing this @dtkaczyk ?

@jmelot jmelot changed the title Write single script that executes the ner_text_extraction pipeline end to end Write single script that runs the ner_text_extraction pipeline end to end Nov 5, 2023
@jmelot jmelot added this to the Paper submission milestone Nov 5, 2023
@dtkaczyk dtkaczyk self-assigned this Nov 7, 2023
dtkaczyk added a commit that referenced this issue Nov 14, 2023
dtkaczyk added a commit that referenced this issue Nov 14, 2023
@jmelot
Copy link
Owner Author

jmelot commented Dec 7, 2023

Closed via #28

@jmelot jmelot closed this as completed Dec 7, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants