Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Index page dicts - links to masterlist tropes only #18

Closed
jwzimmer-zz opened this issue Nov 2, 2020 · 4 comments
Closed

Index page dicts - links to masterlist tropes only #18

jwzimmer-zz opened this issue Nov 2, 2020 · 4 comments
Assignees

Comments

@jwzimmer-zz
Copy link
Owner

jwzimmer-zz commented Nov 2, 2020

From: #17

Make new dicts from the index pages (need a masterlist of indices too) that only include tropes that are in our masterlist.

@jwzimmer-zz
Copy link
Owner Author

Masterlist of indices - things in main (https://tvtropes.org/pmwiki/index_report.php) that aren't listed as tropes in our tropes masterlist?

@nguyenhphilip
Copy link
Collaborator

Done! Script here: https://github.com/jwzimmer/tv-tropes/blob/main/pull-index.ipynb

Index master dictionary as well as individual files: https://github.com/jwzimmer/tv-tropes/tree/main/index-list

Have ~4218 indices. Will probably need some way to filter these as we likely don't want this many in our visualization.

@jwzimmer-zz
Copy link
Owner Author

Oh whoops I did not see your comment here, my bad! This is great, thanks!

So this is: for every entry in Main Indices (https://tvtropes.org/pmwiki/index_report.php), if it links to tropes in our masterlist of tropes, then we now have a dictionary of those tropes? (In other words, no indices that have no links to tropes in our masterlist, and no tropes that are not in our masterlist?)

What about the case when an index linked to another index (if that happened ever)?

@jwzimmer-zz
Copy link
Owner Author

Resolved by @nguyenhphilip

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants