-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
check random subset of dicts to verify they match each other and the website #12
Comments
Revised plan for sanity checking due to #13 (comment):
|
Will do this tn! |
Problems
|
I think I fixed the issues I found above ^ in 2485da7 and 1c0110a? === Before I re-run my script to get updated dicts: ===
|
@nguyenhphilip I'll check this issue again later/ tmrw to see what changes I should make based on whatever you find, too. : ) |
Hm, so this is written on the UsefulNotes page: "Useful Notes articles are not tropes and are not to be included in a work's trope list. See, however, Historical Domain Character. Similarly, tropes are not to be used to describe the subject of a Useful Notes article directly. You may, however, list tropes that are commonly found in media portraying the subject." The the links on this UsefulNote page GunsOfFiction verify this, it just links to different types of guns like 'revolver' and 'sniper rifle', but there isn't really any content inside of them, which makes me think the purpose of these pages aren't to be meta tropes. One UsefulNote, MuhammadAli, does link to other tropes that are in The World War II UsefulNote links link to other Useful Notes (i.e. 1 and 2 and 3) So maybe UsefulNotes won't be so useful for us :\ |
Hmmmm ok, well, that's easy to take back out! : ) |
Made a dict of links from the articles in every subfolder in Indices Also while spot checking ABoyAGirlAndABabyFamily using my dict and Julia's dict, it looks like Julia's script captures links that are nested in the Example folders that you have to click to expand on their website that mine doesn't. I think this is because J loops through Doing some further digging, it looks like AlwaysMale IS in Everything in Julia's FightSceneFailure dict lines up with what's in mine except for an extra link to FightSceneFailure. Looks like there are some other |
Cool, super helpful! So my script is currently doing this "Maybe a better filtering strategy would be to grab all links within Allowing self-links and duplicates: I think there are analyses that could care about this, especially the duplicates part (e.g. it would be cool to make a network only showing edges above a certain threshhold, like articles that connected to each other more than 10 times or whatever)... maybe we should keep duplicates but get rid of self-links? |
Yeah the idea i had in mind was to grab tropes that were in Main but not listed in trope_list so that we can capture as many tropes as we can. self-links and duplicates: sorry i forgot to think about this last night! yes i agree, i think having duplicates could be interesting depending on the analysis and that we probably don't care about self-links. side note: Once we do some final filtering/update of scripts I feel like we probably have a large enough sample to begin looking at some questions! :) |
ok so i should make the dicts such that: (0) one dict for every trope in the masterlist of tropes, (1) include links to things not on the masterlist, which we can ignore or include as needed, (2) only include links in the Main namespace or in the masterlist of tropes, (3) include duplicates, (4) do not include self-links |
Remade in 026cfc1 such that:
To reiterate from elsewhere: the trope masterlist is all the articles they've labelled as tropes, https://github.com/jwzimmer/tv-tropes/tree/main/trope_list/tropes i will manually check that these pages #15 are fine to exclude (if not, I'll add them as new dicts). |
@nguyenhphilip these dicts I think look reasonable - not too different from the ones we had before but with the revisions above - so at some point would you mind sanity-checking a few and making sure they look like what you expect? Thanks. |
QCd random 3 articles in list
I feel confident that Julia's script got the things we wanted! |
Great! Thanks, @nguyenhphilip! |
from issue #8 - before we close the issue, i think we should randomly spot check our results:
The text was updated successfully, but these errors were encountered: