-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Codebook/ what are all the things #13
Comments
So far we are using the titles given to the trope pages as unique identifiers of a trope. We're storing them as strings in lists or dicts. We're assuming the filename of the trope article represents that article's title. (1) https://github.com/jwzimmer/tv-tropes/blob/main/trope_list/tropes/tropes_dict.json: single list comprising a dict for every trope in https://github.com/jwzimmer/tv-tropes/tree/main/trope_list/tropes which has every trope that trope links to on its article page |
Turns out (6) is definitely not equivalent to (1), we interpreted these things differently: But that might turn out to be good - it complicates sanity checking a little, but allows us to compare what the difference is between what they (community of contributors) explicitly think of as a related trope vs. what they relate the trope to while writing about it. |
Relevant to issue #7 too: |
Now my script in the pic above also captures links that are in lists in the main article, not just links in the paragraphs. |
After more discussion, comparison, etc., we have decided: for our purposes, we're defining "trope" as a page in https://github.com/jwzimmer/tv-tropes/tree/main/trope_list/tropes (this is the list they've identified as being tropes here https://tvtropes.org/pmwiki/pagelist_having_pagetype_in_namespace.php?n=Main&t=trope ... via https://tvtropes.org/pmwiki/pmwiki.php/Administrivia/NotATrope> https://tvtropes.org/pmwiki/pmwiki.php/Main/Trope > https://tvtropes.org/pmwiki/pmwiki.php/Main/Tropes > https://tvtropes.org/pmwiki/pagelist_having_pagetype_in_namespace.php?n=Main&t=trope) all the pages in that folder, so equivalent masterlist, in: https://github.com/jwzimmer/tv-tropes/blob/main/in_Masterlist.json all the pages in the Main folder, so containing tropes not in the masterlist, metatropes, indices, and other article types, in: https://github.com/jwzimmer/tv-tropes/blob/main/in_pmwiki_Main.json |
Description of the dicts that are the links within each trope article: #12 (comment) |
A gml file (for gephi) of the network in the Sister Tropes page (https://tvtropes.org/pmwiki/pmwiki.php/Main/SisterTrope) - there are unweighted, undirected edges between every pair of tropes given as "sister tropes" in the Examples section of the page - only including tropes that are in the trope masterlist: (this version has all the links given in the examples section, whether they're tropes from the masterlist or not: https://github.com/jwzimmer/tv-tropes/blob/main/sistertropes.gml) |
A gml file (for gephi) of the network in the Super Tropes page (https://tvtropes.org/pmwiki/pmwiki.php/Main/SuperTrope), there are unweighted, undirected edges between a "Super Trope" root node and each example given in the "samples" section of the page, and then from each example their listed subtropes. The edgelist is listed in #16 (comment). (I included super tropes NOT in the trope master list; I did not include sub tropes that were not in the trope master list) |
The folder https://github.com/jwzimmer/tv-tropes/tree/main/Stanford_Neighborhoods has CSV files for the neighborhoods found in https://dhs.stanford.edu/social-media-literacy/tvtropes-pt-2-trope-but-not-troper-communities/ |
The list of all the tropes (in the masterlist) and the tropes they link to: github.com/jzimmer/tv-tropes/all-tropes-with-links.json |
For imperfect answers to questions from the Datasheets for Datasets paper (https://arxiv.org/abs/1803.09010), see jwzimmer-zz/tv-tropening#3 (comment). |
It probably is a good idea to keep track of what everything is so we don't forget, like Prof Cheney said this morning.
The text was updated successfully, but these errors were encountered: