-
Notifications
You must be signed in to change notification settings - Fork 13
/
tsm_test2.py
28 lines (24 loc) · 2.13 KB
/
tsm_test2.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
# This is a test file for the TSM Python module by Deen Freelon <[email protected]>. For all functions to operate as intended, you need at least one CSV file containing Twitter data to use as input. (You can use the same file for tweet_file_A and tweet_file_B, in which case match_communities will compare two separate runs of the Louvain method on that file to each other.) Unfortunately, Twitter's terms of service prevent me from making these publicly available. However, you can pull your own Twitter data using Martin Hawksey's excellent Google-spreadsheet-based TAGS system, which requires no code: http://mashe.hawksey.info/2013/02/twitter-archive-tagsv5/
# For instructions on how to format your data files and what the functions below output, see tsm.py which is available at https://github.com/dfreelon/TSM
import tsm
# change the next two lines, obviously
tweet_file_A = './path/to/formatted_tsm_file_A.csv'
tweet_file_B = './path/to/formatted_tsm_file_B.csv'
edgelist_A = tsm.t2e(tweet_file_A)
edgelist_B = tsm.t2e(tweet_file_B)
top_communities_A = tsm.get_top_communities(edgelist_A)
top_communities_B = tsm.get_top_communities(edgelist_B)
ei_indices_A = tsm.calc_ei(top_communities_A.node_list,edgelist_A,'ON')
ei_indices_B = tsm.calc_ei(top_communities_B.node_list,edgelist_B,'ON')
top_rts_A = tsm.get_top_rts(tweet_file_A,top_communities_A.node_list)
top_rts_B = tsm.get_top_rts(tweet_file_B,top_communities_B.node_list)
community_matches = tsm.match_communities(top_communities_A.node_list,top_communities_B.node_list)
intermediaries_A = tsm.get_intermediaries(top_communities_A.node_list,edgelist_A)
intermediaries_B = tsm.get_intermediaries(top_communities_B.node_list,edgelist_B)
top_hashtags_A = tsm.get_top_hashtags(tweet_file_A,top_communities_A.node_list)
top_hashtags_B = tsm.get_top_hashtags(tweet_file_B,top_communities_B.node_list)
top_links_A = tsm.get_top_links(tweet_file_A,top_communities_A.node_list)
top_links_B = tsm.get_top_links(tweet_file_B,top_communities_B.node_list)
grid_A = tsm.shared_ties_grid(ei_indices_A,calc_propor=True)
grid_B = tsm.shared_ties_grid(ei_indices_B,calc_propor=True)
print("Now feel free to inspect the variables...")