Skip to content

finished assignment #2

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 2 commits into
base: master
Choose a base branch
from
Open

finished assignment #2

wants to merge 2 commits into from

Conversation

smortime
Copy link

In line 20 of tests.py, I changed the file path to '../data/us_cities.geojson' so that it will move to the parent directory then into data.'

smortime added 2 commits February 22, 2016 15:53
if (i['properties']['pop_max'] > max_population):
max_population = i['properties']['pop_max']
city = i['properties']['name']

return city, max_population
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I image that something along these lines, but more robust will be part of your twitter work?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Kind of, so far I am pulling all the tweets from the the Twitter API and parsing the JSON for a handful of attributes and then writing them to a .csv (one column per tweet). However, when I start pulling from the Twitter Stream, it might be beneficial to just save summary stats from the JSON instead of every individual tweet, since I am not interested in everything from the stream data.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I pulled maybe 30 days of geoloacated data off the twitter stream. The approach I landed on was to use MongoDB (it has a good Python API) to store the data (the raw tweet). Then I analyzed the data in an iPython notebook using a combination of MongoDB database queries and raw Python. That might be a viable workflow that preserves the raw data?

@jlaura
Copy link
Contributor

jlaura commented Mar 1, 2016

👍

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants