wibidota

Tools for working with Valve's Dota 2 data set for the WibiData team. Presently consists only of dota_slurp.py, a tool for efficiently pulling in a lot of data.

Requirements

Python's requests library (pip install requests)
Recommended: The ujson library (pip install ujson)

You'll also need a steam api key. Get one from http://steamcommunity.com/dev/apikey and store it in the environmental variable DOTA2_API_KEY (I added mine to ~/.bash_profile).

`dota_slurp.py` Usage

Install requirements and API key as described above
Coordinate with team to decide who is grabbing which range and set up config.json accordingly.
Run python dota_slurp.py and go grab a sandwich.

The script will create files named things like matches_0-1000000.gz, matches_1000000-2000000.gz, etc. in the current directory. The files are gzip compressed.

config.json

This file is both read and written by dota_slurp.py and used to maintain state betwee runs. It's a json file containing one field, "ranges" with two element arrays containing a list of seq id ranged to retrieve.

Hence the example config.json:

{"ranges":[[1000000,2000000],[3000000,4000000]]}

will grab a block of matches from 1000000 (inclusive) to 2000000 (exclusive) then skip ahead and grab a block of matches from 3000000 to 4000000.

Known Issues and TODOs

More info while running would be nice. Currently only prints data every completed file and if those are set large, it could take a while.
Revisit config.json design. DONE
Compression, compression, compression. I'm seeing a factor of savings of over 10x using bz2 and a little under 8x using gzip. Files are pretty large (about a gigabyte for a million matches), so this is probably a good idea. DONE
It's unclear what to do to continue work later when we reach the end of match history. But this will probably need a different tool.
Obviously, and unfortunately, we don't have skill tags for the matches.
Write 5th Pick program.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
wibidota-mr		wibidota-mr
.gitignore		.gitignore
README.md		README.md
config.json		config.json
dota_slurp.py		dota_slurp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

wibidota

Requirements

`dota_slurp.py` Usage

config.json

Known Issues and TODOs

About

Releases

Packages

Languages

sqrl/wibidota

Folders and files

Latest commit

History

Repository files navigation

wibidota

Requirements

dota_slurp.py Usage

config.json

Known Issues and TODOs

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`dota_slurp.py` Usage

Packages