Skip to content
This repository has been archived by the owner on Mar 12, 2019. It is now read-only.
/ arachas Public archive

Gwent web crawler to extract card data from popular gwent community websites.

License

Notifications You must be signed in to change notification settings

GwentAPI/arachas

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

arachas

As of 14/07/17 Arachas is no longer being maintained

Arachas is a web crawler that is capable of extracting card data related to the GWENT®: The Witcher Card Game from community websites. Currently it only crawl the popular gwentify website.

The extracted data is then saved in a json file. It's also capable of downloading the card images but it will not do it by default.

Arachas have rudimentary diff capabilities which allow a user to keep track of what changed between consecutive runs. It will print a message if a card was added/removed or modified (it can't make the distinction by itself).

Dependencies

You will need Python 3.6.

Install the dependencies:

pip install beautifulsoup4
pip install requests
pip install pytz
pip install termcolor
pip install unidecode

How to use

python arachas.py

If you want to download the full sized card images:

python arachas.py --image

The images will be saved under ./media.

If you want to save the output data under a different name:

python arachas.py --output <name>

The file is saved under the local directory.

About

Gwent web crawler to extract card data from popular gwent community websites.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages