Ahmia is the search engine for .onion
domains on the Tor anonymity
network. It is led by Juha Nurmi and is based
in Finland. This repository contains crawlers used by Ahmia search engine.
Ahmia-index should be installed and running
python3 -m virtualenv venv3
source venv3/bin/activate
pip install -r requirements.txt
Look fleet installation here.
ahmia/ahmia/example.env
contains some default values that should work out of the box.
Copy this to .env
to create your own instance of environment settings:
cp ahmia/ahmia/example.env ahmia/ahmia/.env
In order to execute the crawler to run permanently:
source venv/bin/activate
./run.sh &> crawler.log
scrapy crawl ahmia-tor -s DEPTH_LIMIT=1 -s LOG_LEVEL=DEBUG
or
scrapy crawl ahmia-tor -s DEPTH_LIMIT=1 -O items.json:json
or
scrapy crawl ahmia-tor -s DEPTH_LIMIT=3
# Every day
PATH=/usr/local/bin:/usr/bin:/bin:/home/juha/.local/bin
37 09 * * * cd /home/juha/ahmia-crawler/ && bash run_daily.sh > ./daily.log 2>&1
# First day of each month
PATH=/usr/local/bin:/usr/bin:/bin:/home/juha/.local/bin
30 01 01 * * cd /home/juha/ahmia-crawler/ && bash run.sh > ./monthly.log 2>&1