Skip to content

Latest commit

 

History

History
28 lines (19 loc) · 889 Bytes

README.md

File metadata and controls

28 lines (19 loc) · 889 Bytes

Additions from fork

This repo has fixed some spiders and other bugfixes related to compatibility with python3. The included spiders are confirmed to be working correctly in 2020.

Introduction

This is a firmware scraper that aims to download firmware images and associated metadata from supported device vendor websites.

Dependencies

Usage

  1. Configure the firmware/settings.py file. Comment out SQL_SERVER if metadata about downloaded firmware should not be inserted into a SQL server.

  2. To run a specific scraper, e.g. dlink:

scrapy crawl dlink

To run all scrapers with maximum 4 in parallel, using GNU Parallel:

parallel -j 4 scrapy crawl ::: `for i in ./firmware/spiders/*.py; do basename ${i%.*}; done`