Never had such a pure crawler like this nacf
.
Although I often write crawlers, I don’t like to use huge frameworks, such as scrapy, but prefer
simple requests+bs4
or more general requests_html
. However, these two are inconvenient for a
crawler. E.g. Places, such as error retrying or parallel crawling, need to be handwritten by
myself. It is not very difficult to write it while writing too much can be tedious. Hence I
started writing this nacf (Nasy Crawler Framework), hoping to simplify some error retrying or
parallel writing of crawlers.
Package | Version | Description |
---|---|---|
requests-html | 0.10.0 | HTML Parsing for Humans. |
nalude | 0.3.0 | A standard module. Inspired by Haskell’s Prelude. |
see tests.