Crawler test

Crawler test is a web application developed in python.It extracts image URLs from the web. This library is built on a high-level web crawling and web scraping framework called scrapy. All further details ca be found here.

How to run the Crawler

To run the test on your localhost:

$ git clone https://github.com/bamal/MIMS-test.git
$ cd MIMS-test-master
$ pip install virtualenv
$ virtualenv venv
$ source venv/bin/activate
$ pip install -r requirements.txt
$ {your current working directory}/venv/bin/python -m flask run

Open another Terminal and start the scrapyd server (by default it runs on port 6800).

$ cd cd MIMS-test-master
$ source venv/bin/activate
$ scrapyd

This application has three endpoints:

To start crawling your urls and parsing their images urls, this is an example of a POST request:

curl -X POST 'http://localhost:5000/jobs' -H "Content-Type: application/json" -d '{"urls":["http://4chan.org/", "https://golang.org/"], "workers":2'}

To get the status of your job, this is an example of a GET request:

curl -X GET 'http://localhost:5000/jobs/c25b020a19b011ecb282a504b09ff6d8/status'

To get the result of your job, this is an example of a GET request:

curl -X GET 'http://localhost:5000/jobs/c25b020a19b011ecb282a504b09ff6d8/result'

Here jobid = c25b020a19b011ecb282a504b09ff6d8

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
crawler		crawler
eggs/scrapymims		eggs/scrapymims
scrapymims		scrapymims
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
__init__.py		__init__.py
app.py		app.py
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Crawler test

How to run the Crawler

About

Releases

Packages

Languages

bamal/MIMS-test

Folders and files

Latest commit

History

Repository files navigation

Crawler test

How to run the Crawler

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages