============= Requirements:

a spider to scrape mashape.com

mashape.com is the largest world-class marketplace to consume, distribute, manage, and monitor both private and public APIs from developers all over the world.

This project will create a spider to fetch the interfaces of the public APIs on mashape.com and store the data.

============= Requirements:

Scrapy To install Scrapy, please follow the official guide: http://doc.scrapy.org/en/0.24/intro/install.html

Make sure dependencies lib installed and worked.

Selenium To scrape the JS rendered content, Selenium is needed. steps for Selenium env setup:

a. Download selenium server from http://www.seleniumhq.org/download/ selenium-server-standalone-2.37.0.jar

b. Install Selenium Safari(or other web browers) Webdriver Extension Note: webdriver is not found in Apple Extensions, a workaround is to install it manually, see: https://code.google.com/p/selenium/issues/detail?id=7933

c. Install Selenium Client & WebDriver Language Bindings for Python: pip install selenium

=============== mashape_spider

Usage: scrapy crawl mashape -a readurl=https://www.mashape.com/george-vustrey/ultimate-weather-forecasts

It crawls the given url and get all the REST APIs of it. The sample output is: api.json

================ urls

Usage: scrapy crawl urls -a pageno=10

It crawls the "https://www.mashape.com/explore?sort=developers&page=10" and fetches all the apps' urls The sample output is: urls.txt

============== Next Step:

make spider fetch the whole set of mashape APIs; store the data to MongoDB test and validate on chrome

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
mashape_spider		mashape_spider
urls		urls
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

============= Requirements:

=============== mashape_spider

================ urls

============== Next Step:

About

Releases

Packages

Contributors 2

Languages

PeiwenChen/mashape_spider

Folders and files

Latest commit

History

Repository files navigation

============= Requirements:

=============== mashape_spider

================ urls

============== Next Step:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages