Ruby Capstone Project of Microverse, which students have to complete a real-world-like project within 72 hours according to this project specifications
It's a 3-in-1 Web Scraper, which allows users to parse all courses from udacity.com and jobs from indeed.com and remote.io into CSV file.
Job Scraped from Remote.io Job Scraped from indeed.com Courses Scraped from udacity.com- Ruby
- Nokogiri gem
- HTTParty gem
├── README.md
├── bin
│ └── main.rb
└── lib
└── scraper.rb
└── udacity_scraper.rb
└── indeed_scraper.rb
└── remoteio_scraper.rb
└── rspec
└── scraper_spec.rb
└── spec_helper.rb
Feel free to check out this link for a 3min video walkthrough :)
- Git clone this repo and cd the to the
web_scraper
directory. - Run
bundle install
in command line to install Nokogiri and HTTParty Gem. - Run
bin/main.rb
. - Input either 'udacity', 'indeed', or 'remote.io' and follows the respective commands.
- Tada! 'udacity_courses.csv', 'indeed_jobs.csv', or 'remote_io.csv' would be created at the root directory respectively :)
- Git clone this repo and cd the to the
web_scraper
directory. - Install rspec with
gem install rspec
. - Run
rspec
in Command Line. - You would see failures because all 3 scraped files haven't been created yet.
- To solve it, run
ruby bin/main.rb
and input 'udacity', 'indeed', and 'remote.io' for every execution. - Run
rspec
in CLI again. The test cases would success upon each file created :)
👤 Kyle Law
Contributions, issues and feature requests are welcome!
Feel free to check the issues page.
Give a ⭐️ if you like this project!
- Microverse
- Nokogiri gem
- HTTParty Parser
- Udacity.com
- Indeed.com
- Remote.io
This project is MIT licensed.