Skip to content

Latest commit

 

History

History
26 lines (16 loc) · 1.5 KB

resources.md

File metadata and controls

26 lines (16 loc) · 1.5 KB

Web Scraping Resources

Developer Tools

To lift the hood on a website and see how it works, you'll need to use a browser's developer tools:

Python libraries

Here are a few of the most popular workhorses in the Python community for web scraping tasks:

  • requests to fetch web pages/files
  • BeautifulSoup for parsing HTML
  • lxml for more serious HTML/XML parsing firepower (it's a "heavy" install but offers more speed and features than BeautifulSoup)
  • Selenium for stateful web scraping and working with dynamic page content (among other thorny issues)

Relevant Reading