Skip to content

CryptoPunk/SecurityCrawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SecurityCrawler

Crawl the web for fun stuff

Scrapers

  • Implemented

  • HTML5

  • svg unimplemented due to confusion on xlink in html5

  • XHTML1.1 (partial)

  • TODO: Alert for unrecognized namespaces

  • Sitemap.xml (partial)

  • robots.txt

  • Unimplemented

  • https://developer.mozilla.org/en-US/docs/XML_in_Mozilla

  • Crossdomain.xml

  • SVG

  • PDF (Open that can of worms)

  • Xforms

  • Javascript

  • MS Office Files (Even worse, so many incompatable versions)

  • Open Office Files

  • Flash

  • WSDL

  • Crawler should use gevent.queue.JoinableQueue

About

Crawls the web for forms and resources.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages