Skip to content

Latest commit

 

History

History
59 lines (44 loc) · 1.11 KB

scrape_params.md

File metadata and controls

59 lines (44 loc) · 1.11 KB

Scrape Information

Search

Base URL: http://www.petharbor.com/results.asp

URL Parameters

  • searchtype - LOST
  • friends - 1
  • samaritans - 1
  • nosuccess - 0
  • rows - 1000 (can be arbitrarily increased so don’t have to deal with pagination, currently ~1000 animals in system)
  • imght - 120
  • imgres - thumb
  • view - sysadm.v_animal
  • shelterlist - %27ASTN%27
  • atype - cat, dog
  • page - 1
  • where -
    • type_x
      x = CAT, DOG

    • gender_x
      x = m, f (male, female)

    • size_x
      x = s, m, l (small, medium, large)

    • age_x
      x = y, o (young < 1 year, old > 1 year)

    • color_x
      x = b, br, w (black, brown, white)

    • breed_x
      x = breed name with spaces replaced with %20

      Cat breed names

      Dog breed names

Data to scrape and save

  • Name
  • ID from shelter
  • Gender
  • Color
  • Breed - can be split into primary and secondary
  • Age - can be split into years, months and days
  • Found date

Pictures

Base URL: http://www.petharbor.com/get_image.asp

URL Parameters

  • RES - thumb
  • ID - from search scrape
  • LOCATION - ASTIN