Skip to content
This repository has been archived by the owner on Jun 20, 2022. It is now read-only.

Latest commit

 

History

History
26 lines (17 loc) · 721 Bytes

README.md

File metadata and controls

26 lines (17 loc) · 721 Bytes

HouseSpider


说实话没几行代码,没啥技术含量,爬取的时候比较粗鲁,更细的信息提取放在了本地来做了。

先用浏览器找到自己感兴趣的城市,然后复制链接到对应代码位置,如:

start_urls = ['http://esf.nanjing.fang.com/house/h316/']

运行说明:

爬取"房天下"的数据

scrapy runspider ftx.py -o ftx.csv

下面这个是爬取链家的数据

scrapy runspider lianjia.py -o lianjia.csv

本地具体怎么提取信息和分析请参考:

房天下信息提取

链家信息提取