基于Scrapy的python爬虫,爬取新浪新闻国内财经频道的所有新闻存入json文件,单线程爬取。
Source: http://roll.finance.sina.com.cn/finance/gncj/gncj/index_1.shtml
http://scrapy-chs.readthedocs.org/zh_CN/latest/intro/install.html#intro-install-platform-notes
python 2.7
scrapy
python run.py# SinaNewsCrawl