A distributed Sina Weibo Search spider base on Scrapy and Redis.
tpeng [email protected]
$ sudo apt-get install mysql-server
$ sudo apt-get install redis-server
$ sudo apt-get install python-mysqldb
$ sudo pip install -r requirements.txt
- put your keywords in items.txt
scrapy crawl weibosearch -a username=your_weibo_account -a password=your_weibo_password
- add another spider with scrapy crawl weibosearch -a username=another_weibo_account -a password=another_weibo_password
or
scrapy crawl weibosearch -a username=your_weibo_account -a password=your_weibo_password -a savedb=False
to avoid save to mysql db. (easier for experiment)