Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
lanbing510 committed Jan 12, 2016
1 parent 2982ea9 commit b97ddd0
Showing 1 changed file with 9 additions and 5 deletions.
14 changes: 9 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,12 +1,14 @@
Python所写,豆瓣读书的爬虫,方便大家搜罗各种美美书
###豆瓣读书爬虫

更新:
**Python所写,豆瓣读书的爬虫,方便大家搜罗各种美美书!**

最近爬书发现豆瓣的页面规则发生了变化,导致不能爬到评价人数,由此对代码进行了相应的更新,并爬了'数学'标签下的图书。

<hr>
####更新

实现功能:
最近爬书发现豆瓣的页面规则发生了变化,导致不能爬到评价人数,由此对代码进行了相应的更新,爬下了'数学'标签下的图书并上传。


####实现功能

1 可以爬下豆瓣读书标签下的所有图书

Expand All @@ -16,6 +18,8 @@ Python所写,豆瓣读书的爬虫,方便大家搜罗各种美美书

4 采用User Agent伪装为浏览器进行爬取,并加入随机延时来更好的模仿浏览器行为,避免爬虫被封(更新于 2015-5-20)

####效果截图

试着小小运行了下,爬了七八万本书,结果在book_list.xlsx中,截图如下:

![Aaron Swartz](https://github.com/lanbing510/DouBanSpider/raw/master/screenshots/douban.jpg)
Expand Down

0 comments on commit b97ddd0

Please sign in to comment.