-
Notifications
You must be signed in to change notification settings - Fork 777
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update quotesbot #8
base: master
Are you sure you want to change the base?
Conversation
@@ -16,75 +14,75 @@ | |||
|
|||
|
|||
# Crawl responsibly by identifying yourself (and your website) on the user-agent | |||
#USER_AGENT = 'quotesbot (+http://www.yourdomain.com)' | |||
# USER_AGENT = 'holidaygems (+http://www.yourdomain.com)' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
?
|
||
# Obey robots.txt rules | ||
ROBOTSTXT_OBEY = True | ||
|
||
# Configure maximum concurrent requests performed by Scrapy (default: 16) | ||
#CONCURRENT_REQUESTS = 32 | ||
# CONCURRENT_REQUESTS = 32 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Although not PEP8, I believe the comment spacing in lines like this one is correct for its purpose, which is marking commented out code, as opposed to code comments. I think we need to think of this file as a template for users, and hence leave it like that.
Also, I’m guessing this may be the result of scrapy startproject
or similar, in which case the file contents should match those generated by Scrapy.
I updated the spider code to follow modern guidelines, and the auto-generated files with the current docs URLs.
After that, I just realized that there was an old PR changing the
extract()/extract_first()
toget()/getall()
.Closes: #7