Skip to content

Commit

Permalink
fix regex for usproxy page
Browse files Browse the repository at this point in the history
  • Loading branch information
asthman committed Sep 5, 2017
1 parent 3b923c6 commit a711496
Showing 1 changed file with 1 addition and 2 deletions.
3 changes: 1 addition & 2 deletions ipproxytool/spiders/proxy/usproxy.py
Original file line number Diff line number Diff line change
Expand Up @@ -35,8 +35,7 @@ def __init__(self, *a, **kwargs):

def parse_page(self, response):
pattern = re.compile(
'<tr><td>(.*?)</td><td>(.*?)</td><td>(.*?)</td><td>(.*?)</td><td>(.*?)</td><td>(.*?)</td><td>('
'.*?)</td><td>(.*?)</td></tr>',
'<tr><td>(.*?)</td><td>(.*?)</td><td>(.*?)</td><td.+?>(.*?)</td><td>(.*?)</td><td.+?>(.*?)</td><td.+?>(.*?)</td><td.+?>(.*?)</td></tr>',
re.S)
items = re.findall(pattern, response.body)

Expand Down

0 comments on commit a711496

Please sign in to comment.