An effective way is widely used in the mobile development sector to build APIService, which I now bring to the data-related sector.
- Delegation Pattern In Python To Crawl Data With API
- The problem it solved
- Knowledge Base
- Structure
- Application
- Crawl data easily
- Collect any possible data you want (check robots.txt to make sure you are not violent with any rules), even Tinder!
- Wide range use
I have a demonstration in this repo (Crawl data section).
- APIService
- ItemModel
- GetData
📢 Up-to-date: 07/10/2022
💡 TODO: This is the code from November 2021, now October 2022, for some reason, looks like it crawls all the data available with per_page = 3, max_page=1
, so:
- Check the API call
- If you decide to use this code, make sure to check attribute
all_items
to not have any duplicate value, and limit it, because calling the methodcrawlImage()
will take time.
Should I publish it 🤔
Not done yet! Now it works on a JSON file you already downloaded, use the function on GetData
, need to change the APIService.