Skip to content

An effective way is widely used in the mobile development sector to build APIService, which I now bring to the data-related sector.

License

Notifications You must be signed in to change notification settings

nguyenkhangme/Delegation-Pattern-In-Python-To-Crawl-Data-With-API

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation


banner

An effective way is widely used in the mobile development sector to build APIService, which I now bring to the data-related sector.

Delegation Pattern In Python To Crawl Data With API

The problem it solved

  • Crawl data easily
  • Collect any possible data you want, even Tinder! (check robots.txt, the license page,... to make sure you are not violent with any rules)
  • Wide range use

Knowledge Base

Catch that API call

⚠️ Make sure you check robots.txt, the license page,... to not be violent with any rules.

The demonstration is from this repo (Crawl data section).

Here I demonstrate how to catch Shopee API, you could use the same technique for every other site:

Delegation Pattern

Structure

  • APIService
  • ItemModel
  • GetData

Application

⚠️ You should change the request headers (self.headers) on file APIService.ipynb to match yours.

Shopee

📢 Up-to-date: 07/10/2022

How to use the crawler

Go to the Shopee folder.

Artstation

Go to the Artstation folder.

💡 TODO: This is the code from November 2021, now October 2022, for some reason, looks like it crawls all the data available with per_page = 3, max_page=1, so:

  • Check the API call
  • If you decide to use this code, make sure to check attribute all_items to not have any duplicate value, and limit it, because calling the method crawlImage() will take time.

Tinder

Should I publish it 🤔

Pexels

Go to the Pexels folder.

Not done yet! Now it works on a JSON file you already downloaded, use the function on GetData, need to change the APIService.

About

An effective way is widely used in the mobile development sector to build APIService, which I now bring to the data-related sector.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published