A simple web scraping tool for recipe sites.
pip install recipe-scrapers
then:
from recipe_scrapers import scrape_me
# give the url as a string, it can be url from any site listed below
scraper = scrape_me('http://allrecipes.com/Recipe/Apple-Cake-Iv/Detail.aspx')
scraper.title()
scraper.total_time()
scraper.yields()
scraper.ingredients()
scraper.instructions()
scraper.image()
scraper.links()
Note: scraper.links()
returns a dictionary object containing all of the <a> tag attributes. The attribute names are the dictionary keys.
- http://101cookbooks.com/
- http://allrecipes.com/
- http://bbc.com/
- http://bbc.co.uk/
- http://bbcgoodfood.com/
- http://bettycrocker.com/
- http://bonappetit.com/
- https://www.budgetbytes.com/
- http://closetcooking.com/
- http://cookstr.com/
- http://copykat.com/
- https://cybercook.com.br/
- https://en.wikibooks.org/
- http://delish.com/
- http://epicurious.com/
- http://finedininglovers.com/
- https://food.com/
- http://foodnetwork.com/
- http://foodrepublic.com/
- https://geniuskitchen.com/
- https://greatbritishchefs.com/
- http://giallozafferano.it/
- http://gonnawantseconds.com/
- https://healthyeating.nhlbi.nih.gov/
- https://heinzbrasil.com.br/
- https://www.hellofresh.com/
- https://www.hellofresh.co.uk/
- https://receitas.ig.com.br/
- https://inspiralized.com/
- http://jamieoliver.com/
- https://justbento.com/
- https://kennymcgovern.com/
- https://www.thekitchn.com/
- https://www.marmiton.org/
- https://www.matprat.no/
- http://www.mindmegette.hu/
- https://www.misya.info/
- http://mybakingaddiction.com/
- https://panelinha.com.br/
- http://paninihappy.com/
- http://przepisy.pl/
- http://realsimple.com/
- https://www.seriouseats.com/
- http://simplyrecipes.com/
- https://www.southernliving.com/
- http://steamykitchen.com/
- https://www.tastesoflizzyt.com
- http://tastykitchen.com/
- http://thepioneerwoman.com/
- https://www.thespruceeats.com/
- http://thehappyfoodie.co.uk/
- http://thevintagemixer.com/
- http://tine.no/
- https://tudogostoso.com.br/
- http://twopeasandtheirpod.com/
- http://whatsgabycooking.com/
- http://yummly.com/
Part of the reason I want this open sourced is because if a site makes a design change, the scraper for it should be modified.
If you spot a design change (or something else) that makes the scraper unable to work for a given site - please fire an issue asap.
If you are programmer PRs with fixes are warmly welcomed and acknowledged with a virtual beer.
- Open an Issue providing us the site name, as well as a recipe link from it.
- You are a developer and want to code the scraper on your own:
- If Schema is available on the site - you can do this
- Otherwise, scrape the HTML - like this
Assuming you have python3 installed, navigate to the directory where you want this project to live in and drop these lines
git clone [email protected]:hhursev/recipe-scrapers.git && cd recipe-scrapers && python3 -m venv .venv && source .venv/bin/activate && pip install -r requirements.txt && coverage run -m unittest && coverage report
All the contributors that helped improving the package. You are awesome!