Skip to content

Miscellaneous Projects I'm working on, mostly scrapers

Notifications You must be signed in to change notification settings

driscoll42/miscProjects

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Miscellaneous Projects

In this repo I have various projects I've worked on which don't worry a repo for themselves, but I still want to share. Details on each one are below.

If you find any of these useful and make a fork or use them, I'd love to hear about your use case!

GarminScraper

drawing

The code in this directory is used to create this Google Sheet here.

In essence, it scrapes garmin.com to get the details on every Garmin device made, allowing for comparison between devices and over time. I quite love Garmin products and was curious how they have evolved over time. I encourage anyone to get a Garmin, they're great!

Running the code will create an Excel file with details on every Garmin device made that is listed in the ids.

I have excluded a few files as I'm hesitant to share to much direct data in the interest of being respectful to Garmin. Namely the data used to sort the rows, the release dates, and the DCRainmaker review links. Basically anything which cannot be found by directly scraping the website. The code should work without these, it just won't be in a perfect state.

IronmanMap

drawing

The code in this directory is used to create this Google Map

It scrapes ironman.com to get the details on every Ironman race and outputs a series of excel files that are then loaded into a My Maps on Google to create a map of all the Ironman races around the world.

I'm frankly a bit surprised there's no map on Ironman's website. If there is one created, I'd probably delete this.

OverdriveScraper

drawing

The code in this directory is used to create this Google Sheet, which is a scraping of every single library on Overdrive.com, getting stats on the number of eBooks, Audiobooks, Magazines, and Videos one can check out from the library, along with the count for each category within (e.g. Fiction eBooks, Juvenile Non-Fiction Biology eBooks, Young Adult Poetry Audiobooks etc.)

It iterates over all the possible ids of Overdrive libraries, and for all those which are valid it finds, it scrapes the website. Do note that it takes nearly a full 24 hours to run as I included quite a few sleeps in the code to be kind to Overdrive's website, and to allow the javascript to load.

I did a brief analysis of a run in 2023 here. The results are also used to update the MobileRead Wiki's Libraries that offer cards online or by mail, to non-residents and shared on the MobileRead's Forum post on Non-Resident Library Cards. I highly encourage checking both out.

About

Miscellaneous Projects I'm working on, mostly scrapers

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published