MrKing2023 / buoy-cam-scraper Public

forked from mikewesthad/buoy-cam-scraper

Notifications You must be signed in to change notification settings
Fork 0
Star 0

A scraper to pull images from NBDC's network of buoys that have cam feeds

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
images		images
src		src
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json

Repository files navigation

Buoy Cam Scraper

A scraper to pull images from NBDC's network of buoys that have cam feeds (link), lonely sensors that constantly monitor our seas, air and atmosphere. The cameras are updated on an hourly basis, so the script is set to scrape the feeds every hour from the time it starts running.

Usage:

$ npm install
$ npm run scrape-cams

The data folder contains some scraped meta information:

buoycam-id-list.json - a list of buoy IDs that have cameras, hand collected on 8/1/17
buoycam-info.json - meta information about the buoy cams including: name, lat-long location and a base 64 image of the buoy. Scraped on 8/1/17 using npm run gather-station-info.

Notes:

Push version that uses Tesseract to parse the text out of the buoy cam images
Update the algo to scrape every 30 minutes and only save if the image is unique (because nyquist sampling)