Webscraping

Always find out what is allowed: using robots.txt

Question and answer css class names? wow! colly is free!?! did this get all of the tweets? anything else?

Procedure

go get -u github.com/gocolly/colly/...

Puerkitobio/goquery search library built on top of an xhtml package

git add .; git commit -m "Adding Reddit Scraper and counter"; git push; git status

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.vscode		.vscode
jjscraper		jjscraper
reditscraper		reditscraper
twitscraper		twitscraper
vendor		vendor
.DS_Store		.DS_Store
.gitignore		.gitignore
Readme.md		Readme.md
Scraping.code-workspace		Scraping.code-workspace
go.mod		go.mod
go.sum		go.sum
main.go		main.go