Skip to content

This program provides efficient web scraping services for Tor and non-Tor sites. The program has both a CLI and REST API.

Notifications You must be signed in to change notification settings

drahsansaleem/gotor

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

gotor

Status/Social Links

CircleCI Open Source Helpers

This is a HTTP REST API and command line program for webcrawling Tor (and non Tor) sites.

Flags

Configuration of Tor client

  • -h SOCKS5 proxy host, defaults to localhost
  • -p SOCKS5 proxy port, defaults to 9050

REST

  • -server Starts HTTP server that provides a REST API to the crawling mechanisms
  • Current crawling mechanisms include: building relationship tree of links and getting the IP of the current tor client

CLI

  • -d Searching for children nodes of links, defaults to 1
  • -o Output destination, defaults to 'terminal' (recently added support for excel files)

How it works

Crawling drawio

About

This program provides efficient web scraping services for Tor and non-Tor sites. The program has both a CLI and REST API.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Go 100.0%