Name	Name	Last commit message	Last commit date
Latest commit History 56 Commits
core	core
stringset	stringset
.gitignore	.gitignore
.goreleaser.yml	.goreleaser.yml
README.md	README.md
go.mod	go.mod
go.sum	go.sum
main.go	main.go

gospider

gospider is a simple spider written in Go.

Installation

go get -u github.com/theblackturtle/gospider

Features

Fast web crawling
Brute force and parse sitemap.xml
Parse robots.txt
Parse and verify link from JavaScript files
Find AWS-S3 from response source
Find Subdomain from response source
Get URLs from Wayback Machine, Common Crawl, Virus Total
Format output easy to Grep
Support Burp input
Crawl multiple sites in parallel
Random mobile/web User-Agent

Usage

A Simple Web Spider - v1.0.4 by @theblackturtle

Usage:
  gospider [flags]

Flags:
  -s, --site string          Site to crawl
  -S, --sites string         Site list to crawl
  -p, --proxy string         Proxy (Ex: http://127.0.0.1:8080)
  -o, --output string        Output folder
  -u, --user-agent string    User Agent to use
                                web: random web user-agent
                                mobi: random mobile user-agent
                                or you can set your special user-agent (default "web")
      --cookie string        Cookie to use (testA=a; testB=b)
  -H, --header stringArray   Header to use (Use multiple flag to set multiple header)
      --burp string          Load headers and cookie from burp raw http request
      --blacklist string     Blacklist URL Regex
  -t, --threads int          Number of threads (Run sites in parallel) (default 1)
  -c, --concurrent int       The number of the maximum allowed concurrent requests of the matching domains (default 5)
  -d, --depth int            MaxDepth limits the recursion depth of visited URLs. (Set it to 0 for infinite recursion) (default 1)
  -k, --delay int            Delay is the duration to wait before creating a new request to the matching domains (second)
  -K, --random-delay int     RandomDelay is the extra randomized duration to wait added to Delay before creating a new request (second)
      --sitemap              Try to crawl sitemap.xml
      --robots               Try to crawl robots.txt (default true)
  -a, --other-source         Find URLs from 3rd party (Archive.org, CommonCrawl.org, VirusTotal.com)
  -w, --include-subs         Include subdomains crawled from 3rd party. Default is main domain
      --debug                Turn on debug mode
  -v, --verbose              Turn on verbose
      --no-redirect          Disable redirect
      --version              Check version
  -h, --help                 help for gospider

Example commands

Run with single site

gospider -s "https://google.com/" -o output -c 10 -d 1

Run with site list

gospider -S sites.txt -o output -c 10 -d 1

Run with 20 sites at the same time with 10 bot each site

gospider -S sites.txt -o output -c 10 -d 1 -t 20

Also get URLs from 3rd party (Archive.org, CommonCrawl.org, VirusTotal.com)

gospider -s "https://google.com/" -o output -c 10 -d 1 --other-source

Also get URLs from 3rd party (Archive.org, CommonCrawl.org, VirusTotal.com) and include subdomains

gospider -s "https://google.com/" -o output -c 10 -d 1 --other-source --include-subs

Use custom header/cookies

gospider -s "https://google.com/" -o output -c 10 -d 1 --other-source -H "Accept: */*" -H "Test: test" --cookie "testA=a; testB=b"

gospider -s "https://google.com/" -o output -c 10 -d 1 --other-source --burp burp_req.txt

Blacklist url/file extension.

P/s: gospider blacklisted .(jpg|jpeg|gif|css|tif|tiff|png|ttf|woff|woff2|ico) as default

gospider -s "https://google.com/" -o output -c 10 -d 1 --blacklist ".(woff|pdf)"

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gospider

Installation

Features

Usage

Example commands

Run with single site

Run with site list

Run with 20 sites at the same time with 10 bot each site

Also get URLs from 3rd party (Archive.org, CommonCrawl.org, VirusTotal.com)

Also get URLs from 3rd party (Archive.org, CommonCrawl.org, VirusTotal.com) and include subdomains

Use custom header/cookies

Blacklist url/file extension.

About

Releases

Packages

Languages

preteriranto/gospider

Folders and files

Latest commit

History

Repository files navigation

gospider

Installation

Features

Usage

Example commands

Run with single site

Run with site list

Run with 20 sites at the same time with 10 bot each site

Also get URLs from 3rd party (Archive.org, CommonCrawl.org, VirusTotal.com)

Also get URLs from 3rd party (Archive.org, CommonCrawl.org, VirusTotal.com) and include subdomains

Use custom header/cookies

Blacklist url/file extension.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages