GitHub - rix4uni/tldinfo: Accurately separates a URL’s subdomain, domain, and public suffix, using the Public Suffix List (PSL).

tldinfo

Accurately separates a URL’s subdomain, domain, and public suffix, using the Public Suffix List (PSL).

Installation

git clone https://github.com/rix4uni/tldinfo.git
cd tldinfo
python3 setup.py install

pipx

Quick setup in isolated python environment using pipx

pipx install --force git+https://github.com/rix4uni/tldinfo.git

Usage

usage: tldinfo [-h] [-e EXTRACT] [-r] [-f] [-j] [-s] [-v]

Accurately separates a URL’s subdomain, domain, and public suffix, using the Public Suffix List (PSL).

options:
  -h, --help            show this help message and exit
  -e EXTRACT, --extract EXTRACT
                        Comma-separated list of parts to extract (subdomain, domain, suffix)
  -r, --registered_domain
                        Get the registered domain
  -f, --fqdn            Get the fqdn
  -j, --json            Output result in JSON format
  -s, --silent          Run without printing the banner
  -v, --version         Show current version of tldinfo

Example usages

Single Domains:

▶ echo "http://forums.news.cnn.com/" | tldinfo --silent --extract subdomain
forums.news

▶ echo "http://forums.news.cnn.com/" | tldinfo --silent --extract domain
cnn

▶ echo "http://forums.news.cnn.com/" | tldinfo --silent --extract suffix
com

▶ echo "http://forums.news.cnn.com/" | tldinfo --silent --extract subdomain,domain,suffix
forums.news.cnn.com

▶ echo "http://forums.news.cnn.com/" | tldinfo --silent --extract subdomain,domain,suffix --json
{"input": "http://forums.news.cnn.com/", "subdomain": "forums.news", "domain": "cnn", "suffix": "com"}

▶ echo "http://forums.news.cnn.com/" | tldinfo --silent --registered_domain
cnn.com

▶ echo "http://forums.news.cnn.com/" | tldinfo --silent --fqdn
forums.news.cnn.com

Multiple Domains:

▶ cat targets.txt
forums.news.cnn.com
forums.bbc.co.uk
www.worldbank.org.kg

▶ cat targets.txt | tldinfo --silent --extract subdomain
forums.news
forums
www

Thanks 🙏

tldextract

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
tldinfo		tldinfo
.gitattributes		.gitattributes
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py
subs.txt		subs.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tldinfo

Installation

pipx

Usage

Example usages

Thanks 🙏

About

Releases 1

Packages

Languages

rix4uni/tldinfo

Folders and files

Latest commit

History

Repository files navigation

tldinfo

Installation

pipx

Usage

Example usages

Thanks 🙏

About

Topics

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages