Modern bencode

A simple library for decoding/encoding bencoded data.

There are other bencode related libraries out there, but most of them:

are created a long time ago and don't have active maintainers anymore
don't have any docs at all or have very limited docs
don't have any tests at all or have only partial test coverage
use C extensions to make them faster, which can:
- create problems when compiling on different platforms
- make debugging more complicated (if you get a "Segmentation fault" error)

So, here is one more bencode library. Pluses:

it has an active maintainer
it has docs (both here and for all code) and human readable error messages
it has type annotations (so you can see what each function takes and what it returns)
it has 100% test coverage (+uses black, isort, flake8, mypy, pylint)
it is written in pure Python and has no dependencies

Minuses:

it requires Python >= 3.7
it is slower than libraries, which use C extensions (still, decoding
a regular torrent file takes up to a few milliseconds on a regular VPS)

Installation

pip install modern-bencode

The library requires Python >= 3.7

Usage

from bencode import decode, decode_torrent, encode

assert decode(b"li123e3:abce") == [123, b"abc"]
assert encode([123, b"abc"]) == b"li123e3:abce"

with open("my-torrent-file.torrent", "rb") as source_file:
    torrent_data = source_file.read()
    print(decode(torrent_data))
    print(decode_torrent(torrent_data, encoding="utf8"))

Notes

bencode.decode gets bytes and:

either returns a Python object (bytes, dict, int or list)
or raises a ValueError when decoding is not possible. The error message will
contain a human readable explanation why exactly it is not possible, for example:

Cannot decode an integer, reached the end of the bencoded 
string before the end marker was found. Most likely the 
bencoded string is incomplete or incorrect.

bencode.decode_torrent gets torrent file (as bytes), an optional encoding and:

either returns a Python dict, where all keys and most values are strings.
Fields, which are not supposed to be human readable ("ed2k", "filehash",
"pieces"), are decoded as hex. Fields, which are supposed to be human
readable, are decoded:
- using utf8 (if the key ends with ".utf-8" suffix, like "path.utf-8")
- using the provided encoding (in all other cases)
or raises UnicodeDecodeError/ValueError when decoding is not possible.

bencode.encode gets a Python object (bytes, dict, int or list) and:

either returns a bencoded string (as bytes)
or raises a ValueError when encoding is not possible. The error message will
contain a human readable explanation why exactly it is not possible, for example:

Cannot encode data: objects of type <class 'set'> are not supported.

bencode.encode_torrent is not implemented yet. Let me know if you need it.

Bugs

Feel free to create an issue here if you find a bug or some error message is not clear enough.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
bencode		bencode
tests		tests
.gitignore		.gitignore
README.md		README.md
build_and_upload.sh		build_and_upload.sh
requirements-test.txt		requirements-test.txt
requirements.txt		requirements.txt
run_tests.sh		run_tests.sh
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Modern bencode

Installation

Usage

Notes

Bugs

About

Releases

Packages

Languages

browneyedape/modern-bencode

Folders and files

Latest commit

History

Repository files navigation

Modern bencode

Installation

Usage

Notes

Bugs

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages