Skip to content

browneyedape/modern-bencode

 
 

Repository files navigation

Modern bencode

A simple library for decoding/encoding bencoded data.

There are other bencode related libraries out there, but most of them:

  • are created a long time ago and don't have active maintainers anymore
  • don't have any docs at all or have very limited docs
  • don't have any tests at all or have only partial test coverage
  • use C extensions to make them faster, which can:
    • create problems when compiling on different platforms
    • make debugging more complicated (if you get a "Segmentation fault" error)

So, here is one more bencode library. Pluses:

  • it has an active maintainer
  • it has docs (both here and for all code) and human readable error messages
  • it has type annotations (so you can see what each function takes and what it returns)
  • it has 100% test coverage (+uses black, isort, flake8, mypy, pylint)
  • it is written in pure Python and has no dependencies

Minuses:

  • it requires Python >= 3.7
  • it is slower than libraries, which use C extensions (still, decoding
    a regular torrent file takes up to a few milliseconds on a regular VPS)

Installation

pip install modern-bencode

The library requires Python >= 3.7

Usage

from bencode import decode, decode_torrent, encode

assert decode(b"li123e3:abce") == [123, b"abc"]
assert encode([123, b"abc"]) == b"li123e3:abce"

with open("my-torrent-file.torrent", "rb") as source_file:
    torrent_data = source_file.read()
    print(decode(torrent_data))
    print(decode_torrent(torrent_data, encoding="utf8"))

Notes

bencode.decode gets bytes and:

  • either returns a Python object (bytes, dict, int or list)
  • or raises a ValueError when decoding is not possible. The error message will
    contain a human readable explanation why exactly it is not possible, for example:
Cannot decode an integer, reached the end of the bencoded 
string before the end marker was found. Most likely the 
bencoded string is incomplete or incorrect.

bencode.decode_torrent gets torrent file (as bytes), an optional encoding and:

  • either returns a Python dict, where all keys and most values are strings.
    Fields, which are not supposed to be human readable ("ed2k", "filehash",
    "pieces"), are decoded as hex. Fields, which are supposed to be human
    readable, are decoded:
    • using utf8 (if the key ends with ".utf-8" suffix, like "path.utf-8")
    • using the provided encoding (in all other cases)
  • or raises UnicodeDecodeError/ValueError when decoding is not possible.

bencode.encode gets a Python object (bytes, dict, int or list) and:

  • either returns a bencoded string (as bytes)
  • or raises a ValueError when encoding is not possible. The error message will
    contain a human readable explanation why exactly it is not possible, for example:
Cannot encode data: objects of type <class 'set'> are not supported.

bencode.encode_torrent is not implemented yet. Let me know if you need it.

Bugs

Feel free to create an issue here if you find a bug or some error message is not clear enough.

About

A simple bencode library

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 96.2%
  • Shell 3.8%