Binary Codec for JSON

The goal of the Binson project is to produce a binary codec for JSON (json.org).

The priorities for Binson encoding are (in rough order of priority):

Small encoding size (primarily intended for wire transfer)
Fast parsing
Easy to use & implement
JSON spec compatibility

Binson encodes using a tag, len, value structure.

Currently Binson is smaller than JSON encoding for all practical encoding cases and significantly faster to encode/decode.

A reference implementation can be found under the org.boris.binson folder in this repository.

Types

Object: tag, length (number of pairs), length pairs (string, value)
Array, tag, length (number of values), length values
Values
- Object
- Array
- String: tag, length, UTF-8 encoded string
- Number
  - variable length signed int32
  - int64 (signed, 8 bytes)
  - double (IEEE 754-1985 double precision - 8 bytes)
- Raw byte array: tag, length, bytes
- False
- True
- Null

As per JSON spec, object pairs are explicitly unordered.

Encoding

The tag is 1 byte in length. It contains the type of the field as well as the number of bytes used to represent the length of the field (for variable length length fields)

The size of the length field is encoded in the two most significant bits of the tag byte.

Bits 5 and 6 must always be zero.

Bits 7 and 8 must be zero for fixed length structures (null, int64, double, false, true)

		  MSN      LSN 	
		+--------------------------------------------------+
null	| 0 0 0 0  0 0 0 0 | N/A               | N/A       |
true	| 0 0 0 0  0 0 0 1 | N/A               | N/A       |
false	| 0 0 0 0  0 0 1 0 | N/A               | N/A       |
int64	| 0 0 0 0  0 0 1 1 | value (8 bytes)   | N/A       |
double	| 0 0 0 0  0 1 0 0 | value (8 bytes)   | N/A       |
object	| x x 0 0  1 0 0 0 | len (num pairs)   | len pairs |
array	| x x 0 0  1 0 0 1 | len (num elems)   | len elems |
string	| x x 0 0  1 0 1 0 | len (num bytes)   | len bytes |
int32	| x x 0 0  1 0 1 1 | value (len bytes) | N/A       |
raw		| x x 0 0  1 1 0 0 | len (num bytes)   | len bytes |
		+--------------------------------------------------+

For variable length fields bits 7 and 8 encode 4 possible length values:

0 0 = 1 byte
0 1 = 2 bytes
1 0 = 3 bytes
1 1 = 4 bytes

In hex the high nibble should be 0x0, 0x4, 0x8, 0xc respectively.

For compatibility with existing languages (see priorities 2 & 3) the corresponding length field is a signed 32 bit integer.

Where applicable, endianness is little. This is includes variable length structures.

Note that the object type is encoded as size pairs of (key,value) where key is a string encoded as per above table (ie. tag,len,UTF-8) and value is any type in the table above (including object).

Examples

{ "hello" : "world" }

0x08                            : object type with 1 byte length field 
0x01                            : 1 pair of string/value 
0x0a                            : string with 1 byte length field 
0x05                            : 5 bytes for UTF-8 encoding of "hello"
0x68 0x65 0x6c 0x6c 0x6f        : the UTF-8 encoding of "hello"
0x0a                            : string with 1 byte length field
0x05                            : 5 bytes for UTF-8 encoding of "world"
0x77 0x6f 0x72 0x6c 0x64        : the UTF-8 encoding of "world"

Encoding Size Comparison

The following shows an encoding size (bytes) comparison for example datasets at: http://json.org/example.html

JSON Snippet	JSON (ws)	JSON (no ws)	BSON	Binson
glossary	603	360	395	335
menu	251	183	219	165
widget	627	389	406	346
webapp	3554	2710	2791	2543
menu2	898	613	764	534

Using jackson and bson4jackson libraries for the comparison.

Interoperability with JSON

Binson aims to be compatible with JSON where possible/practical. This is achieved except for the inclusion of a RAW byte array type, which is a natural inclusion on a binary spec.

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
bitbot		bitbot
boris		boris
databoris		databoris
gobot		gobot
morphosis		morphosis
org.boris.binson		org.boris.binson
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Binary Codec for JSON

Types

Encoding

Examples

Encoding Size Comparison

Interoperability with JSON

About

Releases

Packages

Languages

poidasmith/scratch

Folders and files

Latest commit

History

Repository files navigation

Binary Codec for JSON

Types

Encoding

Examples

Encoding Size Comparison

Interoperability with JSON

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages