This project provides the API (JavaDocs)
for hashing any sequences of bytes in Java, including all kinds of
primitive arrays, buffers, CharSequence
s and more. Java 6+. Apache 2.0 licence.
The key design goal, distinguishing this project from, for example, Guava hashing:
this API ease implementing hashing algorithms which don't do a single allocation
during hash computation for any input, and without using ThreadLocal
.
Also, the API attemps to be agile enough in byte order treatment, favoring native access, but allowing the hash function implementation be platform-endianness-agnostic. On the other hand, it allows to "fool" the existing implementation, even sealed for one byte order, feeding data in different byte order and obtain consistent results, only moderately compromising performance.
Currently only long
-valued hash function interface is defined, with a single shipped
implementation:
-
CityHash, version 1.1 (latest; 1.1.1 is C++ language-specific fixing release). This implementation is thought to be independent from native byte order.
It is thoroughly, I believe, tested with JDK 6, 7 and 8, but only on little-endian platform.
On my developer machine, it performs on the speed of 5-6 bytes/ns with 11 ns bootstrap for sequences of any length.
To sum up,
- You need to hash plain byte sequences, memory blocks or "flat" objects.
- You like zero-allocation and pretty good performance (at Java scale).
- You need hashing to be agile in questions related to byte ordering.
-
You need to hash POJOs whose actual data is scattered in memory between managed objects. There is no simple way to hash, for example, instances of such class:
class Person { String givenName, surName; int salary; }
using the API provided by this project.
-
You need to hash byte sequences of beforehand unknown length, for the simpliest example,
Iterator<Byte>
. -
You need to transform the byte sequence (e. g. encode or decode it with a specific coding), and hash the resulting byte sequence on the way without dumping it to memory.
Gradle:
dependencies {
compile 'net.openhft:zero-allocation-hashing:0.1'
}
Or Maven:
<dependency>
<groupId>net.openhft</groupId>
<artifactId>zero-allocation-hashing</artifactId>
<version>0.1</version>
</dependency>
In Java:
long hash = LongHashFunction.city_1_1().hashChars("hello");
See JavaDocs for more information.
See the list of open issues.