Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
.github/workflows		.github/workflows
.vscode		.vscode
data		data
docs		docs
scripts		scripts
src		src
.dockerignore		.dockerignore
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
build.rs		build.rs
docker-compose.yml		docker-compose.yml
package.json		package.json
rustfmt.toml		rustfmt.toml

Repository files navigation

Lightweight Vector Database

A tiny vector database for storing and querying embeddings in pure Rust. Read the explanation of embeddings at OpenAI.

Runs in memory for great speed.
Serialises embeddings to one binary file for simplicity.
Little code for sustainable maintenance.
Versatile REST API for various scenarios.
Easy to integrate to prototypes or small products.

This is a fork of tinyvector with the following goals:

Allow filtering by metadata in addition to the vector similarity search.
Offer other algorithms for computing the vector similarity in addition to cosine, dot and euclidean.
Focus only on embeddings. Do not integrate LLMs to split and vectorise text.
Support CORS and other network features.

There's a demo example included.

Getting Started

Using Docker is easier than running the services built from the scratch. But building is easy. You need just the Rust compiler.

Using Docker

For example, run a container for testing purposes exposing the port 8000 which will be deleted on exit:

docker run -p 8000:8000 --rm -it ghcr.io/prantlf/litevec

For example, run a container named litevec in the background, persisting the data in ./litevec-storage via the volume /litevec/storage:

docker run -p 8000:8000 -v $PWD/litevec-storage:/litevec/storage \
  -dt --name litevec ghcr.io/prantlf/litevec

And the same task as above, only using Docker Compose (place docker-compose.yml to the current directory) to make it easier:

docker-compose up -d

Building from Scratch

Make sure that you have Rust installed before you continue. Clone this repository, build the binary executable and run it:

git clone https://github.com/prantlf/litevec.git
cd litevec
cargo build --release
target/release/litevec

The storage directory will be created in the current directory as needed.

Configuration

Runtime parameters of the service can be customised using the process environment variables below:

Name	Default	Description
LITEVEC_COMPRESSION_LIMIT	1024	minimum response size to get compressed [b]
LITEVEC_CORS_MAXAGE	86400	how long stays CORS preflighting valid [s]
LITEVEC_HOST	0.0.0.0	IP address to bind the server to
LITEVEC_PORT	8000	port address to bind the server to
LITEVEC_PAYLOAD_LIMIT	1073741824	maximum size of request payload [b]
LITEVEC_TIMEOUT	30	maximum duration of a request [s]
RUST_LOG	info	log level (`info`, `debug`, `trace`)

API

See the summary of the endpoints below, API details on a separate page. Run litevec and open http://localhost:8000/docs to inspect and try the available REST API endpoints live.

System endpoints:

Method	Path	Description
GET	/	obtain API metadata
POST	/shutdown	shut the service down (sending SIGTERM or SIGINT works too)

Documentation endpoints:

Method	Path	Description
GET	/docs	web page with the API documentation
GET	/openapi.json	API description according to the OpenAPI schema

Endpoints for embedding collections and similarity search:

Method	Path	Description
GET	/collections	list collection names
POST	/collections/:collection_name	search the collection for similar vectors while filtering with metadata too (1)
PUT	/collections/:collection_name	create a collection (2)
GET	/collections/:collection_name	get information about a collection (3)
DELETE	/collections/:collection_name	delete a collection

Endpoints for embeddings:

Method	Path	Description
GET	/collections/:collection_name/embeddings	list embedding identifiers
POST	/collections/:collection_name/embeddings	filter embeddings with metadata (4)
DELETE	/collections/:collection_name/embeddings	delete embeddings by metadata
PUT	/collections/:collection_name/embeddings/:embedding_id	create an embedding (5)
GET	/collections/:collection_name/embeddings/:embedding_id	get information about an embedding (4)
DELETE	/collections/:collection_name/embeddings/:embedding_id	delete an embedding

License

Licensed under the MIT license.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lightweight Vector Database

Getting Started

Using Docker

Building from Scratch

Configuration

API

License

About

Releases 10

Packages

Languages

License

prantlf/litevec

Folders and files

Latest commit

History

Repository files navigation

Lightweight Vector Database

Getting Started

Using Docker

Building from Scratch

Configuration

API

License

About

Resources

License

Stars

Watchers

Forks

Releases 10

Packages 0

Languages

Packages