Name		Name	Last commit message	Last commit date
Latest commit History 301 Commits
.github/workflows		.github/workflows
app		app
bin		bin
config		config
db		db
deps		deps
lib		lib
log		log
public		public
spec		spec
storage		storage
test		test
tmp		tmp
vendor		vendor
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.rspec		.rspec
.rubocop.yml		.rubocop.yml
.ruby-version		.ruby-version
Dockerfile		Dockerfile
Gemfile		Gemfile
Gemfile.lock		Gemfile.lock
LICENSE		LICENSE
Procfile.dev		Procfile.dev
README.md		README.md
Rakefile		Rakefile
bounce		bounce
chroma_query_response.json		chroma_query_response.json
compose.yaml		compose.yaml
config.ru		config.ru
dotenv_template		dotenv_template
layout.css		layout.css
layout.html		layout.html
ops.yml		ops.yml
package.json		package.json
reloader.sh		reloader.sh
yarn.lock		yarn.lock

Repository files navigation

README

Archyve is a web app that makes pretrained LLMs aware of a user's documents, while keeping those documents on the user's own devices and infrastructure.

Overview

Archyve enables Retrieval-Augmented Generation (RAG) by providing an API to query the user's docs for relevant context. The client provides the prompt the user gave, and Archyve will return relevant text chunks.

Archyve provides:

a document upload and indexing UI, where the user can upload documents and test similarity searches against them
a built-in LLM chat UI, so the user can test the effectiveness of their documents with an LLM
an API, so the user can provide Archyve search results in dedicated LLM chat UIs

Getting started

Dependencies

On a Mac ensure you have brew installed
Make sure you have podman or docker setup and a "machine" configured and ready to pull and run container images.
Ensure you have ops installed

Develop

To start working / developing with Archyve locally, assuming dependencies are good:

Install Ollama and make sure you're running ollama serve and that you have the minimum models installed (see section on Ollama further below).
Clone this repo
ops up
ops rails db:setup
ops rails server
Go to http://127.0.0.1:3300/ and you can login using [email protected] and password to get started.

Build

To run Archyve, use docker compose or podman compose.

Clone this repo
cp dotenv_template local.env
Run openssl rand -hex 64 and put the value in the SECRET_KEY_BASE variable in your local.env file
Run the container

docker compose up --build

If you see "✘ archyve-worker Error", don't worry about it. Docker will build the image and run it.

get a shell in the Archyve container with docker compose exec archyve bash
run bin/rails db:encryption:init from within the container:

$ rails db:encryption:init
Running `bin/rails db:encryption:init` in environment 'dev'...
Add this entry to the credentials of the target environment:

active_record_encryption:
  primary_key: PqxwHUF2E3MnPUW3qmOHUikIWJxhvY90
  deterministic_key: wJi0qI8KftvGhqkNh42SaG2oh64ZKIGZ
  key_derivation_salt: sE2nd5xn1rq2YdkDHHxQOuDhcOMfV5jr

put the values from the output into your local.env file

...
ACTIVE_RECORD_ENCRYPTION="{
  \"primary_key\": \"PqxwHUF2E3MnPUW3qmOHUikIWJxhvY90\",
  \"deterministic_key\": \"wJi0qI8KftvGhqkNh42SaG2oh64ZKIGZ\",
  \"key_derivation_salt\": \"sE2nd5xn1rq2YdkDHHxQOuDhcOMfV5jr\"
}"

Restart the containers
Browse to http://127.0.0.1:3300 and log in with [email protected] / password (you can change these values by setting USERNAME and PASSWORD in your local.env file and restarting the container)

API

Authentication

Archyve provides a ReST API. To use it, you must have:

a Client ID (goes in the X-Client-Id header in all API requests)
an API key (goes in the Authorization header after Bearer )

Ensure you have set up ACTIVE_RECORD_ENCRYPTION as described above!

TODO: add this to the UI

If you are running the app on your host, you can set the DEFAULT_API_KEY and DEFAULT_CLIENT_ID environment variables. On startup, Archyve will ensure that a client with these credentials exists.

DEFAULT_API_KEY must be a 48-byte value encoded in base64. Generate a key with openssl rand -base64 48.
DEFAULT_CLIENT_ID can be any string, but it should be unique to your app. A UUID is recommended.

If you are running the app via docker compose or podman compose, set the above two environment variables in your local.env file and restart the containers.

If you are running the app on your host, set the two above environment variables and run rails db:seed.

Sending authenticated requests

You should be able to send API requests like this:

curl -v localhost:3300/v1/collections \
  -H "Accept: application/json" \
  -H "Authorization: Bearer <YOUR_API_KEY>" \
  -H "X-Client-Id: <YOUR_CLIENT_ID>"

See archyve.io for more information on the API.

See the next section for setting up Ollama for use by Archyve or document uploads and chat will fail.

Dependencies

Ollama

You can run a dedicated instance of Ollama in a container by adding it to the compose.yaml file, but it takes a while to pull a chat model, so the default here is to assume you already have an Ollama instance.

Archyve will use a local instance of Ollama by default. Ensure you have Ollama installed and running (with ollama serve) and then run the following commands to set up your Ollama instance for Archyve:

fast embedding model: ollama pull all-minilm
better embedding model: ollama pull nomic-embed-text
chat model: ollama pull mistral:instruct
alternative chat model: ollama pull gemma:7b (if you intend to use Gemma)

Embedding models

You can select an embedding model separately for each Collection you create inside Archyve.

To make an embedding model available for use in Archyve, go to the ModelConfig page in the admin UI, create a new ModelConfig, and set embedding to true. The new embedding model should be an option when creating a Collection, or viewing a Collection which has no Documents in it.

NOTE The default seeds setup nomic-embed-text as default embedding model.

Make sure you pull the model in Ollama.

Summarization model

You can change summarization model by changing SUMMARIZATION_ENDPOINT and SUMMARIZATION_MODEL in your local.env file and restarting the server. If you change these values, make sure the new models are present in Ollama.

NOTE The default seeds setup mistral:instruct as default embedding model.

Admin UI

There is an admin UI running at http://127.0.0.1:3300/admin. There, you can view and change ModelConfigs and ModelServers if you are logged in as an admin.

There is a link to it in the bottom of the side bar.

Jobs

Archyve uses a jobs framework called Sidekiq. It has a web UI that you can access at http://127.0.0.1/sidekiq if you are logged in as an admin.

TurboStream design

In general:

use a separate channel for each group of things that need to be independently authorized
- e.g. a user can see their own conversations but not the conversations of other users
- therefore, we need to use a separate conversations channel for each user
- using user-specific dom_ids is not enough: that would prevent User 1 from seeing User 2's updates in their browser, but User 2's data would still be sent to User 1, and viewable in dev tools
- all Users can see all Collections for now, so we don't need user-specific Collection-related channels
use helpers to generate channel IDs and dom IDs in case we need to change how it works at some point
- models: ApplicationRecord#channel_id, ApplicationRecord#dom_id
- views: ApplicationHelper#channel_id, ApplicationHelper#dom_id
- controllers: ApplcationController#channel_id, ApplicationController#dom_id

Streams

Current state:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

README

Overview

Getting started

Dependencies

Develop

Build

API

Authentication

Sending authenticated requests

Dependencies

Ollama

Embedding models

Summarization model

Admin UI

Jobs

TurboStream design

Streams

About

Releases

Packages

Languages

License

ayb/archyve

Folders and files

Latest commit

History

Repository files navigation

README

Overview

Getting started

Dependencies

Develop

Build

API

Authentication

Sending authenticated requests

Dependencies

Ollama

Embedding models

Summarization model

Admin UI

Jobs

TurboStream design

Streams

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages