Skip to content

Commit

Permalink
Init adr 001
Browse files Browse the repository at this point in the history
  • Loading branch information
John authored and John committed Oct 3, 2023
1 parent cfb6e03 commit 7257fe9
Show file tree
Hide file tree
Showing 2 changed files with 47 additions and 0 deletions.
47 changes: 47 additions & 0 deletions adr/adr-001-jan-deployable-cloud-native.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,47 @@
# ADR #011: Jan deployable cloud-native

## Changelog

- 23.10.03: Initial unfinished draft

## Authors

- @nam-john-ho

## Context

### Status Quo

User doesn't have a local GPU machine but wants to run Jan on a rented server
User wants a quick, fast way to experiment with Jan on a rented GPU
https://github.com/janhq/jan/issues/255

## Decision

This ADR aims to outline design decisions for deploying Jan in cloud native environments such as: Runpod, AWS, Azure, GCP in a fast and simple way.
The current code-base should not change too much.
The current plugins should be reusable across enviroments (Desktop, Cloud-native).
Simple authentication (username/password) should be supported.


### Key Design Decisions
![alt text](images/adr-001-01.png "Title")


### Detailed Design



## Alternative Approaches



## Considerations



https://www.runpod.io/console/templates
https://repost.aws/articles/ARQ0Tz9eorSL6EAus7XPMG-Q/how-to-install-textgen-webui-on-aws
https://www.youtube.com/watch?v=_59AsSyMERQ
https://gpus.llm-utils.org/running-llama-2-on-runpod-with-oobaboogas-text-generation-webui/
https://medium.com/@jarimh1984/installing-oobabooga-and-oobabooga-api-to-runpod-cloud-step-by-step-tutorial-47457974dfa5
Binary file added adr/images/adr-001-01.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit 7257fe9

Please sign in to comment.