Skip to content

choonho/llama_server

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Llama server

Rest API Server for Llama 2

Prerequisite

Before run this server, you have to download Llama 2 model first.

See https://dev.to/choonho/llama-2-in-apple-silicon-macbook-13-54h

Move ggsm model file to models/7B/ggml-model-q4_0.bin (default MODEL_PATH)

Installation

pip3 install llama-cpp-python langchain
pip3 install fastapi uvicorn

Run Server

python3 server.py

About

Rest API Server for Llama 2

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages