Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
activate		activate
autogen_agents.py		autogen_agents.py
autotrain.py		autotrain.py
chroma.py		chroma.py
codegen.py		codegen.py
dolly.py		dolly.py
embeddings.py		embeddings.py
falcon.py		falcon.py
langchain.py		langchain.py
litellm.py		litellm.py
llama_ctransformers.py		llama_ctransformers.py
mistral.py		mistral.py
nlpcloud.py		nlpcloud.py
open_playground.py		open_playground.py
pyproject.toml		pyproject.toml
qdrant.py		qdrant.py
starcoder.py		starcoder.py
vicuna_ctransformers.py		vicuna_ctransformers.py

Repository files navigation

AI-Playground

Use StarCoder

pip install transformers pip install torch torchvision pip install accelerate bitsandbytes pip install accelerate[torch]

Edit:

load_in_8bit=True

python starcoder.py

will download ~60 GB of model

Models

LLaMa 2 - https://huggingface.co/TheBloke/Llama-2-7b-Chat-GGUF

wget -c https://huggingface.co/TheBloke/Mistral-7B-v0.1-GGUF/resolve/main/mistral-7b-v0.1.Q8_0.gguf wget -c https://huggingface.co/TheBloke/Wizard-Vicuna-7B-Uncensored-GGUF/resolve/main/Wizard-Vicuna-7B-Uncensored.Q8_0.gguf

Try with LLaMA.cpp

Extract LLaMA.cpp zip to bin/ directory

./bin/main.exe -m models/llama-2-7b-chat.Q8_0.gguf

Try with vLLM

pip install -U vllm

python -u -m vllm.entrypoints.openai.api_server --host 0.0.0.0 --model mistralai/Mistral-7B-v0.1

Try with FastChat

pip install -U fastchat

python -m fastchat.serve.openai_api_server --host localhost --port 8000

Try with LeptonAI

pip install -U leptonai

Specs

RAM Required:

Model Size	RAM Required
3B	8 GB
7B	16 GB
13B	32 GB

Development Notes

pip install pyautogen

pip install openplayground
openplayground run

ollama run mistral

pip install -U jina

pip install "ray[serve]"


https://github.com/FlowiseAI/Flowise


wget https://gpt4all.io/models/ggml-gpt4all-j.bin -O models/ggml-gpt4all-j

https://github.com/go-skynet/LocalAI
docker pull quay.io/go-skynet/local-ai:latest

nlpcloud

curl "https://api.nlpcloud.io/v1/<model_name>/entities" \
  -H "Authorization: Token <token>" \
  -H "Content-Type: application/json" \
  -X POST \
  -d '{"text":"John Doe has been working for Microsoft in Seattle since 1999."}'


https://github.com/microsoft/semantic-kernel
https://github.com/microsoft/guidance


https://skypilot.readthedocs.io/

Later:
https://github.com/Arize-ai/phoenix
https://github.com/explodinggradients/ragas
https://github.com/trypromptly/LLMStack

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI-Playground

Use StarCoder

Models

Try with LLaMA.cpp

Try with vLLM

Try with FastChat

Try with LeptonAI

Specs

Development Notes

About

Releases

Packages

Languages

License

AI-Personal/AI-Playground

Folders and files

Latest commit

History

Repository files navigation

AI-Playground

Use StarCoder

Models

Try with LLaMA.cpp

Try with vLLM

Try with FastChat

Try with LeptonAI

Specs

Development Notes

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages