AI-Playground

Use StarCoder

pip install transformers pip install torch torchvision pip install accelerate bitsandbytes pip install accelerate[torch]

Edit:

load_in_8bit=True

python starcoder.py

will download ~60 GB of model

Models

Llama 2 - https://huggingface.co/TheBloke/Llama-2-7b-Chat-GGUF
Llama 3 Instruct - https://huggingface.co/lmstudio-community/Meta-Llama-3-8B-Instruct-GGUF/tree/main

wget -c https://huggingface.co/TheBloke/Mistral-7B-v0.1-GGUF/resolve/main/mistral-7b-v0.1.Q8_0.gguf wget -c https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.1-GGUF/resolve/main/mistral-7b-instruct-v0.1.Q8_0.gguf wget -c https://huggingface.co/TheBloke/Mistral-7B-OpenOrca-GGUF/resolve/main/mistral-7b-openorca.Q8_0.gguf

wget -c https://huggingface.co/TheBloke/Wizard-Vicuna-7B-Uncensored-GGUF/resolve/main/Wizard-Vicuna-7B-Uncensored.Q8_0.gguf

wget -c https://huggingface.co/TheBloke/CodeLlama-7B-Instruct-GGUF/resolve/main/codellama-7b-instruct.Q8_0.gguf

Try with LLaMA.cpp

Extract LLaMA.cpp zip to bin/ directory

./bin/main.exe -m models/llama-2-7b-chat.Q8_0.gguf

Try with vLLM

pip install -U vllm

python -u -m vllm.entrypoints.openai.api_server --host 0.0.0.0 --model mistralai/Mistral-7B-v0.1

Try with FastChat

pip install -U fastchat

python -m fastchat.serve.openai_api_server --host localhost --port 8000

Try with LeptonAI

pip install -U leptonai

Try with ollama

echo "FROM ./models/llama-2-13b-chat.Q5_K_M.gguf" > llama-2-13b-chat.Modelfile

ollama create llama2-13b-chat -f ./llama-2-13b-chat.Modelfile

ollama run llama2-13b-chat

Specs

RAM Required:

Model Size	RAM Required
3B	8 GB
7B	16 GB
13B	32 GB

Other Tools

https://github.com/outlines-dev/outlines

Development Notes

pip install pyautogen

pip install openplayground
openplayground run

ollama run mistral

pip install -U jina

Ray Serve
pip install "ray[serve]"
https://github.com/ray-project/ray-llm

txtai

MLC AI - https://mlc.ai/package/
pip install --pre --force-reinstall mlc-ai-nightly mlc-chat-nightly -f https://mlc.ai/wheels
python -m mlc_chat.rest 

OpenLLM


https://github.com/FlowiseAI/Flowise


wget https://gpt4all.io/models/ggml-gpt4all-j.bin -O models/ggml-gpt4all-j

https://github.com/go-skynet/LocalAI
docker pull quay.io/go-skynet/local-ai:latest

nlpcloud

curl "https://api.nlpcloud.io/v1/<model_name>/entities" \
  -H "Authorization: Token <token>" \
  -H "Content-Type: application/json" \
  -X POST \
  -d '{"text":"John Doe has been working for Microsoft in Seattle since 1999."}'


https://github.com/microsoft/semantic-kernel
https://github.com/microsoft/guidance


https://skypilot.readthedocs.io/

Later:
https://github.com/Arize-ai/phoenix
https://github.com/explodinggradients/ragas
https://github.com/trypromptly/LLMStack


Q5_K_M


LangServe

pip install -U "langserve[all]"
pip install -U langchain-cli


langflow run


flowise

promptflow
pip install promptflow promptflow-tools


# DSPy
pip install dspy-ai

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.vscode		.vscode
anything-llm @ 1135853		anything-llm @ 1135853
chatbot-ui @ 937739f		chatbot-ui @ 937739f
models		models
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
activate		activate
autogen_agents.py		autogen_agents.py
autotrain.py		autotrain.py
chroma.py		chroma.py
codegen.py		codegen.py
dolly.py		dolly.py
embeddings.py		embeddings.py
falcon.py		falcon.py
langchain.py		langchain.py
langflow.py		langflow.py
litellm.py		litellm.py
llama-2-13b-chat.Modelfile		llama-2-13b-chat.Modelfile
llama_ctransformers.py		llama_ctransformers.py
lobe-chat.docker-compose.yml		lobe-chat.docker-compose.yml
mistral.py		mistral.py
nlpcloud.py		nlpcloud.py
open_playground.py		open_playground.py
pyproject.toml		pyproject.toml
qdrant.py		qdrant.py
serge.docker-compose.yml		serge.docker-compose.yml
starcoder.py		starcoder.py
txtai.yml		txtai.yml
txtai_app.py		txtai_app.py
vicuna_ctransformers.py		vicuna_ctransformers.py
vllm_app.py		vllm_app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI-Playground

Use StarCoder

Models

Try with LLaMA.cpp

Try with vLLM

Try with FastChat

Try with LeptonAI

Try with ollama

Specs

Other Tools

Development Notes

About

Releases

Packages

Languages

License

AI-Personal/AI-Playground

Folders and files

Latest commit

History

Repository files navigation

AI-Playground

Use StarCoder

Models

Try with LLaMA.cpp

Try with vLLM

Try with FastChat

Try with LeptonAI

Try with ollama

Specs

Other Tools

Development Notes

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages