Name		Name	Last commit message	Last commit date
Latest commit History 230 Commits
api		api
app		app
cmd		cmd
docs		docs
llama		llama
server		server
web		web
.dockerignore		.dockerignore
.gitignore		.gitignore
.prettierrc.json		.prettierrc.json
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go
models.json		models.json

Repository files navigation

Ollama

Run large language models with llama.cpp.

Note: certain models that can be run with this project are intended for research and/or non-commercial use only.

Features

Download and run popular large language models
Switch between multiple models on the fly
Hardware acceleration where available (Metal, CUDA)
Fast inference server written in Go, powered by llama.cpp
REST API to use with your application (python, typescript SDKs coming soon)

Install

Download for macOS
Download for Windows (coming soon)
Docker: docker run -p 11434:11434 ollama/ollama

You can also build the binary from source.

Quickstart

Run the model that started it all.

ollama run llama

Example models

💬 Chat

Have a conversation.

ollama run vicuna "Why is the sky blue?"

🗺️ Instructions

Ask questions. Get answers.

ollama run orca "Write an email to my boss."

👩‍💻 Code completion

Sometimes you just need a little help writing code.

ollama run replit "Give me react code to render a button"

📖 Storytelling

Venture into the unknown.

ollama run nous-hermes "Once upon a time"

Advanced usage

Run a local model

ollama run ~/Downloads/vicuna-7b-v1.3.ggmlv3.q4_1.bin

Building

make

To run it start the server:

./ollama server &

Finally, run a model!

./ollama run ~/Downloads/vicuna-7b-v1.3.ggmlv3.q4_1.bin

API Reference

`POST /api/pull`

Download a model

curl -X POST http://localhost:11343/api/pull -d '{"model": "orca"}'

`POST /api/generate`

Complete a prompt

curl -X POST http://localhost:11434/api/generate -d '{"model": "orca", "prompt": "hello!", "stream": true}'

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ollama

Features

Install

Quickstart

Example models

💬 Chat

🗺️ Instructions

👩‍💻 Code completion

📖 Storytelling

Advanced usage

Run a local model

Building

API Reference

`POST /api/pull`

`POST /api/generate`

About

Releases

Packages

Languages

License

gaoguangjun/ollama

Folders and files

Latest commit

History

Repository files navigation

Ollama

Features

Install

Quickstart

Example models

💬 Chat

🗺️ Instructions

👩‍💻 Code completion

📖 Storytelling

Advanced usage

Run a local model

Building

API Reference

POST /api/pull

POST /api/generate

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

`POST /api/pull`

`POST /api/generate`

Packages