Stars
Building Large Language Model Applications, Published by Packt
LlamaIndex is a data framework for your LLM applications
A Streamlit user interface for local LLM implementation on Ollama. With just three python apps you can have a localized LLM to chat with. I'm running Ollama Windows (just updated) and DuckDuckGo br…
AI chat assistant for a specific codebase by retreiving relevant details using rag and chroma db
A simple streamlit app with ollama.
oneAPI Level Zero Specification Headers and Loader
Pre-trained Deep Learning models and demos (high quality and extremely fast)
Neural Network Compression Framework for enhanced OpenVINO™ inference
📚 Jupyter notebook tutorials for OpenVINO™
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
Intel® NPU Acceleration Library
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU su…
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
High-level bindings for wasi-nn system calls
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Examples for using ONNX Runtime for machine learning inferencing.
TensorFlow Handbook for JavaScript/TypeScript
Python libraries for Google Colaboratory
Pretrained models for TensorFlow.js
WebGPU Tutorial: Step-by-step graphics programming with WebGPU - the next-generation graphics API for the web.
React Three Fiber, Threejs, Nextjs starter
😎 Curated list of awesome things around WebGPU ecosystem.