Stars
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Implementation of Nougat Neural Optical Understanding for Academic Documents
The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
Repository for mRNA Paper and CodonBERT publication.
An expressive DSL and framework for Kubernetes configuration without YAML
Starlark in Go: the Starlark configuration language, implemented in Go
M3 monorepo - Distributed TSDB, Aggregator and Query Engine, Prometheus Sidecar, Graphite Compatible, Metrics Platform
🤖 Build voice-based LLM agents. Modular + open source.
A modern replacement for Redis and Memcached
Language-agnostic persistent background job server
BuildFlow, is an open source framework for building large scale systems using Python. All you need to do is describe where your input is coming from and where your output should be written, and Bui…
Neon: Serverless Postgres. We separated storage and compute to offer autoscaling, code-like database branching, and scale to zero.
A Gradio web UI for Large Language Models with support for multiple inference backends.
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Running large language models on a single GPU for throughput-oriented scenarios.
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch…
🏬 modelstore is a Python library that allows you to version, export, and save a machine learning model to your filesystem or a cloud storage provider.
UnionML: the easiest way to build and deploy machine learning microservices
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
An open-source ML pipeline development platform
A docker-powered PaaS that helps you build and manage the lifecycle of applications
Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training