Stars
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.
Large language model fine-tuning capabilities based on cloud native and distributed computing.
Large World Model -- Modeling Text and Video with Millions Context
PyTorch code and models for V-JEPA self-supervised learning from video.
Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""
Lustre Monitoring System based on Collectd, Grafana and Influxdb
This repository is established to store personal notes and annotated papers during daily research.
Dragonfly is an open source P2P-based file distribution and image acceleration system. It is hosted by the Cloud Native Computing Foundation (CNCF) as an Incubating Level Project.
Pretrain, finetune and serve LLMs on Intel platforms with Ray
🏭
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
OceanBase is an enterprise distributed relational database with high availability, high performance, horizontal scalability, and compatibility with SQL standards.
Training and serving large-scale neural networks with auto parallelization.
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology
A Virtual Machine Monitor for modern Cloud workloads. Features include CPU, memory and device hotplug, support for running Windows and Linux guests, device offload with vhost-user and a minimal com…
Generate Java types from JSON or JSON Schema and annotate those types for data-binding with Jackson, Gson, etc