ML
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Deep universal probabilistic programming with Python and PyTorch
Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.
☁️ Build multimodal AI applications with cloud-native stack
The open source, end-to-end computer vision platform. Label, build, train, tune, deploy and automate in a unified platform that runs on any cloud and on-premises.
Demonstrating how Bodywork can be used to deploy a simulation of the lifecycle of a train-and-serve ML pipeline, responding to new data undergoing concept drift.
Convert Machine Learning Code Between Frameworks
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
The purpose of the catalog is to help data science teams to collect all the requirements to consider while building a ML model and productionizing it.
A machine learning plugin in Open Distro for real time anomaly detection on streaming data.
Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.
TensorFlow Similarity is a python package focused on making similarity learning quick and easy.
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
OntoMerger is an ontology alignment library for deduplicating knowledge graph nodes that represent the same domain.
CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
🛠 All-in-one web-based IDE specialized for machine learning and data science.
Data Science Sandbox for Snowflake
FouCluster compute distance among songs in frequency domains, and operate with clusters
High-Performance Serverless event and data processing platform
MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integrates into your development and CI/CD environment and automate…
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
🪴 Nebari - your open source data science platform
♾️ CML - Continuous Machine Learning | CI/CD for ML