English | 简体中文
Fluid is an open source Kubernetes-native Distributed Dataset Orchestrator and Accelerator for data-intesive applications, such as big data and AI applications.
-
Native Support for DataSet Abstraction
Make the abilities needed by data-intensive applictions as navtive-supported functions, to achieve efficient data access and reduce the cost of multidimensional management.
-
Cloud Data Warming up and Accessing Acceleration
Fluid empowers Distributed Cache Capaicty(Alluixo inside) in Kubernetes with Observability, Portability, Horizontal Scalability
-
Co-Orchestration for Data and Application
During application scheduling and data placement on cloud, taking both the app's characteristics and data location into consideration, to improve the performance.
-
Support Multiple Namespaces Management
User can create and manage datasets in multiple namespaces
-
Support Heterogeneous Data Source Management
Unify the Data access for OSS, HDFS, CEPH and Other underlayer storages
- Kubernetes version > 1.14, and support CSI
- Golang 1.12+
- Helm 3
You can follow our Get Started guide to quickly start a testing Kubernetes cluster.
You can see our documentation at docs for more in-depth installation and instructions for production:
Feel free to reach out if you have any questions. The maintainers of this project are reachable via:
DingTalk:
Contributions are welcome and greatly appreciated. See CONTRIBUTING.md for details on submitting patches and the contribution workflow.
Fluid is under the Apache 2.0 license. See the LICENSE file for details.
This project is co-founded by Dr. Rong Gu from Nanjing University, Yang Che from Alibaba Group,Inc and Dr. Bin Fan from Alluxio,Inc.