Fluid

📅 Community Meeting
The Fluid project holds bi-weekly community online meeting. To join or watch previous meeting notes and recordings, please see meeting schedule and meeting minutes.

What is Fluid?

Fluid is an open source Kubernetes-native Distributed Dataset Orchestrator and Accelerator for data-intensive applications, such as big data and AI applications. It is hosted by the Cloud Native Computing Foundation (CNCF) as a sandbox project.

For more information, please refer to our paper: Rong Gu, et al. Fluid: Dataset Abstraction and Elastic Acceleration for Cloud-native Deep Learning Training Jobs. IEEE ICDE. pp. 2183-2196, May. 2022.

Fluid

English | 简体中文

What is NEW!
Sep. 03th, 2022. Fluid v0.8.0 is RELEASED! It provides various new features, such as Lifecycle management of Serverless Job with fluid sidecar support, Enable runtime controller on demand, Automatic CRD upgrader, Restrict pod scheduling to dataset cache nodes, Arm64 support with JuicefsRuntime, GCS support for Alluxio Runtime, and so on. Please check the CHANGELOG for details.
Mar. 02th, 2022. Fluid v0.7.0 is RELEASED! It provides various new features, such as Fuse sidecar auto injection for all the runtimes (suitable for serverless environment), Fuse auto recovery and upgrade, lazy fuse mount mode, support JuiceFS cache runtime and so on. Please check the CHANGELOG for details.
Aug. 11th, 2021. Fluid v0.6.0 is RELEASED! It provides various new features, such as dataset cache autoscaling and cronscaling, dataset cache aware Pod scheduling, HA support for cache Runtime. Please check the CHANGELOG for details.
Apr. 27th, 2021. Fluid accpeted by CNCF! Fluid project was accepted as an official CNCF Sandbox Project by CNCF Technical Oversight Committee (TOC) with a majority vote after the review process. New beginning for Fluid! .

Features

Dataset Abstraction

Implement the unified abstraction for datasets from multiple storage sources, with observability features to help users evaluate the need for scaling the cache system.
Scalable Data Engine

Offers a unified access interface for data operations with different runtimes, enabling access to third-party storage systems.
Automated Data Operations

Provides various automated data operation modes to facilitate integration with automated operations systems.
Elasticity and Scheduling

By combining data caching technology with elastic scaling, portability, observability, and data affinity scheduling capabilities, Fluid enhances data access performance.
Runtime Platform Agnostic

Supports a variety of environments and can run different storage clients based on the environment, including native, edge, Serverless Kubernetes clusters, and Kubernetes multi-cluster environments.

Key Concepts

Dataset: A DataSet is a set of data logically related that can be used by computing engines, such as Spark for big data analytics and TensorFlow for AI applications. Intelligently leveraging data often creates core industry values. Managing DataSets may require features in different dimensions, such as security, version management and data acceleration. We hope to start with data acceleration to support the management of datasets.

Runtime: The execution engine that enforces dataset security, provides version management and data acceleration capabilities. The Runtime defines a set of interfaces to manage DataSets in their life cycle, so the management and acceleration of datasets can be implemented behind these interfaces.

Prerequisites

Kubernetes version > 1.16, and support CSI
Golang 1.18+
Helm 3

Quick Start

You can follow our Get Started guide to quickly start a testing Kubernetes cluster.

Documentation

You can see our documentation at docs for more in-depth installation and instructions for production:

You can also visit Fluid Homepage to get relevant documents.

Quick Demo

Demo 1: Accelerate Remote File Accessing with Fluid

Demo 2: Machine Learning with Fluid

Demo 3: Accelerate PVC with Fluid

Demo 4: Preload dataset with Fluid

Demo 5: On-the-fly dataset cache scaling

Roadmap

See ROADMAP.md for the roadmap details. It may be updated from time to time.

Community

Feel free to reach out if you have any questions. The maintainers of this project are reachable via:

DingTalk:

WeChat Official Account:

Slack:

Join in the CNCF Slack and navigate to the #fluid channel for discussion.

Contributing

Contributions are highly welcomed and greatly appreciated. See CONTRIBUTING.md for details on submitting patches and the contribution workflow.

Adopters

If you are interested in Fluid and would like to share your experiences with others, you are warmly welcome to add your information on ADOPTERS.md page. We will continuously discuss new requirements and feature design with you in advance.

Open Source License

Fluid is under the Apache 2.0 license. See the LICENSE file for details. It is vendor-neutral.

Report Vulnerability

Security is a first priority thing for us at Fluid. If you come across a related issue, please send email to [email protected] .

Code of Conduct

Fluid adopts CNCF Code of Conduct.

Name		Name	Last commit message	Last commit date
Latest commit History 2,000 Commits
.circleci		.circleci
.github		.github
addons		addons
api/v1alpha1		api/v1alpha1
charts		charts
cmd		cmd
config		config
csi		csi
docker		docker
docs-new		docs-new
docs		docs
hack		hack
integration		integration
pkg		pkg
samples		samples
static		static
test		test
tools		tools
vendor		vendor
.gitignore		.gitignore
.travis.yml		.travis.yml
ADOPTERS.md		ADOPTERS.md
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
GOVERNANCE.md		GOVERNANCE.md
LICENSE		LICENSE
MAINTAINERS_COMMITTERS.md		MAINTAINERS_COMMITTERS.md
Makefile		Makefile
OWNERS		OWNERS
PROJECT		PROJECT
README-zh_CN.md		README-zh_CN.md
README.md		README.md
RELEASE.md		RELEASE.md
ROADMAP.md		ROADMAP.md
SECURITY_CONTACTS		SECURITY_CONTACTS
codecov.yml		codecov.yml
coverage.txt		coverage.txt
go.mod		go.mod
go.sum		go.sum
k8s-mod.sh		k8s-mod.sh
sonar-project.properties		sonar-project.properties
version.go		version.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What is Fluid?

Fluid

Features

Key Concepts

Prerequisites

Quick Start

Documentation

Quick Demo

Roadmap

Community

Contributing

Adopters

Open Source License

Report Vulnerability

Code of Conduct

About

Releases

Packages

Languages

License

shiwenwenya/fluid

Folders and files

Latest commit

History

Repository files navigation

What is Fluid?

Fluid

Features

Key Concepts

Prerequisites

Quick Start

Documentation

Quick Demo

Roadmap

Community

Contributing

Adopters

Open Source License

Report Vulnerability

Code of Conduct

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages