Starred repositories
NoSQL data store using the seastar framework, compatible with Apache Cassandra
JanusGraph: an open-source, distributed graph database
SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC ac…
Curve is a sandbox project hosted by the CNCF Foundation. It's cloud-native, high-performance, and easy to operate. Curve is an open-source distributed storage system for block and shared file stor…
Scalable, reliable, distributed storage system optimized for data analytics and object store workloads.
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualiz…
Tools for monitoring NVIDIA GPUs on Linux
A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
WeDataSphere is a financial grade, one-stop big data platform suite.
🐶 Kubernetes CLI To Manage Your Clusters In Style!
[EOL] External storage plugins, provisioners, and helper libraries
Official Java client library for kubernetes
Standardized Serverless ML Inference Platform on Kubernetes
Easy automated syncing between your computers and your MEGA Cloud Drive
Native Kubernetes container management platform supporting multi-tenant and multi-cluster