- chengdu
- csunny.github.io
- @magic10616810
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
DSPy: The framework for programming—not prompting—language models
MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.
Awesome-RAG: Collect typical RAG papers and systems.
vsag is a vector indexing library used for similarity search.
This is a repo with links to everything you'd ever want to learn about data engineering
Let your Claude able to think
ETL, Analytics, Versioning for Unstructured Data
Lyric: A Rust-powered secure runtime for AI-Agent.
Secure open source cloud runtime for AI apps & AI agents
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
A Notebook Web Client with Flexible Customization and Easy Integration.
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
☘️ A visualization grammar based on G2 for streamlit.
SuperSonic is the next-generation AI+BI platform that unifies Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms.
agentUniverse is a LLM multi-agent framework that allows developers to easily build multi-agent applications.
Making data higher-quality, juicier, and more digestible for foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!
Cloud Native DataOps & AIOps Platform | 云原生数智运维平台
rockset / rocksdb-cloud
Forked from facebook/rocksdbA library that provides an embeddable, persistent key-value store for fast storage optimized for AWS
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs