- chengdu
- csunny.github.io
- @magic10616810
Lists (1)
Sort Last updated
Starred repositories
No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
DSPy: The framework for programming—not prompting—language models
MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.
Awesome-RAG: Collect typical RAG papers and systems.
vsag is a vector indexing library used for similarity search.
This is a repo with links to everything you'd ever want to learn about data engineering
Let your Claude able to think
ETL, Analytics, Versioning for Unstructured Data
Lyric: A Rust-powered secure runtime for AI-Agent.
Secure open source cloud runtime for AI apps & AI agents
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
A Notebook Web Client with Flexible Customization and Easy Integration.
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
☘️ A visualization grammar based on G2 for streamlit.
SuperSonic is the next-generation AI+BI platform that unifies Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms.
agentUniverse is a LLM multi-agent framework that allows developers to easily build multi-agent applications.
Making data higher-quality, juicier, and more digestible for foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!
Cloud Native DataOps & AIOps Platform | 云原生数智运维平台
rockset / rocksdb-cloud
Forked from facebook/rocksdbA library that provides an embeddable, persistent key-value store for fast storage optimized for AWS
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs