Skip to content
View zhenyu's full-sized avatar

Block or report zhenyu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A portable accelerated data query and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.

Rust 2,053 90 Updated Feb 20, 2025

Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)

Go 1,714 958 Updated Feb 20, 2025

An open-source ML pipeline development platform

Python 980 62 Updated Jan 9, 2025

Epsilla is a high performance Vector Database Management System

C++ 837 40 Updated Dec 5, 2024

🧙 Build, run, and manage data pipelines for integrating and transforming data.

Python 8,149 817 Updated Feb 15, 2025

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 17,294 4,328 Updated Feb 20, 2025

C++/Wolfram Language package for exploring set and graph rewriting systems

Mathematica 222 47 Updated Feb 20, 2025

The open source Firebase alternative. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.

TypeScript 77,712 7,751 Updated Feb 20, 2025

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 39,093 5,574 Updated Feb 20, 2025

Making data lake work for time series

Python 1,152 59 Updated Aug 21, 2024

《Machine Learning Systems: Design and Implementation》- Chinese Version

TeX 4,248 448 Updated Apr 13, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,869 4,250 Updated Feb 20, 2025

Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!

Python 5,649 395 Updated Feb 20, 2025

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch…

Python 1,819 283 Updated Dec 2, 2023

[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving

Python 850 90 Updated Feb 19, 2025

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…

TypeScript 6,112 1,148 Updated Feb 20, 2025

A curated, but incomplete, list of data-centric AI resources.

1,081 76 Updated Jun 26, 2024

High-Performance Serverless event and data processing platform

Go 5,369 540 Updated Feb 20, 2025

Build data pipelines, the easy way 🛠️

TypeScript 4,111 263 Updated Jun 6, 2023

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activelo…

Python 8,398 643 Updated Feb 15, 2025

DuckDB is an analytical in-process SQL database management system

C++ 26,606 2,087 Updated Feb 20, 2025

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…

Rust 4,189 256 Updated Feb 19, 2025

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Python 18,317 1,710 Updated Feb 20, 2025

Evidently is ​​an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.

Jupyter Notebook 5,718 628 Updated Feb 20, 2025

Federated learning platform for edge computing, based on KubeEdge

Python 27 3 Updated Jul 31, 2021

WebRTC for the Curious: Go beyond the APIs

Python 2,026 203 Updated Feb 18, 2025

🪄 Master Modern C++(11/14/17/20) Templates: TMP, SFINAE, Concepts, CRTP, Variadic Magic, and Compile-Time Sorcery

C++ 1,624 283 Updated Jan 24, 2025

flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去…

Java 14,655 3,927 Updated May 25, 2024

Curated List of Self-Driving Cars and Autonomous Vehicles Resources

2,229 584 Updated Mar 15, 2024

An awesome list of self-driving cars

717 171 Updated Sep 12, 2023
Next