Skip to content
View janpychou's full-sized avatar
  • sugo.io
  • guangzhou

Block or report janpychou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DuckDB is an analytical in-process SQL database management system

C++ 26,683 2,094 Updated Feb 22, 2025

A composable and fully extensible C++ execution engine library for data management systems.

C++ 3,618 1,211 Updated Feb 24, 2025

OCR & Document Extraction using vision models

TypeScript 9,865 644 Updated Feb 18, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 14,467 1,672 Updated Feb 12, 2025

机器学习、深度学习、自然语言处理、计算机视觉、各种算法等AI领域相关技术的路线、教程、干货分享。笔记有:机器学习实战、剑指Offer、cs231n、cs131、吴恩达机器学习、cs224n、python自然语言处理实战

Python 574 144 Updated Nov 14, 2020

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.

Python 13,340 1,141 Updated Feb 8, 2025

前端精读周刊。帮你理解最前沿、实用的技术。

JavaScript 28,958 3,255 Updated Sep 9, 2024

🔥🔥🔥AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.

Java 19,202 2,133 Updated Feb 23, 2025

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

Java 2,651 1,030 Updated Feb 24, 2025

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java 10,886 3,106 Updated Feb 23, 2025

跨平台Excel导表工具(Excel=>protobuf/msgpack/lua/javascript/json/xml)

Java 280 71 Updated Dec 4, 2024

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 71,137 14,694 Updated May 10, 2024

前端低代码框架,通过 JSON 配置就能生成各种页面。

TypeScript 17,800 2,575 Updated Feb 21, 2025

Doris表和字段血缘项目

Java 78 30 Updated Apr 30, 2024

kettle插件机制进行血缘采集,采集执行时SQL等

Java 9 2 Updated Jan 28, 2021

解析SQL,获取字段、表级别的血缘关系。转换成血缘模型,在图数据库neo4j上呈现。

Java 171 81 Updated Nov 17, 2020

半佛风格视频生成器

Python 188 35 Updated Oct 24, 2021

快速高效的生成抖音,快手,火山,西瓜视频;批量制作新闻资讯,笑话等短视频;视频风格转移;动态排名视频;视频批量上传,批量发布

Python 623 187 Updated May 17, 2021

Apache Doris is an easy-to-use, high performance and unified analytics database.

Java 13,200 3,379 Updated Feb 24, 2025

DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualiz…

Java 3,124 1,013 Updated Nov 29, 2024

Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.

Java 3,339 1,167 Updated Feb 6, 2025

Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.

Vue 810 267 Updated Dec 11, 2024

Tesseract Open Source OCR Engine (main repository)

C++ 64,800 9,706 Updated Feb 12, 2025

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …

Java 9,547 1,907 Updated Feb 24, 2025

Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)

Python 12,801 2,072 Updated Aug 7, 2024

LunarBase engine (in C and Java) targets a real-time and free style database engine, managing 2 billions of records in one table, where each record has a size limitation up to 32k bytes, and it can…

Java 36 10 Updated Jun 10, 2017

C++ Parallel Computing and Asynchronous Networking Framework

C++ 13,519 2,458 Updated Feb 19, 2025

Submarine is Cloud Native Machine Learning Platform.

Java 700 254 Updated Apr 3, 2024

pandas中文教程

Jupyter Notebook 4,756 1,895 Updated Apr 24, 2024

《机器学习》(西瓜书)公式详解

24,457 4,777 Updated Feb 18, 2025
Next