Skip to content
View lidong1665's full-sized avatar

Block or report lidong1665

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Integrate the DeepSeek API into popular softwares

23,840 2,538 Updated Feb 28, 2025

Embed Python in Java

C 1,372 150 Updated Feb 24, 2025

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…

Python 19,510 2,069 Updated Feb 28, 2025

Start building LLM-empowered multi-agent applications in an easier way.

Python 6,382 375 Updated Feb 24, 2025

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 20,249 2,014 Updated Mar 1, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 41,166 5,537 Updated Feb 28, 2025

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 6,707 553 Updated Feb 28, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 42,591 5,203 Updated Feb 28, 2025

Intelligent data apps and assets with LLMs

Python 133 48 Updated Feb 24, 2025

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 40,465 5,997 Updated Mar 1, 2025

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 75,715 11,044 Updated Mar 1, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,139 4,272 Updated Mar 1, 2025

省市区县乡镇三级或四级城市数据,带拼音标注、坐标、行政区域边界范围;2025年01月14日最新采集,提供csv格式文件,支持在线转成多级联动js代码、通用json格式,提供软件转成shp、geojson、sql、导入数据库;带浏览器里面运行的js采集源码,综合了中华人民共和国民政部、国家统计局、高德地图、腾讯地图行政区划数据

JavaScript 5,842 929 Updated Jan 14, 2025

使用Bert,ERNIE,进行中文文本分类

Python 4,158 906 Updated Jun 28, 2024

基于 BERT 模型的中文文本分类工具

Python 62 15 Updated Apr 5, 2022

Use SQL to query Elasticsearch

Java 7,009 1,542 Updated Feb 4, 2025

The Memory layer for AI Agents

Python 24,972 2,323 Updated Mar 1, 2025

Easy to use open source fast database for search | Good alternative to Elasticsearch now | Drop-in replacement for E in the ELK soon

C++ 9,731 538 Updated Mar 1, 2025

更加简洁友好的接口,封装elasticsearch,lucene等索引工具的细节,提供通用搜索服务

JavaScript 31 14 Updated Jan 12, 2019

Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.

Java 3,342 1,168 Updated Feb 6, 2025

A MediaWiki bot framework in Java

Java 67 59 Updated Jan 4, 2025

A Pythonic wrapper for the Wikipedia API

Python 2,924 521 Updated May 12, 2024

FlinkSQL数据脱敏和行级权限解决方案及源码,支持面向用户级别的数据脱敏和行级数据访问控制,即特定用户只能访问到脱敏后的数据或授权过的行。此方案是实时领域Flink的解决方案,类似于离线数仓Hive Ranger中的Row-level Filter和Column Masking方案。

Java 132 48 Updated Oct 12, 2023

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,639 1,200 Updated Mar 1, 2025

RedisShake is a Redis data processing and migration tool.

Go 3,960 715 Updated Feb 24, 2025

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…

TypeScript 6,165 1,164 Updated Mar 1, 2025

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,743 262 Updated Mar 1, 2025

一款支持标准化schema定义、自动化部署产品包的软件,旨在对产品包下每个服务进行部署、升级、卸载、配置等操作,解放人工运维成本。

Go 203 68 Updated Aug 13, 2023

The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊

Clojure 41,025 5,396 Updated Mar 1, 2025

An Open Standard for lineage metadata collection

Java 1,855 326 Updated Feb 28, 2025
Next