Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
An orchestration platform for the development, production, and observation of data assets.
Run any open-source LLMs, such as Llama, Mistral, as OpenAI compatible API endpoint in the cloud.
An Industrial Grade Federated Learning Framework
CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers catalog.data.gov, open.canada.ca/data, data…
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
A unified framework for privacy-preserving data analysis and machine learning
HanLP作者的新书《自然语言处理入门》详细笔记!业界良心之作,书中不是枯燥无味的公式罗列,而是用白话阐述的通俗易懂的算法模型。从基本概念出发,逐步介绍中文分词、词性标注、命名实体识别、信息抽取、文本聚类、文本分类、句法分析这几个热门问题的算法原理与工程实现。
🚀 RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.
Image Recognition captcha without image segmentation 无需图片分割的验证码识别
FALCON: experimental PacBio diploid assembler -- Out-of-date -- Please use a binary release: https://github.com/PacificBiosciences/FALCON_unzip/wiki/Binaries
AI Flow is an open source framework that bridges big data and artificial intelligence.
Data Pipeline Clientlib provides an interface to tail and publish to data pipeline topics.
BobbySun / autolabel
Forked from refuel-ai/autolabelLabel, clean and enrich text datasets with LLMs.