Kaiyinzhou kyzhouhzau

🎯

Focusing

Interested in machine learning, deep learning and knowledge graph. Familiar with basic machine learning algorithms, especially variational inference.

182 followers · 20 following

BUPT
Beijin Province, China

Achievements

Highlights

Lists (1)

Sort

预训练语言模型

Stars

bionlp-hzau / BLAH8-LLM-for-Rice-GARE

Forked from YaoXinZhi/BLAH8-LLM-for-Rice-GARE

Bioregulatory Event Extraction using Large Language Models: A Case Study of Rice Literature

Python 2 Updated Feb 27, 2024

AI-in-Health / MedLLMsPracticalGuide

A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)

1,211 109 Updated Dec 3, 2024

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,736 1,035 Updated Dec 16, 2024

PhoebusSi / Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,641 248 Updated Dec 12, 2023

zjunlp / KnowLM

An Open-sourced Knowledgable Large Language Model Framework.

Python 1,253 127 Updated Jun 26, 2024

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,277 4,574 Updated Dec 10, 2024

Lightning-AI / lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 6,012 517 Updated Sep 6, 2024

esbatmop / MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,587 249 Updated Dec 17, 2024