Skip to content
View kyzhouhzau's full-sized avatar
🎯
Focusing
🎯
Focusing
  • BUPT
  • Beijin Province, China

Highlights

  • Pro

Block or report kyzhouhzau

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
107 results for source starred repositories
Clear filter

A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)

1,213 109 Updated Dec 3, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,737 1,035 Updated Dec 16, 2024

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,641 248 Updated Dec 12, 2023

An Open-sourced Knowledgable Large Language Model Framework.

Python 1,253 127 Updated Jun 26, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,279 4,574 Updated Dec 10, 2024

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 6,012 517 Updated Sep 6, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,587 249 Updated Dec 17, 2024

Example models using DeepSpeed

Python 6,162 1,052 Updated Dec 14, 2024

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 7,985 762 Updated Oct 16, 2024

FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.

Python 3,841 417 Updated Nov 19, 2024

Inference code for Llama models

Python 56,853 9,614 Updated Aug 18, 2024
Python 1,514 131 Updated Apr 27, 2023

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

Jupyter Notebook 2,187 386 Updated Sep 29, 2023

Making large AI models cheaper, faster and more accessible

Python 38,918 4,349 Updated Dec 17, 2024

Introduction to CPM

163 22 Updated Sep 26, 2021

Chinese Pre-Trained Language Models (CPM-LM) Version-I

Python 1,588 211 Updated Mar 18, 2023

stable diffusion webui colab

Jupyter Notebook 15,677 2,617 Updated Oct 15, 2024

Code repository of the paper "CKConv: Continuous Kernel Convolution For Sequential Data" published at ICLR 2022. https://arxiv.org/abs/2102.02611

Python 118 15 Updated Nov 29, 2022

Code for "Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition", accepted at ACL 2021.

Python 102 18 Updated Mar 18, 2022

BERT-for-BioNLP-OST2019-AGAC-Task2

Python 3 3 Updated Jan 23, 2024
Python 61 27 Updated Jun 13, 2019

Named Entity Recognition as Dependency Parsing

Python 350 39 Updated Aug 16, 2023

史上最详细的梯子教程:CentOS 7.4 + 阿里云ECS

110 31 Updated Sep 22, 2019

A free and unlimited python API for google translate.

Python 396 170 Updated Oct 6, 2022

PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

Python 1,009 105 Updated Apr 19, 2024

中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模型/论文/etc

2,222 376 Updated Jan 17, 2024

code for ACL 2020 paper: FLAT: Chinese NER Using Flat-Lattice Transformer

Python 1,004 175 Updated May 10, 2022

Repository for benchmarking graph neural networks

Jupyter Notebook 2,537 455 Updated Jun 22, 2023
Next