Skip to content
View yingzhao27's full-sized avatar

Block or report yingzhao27

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

天池 疫情相似句对判定大赛 线上第一名方案

Python 432 76 Updated Oct 17, 2020

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

JavaScript 1,175 60 Updated Dec 3, 2024

Several simple examples for popular neural network toolkits calling custom CUDA operators.

Python 1,372 191 Updated Apr 29, 2021

torch-optimizer -- collection of optimizers for Pytorch

Python 3,058 298 Updated Mar 22, 2024

中国大模型

5,656 469 Updated Nov 30, 2024

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,093 375 Updated Dec 17, 2024

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,644 248 Updated Dec 12, 2023

基于ChatGLM-6B的中文问诊模型

Python 788 87 Updated Oct 19, 2023

MNBVC General Cleaning Script for the Q&A Dataset of Foreign Ministry Journalists

Python 6 1 Updated Jul 2, 2023

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

16,913 1,600 Updated Sep 19, 2024

Execute Megatron-DeepSpeed using Slurm for multi-nodes distributed training

Shell 6 1 Updated May 4, 2022

从预训练到强化学习的中文llama2

Python 95 14 Updated Oct 19, 2023

开源SFT数据集整理,随时补充

465 38 Updated Jun 2, 2023

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 12,053 1,273 Updated Dec 17, 2024

Deep Reinforcement Learning

3,423 592 Updated Dec 10, 2022

dialogbot, provide search-based dialogue, task-based dialogue and generative dialogue model. 对话机器人,基于问答型对话、任务型对话、聊天型对话等模型实现,支持网络检索问答,领域知识问答,任务引导问答,闲聊问答,开箱即用。

Python 329 61 Updated Apr 23, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 36,551 4,503 Updated Dec 21, 2024

Panda项目是于2023年5月启动的开源海外中文大语言模型项目,致力于大模型时代探索整个技术栈,旨在推动中文自然语言处理领域的创新和合作。

Python 1,069 90 Updated Oct 19, 2023

中文无监督SimCSE Pytorch实现

Python 133 31 Updated Jul 8, 2021

SimCSE在中文任务上的简单实验

Python 594 83 Updated Aug 7, 2023

bert_avg,bert_whitening,sbert,consert,simcse,esimcse 中文句向量表示

Python 16 2 Updated Apr 7, 2022
Python 277 79 Updated Apr 26, 2022

主要是我是日常看过的不错的文章的资源汇总,方便自己也分享给大家。有些我看过的,就会做简单的解读,没看过的,就先罗列一下,然后之后看了把解读更新上;涉及到搜索/推荐/自然语言处理。

1,742 328 Updated Jun 3, 2021

NLP句子编码、句子embedding、语义相似度:BERT_avg、BERT_whitening、SBERT、SmiCSE

Python 174 36 Updated Dec 29, 2021

中文自然语言推理数据集(A large-scale Chinese Nature language inference and Semantic similarity calculation Dataset)

427 45 Updated Feb 10, 2020

自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子…

Python 1,527 395 Updated Sep 23, 2021

Collections of resources from Joint Laboratory of HIT and iFLYTEK Research (HFL)

Markdown 367 41 Updated Mar 9, 2023

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Python 9,751 1,390 Updated Jul 31, 2023

搜索所有中文NLP数据集,附常用英文NLP数据集

Python 4,194 614 Updated Nov 21, 2022
Next