Skip to content
View NeilHUI's full-sized avatar

Highlights

  • Pro

Block or report NeilHUI

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Python 6,294 775 Updated Jan 3, 2025

💬 Ready-to-use, flexible RAG Chatbot. 基于大模型和 RAG 的知识库问答系统。

Python 12,465 1,626 Updated Jan 8, 2025

Train transformer language models with reinforcement learning.

Python 10,548 1,364 Updated Jan 8, 2025

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 37,537 4,630 Updated Jan 8, 2025

Source code for "Train No Evil: Selective Masking for Task-Guided Pre-Training"

Python 70 17 Updated Nov 25, 2022

搜索所有中文NLP数据集,附常用英文NLP数据集

Python 4,211 616 Updated Nov 21, 2022

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Python 19,173 4,665 Updated Jan 8, 2025

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

Python 324 54 Updated Jan 6, 2025

[ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”

Python 119 6 Updated Jul 8, 2024

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Python 1,715 386 Updated Jan 6, 2025

Clash for openwrt [Luci-app-clash] https://github.com/frainzy1477/luci-app-clash

Makefile 357 75 Updated Dec 28, 2019

unified embedding model

Python 845 66 Updated Sep 1, 2023

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

17,346 1,663 Updated Sep 19, 2024

中文词库/词典,可用于NLP项目、分词等场景

46 19 Updated Jun 15, 2022

THUOCL(THU Open Chinese Lexicon)中文词库

884 197 Updated Apr 3, 2023

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,747 2,222 Updated Jul 29, 2024

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 12,669 1,382 Updated Jan 4, 2025

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,148 826 Updated Jun 10, 2024

Awesome-LLM: a curated list of Large Language Model

20,436 1,667 Updated Dec 31, 2024

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 20,767 2,674 Updated Aug 15, 2024

MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification

Jupyter Notebook 355 62 Updated Jun 5, 2020

剑指offer leetcode 对应编程练习记录,算法岗需求,包含Python和Java两种语言。

23 6 Updated Aug 27, 2018

中文文本分类,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。

Python 5,429 1,238 Updated Sep 23, 2020

Hierarchy-aware Label Semantics Matching Network for Hierarchical Text Classification

Python 46 9 Updated Feb 21, 2023

Hierarchy-Aware Global Model for Hierarchical Text Classification

Python 210 43 Updated Nov 28, 2022

This repository implements a contrastive learning model for hierarchical text classification. This work has been accepted as the long paper "Incorporating Hierarchy into Text Encoder: a Contrastive…

Python 132 30 Updated May 14, 2024
Jupyter Notebook 323 91 Updated May 24, 2019

Code for Label Semantics for Few Shot Named Entity Recognition

Python 55 7 Updated Apr 27, 2023
Python 1 Updated May 19, 2023
Python 1 Updated Apr 16, 2022
Next