terminator123

Follow

terminator123

Follow

2 followers · 13 following

Stars

liu-xiao-guo / semantic_search_es

Jupyter Notebook 9 3 Updated Nov 5, 2024

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,957 532 Updated Oct 24, 2024

FB208 / clash-linux-amd64-v1.2.0

6 4 Updated Mar 7, 2023

RUC-NLPIR / LLM4IR-Survey

This is the repo for the survey of LLM4IR.

448 37 Updated Sep 5, 2024

mli / paper-reading

深度学习经典、新论文逐段精读

27,539 2,468 Updated Nov 17, 2024

shuxueslpi / chatGLM-6B-QLoRA

使用peft库，对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调，并做lora model和base model的merge及4bit的量化（quantize）。

Python 354 46 Updated Aug 22, 2023

liucongg / ChatGLM-Finetuning

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型，进行下游具体任务微调，涉及Freeze、Lora、P-tuning、全参微调等

Python 2,683 301 Updated Dec 12, 2023

necrophagists / ChatGLM2_Lora

This repo was a simple way to implement Lora to fine-tuning ChatGLM2.这个项目是用LORA微调chatglm2的简单实现。

Python 8 1 Updated Aug 21, 2023

DUTIR-Emotion-Group / CCL2019-Chinese-Humor-Computation

CCL2019，“小牛杯”中文幽默计算任务的数据集及baseline

Jupyter Notebook 23 4 Updated Aug 27, 2024

esbatmop / MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,589 249 Updated Dec 17, 2024

THUDM / WebGLM

WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)

Python 1,578 136 Updated Dec 13, 2024

kamalkraj / BERT-NER

Pytorch-Named-Entity-Recognition-with-BERT

Python 1,220 277 Updated May 6, 2021

mlfoundations / open_clip

An open source implementation of CLIP.

Python 10,596 1,000 Updated Dec 4, 2024

brightmart / nlp_chinese_corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,537 1,548 Updated May 23, 2024

Werneror / Poetry

非常全的古诗词数据，收录了从先秦到现代的共计85万余首古诗词。

Python 1,571 387 Updated Aug 8, 2023

chinese-poetry / chinese-poetry

The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人，21050首词。

JavaScript 48,403 9,717 Updated Aug 10, 2024

liu-hz18 / Lyric-GPT2

GPT2 finetuned on Chinese Lyric Dataset.

Python 5 Updated Jan 20, 2023

clue-ai / PromptCLUE

PromptCLUE, 全中文任务支持零样本学习模型

Jupyter Notebook 656 66 Updated Jun 16, 2023

MiracleTanC / Neo4j-KGBuilder

Neo4j+springboot+vue+d3.js知识图谱构建和可视化

Vue 1,196 415 Updated Sep 17, 2023

jiangnanboy / movie_knowledge_graph_app

电影知识图谱，主要包括实体识别、实体查询、关系查询以及智能问答等。movie knowledge graph(Entity identification, graph display, and intelligent question and answer)

JavaScript 124 30 Updated Sep 11, 2022

husthuke / awesome-knowledge-graph

整理知识图谱相关学习资料

4,688 936 Updated Mar 11, 2021

kitaharatomoyo / JDDC2020-3rd-SourceCode

2020智源-京东多模态对话（JDDC2020）第三名解决方案分享

Python 41 13 Updated Nov 9, 2020

Tuzki2333 / 2022-WeChat-Big-Data-Challenge

Python 4 1 Updated Sep 12, 2022

CYang828 / datasetstation

快速下载中文数据集，处理数据集，数据分析、可视化分析，一站式解决数据问题

Python 66 5 Updated Nov 15, 2022

sahil-rajput / Candice-YourPersonalChatBot

Your personal ChatBot

HTML 64 59 Updated Jun 23, 2021

zengbin93 / jddc_solution_4th

2018-JDDC大赛第4名的解决方案

Jupyter Notebook 238 79 Updated Oct 22, 2018

EndlessLethe / jddc2019-3th-retrieve-model

JDDC 2019 并列亚军（第三名）“网数ICT小分队”的检索模型部分

Python 44 11 Updated Mar 24, 2023

jd-aig / nlp_baai

NLP models and codes for BAAI-JD joint project.

Python 254 56 Updated Nov 22, 2022

chatopera / chatbot.catalog.customer-service

💊 智能客服、聊天机器人的应用算法

279 75 Updated Dec 23, 2020

wangshusen / RecommenderSystem

2,613 386 Updated Feb 7, 2024