Skip to content
View terminator123's full-sized avatar

Block or report terminator123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 9 3 Updated Nov 5, 2024

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,957 532 Updated Oct 24, 2024

This is the repo for the survey of LLM4IR.

448 37 Updated Sep 5, 2024

深度学习经典、新论文逐段精读

27,539 2,468 Updated Nov 17, 2024

使用peft库,对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调,并做lora model和base model的merge及4bit的量化(quantize)。

Python 354 46 Updated Aug 22, 2023

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等

Python 2,683 301 Updated Dec 12, 2023

This repo was a simple way to implement Lora to fine-tuning ChatGLM2.这个项目是用LORA微调chatglm2的简单实现。

Python 8 1 Updated Aug 21, 2023

CCL2019,“小牛杯”中文幽默计算任务的数据集及baseline

Jupyter Notebook 23 4 Updated Aug 27, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,589 249 Updated Dec 17, 2024

WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)

Python 1,578 136 Updated Dec 13, 2024

Pytorch-Named-Entity-Recognition-with-BERT

Python 1,220 277 Updated May 6, 2021

An open source implementation of CLIP.

Python 10,596 1,000 Updated Dec 4, 2024

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,537 1,548 Updated May 23, 2024

非常全的古诗词数据,收录了从先秦到现代的共计85万余首古诗词。

Python 1,571 387 Updated Aug 8, 2023

The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。

JavaScript 48,403 9,717 Updated Aug 10, 2024

GPT2 finetuned on Chinese Lyric Dataset.

Python 5 Updated Jan 20, 2023

PromptCLUE, 全中文任务支持零样本学习模型

Jupyter Notebook 656 66 Updated Jun 16, 2023

Neo4j+springboot+vue+d3.js知识图谱构建和可视化

Vue 1,196 415 Updated Sep 17, 2023

电影知识图谱,主要包括实体识别、实体查询、关系查询以及智能问答等。movie knowledge graph(Entity identification, graph display, and intelligent question and answer)

JavaScript 124 30 Updated Sep 11, 2022

整理知识图谱相关学习资料

4,688 936 Updated Mar 11, 2021

2020智源-京东多模态对话(JDDC2020)第三名解决方案分享

Python 41 13 Updated Nov 9, 2020

快速下载中文数据集,处理数据集,数据分析、可视化分析,一站式解决数据问题

Python 66 5 Updated Nov 15, 2022

Your personal ChatBot

HTML 64 59 Updated Jun 23, 2021

2018-JDDC大赛第4名的解决方案

Jupyter Notebook 238 79 Updated Oct 22, 2018

JDDC 2019 并列亚军(第三名)“网数ICT小分队”的检索模型部分

Python 44 11 Updated Mar 24, 2023

NLP models and codes for BAAI-JD joint project.

Python 254 56 Updated Nov 22, 2022

💊 智能客服、聊天机器人的应用算法

279 75 Updated Dec 23, 2020
Next