Skip to content
View Kelsey2018's full-sized avatar

Block or report Kelsey2018

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 3,153 449 Updated Apr 24, 2024

Writing AI Conference Papers: A Handbook for Beginners

1,852 66 Updated Dec 23, 2024

[ICLR 2025] Pytorch implementation of the paper "Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaption".

Python 47 1 Updated Jan 22, 2025

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 3,821 294 Updated Jan 11, 2025

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,540 1,188 Updated Feb 1, 2025

📋 A list of open LLMs available for commercial use.

11,593 794 Updated Feb 3, 2025

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

Jupyter Notebook 14,804 2,057 Updated Dec 26, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,663 255 Updated Jan 24, 2025

A quick guide (especially) for trending instruction finetuning datasets

2,810 181 Updated Nov 28, 2023

Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

1,058 45 Updated Jul 31, 2024

中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模型/论文/etc

2,248 379 Updated Jan 17, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 39,225 4,813 Updated Feb 1, 2025

Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)

Python 4,120 340 Updated Sep 16, 2024

The official Meta Llama 3 GitHub site

Python 28,164 3,250 Updated Jan 26, 2025

Finetune Llama 3.3, DeepSeek-R1, Mistral, Phi-4 & Gemma 2 LLMs 2-5x faster with 70% less memory

Python 22,974 1,584 Updated Feb 3, 2025
Jupyter Notebook 596 67 Updated Dec 10, 2024

Utilities intended for use with Llama models.

Python 5,730 960 Updated Feb 2, 2025

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 33,046 4,832 Updated Jan 31, 2025

Easy to use extractive text summarization with BERT

Python 1,417 309 Updated Jun 12, 2023

Refine high-quality datasets and visual AI models

Python 9,127 592 Updated Feb 3, 2025

Train transformer language models with reinforcement learning.

Python 10,997 1,464 Updated Feb 3, 2025

Reference implementation for DPO (Direct Preference Optimization)

Python 2,346 195 Updated Aug 11, 2024

The implementation of DeBERTa

Python 2,033 232 Updated Sep 29, 2023

Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation

Python 73 1 Updated Nov 13, 2024

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

2,027 130 Updated Feb 3, 2025

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,919 235 Updated Jan 20, 2025

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,658 503 Updated Jan 21, 2025
Python 1,492 158 Updated Oct 25, 2024

结巴中文分词

Python 33,625 6,725 Updated Aug 21, 2024
Next