Skip to content
View LeeyiMing's full-sized avatar

Block or report LeeyiMing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Comprehensive Benchmark for Document Parsing and Evaluation

Python 136 13 Updated Dec 11, 2024

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Python 1,960 115 Updated Sep 28, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,098 291 Updated Dec 16, 2024
Jupyter Notebook 243 24 Updated Dec 6, 2024

Train transformer language models with reinforcement learning.

Python 10,323 1,324 Updated Dec 16, 2024

Get your documents ready for gen AI

Python 14,989 765 Updated Dec 16, 2024

HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance

Python 1,548 129 Updated Oct 29, 2024

A collection of large question answering datasets

344 36 Updated Jul 1, 2024

Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。

880 81 Updated Feb 27, 2024

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,165 147 Updated Sep 3, 2024

中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。

Python 4,331 789 Updated Nov 21, 2023

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 10,921 675 Updated Dec 4, 2024

DeepSeek Coder: Let the Code Write Itself

Python 6,987 484 Updated May 21, 2024

一个还算强大的Web思维导图。A relatively powerful web mind map.

JavaScript 6,976 981 Updated Dec 14, 2024

Multithreaded Python without the GIL

Python 2,907 106 Updated Jul 10, 2024
Python 589 52 Updated Jul 31, 2024

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 14,363 5,326 Updated Nov 29, 2024

A family of lightweight multimodal models.

Python 955 70 Updated Nov 18, 2024

Collection of datasets used for Optical Music Recognition

Python 317 41 Updated Apr 1, 2024

FMA: A Dataset For Music Analysis

Jupyter Notebook 2,267 444 Updated Jan 5, 2023

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Python 4,981 481 Updated Dec 16, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,229 281 Updated May 4, 2024

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 4,069 320 Updated Dec 16, 2024

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 23,802 1,980 Updated Sep 26, 2024

NL2SQL competition dataset

188 45 Updated Jul 19, 2023

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,205 461 Updated Nov 6, 2024

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

564 30 Updated Apr 7, 2024

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 18,577 1,940 Updated Apr 4, 2024

An Efficient Lexical Analyzer for Chinese

Python 2,032 336 Updated Jan 31, 2022
Next