LeeyiMing

LeeyiMing

3 followers · 2 following

Achievements

Stars

opendatalab / OmniDocBench

A Comprehensive Benchmark for Document Parsing and Evaluation

Python 136 13 Updated Dec 11, 2024

X-PLUG / mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Python 1,960 115 Updated Sep 28, 2024

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,098 291 Updated Dec 16, 2024

zhangfaen / finetune-Qwen2-VL

Jupyter Notebook 243 24 Updated Dec 6, 2024

huggingface / trl

Train transformer language models with reinforcement learning.

Python 10,323 1,324 Updated Dec 16, 2024

DS4SD / docling

Get your documents ready for gen AI

Python 14,989 765 Updated Dec 16, 2024

InternLM / HuixiangDou

HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance

Python 1,548 129 Updated Oct 29, 2024

ad-freiburg / large-qa-datasets

A collection of large question answering datasets

344 36 Updated Jul 1, 2024

thu-coai / Safety-Prompts

Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts，用于评估和提升大模型的安全性。

880 81 Updated Feb 27, 2024

THUDM / CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,165 147 Updated Sep 3, 2024

InsaneLife / ChineseNLPCorpus

中文自然语言处理数据集，平时做做实验的材料。欢迎补充提交合并。

Python 4,331 789 Updated Nov 21, 2023

QwenLM / Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 10,921 675 Updated Dec 4, 2024

deepseek-ai / DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Python 6,987 484 Updated May 21, 2024

wanglin2 / mind-map

一个还算强大的Web思维导图。A relatively powerful web mind map.

JavaScript 6,976 981 Updated Dec 14, 2024

colesbury / nogil

Multithreaded Python without the GIL

Python 2,907 106 Updated Jul 10, 2024

Tlntin / Qwen-TensorRT-LLM

Python 589 52 Updated Jul 31, 2024

kaldi-asr / kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 14,363 5,326 Updated Nov 29, 2024

BAAI-DCAI / Bunny

A family of lightweight multimodal models.

Python 955 70 Updated Nov 18, 2024

google-research / deduplicate-text-datasets

Rust 1,144 111 Updated Jul 30, 2024

apacha / OMR-Datasets

Collection of datasets used for Optical Music Recognition

Python 317 41 Updated Apr 1, 2024

mdeff / fma

FMA: A Dataset For Music Analysis

Jupyter Notebook 2,267 444 Updated Jan 5, 2023

lonePatient / awesome-pretrained-chinese-nlp-models

Awesome Pretrained Chinese NLP Models，高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Python 4,981 481 Updated Dec 16, 2024

dvlab-research / MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,229 281 Updated May 4, 2024

InternLM / xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 4,069 320 Updated Dec 16, 2024

microsoft / JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 23,802 1,980 Updated Sep 26, 2024

ZhuiyiTechnology / TableQA

NL2SQL competition dataset

188 45 Updated Jul 19, 2023

OpenBMB / MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,205 461 Updated Nov 6, 2024

jianzhnie / awesome-instruction-datasets

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

564 30 Updated Apr 7, 2024

kaixindelele / ChatPaper

Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 18,577 1,940 Updated Apr 4, 2024

thunlp / THULAC-Python

An Efficient Lexical Analyzer for Chinese

Python 2,032 336 Updated Jan 31, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LeeyiMing

Achievements

Achievements

Block or report LeeyiMing

Stars

opendatalab / OmniDocBench

X-PLUG / mPLUG-DocOwl

OpenRLHF / OpenRLHF

zhangfaen / finetune-Qwen2-VL

huggingface / trl

DS4SD / docling

InternLM / HuixiangDou

ad-freiburg / large-qa-datasets

thu-coai / Safety-Prompts

THUDM / CogVLM2

InsaneLife / ChineseNLPCorpus

QwenLM / Qwen2.5

deepseek-ai / DeepSeek-Coder

wanglin2 / mind-map

colesbury / nogil

Tlntin / Qwen-TensorRT-LLM

kaldi-asr / kaldi

BAAI-DCAI / Bunny

google-research / deduplicate-text-datasets

apacha / OMR-Datasets

mdeff / fma

lonePatient / awesome-pretrained-chinese-nlp-models

dvlab-research / MGM

InternLM / xtuner

microsoft / JARVIS

ZhuiyiTechnology / TableQA

OpenBMB / MiniCPM

jianzhnie / awesome-instruction-datasets

kaixindelele / ChatPaper

thunlp / THULAC-Python