Skip to content
View heihei2015's full-sized avatar

Organizations

@RmachineLearning

Block or report heihei2015

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!

Python 7,824 542 Updated Jan 25, 2025

Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

Python 21,689 1,523 Updated Jan 26, 2025

Datasets and Evaluation Scripts for CompHRDoc

Python 32 4 Updated Mar 28, 2024

Dataset and scripts for HRDoc

Python 35 4 Updated Jun 21, 2023

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 30,328 2,838 Updated Jan 28, 2025

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python 6,309 563 Updated Jan 28, 2025

LLM101n: Let's build a Storyteller

31,155 1,706 Updated Aug 1, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 38,884 4,772 Updated Jan 28, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 4,119 399 Updated Jan 28, 2025

Ultralytics YOLO11 🚀

Python 35,868 6,909 Updated Jan 28, 2025

text embedding

Python 144 7 Updated Sep 18, 2023

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 45,821 5,468 Updated Dec 18, 2024

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 4,827 481 Updated Aug 6, 2024

Fast and memory-efficient exact attention

Python 15,210 1,436 Updated Jan 18, 2025

ReFT: Representation Finetuning for Language Models

Python 1,388 121 Updated Jan 1, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,891 528 Updated Dec 25, 2024

This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path P…

Python 16 1 Updated Mar 20, 2024

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,611 650 Updated Aug 13, 2024

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 112,982 8,981 Updated Jan 28, 2025

全网最全Stable Diffusion全套教程,从入门到进阶,耗时三个月制作

1,450 133 Updated May 26, 2023

自动跳过APP开屏广告

Kotlin 2,770 152 Updated Jan 4, 2025

《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing

Java 107,972 13,472 Updated Jan 25, 2025

《剑指 Offer》 Python, Java, C++ 解题代码,LeetBook《图解算法数据结构》配套代码仓

Java 6,854 799 Updated May 11, 2024

计图大模型推理库,具有高性能、配置要求低、中文支持好、可移植等特点

Python 2,400 185 Updated Jan 6, 2024

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 406 40 Updated Oct 20, 2024

SOTA Open Source TTS

Python 18,686 1,414 Updated Jan 26, 2025

A batched offline inference oriented version of segment-anything

Python 1,212 71 Updated Sep 13, 2024

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Python 7,143 579 Updated Sep 23, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 35,245 5,357 Updated Jan 28, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 37,204 4,627 Updated Aug 16, 2024
Next