Skip to content
View chizhu's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report chizhu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理,可实现 CPU 上毫秒级的 OCR 精准预测,通用场景中英文OCR达到开源SOTA。

Python 39 4 Updated Dec 22, 2024

A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.

Python 156 3 Updated Dec 19, 2024

基于知识图谱的《红楼梦》人物关系可视化及问答系统

HTML 1,200 308 Updated Apr 23, 2019

An Open Large Reasoning Model for Real-World Solutions

Python 1,237 62 Updated Nov 28, 2024

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 3,200 257 Updated Dec 14, 2024

Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…

Python 4,716 414 Updated Dec 21, 2024

「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练!

Python 3,061 379 Updated Dec 13, 2024

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,669 185 Updated Nov 14, 2024

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 21,720 1,558 Updated Dec 20, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 6,146 395 Updated Dec 10, 2024

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Python 3,750 266 Updated Dec 11, 2024

Java version of LangChain, while empowering LLM for Big Data.

Java 551 109 Updated Mar 8, 2024

TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios

Python 163 16 Updated Sep 24, 2024

High-quality datasets, tools, and concepts for LLM fine-tuning.

2,144 186 Updated Dec 13, 2024

sqldf for pandas

Python 1,345 185 Updated Jul 24, 2024

The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM".

Python 234 9 Updated Nov 25, 2024

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

JavaScript 5,528 556 Updated Dec 5, 2024

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,373 103 Updated Oct 8, 2024

Robust recipes to align language models with human and AI preferences

Python 4,809 419 Updated Nov 21, 2024

[ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation

477 17 Updated Oct 31, 2024

LLM101n: Let's build a Storyteller

30,601 1,674 Updated Aug 1, 2024

[ICLR2022] official implementation of UniFormer

Python 833 111 Updated Mar 29, 2024

TrustRAG:The RAG Framework within Reliable input,Trusted output

Python 586 52 Updated Dec 19, 2024

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

Python 1,777 153 Updated Sep 23, 2024

A Strong FuxiCTR Baseline for News CTR Challenge at RecSys 2024

Python 15 4 Updated Jul 13, 2024

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,960 532 Updated Oct 24, 2024

Grok open release

Python 49,743 8,346 Updated Aug 30, 2024

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,929 175 Updated Nov 20, 2024
Next