Skip to content
View Junzh821's full-sized avatar

Block or report Junzh821

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 21,866 1,572 Updated Dec 24, 2024

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Python 537 56 Updated Dec 20, 2024

Tools to download and cleanup Common Crawl data

Python 976 143 Updated Apr 25, 2023

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 5,591 467 Updated Dec 15, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 35,889 4,475 Updated Nov 18, 2024

Convert PDF to markdown + JSON quickly with high accuracy

Python 18,763 1,088 Updated Dec 20, 2024

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 66,510 8,143 Updated Dec 23, 2024

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 10,169 1,013 Updated Sep 26, 2024

一些 LLM 方面的从零复现笔记

Jupyter Notebook 146 22 Updated Sep 20, 2024

An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowe…

Jupyter Notebook 2,068 194 Updated Dec 17, 2024

This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open sourced AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SL…

Jupyter Notebook 2,605 285 Updated Dec 12, 2024

Ollama Python library

Python 5,457 463 Updated Dec 23, 2024

LLM inference in C/C++

C++ 69,665 10,051 Updated Dec 24, 2024
Python 2,177 247 Updated Dec 20, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,910 1,135 Updated May 23, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,504 505 Updated Dec 24, 2024

llama3.np is a pure NumPy implementation for Llama 3 model.

Python 976 78 Updated Jun 2, 2024

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Python 66,056 8,025 Updated Dec 20, 2024

Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.

Python 783 79 Updated Oct 15, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 36,662 4,511 Updated Dec 24, 2024

[ICML 2024] Selecting High-Quality Data for Training Language Models

Python 147 10 Updated Jun 20, 2024

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

Python 897 47 Updated Dec 6, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

16,975 1,609 Updated Sep 19, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 26,080 2,509 Updated Dec 24, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

3,795 173 Updated Sep 25, 2024

A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one package

HTML 251 23 Updated Dec 25, 2023

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…

Python 14,003 1,420 Updated Dec 23, 2024
Jupyter Notebook 9,411 648 Updated Jul 29, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 22,781 2,237 Updated Dec 23, 2024
Next