Skip to content
View shaohaoyang's full-sized avatar

Block or report shaohaoyang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

text embedding

Python 141 6 Updated Sep 18, 2023

A quick guide (especially) for trending instruction finetuning datasets

2,715 175 Updated Nov 28, 2023

Official inference framework for 1-bit LLMs

C++ 12,367 867 Updated Dec 18, 2024

PDF to Markdown with vision models

Python 7,033 396 Updated Dec 18, 2024

Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.

Go 103,464 8,246 Updated Dec 19, 2024

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 11,835 1,559 Updated Dec 19, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 36,404 4,482 Updated Dec 19, 2024

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,218 461 Updated Nov 6, 2024

一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda

Python 1,789 169 Updated Apr 15, 2024

Benchmark for Multi-Scenario-Recommendation.

Python 27 1 Updated Oct 2, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,211 300 Updated Dec 19, 2024

I have surveyed the technology and papers of CTR & Recommender System, and implemented 25 common-used models with Pytorch for reusage. (对工业界学术界的CTR推荐调研并实现25个算法模型,2023)

Jupyter Notebook 42 1 Updated Sep 25, 2023

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 7,326 913 Updated Nov 13, 2024

Universal cross-platform tokenizers binding to HF and sentencepiece

C++ 286 65 Updated Nov 15, 2024

The Memory layer for your AI apps

Python 23,414 2,159 Updated Dec 19, 2024

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 5,548 462 Updated Dec 15, 2024

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 9,161 816 Updated Nov 27, 2024

A conda-forge distribution.

Shell 6,760 340 Updated Dec 14, 2024

MLGB is a library that includes many models of CTR Prediction & Recommender System by TensorFlow & PyTorch. MLGB是一个包含50+点击率预估和推荐系统深度模型的、通过TensorFlow和PyTorch撰写的库。

Python 593 26 Updated Aug 13, 2024

A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io

Python 974 164 Updated Dec 17, 2024

一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.

Python 6,378 763 Updated Dec 9, 2024

A generative speech model for daily dialogue.

Python 33,049 3,589 Updated Dec 3, 2024

Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks

C++ 19,470 1,028 Updated Dec 17, 2024

📚 极客时间电子书

11,389 3,845 Updated Jan 26, 2023

code for piccolo embedding model from SenseTime

Python 116 6 Updated May 21, 2024

PDF Reader in JavaScript

JavaScript 49,039 10,073 Updated Dec 18, 2024

Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers.

5,434 503 Updated Sep 23, 2024

A framework for prompt tuning using Intent-based Prompt Calibration

Python 2,262 197 Updated Nov 23, 2024

Computational geometry and spatial indexing on the sphere

C++ 2,357 311 Updated Dec 6, 2024

练习下用pytorch来复现下经典的推荐系统模型, 如MF, FM, DeepConn, MMOE, PLE, DeepFM, NFM, DCN, AFM, AutoInt, ONN, FiBiNET, DCN-v2, AFN, DCAP等

Python 557 122 Updated Mar 14, 2022
Next