Lists (1)
Sort Name ascending (A-Z)
Stars
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Curated list of datasets and tools for post-training.
A collection of architectural patterns leveraging Large Language Models (LLMs) for efficient Text-to-SQL generation.
EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Models (LLMs).
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
Simple, unified interface to multiple Generative AI providers
Agno is a lightweight library for building multi-modal Agents
Modeling, training, eval, and inference code for OLMo
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)
A debugging and profiling tool that can trace and visualize python code execution
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
A reading list on LLM based Synthetic Data Generation 🔥
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
🧑🚀 全世界最好的LLM资料总结(数据处理、模型训练、模型部署、o1 模型、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.
WGS-note / LLaMA-Factory
Forked from hiyouga/LLaMA-FactorySupport channel loss, forked from LLaMA-Factory
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Official repository for ICLR 2025 paper "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!
Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.