Skip to content
View waderwu's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@SJTU-SCS @0ops @BytecodeDL

Block or report waderwu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

llm

collections of llm
25 repositories

LLM training in simple, raw C/CUDA

Cuda 24,724 2,801 Updated Oct 2, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,183 419 Updated May 29, 2024

🚀 KIMI AI 长文本大模型逆向API【特长:长文本解读整理】,支持高速流式输出、智能体对话、联网搜索、长文档解读、图像解析、多轮对话,零配置部署,多路token支持,自动清理会话痕迹,仅供测试,如需商用请前往官方开放平台。

TypeScript 3,924 639 Updated Dec 13, 2024

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,057 989 Updated Dec 13, 2024

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python 4,884 430 Updated Nov 18, 2024

Training LLMs with QLoRA + FSDP

Jupyter Notebook 1,430 189 Updated Nov 9, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 10,924 675 Updated Dec 4, 2024

《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases

Jupyter Notebook 14,046 2,927 Updated Dec 16, 2024

Browse the web with GPT-4V and Vimium

Python 2,657 199 Updated Sep 25, 2024

AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI

JavaScript 1,000 92 Updated Dec 9, 2024

Set-of-Mark Prompting for GPT-4V and LMMs

Python 1,215 98 Updated Aug 19, 2024

[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"

Jupyter Notebook 738 104 Updated Jul 30, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 36,177 4,451 Updated Dec 14, 2024

Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.

Go 102,995 8,209 Updated Dec 16, 2024

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Python 5,246 567 Updated Aug 8, 2024

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 11,578 1,012 Updated Dec 14, 2024

LLM inference in C/C++

C++ 69,309 9,977 Updated Dec 16, 2024

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 45,939 5,458 Updated Dec 9, 2024

Llama-3 agents that can browse the web by following instructions and talking to you

Python 1,372 103 Updated Dec 10, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,887 1,132 Updated May 23, 2024

Agentless🐱: an agentless approach to automatically solve software development problems

Python 898 95 Updated Dec 9, 2024

Code for the paper 🌳 Tree Search for Language Model Agents

Python 143 19 Updated Jul 25, 2024

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Python 1,912 164 Updated Dec 15, 2024

Large Action Model framework to develop AI Web Agents

Python 5,729 519 Updated Nov 17, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 25,459 2,453 Updated Dec 16, 2024