-
SJTU
- shanghai china
- https://blog.sometimenaive.com
Highlights
- Pro
llm
a state-of-the-art-level open visual language model | 多模态预训练模型
🚀 KIMI AI 长文本大模型逆向API【特长:长文本解读整理】,支持高速流式输出、智能体对话、联网搜索、长文档解读、图像解析、多轮对话,零配置部署,多路token支持,自动清理会话痕迹,仅供测试,如需商用请前往官方开放平台。
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
Training LLMs with QLoRA + FSDP
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Llama-3 agents that can browse the web by following instructions and talking to you
llama3 implementation one matrix multiplication at a time
Agentless🐱: an agentless approach to automatically solve software development problems
Code for the paper 🌳 Tree Search for Language Model Agents
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
Large Action Model framework to develop AI Web Agents
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.