Skip to content
View minpeter's full-sized avatar

Sponsoring

@teknium1

Organizations

@codingpot @hansei-hsoc @hansei-nsb @HJ-404-girfriend @tempfiles-team @hanowwl

Block or report minpeter

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 834 44 Updated Mar 2, 2025

A full attention mechanism and transformer in pure go.

Go 314 5 Updated Feb 13, 2025

Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".

Python 188 43 Updated Oct 1, 2024

AI Agent Builder in Python

TypeScript 3,242 198 Updated Mar 6, 2025

This is a Python package to add tool calling capabilities to newly released LLMs on LangChain's ChatOpenAI library ahead of time before LangChain and LangGraph supports it!

Jupyter Notebook 64 9 Updated Mar 1, 2025

LLM-as-SERP

TypeScript 50 6 Updated Mar 3, 2025
Python 371 29 Updated Jan 14, 2025

MultiWOZ 2.4: A Multi-Domain Task-Oriented Dialogue Dataset

Python 62 7 Updated Nov 9, 2022

the official code for "ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases"

Python 879 39 Updated Oct 26, 2024

The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmark.

Python 45 5 Updated Nov 5, 2024

Hammer: Robust Function-Calling for On-Device Language Models via Function Masking

Python 62 4 Updated Feb 18, 2025

Benchmark in Korean Context

128 4 Updated Sep 26, 2023

Kanana: Compute-efficient Bilingual Language Models

201 4 Updated Mar 4, 2025

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 5,799 280 Updated Mar 6, 2025

Stateful control of Large Language Models

Python 114 5 Updated Mar 6, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,031 600 Updated Mar 6, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,167 774 Updated Mar 1, 2025
Go 11 Updated Jan 2, 2025

Scaling inference-time compute for LLM-as-a-judge, automated evaluations, guardrails, and reinforcement learning.

Jupyter Notebook 162 6 Updated Feb 26, 2025

Flags SDK by Vercel

TypeScript 297 12 Updated Mar 5, 2025

Making large AI models cheaper, faster and more accessible

Python 40,548 4,478 Updated Mar 6, 2025

LLM-powered multiagent persona simulation for imagination enhancement and business insights.

Python 6,047 483 Updated Feb 28, 2025

LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.

TypeScript 154 14 Updated Jan 21, 2025

Grok open release

Python 50,214 8,369 Updated Aug 30, 2024

Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.

Python 4,597 367 Updated Mar 3, 2025

The official Meta Llama 3 GitHub site

Python 28,454 3,305 Updated Jan 26, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 19,336 1,549 Updated Feb 23, 2025

Complex Function Calling Benchmark.

Python 78 5 Updated Jan 20, 2025
Next