Skip to content
View HYZ17's full-sized avatar

Block or report HYZ17

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Reproducible, flexible LLM evaluations

Python 122 8 Updated Dec 9, 2024

veRL: Volcano Engine Reinforcement Learning for LLM

Python 725 57 Updated Jan 21, 2025

Scalable RL solution for advanced reasoning of language models

Python 934 60 Updated Jan 17, 2025

A high performance general purpose code execution engine.

JavaScript 2,011 265 Updated Oct 11, 2024

Sandboxed code execution for AI agents, locally or on the cloud.

Python 51 8 Updated Jan 21, 2025

A multi-language code evaluation tool.

Python 20 8 Updated Jan 26, 2024
Python 178 9 Updated Jan 16, 2025

Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Python 5,384 452 Updated Jan 11, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,290 346 Updated Jan 22, 2025

B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Python 66 11 Updated Jan 3, 2025

Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym

Jupyter Notebook 210 8 Updated Jan 13, 2025

Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.

Python 3,943 275 Updated Jan 17, 2025

🙌 OpenHands: Code Less, Make More

Python 44,222 4,899 Updated Jan 22, 2025

[ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 2,310 398 Updated Jan 22, 2025

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 1,833 107 Updated Jun 1, 2023

👨‍💻 An awesome and curated list of best code-LLM for research.

1,094 63 Updated Dec 10, 2024

Recipes to train reward model for RLHF.

Python 1,103 78 Updated Dec 12, 2024

A series of math-specific large language models of our Qwen2 series.

Python 712 73 Updated Jan 11, 2025

Recipes to scale inference-time compute of open models

Python 956 86 Updated Jan 16, 2025

《labuladong的算法小抄》顺序阅读版

816 284 Updated Aug 8, 2022

Code and datasets for "What’s “up” with vision-language models? Investigating their struggle with spatial reasoning".

Python 38 2 Updated Feb 28, 2024

Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 🍓

2,298 129 Updated Jan 22, 2025

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 24,819 1,865 Updated Jan 21, 2025

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Python 13,013 3,090 Updated Jan 21, 2025

A curated list of awesome data labeling tools

3,889 443 Updated Jun 17, 2024

To speedup and simplify image labeling/ annotation process with multiple supported formats.

HTML 996 616 Updated Jan 1, 2024

MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)

Python 279 11 Updated Jan 20, 2025

Tips for Writing a Research Paper using LaTeX

TeX 3,325 381 Updated May 4, 2023
Next