Skip to content
View zhongwanjun's full-sized avatar

Block or report zhongwanjun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Zotero chat PDF with DeepSeek, GPT 4.5, ChatGPT, Claude, Gemini

JavaScript 1,272 43 Updated Mar 1, 2025

Integrate the DeepSeek API into popular softwares

24,125 2,579 Updated Feb 28, 2025

Collection of papers and repos for multimodal chain-of-thought

59 3 Updated Nov 6, 2024

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,017 224 Updated Feb 19, 2025

A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.

Python 178 10 Updated Feb 6, 2025

Recipes to train reward model for RLHF.

Python 1,206 88 Updated Feb 9, 2025
Python 1,338 50 Updated Nov 21, 2024

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,188 378 Updated Jan 27, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,523 364 Updated Feb 26, 2025

中文版学术主页

SCSS 120 176 Updated Mar 2, 2025

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 9,117 1,329 Updated Feb 7, 2025

Control Any Computer Using LLMs.

Python 1,832 179 Updated Feb 18, 2025

An open source implementation of CLIP.

Python 11,117 1,050 Updated Mar 1, 2025

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 951 68 Updated Jan 31, 2025

RuLES: a benchmark for evaluating rule-following in language models

Python 219 15 Updated Feb 24, 2025

RuleR: Improving LLM Controllability by Rule-based Data Recycling

Python 12 1 Updated Feb 11, 2025

DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.

Python 1,334 202 Updated Jan 13, 2025

The official Meta Llama 3 GitHub site

Python 28,421 3,295 Updated Jan 26, 2025

A quick guide (especially) for trending instruction finetuning datasets

2,896 190 Updated Nov 28, 2023

[ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios

Python 53 2 Updated Mar 27, 2024

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,236 468 Updated Nov 6, 2024

Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation

Python 197 27 Updated Feb 10, 2024

Collection of training data management explorations for large language models

311 30 Updated Aug 2, 2024

[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

Python 615 37 Updated Jul 22, 2024

EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural language processing. Stay updated on the latest in machine learni…

Python 105 7 Updated May 18, 2024

MAD: The first work to explore Multi-Agent Debate with Large Language Models :D

Python 335 36 Updated Jan 14, 2025

Must-read Papers on LLM Agents.

2,162 126 Updated Feb 19, 2025

papers related to LLM-agent that published on top conferences

311 15 Updated Feb 7, 2024
Next