-
Sun Yat-Sen University
- Guangzhou, Beijing
- zhongwanjun.github.io
Stars
Zotero chat PDF with DeepSeek, GPT 4.5, ChatGPT, Claude, Gemini
Integrate the DeepSeek API into popular softwares
Collection of papers and repos for multimodal chain-of-thought
This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.
Recipes to train reward model for RLHF.
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
An open source implementation of CLIP.
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
RuLES: a benchmark for evaluating rule-following in language models
RuleR: Improving LLM Controllability by Rule-based Data Recycling
DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.
A quick guide (especially) for trending instruction finetuning datasets
[ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation
Collection of training data management explorations for large language models
[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural language processing. Stay updated on the latest in machine learni…
MAD: The first work to explore Multi-Agent Debate with Large Language Models :D
papers related to LLM-agent that published on top conferences