-
LightZero Public
Forked from opendilab/LightZero[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
Python Apache License 2.0 UpdatedMay 24, 2024 -
muzero-general Public
Forked from werner-duvaud/muzero-generalMuZero
Python MIT License UpdatedFeb 29, 2024 -
Stochastic-muzero Public
Forked from DHDev0/Stochastic-muzeroPytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variat…
Python GNU General Public License v3.0 UpdatedFeb 27, 2024 -
ChatGLM3 Public
Forked from THUDM/ChatGLM3ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
Python Other UpdatedNov 29, 2023 -
chatGLM-6B-QLoRA Public
Forked from shuxueslpi/chatGLM-6B-QLoRA -
localGPT Public
Forked from PromtEngineer/localGPTChat with your documents on your local device using GPT models. No data leaves your device and 100% private.
Jupyter Notebook Apache License 2.0 UpdatedOct 15, 2023 -
fastllm Public
Forked from ztxz16/fastllm纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行
C++ Apache License 2.0 UpdatedOct 10, 2023 -
MedicalGPT Public
Forked from shibing624/MedicalGPTMedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现包括二次预训练、有监督微调、奖励建模、强化学习训练。
Python Apache License 2.0 UpdatedSep 30, 2023 -
langchain_0831 Public
Forked from langchain-ai/langchain⚡ Building applications with LLMs through composability ⚡
Python MIT License UpdatedSep 1, 2023 -
LLaMA-Efficient-Tuning Public
Forked from hiyouga/LLaMA-FactoryEasy-to-use fine-tuning framework using PEFT (PT+SFT+RLHF with QLoRA)
Python Apache License 2.0 UpdatedAug 27, 2023 -
ChatGLM2-6B Public
Forked from THUDM/ChatGLM2-6BChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Python Other UpdatedAug 18, 2023 -
ChatGLM-RLHF-LoRA-RM-PPO Public
Forked from Pillars-Creation/ChatGLM-RLHF-LoRA-RM-PPOChatGLM-6B添加了RLHF的实现,以及部分核心代码的逐行讲解 ,实例部分是做了个新闻短标题的生成
Python Apache License 2.0 UpdatedAug 13, 2023 -
ChatGLM-LoRA-RLHF-PyTorch Public
Forked from jackaduma/ChatGLM-LoRA-RLHF-PyTorchA full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically Ch…
Python MIT License UpdatedAug 7, 2023 -
ChatGLM-Efficient-Tuning Public
Forked from hiyouga/ChatGLM-Efficient-TuningFine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
Python Apache License 2.0 UpdatedJul 25, 2023 -
Firefly Public
Forked from yangjianxin1/FireflyFirefly(流萤): 中文对话式大语言模型(全量微调+QLoRA)
Jupyter Notebook UpdatedJul 23, 2023 -
ChatGLM-6B-api-update Public
Forked from aleimu/ChatGLM-6BChatGLM-6B: api.py优化
Python Apache License 2.0 UpdatedJul 18, 2023 -
Goat-Math-Chinese Public
Forked from supinyu/Goat-Math-Chinese山羊中文算术大模型
Python Apache License 2.0 UpdatedJul 17, 2023 -
LLM-Tuning Public
Forked from beyondguo/LLM-TuningTuning LLMs with no tears💦, sharing LLM-tools with love❤️.
Python UpdatedJul 6, 2023 -
lora_bnb_int8 Public
利用LoRA bnb_int8微调chatYuan-large-v2的demo
-
-
Fine-tuning_ChatYuan-largeV2_测试alpaca格式数据集_LoRA+bitsandbytes_int8微调_去掉全量finetune_保留int8模型加载测试_0415.ipynb
UpdatedApr 15, 2023 -
winutils Public
Forked from cdarlint/winutilswinutils.exe hadoop.dll and hdfs.dll binaries for hadoop windows
Shell UpdatedOct 9, 2019 -
-
-
-
hierarchical-clustering-java Public
Forked from lbehnke/hierarchical-clustering-javaImplementation of an agglomerative hierarchical clustering algorithm in Java. Different linkage approaches are supported.