Skip to content
View WynMew's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report WynMew

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,764 263 Updated Mar 8, 2025

Simple next-token-prediction for RLHF

Python 222 17 Updated Sep 30, 2023

交易模块

Python 5,870 1,327 Updated May 13, 2024

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,604 2,926 Updated Sep 2, 2024

[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.

Python 803 55 Updated Apr 28, 2023

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,068 766 Updated Oct 16, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,876 4,056 Updated Jul 17, 2024

Aligning pretrained language models with instruction data generated by themselves.

Python 4,302 503 Updated Mar 27, 2023

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,872 2,604 Updated Mar 4, 2025

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".

Jupyter Notebook 478 36 Updated Oct 30, 2023

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,827 379 Updated Mar 14, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,736 1,890 Updated Apr 30, 2024

Making large AI models cheaper, faster and more accessible

Python 40,567 4,478 Updated Mar 10, 2025

Train transformer language models with reinforcement learning.

Python 12,367 1,667 Updated Mar 7, 2025

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,464 705 Updated Jan 28, 2025

An elegant PyTorch deep reinforcement learning library.

Python 8,266 1,135 Updated Mar 9, 2025

Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.

Go 3,112 364 Updated Mar 7, 2025

python wrapper for rubberband

Python 177 23 Updated Sep 30, 2024

Official PyTorch implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation

Python 2,529 476 Updated Mar 16, 2023

Sublinear memory optimization for deep learning. https://arxiv.org/abs/1604.06174

Python 595 54 Updated Dec 27, 2019

A PyTorch implementation of EfficientNet

Python 8,039 1,532 Updated Apr 8, 2022

Collaging on Internal Representations: An Intuitive Approach for Semantic Transfiguration

Jupyter Notebook 564 89 Updated Apr 24, 2019

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

270,400 21,115 Updated Oct 3, 2024

TOMM2020 Dual-Path Convolutional Image-Text Embedding with Instance Loss 🐾 https://arxiv.org/abs/1711.05535

MATLAB 289 73 Updated Jan 12, 2025

A PyTorch Implementation of Focal Loss.

Python 1 Updated Jan 18, 2018