Stars
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
Simple next-token-prediction for RLHF
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
Code and documentation to train Stanford's Alpaca models, and generate the data.
Aligning pretrained language models with instruction data generated by themselves.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Making large AI models cheaper, faster and more accessible
Train transformer language models with reinforcement learning.
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
An elegant PyTorch deep reinforcement learning library.
Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.
Official PyTorch implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation
Sublinear memory optimization for deep learning. https://arxiv.org/abs/1604.06174
A PyTorch implementation of EfficientNet
Collaging on Internal Representations: An Intuitive Approach for Semantic Transfiguration
Repo for counting stars and contributing. Press F to pay respect to glorious developers.
TOMM2020 Dual-Path Convolutional Image-Text Embedding with Instance Loss 🐾 https://arxiv.org/abs/1711.05535
A PyTorch Implementation of Focal Loss.