Stars
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
A series of large language models developed by Baichuan Intelligent Technology
EVA Series: Visual Representation Fantasies from BAAI
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
A method to increase the speed and lower the memory footprint of existing vision transformers.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Example models using DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Source code for Twitter's Recommendation Algorithm
Source code for Twitter's Recommendation Algorithm
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.
A curated list of Diffusion Model in RL resources (continually updated)
Open source code for paper "Denoised MDPs: Learning World Models Better Than the World Itself"
The repository is for safe reinforcement learning baselines.
润学全球官方指定GITHUB,整理润学宗旨、纲领、理论和各类润之实例;解决为什么润,润去哪里,怎么润三大问题; 并成为新中国人的核心宗教,核心信念。
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
This repository is for an open-source environment for multi-agent active voltage control on power distribution networks (MAPDN).