Skip to content

Latest commit

 

History

History
26 lines (21 loc) · 656 Bytes

index.rst

File metadata and controls

26 lines (21 loc) · 656 Bytes

Welcome to trlX's documentation!

trlX is a library made for training large language models using reinforcement learning. It currently supports training using PPO or ILQL for models up to 20B using Accelerate.

.. toctree::
   :maxdepth: 2
   :caption: Contents:

   data
   models
   configs
   pipeline
   examples

Indices and tables