Name		Name	Last commit message	Last commit date
parent directory ..
examples		examples
src/rllib_a3c/a3c		src/rllib_a3c/a3c
tests		tests
tuned_examples		tuned_examples
BUILD		BUILD
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

README.md

A3C (Asynchronous Advantage Actor-Critic)

A3C is the asynchronous version of A2C, where gradients are computed on the workers directly after trajectory rollouts, and only then shipped to a central learner to accumulate these gradients on the central model. After the central model update, parameters are broadcast back to all workers. Similar to A2C, A3C scales to 16-32+ worker processes depending on the environment.

Installation

conda create -n rllib-a3c python=3.10
conda activate rllib-a3c
pip install -r requirements.txt
pip install -e '.[development]'

Usage

A3C Example

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

a3c

a3c

README.md

A3C (Asynchronous Advantage Actor-Critic)

Installation

Usage

Files

a3c

Directory actions

More options

Directory actions

More options

Latest commit

History

a3c

Folders and files

parent directory

README.md

A3C (Asynchronous Advantage Actor-Critic)

Installation

Usage