Skip to content

Codebase of ηψ-Learning algorithm that learns a non-Markovian maximum state entropy exploration policy by combining predecessor and successor representation to estimate the state visitation distribution of a trajectory of finite length.

License

Notifications You must be signed in to change notification settings

arnavkj1995/Eta_Psi_Learning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

About

Codebase of ηψ-Learning algorithm that learns a non-Markovian maximum state entropy exploration policy by combining predecessor and successor representation to estimate the state visitation distribution of a trajectory of finite length.

Topics

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published