Skip to content

Latest commit

 

History

History
 
 

caql

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

CAQL, Continuous Action Q-Learning, is a class of algorithms for continuous-action Q-learning that can use several plug-and-play optimizers for the max-Q problem.

NOTE: MIP optimizer is not included in this initial version because the MIP optimizer is a Google-internal library and has not been open-sourced yet. However, it has a plan for open-source, and we will update this repository as soon as it is released.

For technical details of CAQL, refer to ICLR 2020 paper (https://openreview.net/forum?id=BkxXe0Etwr)