* refactoring global async to use a much simpler update procedure with a single global lock
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* simplification of async learning algorithms, stabilization + better hyperparameters
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* started to play with using mockito for tests
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* Working on refactoring tests for async classes and trying to make async simpler
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* more work on mockito tests and making some tests much less complex and more explicit in what they are testing
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* some fixes from merging
* do not allow copying of the current network to worker threads, fixing debug line
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* adding some more tests around PR review
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* Adding more tests after review comments
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* few more tests and fixes from PR review
* remove rename of maxEpochStep to maxStepsPerEpisode as we agreed to review this in a seperate PR
* 2019 instead of 2018 on copyright header
* adding konduit copyright to files
* some more copyright headers
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
Co-authored-by: Alexandre Boulanger <aboulang2002@yahoo.com>
* Starting to switch configs of RL algorithms to use more fluent builder patterns. Many parameter choices in different algorithms default to SOTA and only be changed in specific cases
Signed-off-by: Bam4d <chris.bam4d@gmail.com>
* remove personal gpu-build file
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* refactored out configurations so they are heirarchical and re-usable, this is a step towards having a plug-and-play framework for different algorithms
* backwardly compatible configurations
* adding documentation to new configuration classes
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* private access modifiers are better suited here
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* RL4j does not compile without java 8 due to previous updates
fixing null pointers when listener arrays are empty
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* fixing copyright headers
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* uncomment logging line
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* fixing default value for learningUpdateFrequency
fixing test failure due to #352
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
Co-authored-by: Bam4d <chris.bam4d@gmail.com>