* working on ALE image pipelines that appear to lose data
* transformation pipeline for ALE has been broken for a while and needed some cleanup to make sure that openCV tooling for scene transforms was actually working.
* allowing history length to be set and passed through to history merge transforms
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* native image loader is not thread-safe so should not be static
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* make sure the transformer for encoding observations that are not pixels converts corectly
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* Test fixes for ALE pixel observation shape
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* Fix compilation errors
Signed-off-by: Samuel Audet <samuel.audet@gmail.com>
* fixing some post-merge issues, and comments from PR
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
Co-authored-by: Samuel Audet <samuel.audet@gmail.com>
* refactoring global async to use a much simpler update procedure with a single global lock
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* simplification of async learning algorithms, stabilization + better hyperparameters
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* started to play with using mockito for tests
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* Working on refactoring tests for async classes and trying to make async simpler
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* more work on mockito tests and making some tests much less complex and more explicit in what they are testing
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* some fixes from merging
* do not allow copying of the current network to worker threads, fixing debug line
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* adding some more tests around PR review
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* Adding more tests after review comments
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
* few more tests and fixes from PR review
* remove rename of maxEpochStep to maxStepsPerEpisode as we agreed to review this in a seperate PR
* 2019 instead of 2018 on copyright header
* adding konduit copyright to files
* some more copyright headers
Signed-off-by: Bam4d <chrisbam4d@gmail.com>
Co-authored-by: Alexandre Boulanger <aboulang2002@yahoo.com>
* Update Japanese translation for Deeplearning4J UI (#8525)
Signed-off-by: k-tamura <ktamura.biz.80@gmail.com>
* RL4J: Remove processing done on observations in Policy & Async (#8471)
* Removed processing from Policy.play() and fixed missing resets
Signed-off-by: unknown <aboulang2002@yahoo.com>
* Adjusted unit test to check if DQNs have been reset
Signed-off-by: unknown <aboulang2002@yahoo.com>
* Fixed a couple of problems, added and updated unit tests
Signed-off-by: unknown <aboulang2002@yahoo.com>
* Removed processing from AsyncThreadDiscrete
Signed-off-by: unknown <aboulang2002@yahoo.com>
* Fixed a few problems
Signed-off-by: unknown <aboulang2002@yahoo.com>
* python version bump
* increase
* RL4J: Replace gym-java-client with JavaCPP (#8595)
* RL4J: Replace gym-java-client with JavaCPP
Signed-off-by: Samuel Audet <samuel.audet@gmail.com>
Co-authored-by: Kohei Tamura <ktamura.biz.80@gmail.com>
Co-authored-by: Alexandre Boulanger <44292157+aboulang2002@users.noreply.github.com>
Co-authored-by: Max Pumperla <max.pumperla@googlemail.com>
Co-authored-by: Samuel Audet <samuel.audet@gmail.com>