cavis

Author	SHA1	Message	Date
Chris Bamford	032b97912e	RL4J: Sanitize Observation (#404 ) * working on ALE image pipelines that appear to lose data * transformation pipeline for ALE has been broken for a while and needed some cleanup to make sure that openCV tooling for scene transforms was actually working. * allowing history length to be set and passed through to history merge transforms Signed-off-by: Bam4d <chrisbam4d@gmail.com> * native image loader is not thread-safe so should not be static Signed-off-by: Bam4d <chrisbam4d@gmail.com> * make sure the transformer for encoding observations that are not pixels converts corectly Signed-off-by: Bam4d <chrisbam4d@gmail.com> * Test fixes for ALE pixel observation shape Signed-off-by: Bam4d <chrisbam4d@gmail.com> * Fix compilation errors Signed-off-by: Samuel Audet <samuel.audet@gmail.com> * fixing some post-merge issues, and comments from PR Signed-off-by: Bam4d <chrisbam4d@gmail.com> Co-authored-by: Samuel Audet <samuel.audet@gmail.com>	2020-04-23 10:47:26 +09:00
Chris Bamford	74420bca31	RL4J: Sanitize async learner (#327 ) * refactoring global async to use a much simpler update procedure with a single global lock Signed-off-by: Bam4d <chrisbam4d@gmail.com> * simplification of async learning algorithms, stabilization + better hyperparameters Signed-off-by: Bam4d <chrisbam4d@gmail.com> * started to play with using mockito for tests Signed-off-by: Bam4d <chrisbam4d@gmail.com> * Working on refactoring tests for async classes and trying to make async simpler Signed-off-by: Bam4d <chrisbam4d@gmail.com> * more work on mockito tests and making some tests much less complex and more explicit in what they are testing Signed-off-by: Bam4d <chrisbam4d@gmail.com> * some fixes from merging * do not allow copying of the current network to worker threads, fixing debug line Signed-off-by: Bam4d <chrisbam4d@gmail.com> * adding some more tests around PR review Signed-off-by: Bam4d <chrisbam4d@gmail.com> * Adding more tests after review comments Signed-off-by: Bam4d <chrisbam4d@gmail.com> * few more tests and fixes from PR review * remove rename of maxEpochStep to maxStepsPerEpisode as we agreed to review this in a seperate PR * 2019 instead of 2018 on copyright header * adding konduit copyright to files * some more copyright headers Signed-off-by: Bam4d <chrisbam4d@gmail.com> Co-authored-by: Alexandre Boulanger <aboulang2002@yahoo.com>	2020-04-20 11:21:01 +09:00
Alex Black	95db34e389	Eclipse -> Konduit update (#188 ) * Update Japanese translation for Deeplearning4J UI (#8525) Signed-off-by: k-tamura <ktamura.biz.80@gmail.com> * RL4J: Remove processing done on observations in Policy & Async (#8471) * Removed processing from Policy.play() and fixed missing resets Signed-off-by: unknown <aboulang2002@yahoo.com> * Adjusted unit test to check if DQNs have been reset Signed-off-by: unknown <aboulang2002@yahoo.com> * Fixed a couple of problems, added and updated unit tests Signed-off-by: unknown <aboulang2002@yahoo.com> * Removed processing from AsyncThreadDiscrete Signed-off-by: unknown <aboulang2002@yahoo.com> * Fixed a few problems Signed-off-by: unknown <aboulang2002@yahoo.com> * python version bump * increase * RL4J: Replace gym-java-client with JavaCPP (#8595) * RL4J: Replace gym-java-client with JavaCPP Signed-off-by: Samuel Audet <samuel.audet@gmail.com> Co-authored-by: Kohei Tamura <ktamura.biz.80@gmail.com> Co-authored-by: Alexandre Boulanger <44292157+aboulang2002@users.noreply.github.com> Co-authored-by: Max Pumperla <max.pumperla@googlemail.com> Co-authored-by: Samuel Audet <samuel.audet@gmail.com>	2020-01-27 16:03:00 +11:00
skymindops	b5f0ec072f	Eclipse Migration Initial Commit	2019-06-06 15:21:15 +03:00

4 Commits