* Update Japanese translation for Deeplearning4J UI (#8525)
Signed-off-by: k-tamura <ktamura.biz.80@gmail.com>
* RL4J: Remove processing done on observations in Policy & Async (#8471)
* Removed processing from Policy.play() and fixed missing resets
Signed-off-by: unknown <aboulang2002@yahoo.com>
* Adjusted unit test to check if DQNs have been reset
Signed-off-by: unknown <aboulang2002@yahoo.com>
* Fixed a couple of problems, added and updated unit tests
Signed-off-by: unknown <aboulang2002@yahoo.com>
* Removed processing from AsyncThreadDiscrete
Signed-off-by: unknown <aboulang2002@yahoo.com>
* Fixed a few problems
Signed-off-by: unknown <aboulang2002@yahoo.com>
* python version bump
* increase
* RL4J: Replace gym-java-client with JavaCPP (#8595)
* RL4J: Replace gym-java-client with JavaCPP
Signed-off-by: Samuel Audet <samuel.audet@gmail.com>
Co-authored-by: Kohei Tamura <ktamura.biz.80@gmail.com>
Co-authored-by: Alexandre Boulanger <44292157+aboulang2002@users.noreply.github.com>
Co-authored-by: Max Pumperla <max.pumperla@googlemail.com>
Co-authored-by: Samuel Audet <samuel.audet@gmail.com>
* A few fixes
Signed-off-by: unknown <aboulang2002@yahoo.com>
* Reverted move of ObservationSpace, ActionSpace and others
Signed-off-by: unknown <aboulang2002@yahoo.com>
* Added unit tests
Signed-off-by: unknown <aboulang2002@yahoo.com>
* Changed ActionSpace of gym-java-client to use Nd4j's Random
Signed-off-by: Alexandre Boulanger <aboulang2002@yahoo.com>
* Changed to use Nd4j Random instead of java.util.Random
Signed-off-by: unknown <aboulang2002@yahoo.com>
* Changed to use Nd4j.getRandom() instead of the factory
Signed-off-by: Alexandre Boulanger <aboulang2002@yahoo.com>
* Small build fixes
Signed-off-by: Alex Black <blacka101@gmail.com>
* Fix RL4J
Signed-off-by: Alex Black <blacka101@gmail.com>
* Test fixes
Signed-off-by: Alex Black <blacka101@gmail.com>
* Another fix
Signed-off-by: Alex Black <blacka101@gmail.com>
* Added a unit test to help refac QLearningDiscrete.trainStep()
Signed-off-by: unknown <aboulang2002@yahoo.com>
* Changed expReplay setter to package private
Signed-off-by: Alexandre Boulanger <aboulang2002@yahoo.com>
* Added observation classes and tests
Signed-off-by: unknown <aboulang2002@yahoo.com>
* Now uses DataSetPreProcessors
Signed-off-by: Alexandre Boulanger <aboulang2002@yahoo.com>
* CompositeDataSetPreProcessor can now stop processing on empty dataset; Some DataSetPreProcessors moving from RL4J to ND4J
Signed-off-by: Alexandre Boulanger <aboulang2002@yahoo.com>
* Did requested minor changes
Signed-off-by: Alexandre Boulanger <Alexandre.Boulanger@ia.ca>
Signed-off-by: Alexandre Boulanger <aboulang2002@yahoo.com>