cavis/rl4j/README.md

# RL4J: Reinforcement Learning for Java

RL4J is a reinforcement learning framework integrated with deeplearning4j and released under an Apache 2.0 open-source license. By contributing code to this repository, you agree to make your contribution available under an Apache 2.0 license.

* DQN (Deep Q Learning with double DQN)
* Async RL (A3C, Async NStepQlearning)

Both for Low-Dimensional (array of info) and high-dimensional (pixels) input.


![DOOM](docs/images/doom.gif)


![Cartpole](docs/images/cartpole.gif)


Here is a useful blog post I wrote to introduce you to reinforcement learning, DQN and Async RL:


[Blog post](https://rubenfiszel.github.io/posts/rl4j/2016-08-24-Reinforcement-Learning-and-DQN.html)

[Examples](https://github.com/eclipse/deeplearning4j-examples/tree/master/rl4j-examples)

[Cartpole example](https://github.com/eclipse/deeplearning4j-examples/blob/master/rl4j-examples/src/main/java/org/deeplearning4j/examples/rl4j/Cartpole.java)

# Disclaimer

This is a tech preview and distributed as is.
Comments are welcome on our gitter channel:
[gitter](https://gitter.im/deeplearning4j/deeplearning4j)


# Quickstart

* mvn install

# Visualisation

[webapp-rl4j](https://github.com/rubenfiszel/webapp-rl4j)

# Quicktry cartpole:

* run with this [main](https://github.com/eclipse/deeplearning4j-examples/blob/master/rl4j-examples/src/main/java/org/deeplearning4j/examples/rl4j/Cartpole.java)

# Doom

Doom is not ready yet but you can make it work if you feel adventurous with some additional steps:

* You will need vizdoom, compile the native lib and move it into the root of your project in a folder
* export MAVEN_OPTS=-Djava.library.path=THEFOLDEROFTHELIB
* mvn compile exec:java -Dexec.mainClass="YOURMAINCLASS"

# Malmo (Minecraft)

![Malmo](docs/images/malmo.gif)

* Download and unzip Malmo from [here](https://github.com/Microsoft/malmo/releases)
* export MALMO_HOME=YOURMALMO_FOLDER
* export MALMO_XSD_PATH=$MALMO_HOME/Schemas
* launch malmo per [instructions](https://github.com/Microsoft/malmo#launching-minecraft-with-our-mod)
* run with this [main](https://github.com/eclipse/deeplearning4j-examples/blob/master/rl4j-examples/src/main/java/org/deeplearning4j/examples/rl4j/MalmoPixels.java)


# WIP

* Documentation
* Serialization/Deserialization (load save)
* Compression of pixels in order to store 1M state in a reasonnable amount of memory
* Async learning: A3C and nstep learning (requires some missing features from dl4j (calc and apply gradients)).

# Author

[Ruben Fiszel](http://rubenfiszel.github.io/)

# Proposed contribution area:

* Continuous control
* Policy Gradient
* Update rl4j-gym to make it compatible with pixels environments to play with Pong, Doom, etc ...
Eclipse Migration Initial Commit 2019-06-06 14:21:15 +02:00			`# RL4J: Reinforcement Learning for Java`

			`RL4J is a reinforcement learning framework integrated with deeplearning4j and released under an Apache 2.0 open-source license. By contributing code to this repository, you agree to make your contribution available under an Apache 2.0 license.`

			`* DQN (Deep Q Learning with double DQN)`
			`* Async RL (A3C, Async NStepQlearning)`

			`Both for Low-Dimensional (array of info) and high-dimensional (pixels) input.`


Fix formatting, remove obsolete files (#439) * Update/remove obsolete files * Fix nd4j-parameter-server-parent folder and module name * Fix formatting for libnd4j pom * Remove LICENSE file check for libnd4j build * Temp revert removing encoding and version for nd4j-parameter-server-model, nd4j-parameter-server-node, nd4j-parameter-server-client 2020-05-29 10:01:02 +02:00			`![DOOM](docs/images/doom.gif)`
Eclipse Migration Initial Commit 2019-06-06 14:21:15 +02:00

Fix formatting, remove obsolete files (#439) * Update/remove obsolete files * Fix nd4j-parameter-server-parent folder and module name * Fix formatting for libnd4j pom * Remove LICENSE file check for libnd4j build * Temp revert removing encoding and version for nd4j-parameter-server-model, nd4j-parameter-server-node, nd4j-parameter-server-client 2020-05-29 10:01:02 +02:00			`![Cartpole](docs/images/cartpole.gif)`
Eclipse Migration Initial Commit 2019-06-06 14:21:15 +02:00

			`Here is a useful blog post I wrote to introduce you to reinforcement learning, DQN and Async RL:`


			`[Blog post](https://rubenfiszel.github.io/posts/rl4j/2016-08-24-Reinforcement-Learning-and-DQN.html)`

Update links to eclipse repos (#252) * Fix repo links and clean up old github templates Signed-off-by: AlexDBlack <blacka101@gmail.com> * More link updates Signed-off-by: AlexDBlack <blacka101@gmail.com> 2019-09-10 11:09:46 +02:00			`[Examples](https://github.com/eclipse/deeplearning4j-examples/tree/master/rl4j-examples)`
Eclipse Migration Initial Commit 2019-06-06 14:21:15 +02:00
Update links to eclipse repos (#252) * Fix repo links and clean up old github templates Signed-off-by: AlexDBlack <blacka101@gmail.com> * More link updates Signed-off-by: AlexDBlack <blacka101@gmail.com> 2019-09-10 11:09:46 +02:00			`[Cartpole example](https://github.com/eclipse/deeplearning4j-examples/blob/master/rl4j-examples/src/main/java/org/deeplearning4j/examples/rl4j/Cartpole.java)`
Eclipse Migration Initial Commit 2019-06-06 14:21:15 +02:00
			`# Disclaimer`

			`This is a tech preview and distributed as is.`
			`Comments are welcome on our gitter channel:`
			`[gitter](https://gitter.im/deeplearning4j/deeplearning4j)`


			`# Quickstart`

			`* mvn install`

			`# Visualisation`

			`[webapp-rl4j](https://github.com/rubenfiszel/webapp-rl4j)`

			`# Quicktry cartpole:`

RL4J: Replace gym-java-client with JavaCPP (#8595) * RL4J: Replace gym-java-client with JavaCPP Signed-off-by: Samuel Audet <samuel.audet@gmail.com> 2020-01-20 09:13:57 +01:00			`* run with this [main](https://github.com/eclipse/deeplearning4j-examples/blob/master/rl4j-examples/src/main/java/org/deeplearning4j/examples/rl4j/Cartpole.java)`
Eclipse Migration Initial Commit 2019-06-06 14:21:15 +02:00
			`# Doom`

			`Doom is not ready yet but you can make it work if you feel adventurous with some additional steps:`

			`* You will need vizdoom, compile the native lib and move it into the root of your project in a folder`
			`* export MAVEN_OPTS=-Djava.library.path=THEFOLDEROFTHELIB`
			`* mvn compile exec:java -Dexec.mainClass="YOURMAINCLASS"`

			`# Malmo (Minecraft)`

Fix formatting, remove obsolete files (#439) * Update/remove obsolete files * Fix nd4j-parameter-server-parent folder and module name * Fix formatting for libnd4j pom * Remove LICENSE file check for libnd4j build * Temp revert removing encoding and version for nd4j-parameter-server-model, nd4j-parameter-server-node, nd4j-parameter-server-client 2020-05-29 10:01:02 +02:00			`![Malmo](docs/images/malmo.gif)`
Eclipse Migration Initial Commit 2019-06-06 14:21:15 +02:00
			`* Download and unzip Malmo from [here](https://github.com/Microsoft/malmo/releases)`
			`* export MALMO_HOME=YOURMALMO_FOLDER`
			`* export MALMO_XSD_PATH=$MALMO_HOME/Schemas`
			`* launch malmo per [instructions](https://github.com/Microsoft/malmo#launching-minecraft-with-our-mod)`
Update links to eclipse repos (#252) * Fix repo links and clean up old github templates Signed-off-by: AlexDBlack <blacka101@gmail.com> * More link updates Signed-off-by: AlexDBlack <blacka101@gmail.com> 2019-09-10 11:09:46 +02:00			`* run with this [main](https://github.com/eclipse/deeplearning4j-examples/blob/master/rl4j-examples/src/main/java/org/deeplearning4j/examples/rl4j/MalmoPixels.java)`
Eclipse Migration Initial Commit 2019-06-06 14:21:15 +02:00


			`# WIP`

			`* Documentation`
			`* Serialization/Deserialization (load save)`
			`* Compression of pixels in order to store 1M state in a reasonnable amount of memory`
			`* Async learning: A3C and nstep learning (requires some missing features from dl4j (calc and apply gradients)).`

			`# Author`

			`[Ruben Fiszel](http://rubenfiszel.github.io/)`

			`# Proposed contribution area:`

			`* Continuous control`
			`* Policy Gradient`
Fix formatting, remove obsolete files (#439) * Update/remove obsolete files * Fix nd4j-parameter-server-parent folder and module name * Fix formatting for libnd4j pom * Remove LICENSE file check for libnd4j build * Temp revert removing encoding and version for nd4j-parameter-server-model, nd4j-parameter-server-node, nd4j-parameter-server-client 2020-05-29 10:01:02 +02:00			`* Update rl4j-gym to make it compatible with pixels environments to play with Pong, Doom, etc ...`