cavis/gym-java-client
Alexandre Boulanger a2b973d41b RL4J: Make a few fixes (#8303)
* A few fixes

Signed-off-by: unknown <aboulang2002@yahoo.com>

* Reverted move of ObservationSpace, ActionSpace and others

Signed-off-by: unknown <aboulang2002@yahoo.com>

* Added unit tests

Signed-off-by: unknown <aboulang2002@yahoo.com>

* Changed ActionSpace of gym-java-client to use Nd4j's Random

Signed-off-by: Alexandre Boulanger <aboulang2002@yahoo.com>
2019-10-31 13:41:52 +09:00
..
contrib Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
src RL4J: Make a few fixes (#8303) 2019-10-31 13:41:52 +09:00
.gitignore Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
LICENSE.txt Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
README.md Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
pom.xml Fix backend dependencies for tests (#189) 2019-08-29 12:54:48 +09:00

README.md

gym-java-client

A java http client for gym-http-api.

Note: If you are encountering errors as reported in issue #13, please execute the following command before launching python gym_http_server.py:

$ sudo sysctl -w net.ipv4.tcp_tw_recycle=1

Quickstart

To create a new Client, use the ClientFactory. If the url is not localhost:5000, provide it as a second argument

Client<Box, Integer, DiscreteSpace> client = ClientFactory.build("CartPole-v0");

"CartPole-v0" is the name of the gym environment.

The type parameters of a client are the Observation type, the Action type, the Observation Space type and the ActionSpace type.

It is a bit cumbersome to both declare an ActionSpace and an Action since an ActionSpace knows what type is an Action but unfortunately java does't support type member and path dependant types.

Here we use Box and BoxSpace for the environment and Integer and Discrete Space because it is how CartPole-v0 is specified.

The methods nomenclature follows closely the api interface of gym-http-api, O is Observation an A is Action:

//Static methods

/**
 * @param url url of the server
 * @return set of all environments running on the server at the url
 */
public static Set<String> listAll(String url);

/**
 * Shutdown the server at the url
 *
 * @param url url of the server
 */
public static void serverShutdown(String url);



//Methods accessible from a Client
/**
 * @return set of all environments running on the same server than this client
 */
public Set<String> listAll();

/**
 * Step the environment by one action
 *
 * @param action action to step the environment with
 * @return the StepReply containing the next observation, the reward, if it is a terminal state and optional information.
 */
public StepReply<O> step(A action);
/**
 * Reset the state of the environment and return an initial observation.
 *
 * @return initial observation
 */
public O reset();

/**
 * Start monitoring.
 *
 * @param directory path to directory in which store the monitoring file
 * @param force     clear out existing training data from this directory (by deleting every file prefixed with "openaigym.")
 * @param resume    retain the training data already in this directory, which will be merged with our new data
 */
public void monitorStart(String directory, boolean force, boolean resume);

/**
 * Flush all monitor data to disk
 */
public void monitorClose();

/**
 * Upload monitoring data to OpenAI servers.
 *
 * @param trainingDir directory that contains the monitoring data
 * @param apiKey      personal OpenAI API key
 * @param algorithmId an arbitrary string indicating the paricular version of the algorithm (including choices of parameters) you are running.
 **/
public void upload(String trainingDir, String apiKey, String algorithmId);

/**
 * Upload monitoring data to OpenAI servers.
 *
 * @param trainingDir directory that contains the monitoring data
 * @param apiKey      personal OpenAI API key
 */
public void upload(String trainingDir, String apiKey);


/**
 * Shutdown the server at the same url than this client
 */
public void serverShutdown()

TODO

  • Add all ObservationSpace and ActionSpace when they will be available.