* RL4J: Add generic update rule (#502)
Signed-off-by: Alexandre Boulanger <aboulang2002@yahoo.com>
* Shyrma reduce (#481)
* - start working on improving of cpu legacy code for reduce ops
Signed-off-by: Yurii <iuriish@yahoo.com>
* - further work on improving legacy loops
Signed-off-by: Yurii <iuriish@yahoo.com>
* - still working on improving reduce ops
Signed-off-by: Yurii <iuriish@yahoo.com>
* - further work on improving reduce ops
Signed-off-by: Yurii <iuriish@yahoo.com>
* - testing speed run of new reduce op
Signed-off-by: Yurii <iuriish@yahoo.com>
* - working on improvement of default loop for reduce op
Signed-off-by: Yurii <iuriish@yahoo.com>
* - update signatures of stuff which calls reduce ops
Signed-off-by: Yurii <iuriish@yahoo.com>
* - make corrections in cuda reduce kernels
Signed-off-by: Yurii <iuriish@yahoo.com>
* - change loop for default case in broadcast legacy ops
Signed-off-by: Yurii <iuriish@yahoo.com>
* - comment some shape stuff
Signed-off-by: Yurii <iuriish@yahoo.com>
* - comment unnecessary prints in RNGtests
Signed-off-by: Yurii <iuriish@yahoo.com>
* - finish to resolve conflicts after master has been merged
Signed-off-by: Yurii <iuriish@yahoo.com>
* - get rid of some compilation mistakes of cuda stuff
Signed-off-by: Yurii <iuriish@yahoo.com>
* - minor changes
Signed-off-by: Yurii <iuriish@yahoo.com>
* - further search for bug causing crash on java test
Signed-off-by: Yurii <iuriish@yahoo.com>
* - add scalar case in reduce_ ... exec stuff
Signed-off-by: Yurii <iuriish@yahoo.com>
* - minor corrections in NAtiveOps.cu
Signed-off-by: Yurii <iuriish@yahoo.com>
* - add switch to scalar case execReduceXD functions
Signed-off-by: Yurii <iuriish@yahoo.com>
* - add support for vectors old shape in ConstantShapeHelper::createShapeInfoWithNoUnitiesForReduce
Signed-off-by: Yurii <iuriish@yahoo.com>
* - correct cuda mirrorPad
Signed-off-by: Yurii <iuriish@yahoo.com>
* - add support for vectors old shape in cuda createShapeInfoWithNoUnitiesForReduce
Signed-off-by: Yurii <iuriish@yahoo.com>
Co-authored-by: raver119 <raver119@gmail.com>
* Add support for CUDA 11.0 (#492)
* Add support for CUDA 11.0
* libnd4j tweaks for CUDA 11
Signed-off-by: raver119@gmail.com <raver119@gmail.com>
* bindings update, again?
Signed-off-by: raver119@gmail.com <raver119@gmail.com>
* * Update versions of JavaCPP Presets for FFmpeg, OpenBLAS, and NumPy
* update API to match CUDA 8
Signed-off-by: raver119@gmail.com <raver119@gmail.com>
* * Update version of JavaCPP Presets for CPython
* C++ updated for cuDNN 8.0
Signed-off-by: raver119@gmail.com <raver119@gmail.com>
* one more test
Signed-off-by: raver119@gmail.com <raver119@gmail.com>
* one more test
Signed-off-by: raver119@gmail.com <raver119@gmail.com>
* one more test
Signed-off-by: raver119@gmail.com <raver119@gmail.com>
* 128-bit alignment for workspaces
Signed-off-by: raver119@gmail.com <raver119@gmail.com>
* change seed in 1 test
Signed-off-by: raver119@gmail.com <raver119@gmail.com>
* Fix dependecy duplication in python4j-parent pom
* Fix group id for in python4j-numpy
* few tests tweaked
Signed-off-by: raver119@gmail.com <raver119@gmail.com>
* Remove macosx-x86_64-gpu from nd4j-tests-tensorflow
* few minor tweaks for IndexReduce
Signed-off-by: raver119@gmail.com <raver119@gmail.com>
* one test removed
Signed-off-by: raver119@gmail.com <raver119@gmail.com>
Co-authored-by: raver119@gmail.com <raver119@gmail.com>
Co-authored-by: Serhii Shepel <9946053+sshepel@users.noreply.github.com>
* RL4J: Add SyncTrainer and AgentLearnerBuilder for a few algorithms (#504)
Signed-off-by: Alexandre Boulanger <aboulang2002@yahoo.com>
Co-authored-by: Alexandre Boulanger <44292157+aboulang2002@users.noreply.github.com>
Co-authored-by: Yurii Shyrma <iuriish@yahoo.com>
Co-authored-by: raver119 <raver119@gmail.com>
Co-authored-by: Serhii Shepel <9946053+sshepel@users.noreply.github.com>
This means that when a model is being loaded (e.g., from a Keras H5 file), the default CNN2DFormat can be set. Currently it always defaults to 'Channels First' which can cause problems for some models.
* shutdown hook for UI server
Signed-off-by: Tamás Fenyvesi <tamas.fenyvesi@doknet.hu>
* logging exception
Signed-off-by: Tamás Fenyvesi <tamas.fenyvesi@doknet.hu>
* - raspberry Pi build and ArmCompute library support
- initial ArmCompute platform implementations (Maxpool2d AvgPool2d for float32)
Signed-off-by: AbdelRauf <rauf@konduit.ai>
* - Build script for pi
- small changes
Signed-off-by: AbdelRauf <rauf@konduit.ai>
* org.eclipse->org.nd4j
* Fix parent groupid
Signed-off-by: Alex Black <blacka101@gmail.com>
* memview->bytes
* del test
Co-authored-by: Alex Black <blacka101@gmail.com>
* random_shuffle test for Yurii
Signed-off-by: raver119@gmail.com <raver119@gmail.com>
* - implementation and testing random_shuffle for vector case (cpu)
Signed-off-by: Yurii <iuriish@yahoo.com>
* - fix bug in random shuffle for cpu
Signed-off-by: Yurii <iuriish@yahoo.com>
* - correct tests for random shuffle and improve alg when inPlace is false
Signed-off-by: Yurii <iuriish@yahoo.com>
* - implementation of random shuffle algorithm for cuda
Signed-off-by: Yurii <iuriish@yahoo.com>
* - split cuda random shuffle alg into separate launches of 2 kernels
Signed-off-by: Yurii <iuriish@yahoo.com>
* - minor corrections in cuda concat kernel
Signed-off-by: Yurii <iuriish@yahoo.com>
Co-authored-by: raver119@gmail.com <raver119@gmail.com>
* Fix MergeVertex serialization for NHWC case
Signed-off-by: Alex Black <blacka101@gmail.com>
* #8999 Dropout JSON field ignore
Signed-off-by: Alex Black <blacka101@gmail.com>
* Added dtype formulation for poisson and gamma distributions.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Refactored gamma distribution generator and tests.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Added generator for gamma distribution when alpha (shape) between 0 and 1
Signed-off-by: shugeo <sgazeos@gmail.com>
* Implemented gamma distribution for shape param less than 1 and tests.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Implemented gamma distributed randoms for shape (alpha) parameter greater then 1.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Added cuda implementation for gamma distribution.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Refactored cuda and cpu implementation of gamma distribution.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Fixed crash with default beta param with gamma distribution.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Fixed pow for arm arch.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Gamma test fixed
* Cosmetic changes only.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Fixed random value retrieving
* Eliminated overflow attemptions.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Modified random retrieving.
Signed-off-by: shugeo <sgazeos@gmail.com>
* enlighted density of tests for Gamma distribution.
Signed-off-by: shugeo <sgazeos@gmail.com>
Co-authored-by: Alexander Stoyakin <alexander.stoyakin@gmail.com>
Co-authored-by: raver119 <raver119@gmail.com>
* get rid of culibos reference
Signed-off-by: raver119 <raver119@gmail.com>
* typo
Signed-off-by: raver119 <raver119@gmail.com>
* one less printf
Signed-off-by: raver119 <raver119@gmail.com>
* one disabled test
Signed-off-by: raver119 <raver119@gmail.com>
* Fix Yolo2OutputLayer for NHWC data format
Signed-off-by: Alex Black <blacka101@gmail.com>
* ObjectDetectionRecordReader NHWC support
Signed-off-by: Alex Black <blacka101@gmail.com>
* Eliminated error with resize implementation.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Refactored resize caller implementation.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Refactored image.resize op helper.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Added dumb implementations for missed resize methods.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Added resize_images op. Refactored image_resize op.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Refactored matrix_band_part op and test.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Refactored resize_images op to comply with preserve_aspect_ratio flag properly.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Refactored resize_images and tests for resizeArea method.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Refactored resize methods and test.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Added new methods for TF2 resize op.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Portion of resize algorithms from TF2
Signed-off-by: shugeo <sgazeos@gmail.com>
* Added routine to process resize with given algorithm.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Added new image resize via scale and translate process helper.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Cpu implementation for V2 image resize operation helpers.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Added implementation for lancos5 algorithm of resize and test.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Added prints for span computing.
Signed-off-by: shugeo <sgazeos@gmail.com>
* The first working implementation and tests for lancos5 resize.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Eliminated waste prints.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Refactored image_resize op and tests."
Signed-off-by: shugeo <sgazeos@gmail.com>
* Lanczos3 resize implementation and tests.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Implemented bicubic resize algorithm and tests.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Added a couple of tests and cosmetic changes with image resize helper.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Added bilinear implementation for image resize.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Refactored bicubic algorithm and also implement area and neighbor algoritms for image resize on cpu arch.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Added a couple of tests for nearest neighbor and area resize.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Cosmetic changes for cpu implementation and added cuda implementation for resize methods.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Separated cuda implementation of v2 image resize.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Added kernels for span calculation and span gathering with new image resize cuda implementation.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Refactored cuda implementation of image resize kernels.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Finished the first working implementation of image resize op and tests.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Fixed resize_images and image_resize ops.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Refactored shape construction and output validation.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Fixed test to properly initalized with float.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Added 3D input opotunity for resize ops.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Fixed test for resize_images op.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Fixed test and call for resize_images op.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Refactored image_resize op output data type handling for nearest neighbors method and tests.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Fixed issue with wrong resize method.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Added checkup for wrong resize methods for resize ops.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Refactored resize methods and test.
Signed-off-by: shugeo <sgazeos@gmail.com>
* Added output data type validation for given resize method.
Signed-off-by: shugeo <sgazeos@gmail.com>
* - ResizeMethod rearranged in order to match C++ side
- minor test fix
Signed-off-by: raver119@gmail.com <raver119@gmail.com>
* Refactored resize_images op.
Signed-off-by: shugeo <sgazeos@gmail.com>
Co-authored-by: raver119@gmail.com <raver119@gmail.com>
* - disable mkldnn concat when number of input arrays > 3072
Signed-off-by: Yurii <iuriish@yahoo.com>
* - get rid of loop in calculating of input arrays number
Signed-off-by: Yurii <iuriish@yahoo.com>