029b84e2b7
* RL4J: Add generic update rule (#502) Signed-off-by: Alexandre Boulanger <aboulang2002@yahoo.com> * Shyrma reduce (#481) * - start working on improving of cpu legacy code for reduce ops Signed-off-by: Yurii <iuriish@yahoo.com> * - further work on improving legacy loops Signed-off-by: Yurii <iuriish@yahoo.com> * - still working on improving reduce ops Signed-off-by: Yurii <iuriish@yahoo.com> * - further work on improving reduce ops Signed-off-by: Yurii <iuriish@yahoo.com> * - testing speed run of new reduce op Signed-off-by: Yurii <iuriish@yahoo.com> * - working on improvement of default loop for reduce op Signed-off-by: Yurii <iuriish@yahoo.com> * - update signatures of stuff which calls reduce ops Signed-off-by: Yurii <iuriish@yahoo.com> * - make corrections in cuda reduce kernels Signed-off-by: Yurii <iuriish@yahoo.com> * - change loop for default case in broadcast legacy ops Signed-off-by: Yurii <iuriish@yahoo.com> * - comment some shape stuff Signed-off-by: Yurii <iuriish@yahoo.com> * - comment unnecessary prints in RNGtests Signed-off-by: Yurii <iuriish@yahoo.com> * - finish to resolve conflicts after master has been merged Signed-off-by: Yurii <iuriish@yahoo.com> * - get rid of some compilation mistakes of cuda stuff Signed-off-by: Yurii <iuriish@yahoo.com> * - minor changes Signed-off-by: Yurii <iuriish@yahoo.com> * - further search for bug causing crash on java test Signed-off-by: Yurii <iuriish@yahoo.com> * - add scalar case in reduce_ ... exec stuff Signed-off-by: Yurii <iuriish@yahoo.com> * - minor corrections in NAtiveOps.cu Signed-off-by: Yurii <iuriish@yahoo.com> * - add switch to scalar case execReduceXD functions Signed-off-by: Yurii <iuriish@yahoo.com> * - add support for vectors old shape in ConstantShapeHelper::createShapeInfoWithNoUnitiesForReduce Signed-off-by: Yurii <iuriish@yahoo.com> * - correct cuda mirrorPad Signed-off-by: Yurii <iuriish@yahoo.com> * - add support for vectors old shape in cuda createShapeInfoWithNoUnitiesForReduce Signed-off-by: Yurii <iuriish@yahoo.com> Co-authored-by: raver119 <raver119@gmail.com> * Add support for CUDA 11.0 (#492) * Add support for CUDA 11.0 * libnd4j tweaks for CUDA 11 Signed-off-by: raver119@gmail.com <raver119@gmail.com> * bindings update, again? Signed-off-by: raver119@gmail.com <raver119@gmail.com> * * Update versions of JavaCPP Presets for FFmpeg, OpenBLAS, and NumPy * update API to match CUDA 8 Signed-off-by: raver119@gmail.com <raver119@gmail.com> * * Update version of JavaCPP Presets for CPython * C++ updated for cuDNN 8.0 Signed-off-by: raver119@gmail.com <raver119@gmail.com> * one more test Signed-off-by: raver119@gmail.com <raver119@gmail.com> * one more test Signed-off-by: raver119@gmail.com <raver119@gmail.com> * one more test Signed-off-by: raver119@gmail.com <raver119@gmail.com> * 128-bit alignment for workspaces Signed-off-by: raver119@gmail.com <raver119@gmail.com> * change seed in 1 test Signed-off-by: raver119@gmail.com <raver119@gmail.com> * Fix dependecy duplication in python4j-parent pom * Fix group id for in python4j-numpy * few tests tweaked Signed-off-by: raver119@gmail.com <raver119@gmail.com> * Remove macosx-x86_64-gpu from nd4j-tests-tensorflow * few minor tweaks for IndexReduce Signed-off-by: raver119@gmail.com <raver119@gmail.com> * one test removed Signed-off-by: raver119@gmail.com <raver119@gmail.com> Co-authored-by: raver119@gmail.com <raver119@gmail.com> Co-authored-by: Serhii Shepel <9946053+sshepel@users.noreply.github.com> * RL4J: Add SyncTrainer and AgentLearnerBuilder for a few algorithms (#504) Signed-off-by: Alexandre Boulanger <aboulang2002@yahoo.com> Co-authored-by: Alexandre Boulanger <44292157+aboulang2002@users.noreply.github.com> Co-authored-by: Yurii Shyrma <iuriish@yahoo.com> Co-authored-by: raver119 <raver119@gmail.com> Co-authored-by: Serhii Shepel <9946053+sshepel@users.noreply.github.com> |
||
---|---|---|
.. | ||
compression | ||
legacy | ||
BarnesHutTsne.cu | ||
README.md | ||
activations.cu | ||
addBias.cu | ||
adjust_hue.cu | ||
adjust_saturation.cu | ||
axis.cu | ||
batched_gemm.cu | ||
batchnorm.cu | ||
betaInc.cu | ||
clip.cu | ||
col2im.cu | ||
compare_elem.cu | ||
concat.cu | ||
confusion.cu | ||
convolutions_col2vol.cu | ||
convolutions_conv2d.cu | ||
convolutions_conv2dBP.cu | ||
convolutions_depthwiseConv2d.cu | ||
convolutions_depthwiseConv2dBP.cu | ||
convolutions_pooling2d.cu | ||
convolutions_pooling2dBP.cu | ||
convolutions_pooling3d.cu | ||
convolutions_pooling3dBP.cu | ||
convolutions_sconv2d.cu | ||
convolutions_upsampling2d.cu | ||
convolutions_upsampling2dBP.cu | ||
convolutions_upsampling3d.cu | ||
convolutions_upsampling3dBP.cu | ||
convolutions_vol2col.cu | ||
cross.cu | ||
d_t_s.cu | ||
diGamma.cu | ||
diag.cu | ||
dilation2d.cu | ||
dropout.cu | ||
dynamic.cu | ||
extract_patches.cu | ||
fake_quantization.cu | ||
flatten.cu | ||
gather.cu | ||
gather_nd.cu | ||
gradient.cu | ||
hamming.cu | ||
hashcode.cu | ||
histogram.cu | ||
histogramFixedWidth.cu | ||
im2col.cu | ||
image_draw_bounding_boxes.cu | ||
image_resize.cu | ||
image_resize_v2.cu | ||
image_suppression.cu | ||
imagesHelpers.cu | ||
indexReductions.cu | ||
ismax.cu | ||
legacy_helper.cu | ||
lgamma.cu | ||
lrn.cu | ||
lstm.cu | ||
lstsq.cu | ||
lup.cu | ||
matrixSetDiag.cu | ||
matrix_band.cu | ||
matrix_diag_part.cu | ||
max_pooling.cu | ||
maximum.cu | ||
merge.cu | ||
meshgrid.cu | ||
minimum.cu | ||
nth_element.cu | ||
one_hot.cu | ||
pad.cu | ||
percentile.cu | ||
polyGamma.cu | ||
prefix.cu | ||
print_variable.cu | ||
qr.cu | ||
random.cu | ||
randomShuffle.cu | ||
random_crop.cu | ||
range.cu | ||
reverse.cu | ||
roll.cu | ||
s_t_b.cu | ||
s_t_d.cu | ||
scatter.cu | ||
scatter_simple.cu | ||
scatter_update.cu | ||
segment.cu | ||
segment_max.cu | ||
segment_mean.cu | ||
segment_min.cu | ||
segment_prod.cu | ||
segment_sqrtn.cu | ||
segment_sum.cu | ||
sequence_mask.cu | ||
sg_cb.cu | ||
shift.cu | ||
solve.cu | ||
split.cu | ||
sru.cu | ||
stack.cu | ||
svd.cu | ||
toggle_bits.cu | ||
top_k.cu | ||
transforms.cu | ||
triangular_solve.cu | ||
updaterAdaDelta.cu | ||
updaterAdaGrad.cu | ||
updaterAdaMax.cu | ||
updaterAdam.cu | ||
updaterAmsGrad.cu | ||
updaterNadam.cu | ||
updaterNesterovs.cu | ||
updaterRmsProp.cu | ||
weights.cu | ||
zeta.cu |
README.md
This folder contains CUDA-specific implementations for operations.