cavis

History

* Libnd4j: TensorMMul backprop op #8174, raw implementation

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 merge master and some corrections

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 algorithm update, need testing, sync with  master

* Libnd4j: TensorMMul backprop op #8174 fixed incorrect B axes calculation

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 optimize axes identification and fix bug of indeces overlapping, added first test. need testing with different shapes

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 some fixes and improvements need more testing

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 fixed order of matrix multiply

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 fixed issue of incorrect axes definition, add tests based on TF, need additional testing for case dLdC not equal 1

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 fixed scalar case add test

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 fixed bp algorithm, axes definition, need some mode testing with different orders combination f,c; c,f f,f and add some checks for inputs

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 some checks and corrections added tests, exists the problem with different input orders support A-f B-c and A-f B-f

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 sync master

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* - correct bug in MmulHelper::tensorDot(a, b, c, axes_a, axes_b,permutForC)

Signed-off-by: Yurii <iuriish@yahoo.com>

* Libnd4j: TensorMMul backprop op #8174 code clean up and refactoring

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* - add check for linspase ordered permutations in ShapeUtils::evalShapeForTensorDot

Signed-off-by: Yurii <iuriish@yahoo.com>

* - provide additional code in shape::reshape stuff in order to reduce amount of allocation/copy operations during reshaping procedure

Signed-off-by: Yurii <iuriish@yahoo.com>

* - further work on problem of wrong shape evaluation during permute/reshape procedures

Signed-off-by: Yurii <iuriish@yahoo.com>

* - still looking for bug reason in reshape/permute stuff

Signed-off-by: Yurii <iuriish@yahoo.com>

* - correct bug in transform cuda native ops

Signed-off-by: Yurii <iuriish@yahoo.com>

* - correct bug in NDArray::assign

Signed-off-by: Yurii <iuriish@yahoo.com>

* - remove old shape::reshape stuff

Signed-off-by: Yurii <iuriish@yahoo.com>

* - add possibility to disable copy of old buffer to new buffer during reshape operation in NDArray class

Signed-off-by: Yurii <iuriish@yahoo.com>

* - correct bug in tensorDot which had to do with wrong pointers assigments

Signed-off-by: Yurii <iuriish@yahoo.com>

Co-authored-by: Oleh <oleg.semeniv@gmail.com>

2020-02-13 20:33:54 +03:00

legacy

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

activations.cu

String changes (#3 )

2020-01-04 13:27:50 +03:00

addBias.cu

Shyrma deconv3 (#69 )

2019-11-21 21:17:30 +02:00

adjust_hue.cu

Fix for hsv and rgb ranges (#136 )

2019-12-20 08:48:30 +03:00

adjust_saturation.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

axis.cu

Shugeo cuda docs1 (#249 )

2019-09-09 16:27:45 +03:00

BarnesHutTsne.cu

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

batched_gemm.cu

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

batchnorm.cu

- fix wrong calculation of elements offsets in batchnorm op when input arrays have unusual (#169 )

2020-01-11 00:14:20 +03:00

betaInc.cu

DNNL/MKLDNN dilated causal conv1d + betainc (#103 )

2019-12-04 14:50:17 +03:00

col2im.cu

Shyrma deconv3 (#69 )

2019-11-21 21:17:30 +02:00

compare_elem.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

concat.cu

[WIP] CUDA concat tweak (#148 )

2019-12-24 17:01:03 +03:00

confusion.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

convolutions.cu

Oleh tenzor mmul (#231 )

2020-02-13 20:33:54 +03:00

cross.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

d_t_s.cu

Merge master to upstream (#7945 )

2019-06-27 18:37:04 +03:00

diag.cu

Shugeo cuda doc2 (#255 )

2019-09-11 21:04:43 +03:00

diGamma.cu

Shyrma adjust (#98 )

2019-12-03 09:40:45 +03:00

dilation2d.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

dropout.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

dynamic.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

extract_patches.cu

Shugeo cuda doc2 (#255 )

2019-09-11 21:04:43 +03:00

fake_quantization.cu

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

flatten.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

gather_nd.cu

Shyrma scatter (#84 )

2019-11-26 20:29:09 +03:00

gather.cu

Shyrma scatter (#84 )

2019-11-26 20:29:09 +03:00

gradient.cu

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

gru.cu

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

hamming.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

hashcode.cu

syncthreads (#136 )

2019-08-20 18:28:43 +03:00

histogram.cu

[WIP] minor (#218 )

2019-09-02 11:25:48 +03:00

histogramFixedWidth.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

im2col.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

image_draw_bounding_boxes.cu

Shugeo release fix1 (#61 )

2019-11-20 13:37:48 +02:00

image_resize.cu

Shugeo resize area fix4 (#229 )

2020-02-12 19:02:42 +03:00

image_suppression.cu

Shugeo_release_fixes3 (#81 )

2019-11-28 21:08:51 +03:00

imagesHelpers.cu

[WIP] Oleh rgb yuv (#147 )

2019-12-24 18:30:54 +03:00

ismax.cu

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

legacy_helper.cu

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

lgamma.cu

cuDNN integration (#150 )

2020-01-20 21:32:46 +03:00

lrn.cu

[WIP] minor (#218 )

2019-09-02 11:25:48 +03:00

lstm.cu

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

lup.cu

Fixed lu for cuda platform and tests. (#158 )

2020-01-02 23:25:41 +03:00

matrix_band.cu

Shugeo cuda doc2 (#255 )

2019-09-11 21:04:43 +03:00

matrix_diag_part.cu

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

matrixSetDiag.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

max_pooling.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

maximum.cu

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

merge.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

meshgrid.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

minimum.cu

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

nth_element.cu

Shugeo cuda doc2 (#255 )

2019-09-11 21:04:43 +03:00

one_hot.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

pad.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

percentile.cu

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

polyGamma.cu

Shyrma adjust (#98 )

2019-12-03 09:40:45 +03:00

prefix.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

print_variable.cu

String changes (#3 )

2020-01-04 13:27:50 +03:00

qr.cu

Shugeo qr (#153 )

2020-01-22 13:59:36 +03:00

random_crop.cu

Eclipse Migration Initial Commit

2019-06-06 15:21:15 +03:00

random.cu

Oleh multinomial (#163 )

2020-01-06 22:35:05 +03:00

range.cu

[WIP] minor (#218 )

2019-09-02 11:25:48 +03:00

README.md

Merge master to upstream (#7945 )

2019-06-27 18:37:04 +03:00

reverse.cu

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

roll.cu

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

s_t_b.cu

Oleh tenzor mmul (#231 )

2020-02-13 20:33:54 +03:00

s_t_d.cu

Merge master to upstream (#7945 )

2019-06-27 18:37:04 +03:00

scatter_simple.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

scatter_update.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

scatter.cu

Shyrma scatter (#84 )

2019-11-26 20:29:09 +03:00

segment_max.cu

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

segment_mean.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

segment_min.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

segment_prod.cu

Shugeo_release_fixes3 (#81 )

2019-11-28 21:08:51 +03:00

segment_sqrtn.cu

Shugeo_release_fixes3 (#81 )

2019-11-28 21:08:51 +03:00

segment_sum.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

segment.cu

[WIP] multi-device support (#80 )

2019-08-14 16:52:34 +03:00

sequence_mask.cu

Shugeo sequence mask fix2 (#216 )

2020-02-06 21:06:50 +03:00

sg_cb.cu

Shugeo image resize bicubic (#56 )

2019-11-20 21:11:04 +02:00

shift.cu

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

solve.cu

Shugeo solve linear (#191 )

2020-02-04 08:59:11 +03:00

sru.cu

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

stack.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

svd.cu

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

toggle_bits.cu

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

top_k.cu

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

transforms.cu

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

triangular_solve.cu

Shugeo solve linear (#191 )

2020-02-04 08:59:11 +03:00

weights.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

zeta.cu

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

README.md

This folder contains CUDA-specific implementations for operations.