cavis

History

* Libnd4j: TensorMMul backprop op #8174, raw implementation

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 merge master and some corrections

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 algorithm update, need testing, sync with  master

* Libnd4j: TensorMMul backprop op #8174 fixed incorrect B axes calculation

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 optimize axes identification and fix bug of indeces overlapping, added first test. need testing with different shapes

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 some fixes and improvements need more testing

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 fixed order of matrix multiply

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 fixed issue of incorrect axes definition, add tests based on TF, need additional testing for case dLdC not equal 1

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 fixed scalar case add test

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 fixed bp algorithm, axes definition, need some mode testing with different orders combination f,c; c,f f,f and add some checks for inputs

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 some checks and corrections added tests, exists the problem with different input orders support A-f B-c and A-f B-f

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 sync master

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* - correct bug in MmulHelper::tensorDot(a, b, c, axes_a, axes_b,permutForC)

Signed-off-by: Yurii <iuriish@yahoo.com>

* Libnd4j: TensorMMul backprop op #8174 code clean up and refactoring

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* - add check for linspase ordered permutations in ShapeUtils::evalShapeForTensorDot

Signed-off-by: Yurii <iuriish@yahoo.com>

* - provide additional code in shape::reshape stuff in order to reduce amount of allocation/copy operations during reshaping procedure

Signed-off-by: Yurii <iuriish@yahoo.com>

* - further work on problem of wrong shape evaluation during permute/reshape procedures

Signed-off-by: Yurii <iuriish@yahoo.com>

* - still looking for bug reason in reshape/permute stuff

Signed-off-by: Yurii <iuriish@yahoo.com>

* - correct bug in transform cuda native ops

Signed-off-by: Yurii <iuriish@yahoo.com>

* - correct bug in NDArray::assign

Signed-off-by: Yurii <iuriish@yahoo.com>

* - remove old shape::reshape stuff

Signed-off-by: Yurii <iuriish@yahoo.com>

* - add possibility to disable copy of old buffer to new buffer during reshape operation in NDArray class

Signed-off-by: Yurii <iuriish@yahoo.com>

* - correct bug in tensorDot which had to do with wrong pointers assigments

Signed-off-by: Yurii <iuriish@yahoo.com>

Co-authored-by: Oleh <oleg.semeniv@gmail.com>

2020-02-13 20:33:54 +03:00

activations.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

addBias.cpp

Initial performance improvement for Bias Add and etc #8556 (#217 )

2020-02-08 15:31:30 +03:00

adjust_hue.cpp

Fix for hsv and rgb ranges (#136 )

2019-12-20 08:48:30 +03:00

adjust_saturation.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

axis.cpp

[WIP] Roll rewritten (#128 )

2019-08-17 14:15:08 +03:00

BarnesHutTsne.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

batched_gemm.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

batchnorm.cpp

- fix wrong calculation of elements offsets in batchnorm op when input arrays have unusual (#169 )

2020-01-11 00:14:20 +03:00

betaInc.cpp

DNNL/MKLDNN dilated causal conv1d + betainc (#103 )

2019-12-04 14:50:17 +03:00

col2im.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

compare_elem.cpp

Various fixes (#143 )

2020-01-04 13:45:07 +11:00

confusion.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

convolutions.cpp

Oleh tenzor mmul (#231 )

2020-02-13 20:33:54 +03:00

cross.cpp

Oleh tenzor mmul (#231 )

2020-02-13 20:33:54 +03:00

d_t_s.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

diag.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

diGamma.cpp

Shyrma adjust (#98 )

2019-12-03 09:40:45 +03:00

dilation2d.cpp

Shyrma deconv3 (#69 )

2019-11-21 21:17:30 +02:00

dropout.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

dynamic.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

extract_patches.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

fake_quantization.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

flatten.cpp

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

gather.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

gradient.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

gru.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

hamming.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

hashcode.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

histogram.cpp

Minor fixes (#165 )

2020-01-04 15:27:16 +03:00

histogramFixedWidth.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

im2col.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

image_draw_bounding_boxes.cpp

Shugeo image resize bicubic (#56 )

2019-11-20 21:11:04 +02:00

image_resize.cpp

Shugeo resize area (#162 )

2020-01-22 10:46:33 +03:00

image_suppression.cpp

Shugeo_release_fixes3 (#81 )

2019-11-28 21:08:51 +03:00

imagesHelpers.cpp

[WIP] Oleh rgb yuv (#147 )

2019-12-24 18:30:54 +03:00

ismax.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

legacy_helper.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

lgamma.cpp

Shugeo lgamma (#170 )

2020-01-20 12:29:36 +03:00

lrn.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

lstm.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

lup.cpp

Shugeo solve linear (#191 )

2020-02-04 08:59:11 +03:00

matrix_band.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

matrix_diag_part.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

matrixSetDiag.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

max_pooling.cpp

[WIP] multi-device support (#80 )

2019-08-14 16:52:34 +03:00

meshgrid.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

minimax.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

nth_element.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

one_hot.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

percentile.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

polyGamma.cpp

Shyrma adjust (#98 )

2019-12-03 09:40:45 +03:00

prefix.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

print_variable.cpp

String changes (#3 )

2020-01-04 13:27:50 +03:00

qr.cpp

Shugeo qr (#153 )

2020-01-22 13:59:36 +03:00

random_crop.cpp

Eclipse Migration Initial Commit

2019-06-06 15:21:15 +03:00

random.cpp

Oleh multinomial (#163 )

2020-01-06 22:35:05 +03:00

range.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

README.md

Merge master to upstream (#7945 )

2019-06-27 18:37:04 +03:00

reverse.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

roll.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

s_t_b.cpp

Oleh tenzor mmul (#231 )

2020-02-13 20:33:54 +03:00

s_t_d.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

scatter.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

segment.cpp

[WIP] Memory limits (#167 )

2020-01-24 10:11:09 +03:00

sequence_mask.cpp

Shugeo sequence mask fix2 (#216 )

2020-02-06 21:06:50 +03:00

sg_cb.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

shift.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

solve.cpp

Shugeo sequence mask fix2 (#216 )

2020-02-06 21:06:50 +03:00

sru.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

stack.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

svd.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

toggle_bits.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

top_k.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

transforms.cpp

Shyrma temp (#131 )

2019-12-20 22:35:39 +03:00

triangular_solve.cpp

Shugeo sequence mask fix2 (#216 )

2020-02-06 21:06:50 +03:00

weights.cpp

Eclipse Migration Initial Commit

2019-06-06 15:21:15 +03:00

zeta.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

README.md

This folder contains OpenMP implementations for operations helpers. Basically suited for homogenous x86-like platforms.