cavis/libnd4j/include/ops/declarable/helpers/cpu
Yurii Shyrma fe47f52896
Oleh tenzor mmul (#231)
* Libnd4j: TensorMMul backprop op #8174, raw implementation

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 merge master and some corrections

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 algorithm update, need testing, sync with  master

* Libnd4j: TensorMMul backprop op #8174 fixed incorrect B axes calculation

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 optimize axes identification and fix bug of indeces overlapping, added first test. need testing with different shapes

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 some fixes and improvements need more testing

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 fixed order of matrix multiply

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 fixed issue of incorrect axes definition, add tests based on TF, need additional testing for case dLdC not equal 1

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 fixed scalar case add test

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 fixed bp algorithm, axes definition, need some mode testing with different orders combination f,c; c,f f,f and add some checks for inputs

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 some checks and corrections added tests, exists the problem with different input orders support A-f B-c and A-f B-f

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* Libnd4j: TensorMMul backprop op #8174 sync master

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* - correct bug in MmulHelper::tensorDot(a, b, c, axes_a, axes_b,permutForC)

Signed-off-by: Yurii <iuriish@yahoo.com>

* Libnd4j: TensorMMul backprop op #8174 code clean up and refactoring

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* - add check for linspase ordered permutations in ShapeUtils::evalShapeForTensorDot

Signed-off-by: Yurii <iuriish@yahoo.com>

* - provide additional code in shape::reshape stuff in order to reduce amount of allocation/copy operations during reshaping procedure

Signed-off-by: Yurii <iuriish@yahoo.com>

* - further work on problem of wrong shape evaluation during permute/reshape procedures

Signed-off-by: Yurii <iuriish@yahoo.com>

* - still looking for bug reason in reshape/permute stuff

Signed-off-by: Yurii <iuriish@yahoo.com>

* - correct bug in transform cuda native ops

Signed-off-by: Yurii <iuriish@yahoo.com>

* - correct bug in NDArray::assign

Signed-off-by: Yurii <iuriish@yahoo.com>

* - remove old shape::reshape stuff

Signed-off-by: Yurii <iuriish@yahoo.com>

* - add possibility to disable copy of old buffer to new buffer during reshape operation in NDArray class

Signed-off-by: Yurii <iuriish@yahoo.com>

* - correct bug in tensorDot which had to do with wrong pointers assigments

Signed-off-by: Yurii <iuriish@yahoo.com>

Co-authored-by: Oleh <oleg.semeniv@gmail.com>
2020-02-13 20:33:54 +03:00
..
BarnesHutTsne.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
README.md Merge master to upstream (#7945) 2019-06-27 18:37:04 +03:00
activations.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
addBias.cpp Initial performance improvement for Bias Add and etc #8556 (#217) 2020-02-08 15:31:30 +03:00
adjust_hue.cpp Fix for hsv and rgb ranges (#136) 2019-12-20 08:48:30 +03:00
adjust_saturation.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
axis.cpp [WIP] Roll rewritten (#128) 2019-08-17 14:15:08 +03:00
batched_gemm.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
batchnorm.cpp - fix wrong calculation of elements offsets in batchnorm op when input arrays have unusual (#169) 2020-01-11 00:14:20 +03:00
betaInc.cpp DNNL/MKLDNN dilated causal conv1d + betainc (#103) 2019-12-04 14:50:17 +03:00
col2im.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
compare_elem.cpp Various fixes (#143) 2020-01-04 13:45:07 +11:00
confusion.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
convolutions.cpp Oleh tenzor mmul (#231) 2020-02-13 20:33:54 +03:00
cross.cpp Oleh tenzor mmul (#231) 2020-02-13 20:33:54 +03:00
d_t_s.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
diGamma.cpp Shyrma adjust (#98) 2019-12-03 09:40:45 +03:00
diag.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
dilation2d.cpp Shyrma deconv3 (#69) 2019-11-21 21:17:30 +02:00
dropout.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
dynamic.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
extract_patches.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
fake_quantization.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
flatten.cpp [WIP] bunch of improvements (#257) 2019-09-11 20:12:09 +03:00
gather.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
gradient.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
gru.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
hamming.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
hashcode.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
histogram.cpp Minor fixes (#165) 2020-01-04 15:27:16 +03:00
histogramFixedWidth.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
im2col.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
image_draw_bounding_boxes.cpp Shugeo image resize bicubic (#56) 2019-11-20 21:11:04 +02:00
image_resize.cpp Shugeo resize area (#162) 2020-01-22 10:46:33 +03:00
image_suppression.cpp Shugeo_release_fixes3 (#81) 2019-11-28 21:08:51 +03:00
imagesHelpers.cpp [WIP] Oleh rgb yuv (#147) 2019-12-24 18:30:54 +03:00
ismax.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
legacy_helper.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
lgamma.cpp Shugeo lgamma (#170) 2020-01-20 12:29:36 +03:00
lrn.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
lstm.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
lup.cpp Shugeo solve linear (#191) 2020-02-04 08:59:11 +03:00
matrixSetDiag.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
matrix_band.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
matrix_diag_part.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
max_pooling.cpp [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
meshgrid.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
minimax.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
nth_element.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
one_hot.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
percentile.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
polyGamma.cpp Shyrma adjust (#98) 2019-12-03 09:40:45 +03:00
prefix.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
print_variable.cpp String changes (#3) 2020-01-04 13:27:50 +03:00
qr.cpp Shugeo qr (#153) 2020-01-22 13:59:36 +03:00
random.cpp Oleh multinomial (#163) 2020-01-06 22:35:05 +03:00
random_crop.cpp Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
range.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
reverse.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
roll.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
s_t_b.cpp Oleh tenzor mmul (#231) 2020-02-13 20:33:54 +03:00
s_t_d.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
scatter.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
segment.cpp [WIP] Memory limits (#167) 2020-01-24 10:11:09 +03:00
sequence_mask.cpp Shugeo sequence mask fix2 (#216) 2020-02-06 21:06:50 +03:00
sg_cb.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
shift.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
solve.cpp Shugeo sequence mask fix2 (#216) 2020-02-06 21:06:50 +03:00
sru.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
stack.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
svd.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
toggle_bits.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
top_k.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
transforms.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
triangular_solve.cpp Shugeo sequence mask fix2 (#216) 2020-02-06 21:06:50 +03:00
weights.cpp Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
zeta.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00

README.md

This folder contains OpenMP implementations for operations helpers. Basically suited for homogenous x86-like platforms.