cavis/libnd4j/include/ops/declarable/helpers/cpu
shugeo 330a69d4e2
Shugeo solve ls (#203)
* lstsq op. Initial commit.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Least squares linear problem solve op (lstsq). Cpu draft implementation.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Fixed shape routine and tests.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Added test for lstsq op.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Rectification for lstsq op implementation.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Corrected test to avoid numerical inconsistensy.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Added prints for check computing.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Corrected tests to use evalueate facility instead.

Signed-off-by: shugeo <sgazeos@gmail.com>

* CPU implementation of MatrixSolveLs op and tests.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Added cuda implementation for helpers with lstsq op.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Refactored tests for lstsq op.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Added processing for empty inputs.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Merged tests.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Refactored lstsq op for fast case.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Fixed test.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Refactored lstsq op.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Fixed some issues with solve.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Fixed lstsq op to avoid erros.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Added kernel for giagonal factor

Signed-off-by: shugeo <sgazeos@gmail.com>

* lstsq wrapper and triangular_solve fixed

* Added proper processing empty inputs and test.

Signed-off-by: shugeo <sgazeos@gmail.com>

* SequenceMask test

* Build fixed

* Added proper processing of empty inputs with solve op.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Mapping added

* Added check of input shapes with solve op.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Added a couple of tests for lstsq op and minor changes with cuda helper for one.'

Signed-off-by: shugeo <sgazeos@gmail.com>

* Tests on

* Refactored test for lstsq op.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Fixed test

* Added another approach for lstsq op aka solve_ls.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Finished cpu part for solve_ls op helpers.

* Added helper for low triangular matrix inversion.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Refactored alternate solve_ls cpu implementation.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Removed alternate approach for solve_ls op. Added multithreading with matrix inversion.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Assert fixed

* Refactored multithreading for inverse matricies.

Signed-off-by: shugeo <sgazeos@gmail.com>

Co-authored-by: Alexander Stoyakin <alexander.stoyakin@gmail.com>
2020-02-28 11:37:26 +03:00
..
compilation_units More compilation units (#246) 2020-02-17 10:23:05 +03:00
BarnesHutTsne.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
README.md Merge master to upstream (#7945) 2019-06-27 18:37:04 +03:00
activations.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
addBias.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
adjust_hue.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
adjust_saturation.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
axis.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
batched_gemm.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
batchnorm.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
betaInc.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
col2im.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
compare_elem.cpp Various fixes (#143) 2020-01-04 13:45:07 +11:00
concat.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
confusion.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
convolutions.cpp Oleh tenzor mmul (#231) 2020-02-13 20:33:54 +03:00
crop_and_resize.cpp More compilation units (#246) 2020-02-17 10:23:05 +03:00
crop_and_resize.hpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
cross.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
d_t_s.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
diGamma.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
diag.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
dilation2d.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
dropout.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
dynamic.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
extract_patches.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
fake_quantization.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
flatten.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
gather.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
gradient.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
gru.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
hamming.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
hashcode.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
histogram.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
histogramFixedWidth.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
im2col.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
image_draw_bounding_boxes.cpp Shugeo image resize bicubic (#56) 2019-11-20 21:11:04 +02:00
image_resize.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
image_suppression.cpp Shugeo_release_fixes3 (#81) 2019-11-28 21:08:51 +03:00
imagesHelpers.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
ismax.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
legacy_helper.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
lgamma.cpp Shugeo lgamma (#170) 2020-01-20 12:29:36 +03:00
lrn.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
lstm.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
lstsq.cpp Shugeo solve ls (#203) 2020-02-28 11:37:26 +03:00
lup.cpp Shugeo solve ls (#203) 2020-02-28 11:37:26 +03:00
matrixSetDiag.cpp [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
matrix_band.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
matrix_diag_part.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
max_pooling.cpp [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
meshgrid.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
minimax.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
nth_element.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
one_hot.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
percentile.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
polyGamma.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
prefix.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
print_variable.cpp String changes (#3) 2020-01-04 13:27:50 +03:00
qr.cpp Shugeo solve ls (#203) 2020-02-28 11:37:26 +03:00
random.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
random_crop.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
range.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
reverse.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
roll.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
s_t_b.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
s_t_d.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
scatter.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
segment.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
sequence_mask.cpp Shugeo sequence mask fix2 (#216) 2020-02-06 21:06:50 +03:00
sg_cb.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
shift.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
solve.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
split.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
sru.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
stack.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00
svd.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
toggle_bits.cpp Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
top_k.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
transforms.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
triangular_solve.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
weights.cpp Loops auto-vectorization problem fix (#274) 2020-02-26 21:12:19 +03:00
zeta.cpp Minor improvements (#255) 2020-02-20 11:43:26 +03:00

README.md

This folder contains OpenMP implementations for operations helpers. Basically suited for homogenous x86-like platforms.