cavis/libnd4j/include/ops/declarable/helpers/cpu
raver119 98e2814879
Platform helpers (#8216)
* platform helpers draft

Signed-off-by: raver119 <raver119@gmail.com>

* typo

Signed-off-by: raver119 <raver119@gmail.com>

* disable platform cmake

Signed-off-by: raver119 <raver119@gmail.com>

* another draft

Signed-off-by: raver119 <raver119@gmail.com>

* mkldnn convolution refactored

Signed-off-by: raver119 <raver119@gmail.com>

* minor tweaks

Signed-off-by: raver119 <raver119@gmail.com>

* one more safety check

Signed-off-by: raver119 <raver119@gmail.com>

* prototype works

Signed-off-by: raver119 <raver119@gmail.com>

* meh

Signed-off-by: raver119 <raver119@gmail.com>

* force static library mode for mkldnn

Signed-off-by: raver119 <raver119@gmail.com>

* - ismax fix
- experimental arg fix
- don't enforce openblas on Apple hardware

Signed-off-by: raver119 <raver119@gmail.com>

* bunch of small fixes

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* declare concurrent

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* - MKLDNN version upgrade to 1.0.2
- avgpool2d/maxpool2d APIs update

Signed-off-by: raver119 <raver119@gmail.com>

* - avgpool2d_bp/maxpool2d_bp APIs update

Signed-off-by: raver119 <raver119@gmail.com>

* - conv2d/batchnorm APIs update

Signed-off-by: raver119 <raver119@gmail.com>

* - lrn/conv2d_bp/conv3d/conv3d_bp APIs update

Signed-off-by: raver119 <raver119@gmail.com>

* all ops converted to MKLDNN 1.x

Signed-off-by: raver119 <raver119@gmail.com>

* bunch of tweaks

Signed-off-by: raver119 <raver119@gmail.com>

* namespace for platform helpers

Signed-off-by: raver119 <raver119@gmail.com>

* make sure platform helpers aren't opimized out

Signed-off-by: raver119 <raver119@gmail.com>

* build cpu_features on x86 systems

Signed-off-by: raver119 <raver119@gmail.com>

* build cpu_features on x86 systems

Signed-off-by: raver119 <raver119@gmail.com>

* more of cpu_features

Signed-off-by: raver119 <raver119@gmail.com>

* - mkldnn removed from java
- cpu_features checks in CpuNDArrayFactory

Signed-off-by: raver119 <raver119@gmail.com>

* F16C definition renamed

Signed-off-by: raver119 <raver119@gmail.com>

* some mkldnn rearrangements

Signed-off-by: raver119 <raver119@gmail.com>

* check supported instructions before doing anything

Signed-off-by: raver119 <raver119@gmail.com>

* typo

Signed-off-by: raver119 <raver119@gmail.com>

* missied impl

Signed-off-by: raver119 <raver119@gmail.com>

* BUILD_PIC option

Signed-off-by: raver119 <raver119@gmail.com>

* conv2d fix

Signed-off-by: raver119 <raver119@gmail.com>

* avgpool3d fix

Signed-off-by: raver119 <raver119@gmail.com>

* avgpool3d_bp fix

Signed-off-by: raver119 <raver119@gmail.com>

* avgpool2d_bp leak fix

Signed-off-by: raver119 <raver119@gmail.com>

* avgpool3d_bp leak fix

Signed-off-by: raver119 <raver119@gmail.com>

* maxpool bp leaks fixed

Signed-off-by: raver119 <raver119@gmail.com>

* printf removed

Signed-off-by: raver119 <raver119@gmail.com>

* batchnorm fix

Signed-off-by: raver119 <raver119@gmail.com>

* AVX warning/error polishing

Signed-off-by: AlexDBlack <blacka101@gmail.com>

* Fix

Signed-off-by: AlexDBlack <blacka101@gmail.com>

* More polish

Signed-off-by: AlexDBlack <blacka101@gmail.com>

* Polish

Signed-off-by: AlexDBlack <blacka101@gmail.com>

* remove previous MKL-DNN support layer

Signed-off-by: raver119 <raver119@gmail.com>

* avx2 tweak

Signed-off-by: raver119 <raver119@gmail.com>

* allow static for apple

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* exclude mkldnn in one more place

Signed-off-by: raver119 <raver119@gmail.com>

* exclude mkldnn in one more place

Signed-off-by: raver119 <raver119@gmail.com>

* restore OPENBLAS_PATH use

Signed-off-by: raver119 <raver119@gmail.com>

* add runtime check for avx/avx2 support

Signed-off-by: raver119 <raver119@gmail.com>

* convolution_auto

Signed-off-by: raver119 <raver119@gmail.com>

* Add logic for helper argument

* minor test fix

Signed-off-by: raver119 <raver119@gmail.com>

* few tweaks

Signed-off-by: raver119 <raver119@gmail.com>

* few tweaks

Signed-off-by: raver119 <raver119@gmail.com>

* skip OpTracker props for non-x86 builds

Signed-off-by: raver119 <raver119@gmail.com>

* linux arm isn't x86 :)

Signed-off-by: raver119 <raver119@gmail.com>

* avx-512

Signed-off-by: raver119 <raver119@gmail.com>

* CUDA presets fix

Signed-off-by: raver119 <raver119@gmail.com>

* BUILD_PIC

Signed-off-by: raver119 <raver119@gmail.com>

* prefetchw for avx2

Signed-off-by: raver119 <raver119@gmail.com>

* BUILD_PIC again

Signed-off-by: raver119 <raver119@gmail.com>
2019-09-11 21:50:28 +03:00
..
BarnesHutTsne.cpp [WIP] More of CUDA (#95) 2019-08-05 11:27:05 +10:00
README.md Merge master to upstream (#7945) 2019-06-27 18:37:04 +03:00
activations.cpp [WIP] more CUDA stuff (#57) 2019-07-20 23:05:21 +10:00
addBias.cpp Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
adjust_hue.cpp [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
adjust_saturation.cpp [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
axis.cpp [WIP] Roll rewritten (#128) 2019-08-17 14:15:08 +03:00
batched_gemm.cpp Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
batchnorm.cpp [WIP] more CUDA stuff (#57) 2019-07-20 23:05:21 +10:00
betaInc.cpp Merge master to upstream (#7945) 2019-06-27 18:37:04 +03:00
col2im.cpp [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
compare_elem.cpp int -> long (#108) 2019-08-10 09:14:18 +03:00
confusion.cpp Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
convolutions.cpp Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
cross.cpp int -> long (#108) 2019-08-10 09:14:18 +03:00
d_t_s.cpp Merge master to upstream (#7945) 2019-06-27 18:37:04 +03:00
diag.cpp Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
dilation2d.cpp [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
dropout.cpp [WIP] More of CUDA operations (#69) 2019-07-20 23:20:41 +10:00
dynamic.cpp Merge master to upstream (#7945) 2019-06-27 18:37:04 +03:00
extract_patches.cpp Shugeo cuda cuda (#105) 2019-08-07 15:29:17 +03:00
fake_quantization.cpp Merge master to upstream (#7945) 2019-06-27 18:37:04 +03:00
flatten.cpp [WIP] More of CUDA (#63) 2019-07-20 23:15:14 +10:00
gather.cpp Add libnd4j benchmarks (#3) 2019-07-20 22:54:44 +10:00
gradient.cpp [WIP] More of CUDA (#95) 2019-08-05 11:27:05 +10:00
gru.cpp [WIP] More of CUDA operations (#69) 2019-07-20 23:20:41 +10:00
hamming.cpp fixed args 2019-09-01 22:06:01 +03:00
hashcode.cpp [WIP] INDArray hashCode() impl (#50) 2019-07-20 22:22:11 +10:00
histogram.cpp [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
histogramFixedWidth.cpp [WIP] ops fixes (#168) 2019-08-26 19:37:05 +03:00
im2col.cpp [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
image_resize.cpp [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
image_suppression.cpp [WIP] More of CUDA operations (#69) 2019-07-20 23:20:41 +10:00
ismax.cpp - cpu isMax fix for multidim case + test 2019-08-23 18:44:37 +03:00
legacy_helper.cpp - add parameter alpha to elu and lrelu_bp (#213) 2019-08-31 20:57:39 +03:00
lrn.cpp Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
lstm.cpp [WIP] More of CUDA (#95) 2019-08-05 11:27:05 +10:00
lup.cpp lup context fix (#164) 2019-08-24 16:57:48 +03:00
matrixSetDiag.cpp Snapshot update (#8194) 2019-09-03 22:02:02 +03:00
matrix_band.cpp Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
matrix_diag_part.cpp Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
max_pooling.cpp [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
meshgrid.cpp Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
minimax.cpp Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
nth_element.cpp Merge master to upstream (#7945) 2019-06-27 18:37:04 +03:00
one_hot.cpp Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
percentile.cpp Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
polyGamma.cpp Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
prefix.cpp Merge master to upstream (#7945) 2019-06-27 18:37:04 +03:00
random_crop.cpp Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
range.cpp Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
reverse.cpp Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
roll.cpp [WIP] Roll rewritten (#128) 2019-08-17 14:15:08 +03:00
s_t_b.cpp [WIP] stb/bts nd (#144) 2019-08-21 21:11:46 +03:00
s_t_d.cpp Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
scatter.cpp [WIP] ops fixes (#168) 2019-08-26 19:37:05 +03:00
segment.cpp [WIP] bitwise ops (#115) 2019-08-15 11:49:50 +03:00
sequence_mask.cpp [WIP] More of CUDA operations (#69) 2019-07-20 23:20:41 +10:00
sg_cb.cpp [WIP] more fixes (#159) 2019-08-23 19:20:50 +03:00
shift.cpp [WIP] right shift ops (#118) 2019-08-15 20:35:15 +03:00
sru.cpp [WIP] more CUDA stuff (#57) 2019-07-20 23:05:21 +10:00
stack.cpp [WIP] More of CUDA (#95) 2019-08-05 11:27:05 +10:00
svd.cpp Merge master to upstream (#7945) 2019-06-27 18:37:04 +03:00
toggle_bits.cpp [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
top_k.cpp Merge master to upstream (#7945) 2019-06-27 18:37:04 +03:00
transforms.cpp [WIP] repeat op (#143) 2019-08-21 21:10:29 +03:00
weights.cpp Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
zeta.cpp Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00

README.md

This folder contains OpenMP implementations for operations helpers. Basically suited for homogenous x86-like platforms.