cavis/libnd4j/include/ops/declarable/helpers/cpu
Yurii Shyrma 78934c17ad
profiling of stack and unstack ops (#261)
* - profiling of stack and unstack ops

Signed-off-by: Yurii <iuriish@yahoo.com>

* - fix bug in cpu concat op

Signed-off-by: Yurii <iuriish@yahoo.com>

* - correction of cuda stack and unstack

Signed-off-by: Yurii <iuriish@yahoo.com>

* - change shape.h method which operates with unity dimensions strides

Signed-off-by: Yurii <iuriish@yahoo.com>

* - rearrange stack tests

Signed-off-by: Yurii <iuriish@yahoo.com>

* - correct evaluation of smallest stride for moving through contiguous axis

Signed-off-by: Yurii <iuriish@yahoo.com>

* - forgot to update signature of function strideOverContigAxis in cuda concat and split ops

Signed-off-by: Yurii <iuriish@yahoo.com>

* - remove ShapeUtils::shapeAsString method applied before input arrays validations

Signed-off-by: Yurii <iuriish@yahoo.com>

* -  further removing of ShapeUtils::shapeAsString

Signed-off-by: Yurii <iuriish@yahoo.com>

* - take sub-array shapeIndo/offset calculation out of NDArray class
- add possibility of contiguous memory copy in execTransformAny op if opNum == assign

Signed-off-by: Yurii <iuriish@yahoo.com>

* - correct test_empty_scatter_2 in EmptyTests.cpp

Signed-off-by: Yurii <iuriish@yahoo.com>

* - profiling of slice op

Signed-off-by: Yurii <iuriish@yahoo.com>

* - get rid of contiguous memcpy for some cases in concat and split ops

Signed-off-by: Yurii <iuriish@yahoo.com>

* - forgot to declare oid nd4j::SpecialMethods<T>::splitCpuGeneric

Signed-off-by: Yurii <iuriish@yahoo.com>

* - correct typo in calculation of threads in cuda split op

Signed-off-by: Yurii <iuriish@yahoo.com>

* - forgot to correct another set of threads variables in split cuda ops

Signed-off-by: Yurii <iuriish@yahoo.com>

* - further conflicts resolving

Signed-off-by: Yurii <iuriish@yahoo.com>

Co-authored-by: raver119 <raver119@gmail.com>
2020-03-03 07:32:37 +03:00
..
compilation_units libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
BarnesHutTsne.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
README.md Merge master to upstream (#7945) 2019-06-27 18:37:04 +03:00
activations.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
addBias.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
adjust_hue.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
adjust_saturation.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
axis.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
batched_gemm.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
batchnorm.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
betaInc.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
col2im.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
compare_elem.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
concat.cpp profiling of stack and unstack ops (#261) 2020-03-03 07:32:37 +03:00
confusion.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
convolutions.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
crop_and_resize.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
crop_and_resize.hpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
cross.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
d_t_s.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
diGamma.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
diag.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
dilation2d.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
dropout.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
dynamic.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
extract_patches.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
fake_quantization.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
flatten.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
gather.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
gradient.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
gru.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
hamming.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
hashcode.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
histogram.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
histogramFixedWidth.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
im2col.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
image_draw_bounding_boxes.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
image_resize.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
image_suppression.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
imagesHelpers.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
ismax.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
legacy_helper.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
lgamma.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
lrn.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
lstm.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
lstsq.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
lup.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
matrixSetDiag.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
matrix_band.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
matrix_diag_part.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
max_pooling.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
meshgrid.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
minimax.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
nth_element.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
one_hot.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
percentile.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
polyGamma.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
prefix.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
print_variable.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
qr.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
random.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
random_crop.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
range.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
reverse.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
roll.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
s_t_b.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
s_t_d.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
scatter.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
segment.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
sequence_mask.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
sg_cb.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
shift.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
solve.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
split.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
sru.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
stack.cpp profiling of stack and unstack ops (#261) 2020-03-03 07:32:37 +03:00
svd.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
toggle_bits.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
top_k.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
transforms.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
triangular_solve.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
weights.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
zeta.cpp libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00

README.md

This folder contains OpenMP implementations for operations helpers. Basically suited for homogenous x86-like platforms.