cavis/libnd4j/include/loops
raver119 0613485654
compression ops (#436)
* Added declarations for decode/encode_bitmap ops.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Added implementation for bitmap encoding/decoding ops.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Added helpers for encode/decode bitmap ops.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Refactored encodingBitmap helper.

Signed-off-by: shugeo <sgazeos@gmail.com>

* threshold encode/decode skeleton

* helper skeleton

* minor import fix

* encoder shape fn & op impl

* thresholdEncode cpu impl

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* thresholdDecode cpu impl

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* Only cosmetical changes.

Signed-off-by: shugeo <sgazeos@gmail.com>

* placeholder

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* Added cuda implementation for bitmap decode helper.

Signed-off-by: shugeo <sgazeos@gmail.com>

* cuda thresholdEstimate

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* cuda thresholdDecode

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* next step

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* - nano cmakelist update (get rid of Clion section)
- fixed forgotten throw in AtomicTests

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* thesholdEncode cuda impl

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* Added tests for bitmap encoding/decoding ops.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Fixed tests for encode/decode bitmaps.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Refactored decode/encode helpers.

Signed-off-by: shugeo <sgazeos@gmail.com>

* Fixed crashes with bitmap decode/encode helpers.

Signed-off-by: shugeo <sgazeos@gmail.com>

* bitmap encode/decode CPU

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* bitmap encode/decode CUDA

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* C API removed for threshold/bitmap encode

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* EncodeBitmap/DecodeBitmap Java side

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* EncodeThreshold/DecodeThreshold Java side

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* EncodeThreshold/DecodeThreshold Java side

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* few more tests for threshold encoding

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* minor test tweak

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* two special tests

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* encodeBitmap CPU fix

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* parallel_long/parallel_double proper spans fix

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* encodeThreshold CUDA fix

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* nano fix

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* grid tweaks

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* RTX adaptation for thresholdEncode

Signed-off-by: raver119 <raver119@gmail.com>

* don't allow threshold encoding for length < 2

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* get rid of NDArrayCompressor in EncodingHandler

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* one more minor update of EncodingHandler

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* one more minor tweak of EncodingHandler

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* - matmul allows integer data types use
- EncodingHandler boundary default value
- few tests for integer matmul

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* minor fix of CUDA bitmap encode

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* boundary changed to integer everywhere

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* boundary changed to integer everywhere

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* re-enable CUDA deallocator

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* threshold encoder fix for systems without omp

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* - encode_threshold now requires non-negative boundary
- minor tweak in EncodingHandler

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* restore parallelism in decode_bitmap

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* fall back to omp for encode_bitmap cpu

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* single time casts

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

* - additional test for encode_threshold
- sync buffers to device before calling for shape function

Signed-off-by: raver119@gmail.com <raver119@gmail.com>

Co-authored-by: shugeo <sgazeos@gmail.com>
2020-05-08 20:59:39 +03:00
..
cpu correct output empty shapes deducing in split op (#311) 2020-03-12 18:25:54 +03:00
cuda compression ops (#436) 2020-05-08 20:59:39 +03:00
impl compression ops (#436) 2020-05-08 20:59:39 +03:00
BroadcastPairwiseConverter.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
BroadcastScalarConverter.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
ReduceType.h Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
broadcasting.h Shyrma broadcast (#302) 2020-03-10 16:29:09 +03:00
broadcasting_bool.h Shyrma broadcast (#302) 2020-03-10 16:29:09 +03:00
broadcasting_int.h Shyrma broadcast (#302) 2020-03-10 16:29:09 +03:00
grid_shaped.legacy Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
grid_strided.legacy Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
indexreduce.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
legacy_ops.h Shyrma lstm layer bp (#370) 2020-04-13 13:21:51 +03:00
pairwise_bool.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
pairwise_int.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
pairwise_transform.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
random.h [WIP] Weekly update of repo (#8390) 2019-11-13 17:15:18 +03:00
reduce3.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
reduce_bool.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
reduce_float.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
reduce_long.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
reduce_same.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
scalar.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
scalar_bool.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
scalar_int.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
special_kernels.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
summarystatsreduce.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
transform_any.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
transform_bool.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
transform_float.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
transform_same.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
transform_strict.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
type_conversions.h compression ops (#436) 2020-05-08 20:59:39 +03:00