cavis/libnd4j/blas/cuda
raver119 b71c993ded
[WIP] maxpool_bp cuda fix (#212)
* one test for alex

Signed-off-by: raver119 <raver119@gmail.com>

* fix

Signed-off-by: raver119 <raver119@gmail.com>

* get rid of safety offset in cpp

Signed-off-by: raver119 <raver119@gmail.com>

* bfloat16

Signed-off-by: raver119 <raver119@gmail.com>

* minor test rearrangement to fastpath launch

Signed-off-by: raver119 <raver119@gmail.com>

* - atomicAdd/Mul/Div fix for float16/bfloat16 misalignment
- one special test for maxpoolbp java
- safety offset of 8 bytes is back to libnd4j legacy

Signed-off-by: raver119 <raver119@gmail.com>
2019-08-31 20:57:05 +03:00
..
NDArray.cu [WIP] repeat op (#143) 2019-08-21 21:10:29 +03:00
NDArrayLambda.hpp [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
NativeOpExecutioner.cu [WIP] Int broadcastables (#195) 2019-08-30 10:12:40 +03:00
NativeOps.cu [WIP] maxpool_bp cuda fix (#212) 2019-08-31 20:57:05 +03:00