cavis/libnd4j/blas
raver119 b71c993ded
[WIP] maxpool_bp cuda fix (#212)
* one test for alex

Signed-off-by: raver119 <raver119@gmail.com>

* fix

Signed-off-by: raver119 <raver119@gmail.com>

* get rid of safety offset in cpp

Signed-off-by: raver119 <raver119@gmail.com>

* bfloat16

Signed-off-by: raver119 <raver119@gmail.com>

* minor test rearrangement to fastpath launch

Signed-off-by: raver119 <raver119@gmail.com>

* - atomicAdd/Mul/Div fix for float16/bfloat16 misalignment
- one special test for maxpoolbp java
- safety offset of 8 bytes is back to libnd4j legacy

Signed-off-by: raver119 <raver119@gmail.com>
2019-08-31 20:57:05 +03:00
..
cpu [WIP] Int broadcastables (#195) 2019-08-30 10:12:40 +03:00
cuda [WIP] maxpool_bp cuda fix (#212) 2019-08-31 20:57:05 +03:00
CMakeLists.txt cmake fix for windows debug build 2019-08-26 08:13:22 +03:00
Environment.cpp [WIP] More of CUDA (#95) 2019-08-05 11:27:05 +10:00
Environment.h Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
GraphExecutioner.h Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
NDArray.h [WIP] Int broadcastables (#195) 2019-08-30 10:12:40 +03:00
NDArray.hpp [WIP] Int broadcastables (#195) 2019-08-30 10:12:40 +03:00
NDArrayFactory.h Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
NativeOpExecutioner.h [WIP] Int broadcastables (#195) 2019-08-30 10:12:40 +03:00
NativeOps.h [WIP] Error handling (#169) 2019-08-26 19:57:51 +03:00