a90c7dd995
* mmul op instead of cublasSgemm Signed-off-by: raver119 <raver119@gmail.com> * transB Signed-off-by: raver119 <raver119@gmail.com> * jcpp handles Signed-off-by: raver119 <raver119@gmail.com> * bitwise and/or/xor Signed-off-by: raver119 <raver119@gmail.com> * bitwise and/or/xor mapping Signed-off-by: raver119 <raver119@gmail.com> * cuda/cublas version check Signed-off-by: raver119 <raver119@gmail.com> * add expected version Signed-off-by: raver119 <raver119@gmail.com> * cuda/cublas version check in java Signed-off-by: raver119 <raver119@gmail.com> * one more error check Signed-off-by: raver119 <raver119@gmail.com> * build fix Signed-off-by: raver119 <raver119@gmail.com> * build fix Signed-off-by: raver119 <raver119@gmail.com> * build fix Signed-off-by: raver119 <raver119@gmail.com> * one more fix Signed-off-by: raver119 <raver119@gmail.com> * skip CUDA version check for now Signed-off-by: raver119 <raver119@gmail.com> * better wording Signed-off-by: raver119 <raver119@gmail.com> * few more tweaks Signed-off-by: raver119 <raver119@gmail.com> * few more tweaks Signed-off-by: raver119 <raver119@gmail.com> |
||
---|---|---|
.. | ||
GraphExecutioner.cpp | ||
NDArray.cpp | ||
NDArray.macro | ||
NDArrayFactory.cpp | ||
NDArrayLambda.hpp | ||
NativeOpExecutioner.cpp | ||
NativeOps.cpp |