cavis/libnd4j/include/ops/declarable
Yurii Shyrma 66b84b38cf
Shyrma mmul (#58)
* - get rid of some copy procedures in mmulHelper ops

Signed-off-by: Yurii <iuriish@yahoo.com>

* - further work on embedding cuda api for batched gemm (cublasGemmBatchedEx) in our mmulHelper class

Signed-off-by: Yurii <iuriish@yahoo.com>

* - further work on cuda batched gamm api

Signed-off-by: Yurii <iuriish@yahoo.com>

* - write own cuda kernel performing batched gemm

Signed-off-by: Yurii <iuriish@yahoo.com>

* missing include in MmulHelper

Signed-off-by: raver119 <raver119@gmail.com>

* - forgot to keep in code previous correct kernels for mmulNxN, since it may happen that new onw will fail for some reason in future

Signed-off-by: Yurii <iuriish@yahoo.com>

* disable old tensordot

Signed-off-by: raver119 <raver119@gmail.com>

* - rewrite cuda kernels for usualGemm and usualGemv

Signed-off-by: Yurii <iuriish@yahoo.com>

* - profiling mmul helpers

Signed-off-by: Yurii <iuriish@yahoo.com>

* - prints to check shapes were added

Signed-off-by: Yurii <iuriish@yahoo.com>

* - correct type of output array Cin mmulNxN

Signed-off-by: Yurii <iuriish@yahoo.com>

* - take into account possible nans in C array

Signed-off-by: Yurii <iuriish@yahoo.com>

* slightly change numThreads message

Signed-off-by: raver119 <raver119@gmail.com>

* - make corrections in accordance to given notes in pr review

Signed-off-by: Yurii <iuriish@yahoo.com>
2019-11-19 15:39:36 +02:00
..
generic SameDiff TF import (#49) 2019-11-19 22:44:29 +11:00
headers SameDiff TF import (#49) 2019-11-19 22:44:29 +11:00
helpers Shyrma mmul (#58) 2019-11-19 15:39:36 +02:00
impl fake quant dtype validation fix (#60) 2019-11-19 12:53:52 +03:00
platform - make agreement between our and mkl api dilation/padding formulas (#47) 2019-11-14 20:21:22 +03:00
BooleanOp.h [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
BroadcastableOp.h [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
CustomOperations.h Merge pull request #7 from KonduitAI/asto_nd4s_10172019 2019-10-23 12:11:25 +03:00
DeclarableCustomOp.h [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
DeclarableListOp.h [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
DeclarableOp.h [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
DeclarableReductionOp.h [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
LegacyBroadcastBoolOp.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
LegacyBroadcastOp.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
LegacyIndexReduceOp.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
LegacyOp.h [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
LegacyPairwiseTransformBoolOp.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
LegacyPairwiseTransformOp.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
LegacyRandomOp.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
LegacyReduce3Op.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
LegacyReduceBoolOp.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
LegacyReduceFloatOp.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
LegacyReduceLongOp.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
LegacyReduceOp.h Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
LegacyReduceSameOp.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
LegacyScalarBoolOp.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
LegacyScalarOp.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
LegacyStatsOp.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
LegacyTransformAnyOp.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
LegacyTransformBoolOp.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
LegacyTransformFloatOp.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
LegacyTransformOp.h Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
LegacyTransformSameOp.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
LegacyTransformStrictOp.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
LogicOp.h [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
OpDescriptor.h [WIP] size etc (#155) 2019-08-23 12:31:12 +03:00
OpRegistrator.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
OpTuple.h [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
PlatformHelper.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00