Yurii Shyrma 66b84b38cf
Shyrma mmul (#58)
* - get rid of some copy procedures in mmulHelper ops

Signed-off-by: Yurii <iuriish@yahoo.com>

* - further work on embedding cuda api for batched gemm (cublasGemmBatchedEx) in our mmulHelper class

Signed-off-by: Yurii <iuriish@yahoo.com>

* - further work on cuda batched gamm api

Signed-off-by: Yurii <iuriish@yahoo.com>

* - write own cuda kernel performing batched gemm

Signed-off-by: Yurii <iuriish@yahoo.com>

* missing include in MmulHelper

Signed-off-by: raver119 <raver119@gmail.com>

* - forgot to keep in code previous correct kernels for mmulNxN, since it may happen that new onw will fail for some reason in future

Signed-off-by: Yurii <iuriish@yahoo.com>

* disable old tensordot

Signed-off-by: raver119 <raver119@gmail.com>

* - rewrite cuda kernels for usualGemm and usualGemv

Signed-off-by: Yurii <iuriish@yahoo.com>

* - profiling mmul helpers

Signed-off-by: Yurii <iuriish@yahoo.com>

* - prints to check shapes were added

Signed-off-by: Yurii <iuriish@yahoo.com>

* - correct type of output array Cin mmulNxN

Signed-off-by: Yurii <iuriish@yahoo.com>

* - take into account possible nans in C array

Signed-off-by: Yurii <iuriish@yahoo.com>

* slightly change numThreads message

Signed-off-by: raver119 <raver119@gmail.com>

* - make corrections in accordance to given notes in pr review

Signed-off-by: Yurii <iuriish@yahoo.com>
2019-11-19 15:39:36 +02:00
..
2019-11-19 22:44:29 +11:00
2019-11-19 15:39:36 +02:00
2019-11-13 17:04:59 +03:00
2019-06-06 15:21:15 +03:00
2019-09-11 20:12:09 +03:00
2019-06-06 15:21:15 +03:00
2019-06-06 15:21:15 +03:00
2019-06-27 18:37:04 +03:00
2019-06-06 15:21:15 +03:00
2019-06-06 15:21:15 +03:00
2019-06-06 15:21:15 +03:00
2019-11-13 17:04:59 +03:00
2019-06-06 15:21:15 +03:00
2019-06-06 15:21:15 +03:00
2019-07-20 23:05:21 +10:00
2019-06-06 15:21:15 +03:00
2019-06-06 15:21:15 +03:00
2019-09-11 20:12:09 +03:00
2019-06-06 15:21:15 +03:00
2019-08-05 11:27:05 +10:00
2019-07-20 23:20:41 +10:00
2019-08-28 18:20:44 +03:00
2019-07-20 23:20:41 +10:00
2019-11-13 17:04:59 +03:00
2019-06-06 15:21:15 +03:00
2019-11-19 22:44:29 +11:00
2019-06-06 15:21:15 +03:00
2019-06-06 15:21:15 +03:00
2019-06-27 18:37:04 +03:00
2019-08-05 11:27:05 +10:00
2019-06-06 15:21:15 +03:00
2019-07-20 23:05:21 +10:00
2019-11-13 17:04:59 +03:00
2019-06-06 15:21:15 +03:00
2019-09-02 16:25:58 +03:00
2019-06-06 15:21:15 +03:00
2019-06-06 15:21:15 +03:00
2019-06-06 15:21:15 +03:00
2019-06-06 15:21:15 +03:00
2019-06-27 18:37:04 +03:00
2019-06-06 15:21:15 +03:00
2019-06-06 15:21:15 +03:00
2019-06-06 15:21:15 +03:00
2019-06-27 18:37:04 +03:00
2019-06-06 15:21:15 +03:00
2019-11-06 12:49:27 +02:00
2019-06-06 15:21:15 +03:00
2019-06-06 15:21:15 +03:00
2019-06-06 15:21:15 +03:00
2019-08-17 14:15:08 +03:00
2019-08-21 21:11:46 +03:00
2019-06-06 15:21:15 +03:00
2019-07-20 23:05:21 +10:00
2019-06-06 15:21:15 +03:00
2019-06-06 15:21:15 +03:00
2019-08-15 20:35:15 +03:00
2019-06-06 15:21:15 +03:00
2019-08-05 11:27:05 +10:00
2019-07-20 23:05:21 +10:00
2019-06-06 15:21:15 +03:00
2019-06-27 18:37:04 +03:00
2019-08-21 21:10:29 +03:00
2019-06-06 15:21:15 +03:00
2019-06-06 15:21:15 +03:00
2019-06-06 15:21:15 +03:00
2019-06-06 15:21:15 +03:00