cavis/libnd4j/blas
Yurii Shyrma f7a9190407
profiling of concat op (both cuda and cpu) (#151)
* - profiling of concat op (both cuda and cpu)

Signed-off-by: Yurii <iuriish@yahoo.com>

* better comparison for large concat

Signed-off-by: raver119 <raver119@gmail.com>

* - further improving of concat op

Signed-off-by: Yurii <iuriish@yahoo.com>

* some loggin

Signed-off-by: raver119 <raver119@gmail.com>

* - add possibility to verify presence of trailing unities in shape and set strides/ews correspondingly
- restrict second simple case in concat op to c order only

Signed-off-by: Yurii <iuriish@yahoo.com>

* - move concat op to specials_single.cpp file

Signed-off-by: Yurii <iuriish@yahoo.com>

* - get rid of second concat op declaration in transforms.cpp file

Signed-off-by: Yurii <iuriish@yahoo.com>

Co-authored-by: raver119 <raver119@gmail.com>
2020-02-20 21:19:01 +03:00
..
cpu Minor improvements (#255) 2020-02-20 11:43:26 +03:00
cuda OpContext handling (#214) 2020-02-05 07:27:24 +03:00
BlasVersionHelper.h [WIP] Last set of changes (#234) 2019-09-04 14:41:08 +03:00
CMakeLists.txt C++ NPY (#233) 2020-02-12 12:38:10 +03:00
Environment.cpp [WIP] Memory limits (#167) 2020-01-24 10:11:09 +03:00
Environment.h [WIP] Memory limits (#167) 2020-01-24 10:11:09 +03:00
GraphExecutioner.h Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
NDArray.h Oleh tenzor mmul (#231) 2020-02-13 20:33:54 +03:00
NDArray.hpp profiling of concat op (both cuda and cpu) (#151) 2020-02-20 21:19:01 +03:00
NDArrayFactory.h C++ NPY (#233) 2020-02-12 12:38:10 +03:00
NativeOpExecutioner.h Few fixes (#66) 2019-11-21 15:43:03 +03:00
NativeOps.h OpContext handling (#214) 2020-02-05 07:27:24 +03:00