cavis/nd4j/nd4j-backends/nd4j-backend-impls
Yurii Shyrma f7a9190407
profiling of concat op (both cuda and cpu) (#151)
* - profiling of concat op (both cuda and cpu)

Signed-off-by: Yurii <iuriish@yahoo.com>

* better comparison for large concat

Signed-off-by: raver119 <raver119@gmail.com>

* - further improving of concat op

Signed-off-by: Yurii <iuriish@yahoo.com>

* some loggin

Signed-off-by: raver119 <raver119@gmail.com>

* - add possibility to verify presence of trailing unities in shape and set strides/ews correspondingly
- restrict second simple case in concat op to c order only

Signed-off-by: Yurii <iuriish@yahoo.com>

* - move concat op to specials_single.cpp file

Signed-off-by: Yurii <iuriish@yahoo.com>

* - get rid of second concat op declaration in transforms.cpp file

Signed-off-by: Yurii <iuriish@yahoo.com>

Co-authored-by: raver119 <raver119@gmail.com>
2020-02-20 21:19:01 +03:00
..
nd4j-cuda profiling of concat op (both cuda and cpu) (#151) 2020-02-20 21:19:01 +03:00
nd4j-cuda-platform Add support for CUDA 10.2 (#89) 2019-11-29 16:31:03 +11:00
nd4j-native Perf improvements (#242) 2020-02-14 16:20:31 +03:00
nd4j-native-platform few more mkldnn dependencies removed 2019-09-12 04:55:59 +03:00
pom.xml Fixes (#213) 2020-02-05 17:07:36 +11:00