cavis/libnd4j/include/ops
Yurii Shyrma f7a9190407
profiling of concat op (both cuda and cpu) (#151)
* - profiling of concat op (both cuda and cpu)

Signed-off-by: Yurii <iuriish@yahoo.com>

* better comparison for large concat

Signed-off-by: raver119 <raver119@gmail.com>

* - further improving of concat op

Signed-off-by: Yurii <iuriish@yahoo.com>

* some loggin

Signed-off-by: raver119 <raver119@gmail.com>

* - add possibility to verify presence of trailing unities in shape and set strides/ews correspondingly
- restrict second simple case in concat op to c order only

Signed-off-by: Yurii <iuriish@yahoo.com>

* - move concat op to specials_single.cpp file

Signed-off-by: Yurii <iuriish@yahoo.com>

* - get rid of second concat op declaration in transforms.cpp file

Signed-off-by: Yurii <iuriish@yahoo.com>

Co-authored-by: raver119 <raver119@gmail.com>
2020-02-20 21:19:01 +03:00
..
declarable profiling of concat op (both cuda and cpu) (#151) 2020-02-20 21:19:01 +03:00
impl profiling of concat op (both cuda and cpu) (#151) 2020-02-20 21:19:01 +03:00
BroadcastBoolOpsTuple.h [WIP] CUDA tests (#95) 2019-12-02 21:37:21 +03:00
BroadcastIntOpsTuple.h [WIP] CUDA tests (#95) 2019-12-02 21:37:21 +03:00
BroadcastOpsTuple.h Oleh powderev (#171) 2020-01-20 12:59:12 +03:00
InputType.h Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
gemm.h Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
meta_ops.h Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
ops.h Shyrma adjust (#98) 2019-12-03 09:40:45 +03:00
random_ops.h Gamma and Poisson distributions (#27) 2019-11-04 15:42:28 +02:00
special_random_ops.h Minor improvements (#255) 2020-02-20 11:43:26 +03:00
specials.h [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
specials_cuda.h Merge master to upstream (#7945) 2019-06-27 18:37:04 +03:00
specials_sparse.h Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00