cavis/libnd4j/include
Yurii Shyrma 78934c17ad
profiling of stack and unstack ops (#261)
* - profiling of stack and unstack ops

Signed-off-by: Yurii <iuriish@yahoo.com>

* - fix bug in cpu concat op

Signed-off-by: Yurii <iuriish@yahoo.com>

* - correction of cuda stack and unstack

Signed-off-by: Yurii <iuriish@yahoo.com>

* - change shape.h method which operates with unity dimensions strides

Signed-off-by: Yurii <iuriish@yahoo.com>

* - rearrange stack tests

Signed-off-by: Yurii <iuriish@yahoo.com>

* - correct evaluation of smallest stride for moving through contiguous axis

Signed-off-by: Yurii <iuriish@yahoo.com>

* - forgot to update signature of function strideOverContigAxis in cuda concat and split ops

Signed-off-by: Yurii <iuriish@yahoo.com>

* - remove ShapeUtils::shapeAsString method applied before input arrays validations

Signed-off-by: Yurii <iuriish@yahoo.com>

* -  further removing of ShapeUtils::shapeAsString

Signed-off-by: Yurii <iuriish@yahoo.com>

* - take sub-array shapeIndo/offset calculation out of NDArray class
- add possibility of contiguous memory copy in execTransformAny op if opNum == assign

Signed-off-by: Yurii <iuriish@yahoo.com>

* - correct test_empty_scatter_2 in EmptyTests.cpp

Signed-off-by: Yurii <iuriish@yahoo.com>

* - profiling of slice op

Signed-off-by: Yurii <iuriish@yahoo.com>

* - get rid of contiguous memcpy for some cases in concat and split ops

Signed-off-by: Yurii <iuriish@yahoo.com>

* - forgot to declare oid nd4j::SpecialMethods<T>::splitCpuGeneric

Signed-off-by: Yurii <iuriish@yahoo.com>

* - correct typo in calculation of threads in cuda split op

Signed-off-by: Yurii <iuriish@yahoo.com>

* - forgot to correct another set of threads variables in split cuda ops

Signed-off-by: Yurii <iuriish@yahoo.com>

* - further conflicts resolving

Signed-off-by: Yurii <iuriish@yahoo.com>

Co-authored-by: raver119 <raver119@gmail.com>
2020-03-03 07:32:37 +03:00
..
array profiling of stack and unstack ops (#261) 2020-03-03 07:32:37 +03:00
cnpy libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
exceptions libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
execution libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
graph full bert graph (#282) 2020-03-02 18:14:32 +03:00
helpers profiling of stack and unstack ops (#261) 2020-03-03 07:32:37 +03:00
indexing libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
legacy profiling of stack and unstack ops (#261) 2020-03-03 07:32:37 +03:00
loops libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
math libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
memory libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
ops profiling of stack and unstack ops (#261) 2020-03-03 07:32:37 +03:00
performance/benchmarking libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
system libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
types libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
cblas.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
cblas_enum_conversion.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
config.h.in cuDNN integration (#150) 2020-01-20 21:32:46 +03:00
samediff.h libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00