cavis/libnd4j/include
Oleh 0748c7e7c2
Oleh broadcast4d (#257)
* libnd4j raw implementation of native broadcast for special cases

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* libnd4j fixed bugs for special case of 4D loop broadcast, add some tests, need more testing and discussion

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* libnd4j added 3D and 5D cases support and tests, need testing with different orders

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* libnd4j correctd case selection for broadcast 3,4,5D loops, fixed several places for more stable behavior, clean up

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* libnd4j minor corrections to avoid some risks in strides selection, added tests and rename some variables

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* libnd4j optimize usage the stride selection for all loops in separate ShapeUtils method copyCertainStridesFromShapeInfo, merge master

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* libnd4j remove per request several tests for 3D, 4D and 5D broadcast loops

Signed-off-by: Oleg <oleg.semeniv@gmail.com>

* libnd4j removed some loac changes that had not been sync with serve playground, turn on new loops usage
2020-02-21 07:46:05 +03:00
..
array Minor improvements (#255) 2020-02-20 11:43:26 +03:00
cnpy [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
exceptions [WIP] Memory limits (#167) 2020-01-24 10:11:09 +03:00
execution Initial performance improvement for Bias Add and etc #8556 (#217) 2020-02-08 15:31:30 +03:00
graph Perf improvements (#242) 2020-02-14 16:20:31 +03:00
helpers Oleh broadcast4d (#257) 2020-02-21 07:46:05 +03:00
indexing Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
loops Oleh broadcast4d (#257) 2020-02-21 07:46:05 +03:00
memory [WIP] Memory limits (#167) 2020-01-24 10:11:09 +03:00
ops profiling of concat op (both cuda and cpu) (#151) 2020-02-20 21:19:01 +03:00
performance/benchmarking Shyrma temp (#131) 2019-12-20 22:35:39 +03:00
types More compilation units (#246) 2020-02-17 10:23:05 +03:00
Status.h Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
buffer.h [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
cblas.h Platform helpers (#8216) 2019-09-11 21:50:28 +03:00
cblas_enum_conversion.h Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
config.h.in cuDNN integration (#150) 2020-01-20 21:32:46 +03:00
dll.h [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
enum_boilerplate.h Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
msvc.h [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
nd4jmalloc.h Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
nd4jmemset.h Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
op_boilerplate.h DataTypes & FlatBuffers (#197) 2020-01-30 10:07:24 +03:00
op_enums.h [WIP] Int broadcastables (#195) 2019-08-30 10:12:40 +03:00
openmp_pragmas.h [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
optype.h Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
pairwise_util.h Shugeo random uniform int (#30) 2019-11-06 12:49:27 +02:00
platform_boilerplate.h cuDNN integration (#150) 2020-01-20 21:32:46 +03:00
platformmath.h Added missing bfloat16 (#252) 2020-02-18 17:01:43 +03:00
play.h cuDNN integration (#150) 2020-01-20 21:32:46 +03:00
pointercast.h [WIP] ThreadPool (#8) 2019-11-13 17:04:59 +03:00
templatemath.h Perf improvements (#242) 2020-02-14 16:20:31 +03:00
type_boilerplate.h String changes (#3) 2020-01-04 13:27:50 +03:00
util.h Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00