cavis/libnd4j/include/loops/cuda/specials
Yurii Shyrma 6aaca58506
Shyrma broadcast (#302)
* - profiling TrueBroadcastHelper

Signed-off-by: Yurii <iuriish@yahoo.com>

* - further improving of TrueBroadcastHelper

Signed-off-by: Yurii <iuriish@yahoo.com>

* - further profiling of broadcast op

Signed-off-by: Yurii <iuriish@yahoo.com>

* - implementation of broadcastShapeHelper which inserts unities in shapes of arrays to be broadcasted

Signed-off-by: Yurii <iuriish@yahoo.com>

* - provide additional method in ConstantShapeHelper class for deducing broadcast shapes with unities

Signed-off-by: Yurii <iuriish@yahoo.com>

* - provide new NativeOps helpers for usual and true broadcast methods

Signed-off-by: Yurii <iuriish@yahoo.com>

* enable bert profiler

Signed-off-by: raver119 <raver119@gmail.com>

* - delete unnessesary tests

Signed-off-by: Yurii <iuriish@yahoo.com>

Co-authored-by: raver119 <raver119@gmail.com>
2020-03-10 16:29:09 +03:00
..
accumulateKernel.cu libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
averagingKernel.cu libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
bitonicArbitraryStep.cu [WIP] bunch of improvements (#257) 2019-09-11 20:12:09 +03:00
bitonicSortStep.cu [WIP] bunch of improvements (#257) 2019-09-11 20:12:09 +03:00
concatKernel.cu libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
concatKernelHStack.cu libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
concatKernelScalar.cu libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
concatKernelVStack.cu libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
convertHalfs.cu libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
convertToHalf.cu libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
fillDimensionalIsMax.cu libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
fillIsMax.cu libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
flatten.cu libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
oesTad.cu [WIP] bunch of improvements (#257) 2019-09-11 20:12:09 +03:00
pullRowsKernel.cu libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
setDiagonalKernel.cu libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
shuffleKernel.cu libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
swapUnsafeKernel.cu libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
tearKernel.cu libnd4j polishing (#273) 2020-03-02 12:49:41 +03:00
tileKernel.cu Shyrma broadcast (#302) 2020-03-10 16:29:09 +03:00