raver119
|
9cf28ea6c9
|
[WIP] CUDA tweaks (#60)
* special cpu concat
Signed-off-by: raver119 <raver119@gmail.com>
* special concat fix
Signed-off-by: raver119 <raver119@gmail.com>
* OpProfiler tweak for absent host pointers
Signed-off-by: raver119 <raver119@gmail.com>
* minor test tweak to see orders
Signed-off-by: raver119 <raver119@gmail.com>
* CUDA broadcasting diff orders fix
Signed-off-by: raver119 <raver119@gmail.com>
* faster iterations
Signed-off-by: raver119 <raver119@gmail.com>
* OldSoftMax/OldLogSoftMax gone
Signed-off-by: raver119 <raver119@gmail.com>
* RandomLauncher tweaks
Signed-off-by: raver119 <raver119@gmail.com>
* additional check int randomtests
Signed-off-by: raver119 <raver119@gmail.com>
* skip prepare/register action for empty arrays
Signed-off-by: raver119 <raver119@gmail.com>
* npz float16 fix
Signed-off-by: raver119 <raver119@gmail.com>
* empty reduction cuda fixes
Signed-off-by: raver119 <raver119@gmail.com>
* ShapeBufferTests tweaks
Signed-off-by: raver119 <raver119@gmail.com>
|
2019-07-20 23:06:48 +10:00 |