cavis/libnd4j/include/ops/declarable/helpers/cuda
raver119 f03b0ee78f
[WIP] more fixes (#159)
* Added test for MatrixInverse with double input. Fixed matrixDeterminantKernel.

* Fixed kernels to avoid waste templating.

* Fixed logDeterminant kernel.

* Refactored type check for lup'

* - decrease blockDim value for zeta op

Signed-off-by: Yurii <yurii@skymind.io>

* Added print for compound matrix with CUDA.

* Refactored upper matrix invertion kernels.

* - provide move constructor and move assignment operator for OpArgsHoder class

Signed-off-by: Yurii <yurii@skymind.io>

* Refactored usage of launch context.

* - add test for mergemax

Signed-off-by: Yurii <yurii@skymind.io>

* get rid of AveragingArrayProxy

Signed-off-by: raver119 <raver119@gmail.com>

* Refactoring of LUP inversion.

* Added prints for invertion.

* - add OpArgsHolder copy constructor and assignment operator

Signed-off-by: Yurii <yurii@skymind.io>

* Added test for lower inversion

* - fix bug in upsampling2d/3d_bp op

Signed-off-by: Yurii <yurii@skymind.io>

* Added expensive printfs to kernel.

* Refactored expensive kernel prints.

* Refactored expensive printfs

* - remove nullify

Signed-off-by: Yurii <yurii@skymind.io>

* Eliminated waste prints with tests.

* upsampling2d_bp test

Signed-off-by: raver119 <raver119@gmail.com>

* test updated

Signed-off-by: raver119 <raver119@gmail.com>
2019-08-23 19:20:50 +03:00
..
legacy [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
BarnesHutTsne.cu [WIP] More of CUDA (#95) 2019-08-05 11:27:05 +10:00
README.md Merge master to upstream (#7945) 2019-06-27 18:37:04 +03:00
activations.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
adjust_hue.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
adjust_saturation.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
axis.cu [WIP] Roll rewritten (#128) 2019-08-17 14:15:08 +03:00
batched_gemm.cu Merge master to upstream (#7945) 2019-06-27 18:37:04 +03:00
batchnorm.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
bds.cu [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
betaInc.cu Merge master to upstream (#7945) 2019-06-27 18:37:04 +03:00
col2im.cppc Fix OpenMP by replacing pragmas with macros (#8026) 2019-07-18 20:13:56 +09:00
col2im.cu [WIP] stb/bts nd (#144) 2019-08-21 21:11:46 +03:00
compare_elem.cu Merge master to upstream (#7945) 2019-06-27 18:37:04 +03:00
concat.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
confusion.cu Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
convolutions.cu [WIP] more fixes (#159) 2019-08-23 19:20:50 +03:00
cross.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
d_t_s.cu Merge master to upstream (#7945) 2019-06-27 18:37:04 +03:00
diag.cu [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
dilation2d.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
dropout.cu [WIP] cross-device migrations (#134) 2019-08-20 18:52:41 +03:00
dynamic.cu [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
extract_patches.cu Shugeo cuda cuda (#105) 2019-08-07 15:29:17 +03:00
fake_quantization.cu Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
flatten.cu [WIP] More of CUDA (#63) 2019-07-20 23:15:14 +10:00
gather.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
gather_nd.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
gradient.cu build fix 2019-08-05 11:27:05 +10:00
gru.cu [WIP] More of CUDA operations (#69) 2019-07-20 23:20:41 +10:00
hashcode.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
histogram.cu Shugeo cuda tests (#116) 2019-08-15 13:54:47 +03:00
histogramFixedWidth.cu Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
im2col.cppc Fix OpenMP by replacing pragmas with macros (#8026) 2019-07-18 20:13:56 +09:00
im2col.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
image_resize.cu Merge master to upstream (#7945) 2019-06-27 18:37:04 +03:00
image_suppression.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
ismax.cu [WIP] Minor fixes (#140) 2019-08-21 15:05:47 +03:00
legacy_helper.cu [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
lrn.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
lstm.cu [WIP] More of CUDA (#95) 2019-08-05 11:27:05 +10:00
lup.cu [WIP] more fixes (#159) 2019-08-23 19:20:50 +03:00
matrixSetDiag.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
matrix_band.cu Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
matrix_diag.cu Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
matrix_diag_part.cu Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
max_pooling.cu [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
maximum.cu [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
merge.cu Shugeo cuda tests (#116) 2019-08-15 13:54:47 +03:00
meshgrid.cu [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
minimum.cu [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
nth_element.cu [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
one_hot.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
pad.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
percentile.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
polyGamma.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
prefix.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
random_crop.cu Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
range.cu [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
reverse.cu [WIP] More of CUDA operations (#69) 2019-07-20 23:20:41 +10:00
roll.cu [WIP] Roll rewritten (#128) 2019-08-17 14:15:08 +03:00
s_t_b.cu [WIP] stb/bts nd (#144) 2019-08-21 21:11:46 +03:00
s_t_d.cu Merge master to upstream (#7945) 2019-06-27 18:37:04 +03:00
scatter.cu [WIP] Minor fixes (#140) 2019-08-21 15:05:47 +03:00
scatter_simple.cu [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
scatter_update.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
segment.cu [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
segment_max.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
segment_mean.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
segment_min.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
segment_prod.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
segment_sqrtn.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
segment_sum.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
sequence_mask.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
sg_cb.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
shift.cu [WIP] right shift ops (#118) 2019-08-15 20:35:15 +03:00
sru.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
stack.cu temporary stack fix 2019-08-05 11:28:07 +10:00
svd.cu [WIP] more CUDA stuff (#57) 2019-07-20 23:05:21 +10:00
toggle_bits.cu Shugeo cuda tests (#116) 2019-08-15 13:54:47 +03:00
top_k.cu [WIP] multi-device support (#80) 2019-08-14 16:52:34 +03:00
transforms.cu syncthreads (#136) 2019-08-20 18:28:43 +03:00
weights.cu Eclipse Migration Initial Commit 2019-06-06 15:21:15 +03:00
zeta.cu [WIP] more fixes (#159) 2019-08-23 19:20:50 +03:00

README.md

This folder contains CUDA-specific implementations for operations.