* Added implementation for resize_area op. Initial commit. * Added implementation of resize_area op. Initial revision. Signed-off-by: shugeo <sgazeos@gmail.com> * Corrected resizeArea functor call. Signed-off-by: shugeo <sgazeos@gmail.com> * Implementation of resize_area. Cpu platform helpers. Signed-off-by: shugeo <sgazeos@gmail.com> * Implementation for resize_area helpers. The first part revision. Signed-off-by: shugeo <sgazeos@gmail.com> * Added a set of tests for resize_area op. Signed-off-by: shugeo <sgazeos@gmail.com> * Cuda implementation for resize_area. Initial approach. Signed-off-by: shugeo <sgazeos@gmail.com> * Adding multithreading for resize_area algorithm. Signed-off-by: shugeo <sgazeos@gmail.com> * Cuda implementation of resize_area helpers. Shared memory approach. Signed-off-by: shugeo <sgazeos@gmail.com> * Refactored resizeAreaKernel with cuda implementation. * Eliminated compilation errors. * ResizeArea helpers for cuda platform. The first working revision. Signed-off-by: shugeo <sgazeos@gmail.com> * Added test for batched resize_area op testing. Signed-off-by: shugeo <sgazeos@gmail.com> * Implementation of resize_are for cuda platform and tests. Signed-off-by: shugeo <sgazeos@gmail.com> * Fixed multithreading with resize_area op helper. Signed-off-by: shugeo <sgazeos@gmail.com> * Corrected copyright marks with sources. Signed-off-by: shugeo <sgazeos@gmail.com> * Corrected copyright mark for resize_area op implementation. Signed-off-by: shugeo <sgazeos@gmail.com> * Corrected copyright mark for parity ops header. Signed-off-by: shugeo <sgazeos@gmail.com> * Corrected typo in strings and so on with image resize ops. Signed-off-by: shugeo <sgazeos@gmail.com> * Refactored resize_area helpers and multithreading. Signed-off-by: shugeo <sgazeos@gmail.com> * Added ResizeArea wrapper * Added test with align_corners and fixed shape processing with only int args given for output size. Signed-off-by: shugeo <sgazeos@gmail.com> * Added test * TF mapping for ResizeArea * Fixed implementation issues with resize_area op for both platforms. Signed-off-by: shugeo <sgazeos@gmail.com> * Refactored image resizer struct to use flexible types for ints and floats. Signed-off-by: shugeo <sgazeos@gmail.com> * Improved multithreading with resizeAreaKernel launch. Signed-off-by: shugeo <sgazeos@gmail.com> * Use asynchronical memory copying with cuda platform image resize allocations. Signed-off-by: shugeo <sgazeos@gmail.com> Co-authored-by: Alexander Stoyakin <alexander.stoyakin@gmail.com> |
||
---|---|---|
.. | ||
legacy | ||
BarnesHutTsne.cu | ||
README.md | ||
activations.cu | ||
addBias.cu | ||
adjust_hue.cu | ||
adjust_saturation.cu | ||
axis.cu | ||
batched_gemm.cu | ||
batchnorm.cu | ||
betaInc.cu | ||
col2im.cu | ||
compare_elem.cu | ||
concat.cu | ||
confusion.cu | ||
convolutions.cu | ||
cross.cu | ||
d_t_s.cu | ||
diGamma.cu | ||
diag.cu | ||
dilation2d.cu | ||
dropout.cu | ||
dynamic.cu | ||
extract_patches.cu | ||
fake_quantization.cu | ||
flatten.cu | ||
gather.cu | ||
gather_nd.cu | ||
gradient.cu | ||
gru.cu | ||
hamming.cu | ||
hashcode.cu | ||
histogram.cu | ||
histogramFixedWidth.cu | ||
im2col.cu | ||
image_draw_bounding_boxes.cu | ||
image_resize.cu | ||
image_suppression.cu | ||
imagesHelpers.cu | ||
ismax.cu | ||
legacy_helper.cu | ||
lgamma.cu | ||
lrn.cu | ||
lstm.cu | ||
lup.cu | ||
matrixSetDiag.cu | ||
matrix_band.cu | ||
matrix_diag_part.cu | ||
max_pooling.cu | ||
maximum.cu | ||
merge.cu | ||
meshgrid.cu | ||
minimum.cu | ||
nth_element.cu | ||
one_hot.cu | ||
pad.cu | ||
percentile.cu | ||
polyGamma.cu | ||
prefix.cu | ||
print_variable.cu | ||
random.cu | ||
random_crop.cu | ||
range.cu | ||
reverse.cu | ||
roll.cu | ||
s_t_b.cu | ||
s_t_d.cu | ||
scatter.cu | ||
scatter_simple.cu | ||
scatter_update.cu | ||
segment.cu | ||
segment_max.cu | ||
segment_mean.cu | ||
segment_min.cu | ||
segment_prod.cu | ||
segment_sqrtn.cu | ||
segment_sum.cu | ||
sequence_mask.cu | ||
sg_cb.cu | ||
shift.cu | ||
sru.cu | ||
stack.cu | ||
svd.cu | ||
toggle_bits.cu | ||
top_k.cu | ||
transforms.cu | ||
weights.cu | ||
zeta.cu |
README.md
This folder contains CUDA-specific implementations for operations.