cavis

History

shugeo 009007120b Shugeo_release_fixes3 (#81 )

* Implementation for non_max_suppression_v3 was added. Initial version

* Added check for overcome threshold.

* Added definition for V3 method.

* java remapping for NonMaxSuppressionV3

Signed-off-by: raver119 <raver119@gmail.com>

* Fixed proporly processing of an empty output and test.

* Refactored op to less threshold data to float.

* Implemented cuda-based helper for non_max_suppression_v3 op.

* Fixed fake_quant_with_min_max_vars op.

* Fixed tests with float numbers.

* - assert now stops execution
- sortByKey/sortByValue now have input validation

Signed-off-by: raver119 <raver119@gmail.com>

* missing var

Signed-off-by: raver119 <raver119@gmail.com>

* Fixed proper processing for zero max_size inputs.

* Refactored kernel callers.

* Fixed return statement for logdet op helper.

* Refactored unsorted segment SqrtN op.

* get back 8 tail bytes on CUDA

Signed-off-by: raver119 <raver119@gmail.com>

* Refactored segment prod ops and helpers for cuda and tests.

* Additional test.

* CudaWorkspace tests updated for 8 tail bytes

Signed-off-by: raver119 <raver119@gmail.com>

* special atomic test

Signed-off-by: raver119 <raver119@gmail.com>

* atomicMul/atomicDiv fix for 16bit values

Signed-off-by: raver119 <raver119@gmail.com>

* Eliminated waste prints.

2019-11-28 21:08:51 +03:00

activations.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

addBias.cpp

Shyrma deconv3 (#69 )

2019-11-21 21:17:30 +02:00

adjust_hue.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

adjust_saturation.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

axis.cpp

[WIP] Roll rewritten (#128 )

2019-08-17 14:15:08 +03:00

BarnesHutTsne.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

batched_gemm.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

batchnorm.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

betaInc.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

col2im.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

compare_elem.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

confusion.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

convolutions.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

cross.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

d_t_s.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

diag.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

dilation2d.cpp

Shyrma deconv3 (#69 )

2019-11-21 21:17:30 +02:00

dropout.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

dynamic.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

extract_patches.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

fake_quantization.cpp

Shugeo_release_fixes3 (#81 )

2019-11-28 21:08:51 +03:00

flatten.cpp

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

gather.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

gradient.cpp

[WIP] More of CUDA (#95 )

2019-08-05 11:27:05 +10:00

gru.cpp

Various fixes (#43 )

2019-11-14 19:38:20 +11:00

hamming.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

hashcode.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

histogram.cpp

[WIP] multi-device support (#80 )

2019-08-14 16:52:34 +03:00

histogramFixedWidth.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

im2col.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

image_draw_bounding_boxes.cpp

Shugeo image resize bicubic (#56 )

2019-11-20 21:11:04 +02:00

image_resize.cpp

Shugeo release fix2 (#70 )

2019-11-22 22:42:44 +03:00

image_suppression.cpp

Shugeo_release_fixes3 (#81 )

2019-11-28 21:08:51 +03:00

ismax.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

legacy_helper.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

lrn.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

lstm.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

lup.cpp

Shugeo cuda docs1 (#249 )

2019-09-09 16:27:45 +03:00

matrix_band.cpp

Eclipse Migration Initial Commit

2019-06-06 15:21:15 +03:00

matrix_diag_part.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

matrixSetDiag.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

max_pooling.cpp

[WIP] multi-device support (#80 )

2019-08-14 16:52:34 +03:00

meshgrid.cpp

Eclipse Migration Initial Commit

2019-06-06 15:21:15 +03:00

minimax.cpp

Eclipse Migration Initial Commit

2019-06-06 15:21:15 +03:00

nth_element.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

one_hot.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

percentile.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

polyGamma.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

prefix.cpp

[WIP] bunch of improvements (#257 )

2019-09-11 20:12:09 +03:00

random_crop.cpp

Eclipse Migration Initial Commit

2019-06-06 15:21:15 +03:00

random.cpp

- new NDArrayFactory scalar constructor

2019-11-08 08:49:41 +03:00

range.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

README.md

Merge master to upstream (#7945 )

2019-06-27 18:37:04 +03:00

reverse.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

roll.cpp

Shugeo cuda docs1 (#249 )

2019-09-09 16:27:45 +03:00

s_t_b.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

s_t_d.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

scatter.cpp

Shyrma scatter (#84 )

2019-11-26 20:29:09 +03:00

segment.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

sequence_mask.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

sg_cb.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

shift.cpp

[WIP] right shift ops (#118 )

2019-08-15 20:35:15 +03:00

sru.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

stack.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

svd.cpp

- add additional condition in svd helper to take into account rounding errors (#31 )

2019-11-05 17:16:17 +02:00

toggle_bits.cpp

[WIP] multi-device support (#80 )

2019-08-14 16:52:34 +03:00

top_k.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

transforms.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

weights.cpp

Eclipse Migration Initial Commit

2019-06-06 15:21:15 +03:00

zeta.cpp

[WIP] ThreadPool (#8 )

2019-11-13 17:04:59 +03:00

README.md

This folder contains OpenMP implementations for operations helpers. Basically suited for homogenous x86-like platforms.