raver119
|
57210b936c
|
Revert "OpenMP Threads execution (#297)" (#299)
This reverts commit dd2043ef48 .
|
2020-03-09 08:22:49 +03:00 |
raver119
|
dd2043ef48
|
OpenMP Threads execution (#297)
* omp threads backported
Signed-off-by: raver119 <raver119@gmail.com>
* omp scalar reduce
Signed-off-by: raver119 <raver119@gmail.com>
* timing
Signed-off-by: raver119 <raver119@gmail.com>
* timing
Signed-off-by: raver119 <raver119@gmail.com>
* minor tweaks
Signed-off-by: raver119 <raver119@gmail.com>
* minor tweaks
Signed-off-by: raver119 <raver119@gmail.com>
* namespace change
Signed-off-by: raver119 <raver119@gmail.com>
* num_threads
Signed-off-by: raver119 <raver119@gmail.com>
* one minor fix
Signed-off-by: raver119 <raver119@gmail.com>
|
2020-03-09 08:21:44 +03:00 |
raver119
|
25b3cd9b80
|
[WIP] CUDA tests (#95)
* one more CI test
Signed-off-by: raver119 <raver119@gmail.com>
* export additional symbols
Signed-off-by: raver119 <raver119@gmail.com>
* few more tweaks
Signed-off-by: raver119 <raver119@gmail.com>
* one more tweak for linux
Signed-off-by: raver119 <raver119@gmail.com>
* fix dtype in few tests
Signed-off-by: raver119 <raver119@gmail.com>
* missing sync and memset in couple of tests
Signed-off-by: raver119 <raver119@gmail.com>
* copy step for libnd4j cuda
Signed-off-by: raver119 <raver119@gmail.com>
* no-op on empty for adjust hue/contrast/saturation
Signed-off-by: raver119 <raver119@gmail.com>
* CUDA_VERBOSE Off
Signed-off-by: raver119 <raver119@gmail.com>
* BroadcastBool fix + few tests
Signed-off-by: raver119 <raver119@gmail.com>
* trigger jenkins
Signed-off-by: raver119 <raver119@gmail.com>
* trigger jenkins
Signed-off-by: raver119 <raver119@gmail.com>
* - ignore couple of warnings
- remove redundant compiler options
Signed-off-by: raver119 <raver119@gmail.com>
|
2019-12-02 21:37:21 +03:00 |
raver119
|
6de00bf75f
|
[WIP] Weekly update of repo (#8390)
* [WIP] Fix compilation after nd4j changes (#37)
* Fix compilation.
* Some tests fixed
* Disable tests temporarily.
* Restored test
* Tests restored.
* Test restored.
* [WIP] perf tests (#40)
* special maxpool test
Signed-off-by: raver119 <raver119@gmail.com>
* special maxpool test
Signed-off-by: raver119 <raver119@gmail.com>
* Shyrma bnorm bp (#41)
Batchnorm backprop mkldnn
* Add SameDiff memory reuse memory manager (array cache) (#39)
* Attention op comments
Signed-off-by: AlexDBlack <blacka101@gmail.com>
* ArrayCacheMemoryMgr - first pass
Signed-off-by: AlexDBlack <blacka101@gmail.com>
* Tweak array cache for use with SameDiff identity arrays
Signed-off-by: AlexDBlack <blacka101@gmail.com>
* ArrayCacheMemoryMgr javadoc and properly get max memory
Signed-off-by: AlexDBlack <blacka101@gmail.com>
* LRU cache policy + add tests
Signed-off-by: AlexDBlack <blacka101@gmail.com>
* Fixes
Signed-off-by: AlexDBlack <blacka101@gmail.com>
* Resize arrays internally if required for ArrayCacheMemoryMgr
Signed-off-by: AlexDBlack <blacka101@gmail.com>
* Test improvement
Signed-off-by: AlexDBlack <blacka101@gmail.com>
* Small polish
Signed-off-by: AlexDBlack <blacka101@gmail.com>
* SameDiff op runtime benchmarking listener (#42)
Signed-off-by: AlexDBlack <blacka101@gmail.com>
* INLINE_LOOPS for windows
Signed-off-by: raver119 <raver119@gmail.com>
* [WIP] ThreadPool (#8)
This PR removes OpenMP use in 95% of cases
|
2019-11-13 17:15:18 +03:00 |