cavis

Author	SHA1	Message	Date
Alex Black	c84307a6fe	Small SameDiff execution fix (#168 ) * SameDiff exec: Fix for switch op when predicate is constant, and op is inside loop Signed-off-by: AlexDBlack <blacka101@gmail.com> * Update ignores for failing zoo models Signed-off-by: AlexDBlack <blacka101@gmail.com>	2020-01-08 23:57:23 +11:00
Oleh	2404be5fe0	Oleh multinomial (#163 ) * libnd4j: Multinomial op #8570 first raw step of multinomial random data generator implementation Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j: Multinomial op #8570 next step of multinomial random categories generator implementation on both cpu and cuda, need corrections and code clean up before review and testing * libnd4j: Multinomial op #8570 code clean up and fixed issues data selecting, moved from coords to tads * libnd4j: Multinomial op #8570 fixed cuda build add reference for math materials that was used for implementation * libnd4j: Multinomial op #8570 fixed several bugs, added several tests and improved cuda version. current implementation works, need testing of reproduction with the same seed * libnd4j: Multinomial op #8570 fixes and optimization after discussion in both cuda and cpu * libnd4j: Multinomial op #8570 add corrections after review, removed tads, replace 2D parallel loop by 3D Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j: Multinomial op fixed declaration and add tests need discussion * libnd4j: Multinomial op fix in test * libnd4j: Multinomial op corrected behavior to get reproducible results, fixed issue in uniform value getting, tests added, need cuda review and cuda testing Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j: Multinomial op fixed indexing on uniform calculation Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j: Multinomial op some corrections in max min declaration Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j: Multinomial op fixed index calculation, added rewind, corrected input declaration, added stats tests, both cuda and cpu. cuda need testing * libnd4j: Multinomial op fixed bugs on cuda nad cpu. need review Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j: Multinomial op corrected tests to handle different orders Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j: Multinomial op some improvements after code review Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j: Multinomial op more corrections after review Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j: Multinomial op fixed seed usage, update tests, fixed cuda based on comments, fixed bug of rewind, removed one behavior, minor corrections. Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j: Multinomial op minor corrections Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j: Multinomial op rise the bound of fluctuation for random cases Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j: Multinomial op modified operation inputs and update implementation and tests on both cpu and cuda * libnd4j: Multinomial op corrected data types according ops.proto Co-authored-by: raver119 <raver119@gmail.com>	2020-01-06 22:35:05 +03:00
raver119@gmail.com	bb86bbc255	few tests tweaked Signed-off-by: raver119@gmail.com <raver119@gmail.com>	2020-01-06 15:55:16 +03:00
raver119	d9ef5e2467	Minor fixes (#165 ) * ios-arm excluded Signed-off-by: raver119 <raver119@gmail.com> * histogram single threaded Signed-off-by: raver119 <raver119@gmail.com>	2020-01-04 15:27:16 +03:00
raver119	29e8e09db6	String changes (#3 ) * initial commit * additional data types & tensor type Signed-off-by: raver119 <raver119@gmail.com> * next step Signed-off-by: raver119 <raver119@gmail.com> * missing include * sparse_to_dense Signed-off-by: raver119 <raver119@gmail.com> * few more tests files Signed-off-by: raver119 <raver119@gmail.com> * draft Signed-off-by: raver119 <raver119@gmail.com> * numeric sparse_to_dense Signed-off-by: raver119 <raver119@gmail.com> * comment Signed-off-by: raver119 <raver119@gmail.com> * string sparse_to_dense version Signed-off-by: raver119 <raver119@gmail.com> * CUDA DataBuffer expand Signed-off-by: raver119 <raver119@gmail.com> * few tweaks for CUDA build Signed-off-by: raver119 <raver119@gmail.com> * shape fn for string_split Signed-off-by: raver119 <raver119@gmail.com> * one more comment Signed-off-by: raver119 <raver119@gmail.com> * string_split indices Signed-off-by: raver119 <raver119@gmail.com> * next step Signed-off-by: raver119 <raver119@gmail.com> * test passes Signed-off-by: raver119 <raver119@gmail.com> * few rearrangements for databuffer implementations Signed-off-by: raver119 <raver119@gmail.com> * DataBuffer: move inline methods to common implementations Signed-off-by: raver119 <raver119@gmail.com> * add native DataBuffer to Nd4j presets Signed-off-by: raver119 <raver119@gmail.com> * DataBuffer creation Signed-off-by: raver119 <raver119@gmail.com> * use DataBuffer for allocation Signed-off-by: raver119 <raver119@gmail.com> * cpu databuffer as deallocatable Signed-off-by: raver119 <raver119@gmail.com> * DataBuffer setters for bufers Signed-off-by: raver119 <raver119@gmail.com> * couple of wrappers Signed-off-by: raver119 <raver119@gmail.com> * DataBuffers being passed around Signed-off-by: raver119 <raver119@gmail.com> * Bunch of ByteBuffer-related signatures gone Signed-off-by: raver119 <raver119@gmail.com> * - few more Nd4j signatures removed - minor fix for bfloat16 Signed-off-by: raver119 <raver119@gmail.com> * nullptr pointer is still a pointer, but 0 as address :) Signed-off-by: raver119 <raver119@gmail.com> * one special test Signed-off-by: raver119 <raver119@gmail.com> * empty string array init Signed-off-by: raver119 <raver119@gmail.com> * one more test in cpp Signed-off-by: raver119 <raver119@gmail.com> * memcpy instead of databuffer swap Signed-off-by: raver119 <raver119@gmail.com> * special InteropDataBuffer for front-end languages Signed-off-by: raver119 <raver119@gmail.com> * few tweaks for java Signed-off-by: raver119 <raver119@gmail.com> * pointer/indexer actualization Signed-off-by: raver119 <raver119@gmail.com> * CustomOp returns list for inputArumgents and outputArguments instead of array Signed-off-by: raver119 <raver119@gmail.com> * redundant call Signed-off-by: raver119 <raver119@gmail.com> * print_variable op Signed-off-by: raver119 <raver119@gmail.com> * - view handling (but wrong one) - print_variable java wrapper Signed-off-by: raver119 <raver119@gmail.com> * one more test Signed-off-by: raver119 <raver119@gmail.com> * - empty arrays handling Signed-off-by: raver119 <raver119@gmail.com> * - deserialization works now Signed-off-by: raver119 <raver119@gmail.com> * minor fix Signed-off-by: raver119 <raver119@gmail.com> * meh Signed-off-by: raver119 <raver119@gmail.com> * one more fix Signed-off-by: raver119 <raver119@gmail.com> * initial cuda commit Signed-off-by: raver119 <raver119@gmail.com> * print_variable message validation Signed-off-by: raver119 <raver119@gmail.com> * CUDA views Signed-off-by: raver119 <raver119@gmail.com> * CUDA special buffer size Signed-off-by: raver119 <raver119@gmail.com> * minor update to match master changes Signed-off-by: raver119 <raver119@gmail.com> * - consider arrays always actual on device for CUDA - additional PrintVariable constructor - CudaUtf8Buffer now allocates host buffer by default Signed-off-by: raver119 <raver119@gmail.com> * meh Signed-off-by: raver119 <raver119@gmail.com> * - print_variable now allows print from device Signed-off-by: raver119 <raver119@gmail.com> * InteropDataBuffer data type fix Signed-off-by: raver119 <raver119@gmail.com> * ... Signed-off-by: raver119 <raver119@gmail.com> * disable some debug messages Signed-off-by: raver119 <raver119@gmail.com> * master pulled in Signed-off-by: raver119 <raver119@gmail.com> * couple of new methods for DataBuffer interop Signed-off-by: raver119 <raver119@gmail.com> * java side Signed-off-by: raver119 <raver119@gmail.com> * offsetted constructor Signed-off-by: raver119 <raver119@gmail.com> * new CUDA deallocator Signed-off-by: raver119 <raver119@gmail.com> * CUDA backend torn apart Signed-off-by: raver119 <raver119@gmail.com> * CUDA backend torn apart 2 Signed-off-by: raver119 <raver119@gmail.com> * CUDA backend torn apart 3 Signed-off-by: raver119 <raver119@gmail.com> * - few new tests - few new methods for DataBuffer management Signed-off-by: raver119 <raver119@gmail.com> * few more tests + few more tweaks Signed-off-by: raver119 <raver119@gmail.com> * two failing tests Signed-off-by: raver119 <raver119@gmail.com> * one more test Signed-off-by: raver119 <raver119@gmail.com> * two failing tests pass Signed-off-by: raver119 <raver119@gmail.com> * now we pass DataBuffer to legacy ops too Signed-off-by: raver119 <raver119@gmail.com> * Native DataBuffer for legacy ops, Java side Signed-off-by: raver119 <raver119@gmail.com> * CPU java side update Signed-off-by: raver119 <raver119@gmail.com> * CUDA java side update Signed-off-by: raver119 <raver119@gmail.com> * no more prepare/register action on java side Signed-off-by: raver119 <raver119@gmail.com> * NDArray::prepare/register use now accepts vectors Signed-off-by: raver119 <raver119@gmail.com> * InteropDataBuffer now has few more convenience methods Signed-off-by: raver119 <raver119@gmail.com> * java bindings update Signed-off-by: raver119 <raver119@gmail.com> * tick device in NativeOps Signed-off-by: raver119 <raver119@gmail.com> * Corrected usage of OpaqueBuffer for tests. * Corrected usage of OpaqueBuffer for java tests. * NativeOpsTests fixes. * print_variable now returns scalar Signed-off-by: raver119 <raver119@gmail.com> * one more test Signed-off-by: raver119 <raver119@gmail.com> * compat_string_split fix for CUDA Signed-off-by: raver119 <raver119@gmail.com> * - CUDA execScalar fix - CUDA lazyAllocateHostPointer now checks java indexer/pointer instead of native pointer Signed-off-by: raver119 <raver119@gmail.com> * legacy ops DataBuffer migration prototype Signed-off-by: raver119 <raver119@gmail.com> * ignore device shapeinfo coming from java Signed-off-by: raver119 <raver119@gmail.com> * minor fix Signed-off-by: raver119 <raver119@gmail.com> * minor transformAny fix Signed-off-by: raver119 <raver119@gmail.com> * minor tweak for lazy host allocation Signed-off-by: raver119 <raver119@gmail.com> * - DataBuffer::memcpy method - bitcast now uses memcpy Signed-off-by: raver119 <raver119@gmail.com> * - IndexReduce CUDA dimension buffer fix Signed-off-by: raver119 <raver119@gmail.com> * views for CPU and CUDA Signed-off-by: raver119 <raver119@gmail.com> * less spam Signed-off-by: raver119 <raver119@gmail.com> * optional memory init Signed-off-by: raver119 <raver119@gmail.com> * async memset Signed-off-by: raver119 <raver119@gmail.com> * - SummaryStats CUDA fix - DataBuffer.sameUnderlyingData() impl - execBroadcast fix Signed-off-by: raver119 <raver119@gmail.com> * - reduce3All fix switch to CUDA 10 temporarily Signed-off-by: raver119 <raver119@gmail.com> * CUDA version Signed-off-by: raver119 <raver119@gmail.com> * proper memory deallocator registration Signed-off-by: raver119 <raver119@gmail.com> * HOST_ONLY workspace allocation Signed-off-by: raver119 <raver119@gmail.com> * temp commit Signed-off-by: raver119 <raver119@gmail.com> * few conflicts resolved Signed-off-by: raver119 <raver119@gmail.com> * few minor fixes Signed-off-by: raver119 <raver119@gmail.com> * one more minor fix Signed-off-by: raver119 <raver119@gmail.com> * NDArray permute should operate on JVM primitives Signed-off-by: raver119 <raver119@gmail.com> * - create InteropDataBuffer for shapes as well - update pointers after view creation in Java Signed-off-by: raver119 <raver119@gmail.com> * - addressPointer temporary moved to C++ Signed-off-by: raver119 <raver119@gmail.com> * CUDA: don't account offset twice Signed-off-by: raver119 <raver119@gmail.com> * CUDA: DataBuffer pointer constructor updated Signed-off-by: raver119 <raver119@gmail.com> * CUDA NDArray.unsafeDuplication() simplified Signed-off-by: raver119 <raver119@gmail.com> * CUDA minor workspace-related fixes Signed-off-by: raver119 <raver119@gmail.com> * CPU DataBuffer.reallocate() Signed-off-by: raver119 <raver119@gmail.com> * print_affinity op Signed-off-by: raver119 <raver119@gmail.com> * print_affinity java side Signed-off-by: raver119 <raver119@gmail.com> * CUDA more tweaks for data locality Signed-off-by: raver119 <raver119@gmail.com> * - compat_string_split tweak - CudaUtf8Buffer update Signed-off-by: raver119 <raver119@gmail.com> * INDArray.close() mechanic restored Signed-off-by: raver119 <raver119@gmail.com> * one more test fixed Signed-off-by: raver119 <raver119@gmail.com> * - CUDA DataBuffer.reallocate() updated - cudaMemcpy (synchronous) restored Signed-off-by: raver119 <raver119@gmail.com> * one last fix Signed-off-by: raver119 <raver119@gmail.com> * bad import removed Signed-off-by: raver119 <raver119@gmail.com> * another small fix Signed-off-by: raver119 <raver119@gmail.com> * one special test Signed-off-by: raver119 <raver119@gmail.com> * fix bad databuffer size Signed-off-by: raver119 <raver119@gmail.com> * release primaryBuffer on replace Signed-off-by: raver119 <raver119@gmail.com> * higher timeout Signed-off-by: raver119 <raver119@gmail.com> * disable timeouts Signed-off-by: raver119 <raver119@gmail.com> * dbCreateView now validates offset and length of a view Signed-off-by: raver119 <raver119@gmail.com> * additional validation for dbExpand Signed-off-by: raver119 <raver119@gmail.com> * restore timeout back again Signed-off-by: raver119 <raver119@gmail.com> * smaller distribution for rng test to prevent timeouts Signed-off-by: raver119 <raver119@gmail.com> * CUDA DataBuffer::memcpy now copies to device all the time Signed-off-by: raver119 <raver119@gmail.com> * OpaqueDataBuffer now contains all required methods for interop Signed-off-by: raver119 <raver119@gmail.com> * some javadoc Signed-off-by: raver119 <raver119@gmail.com> * GC on failed allocations Signed-off-by: raver119 <raver119@gmail.com> * minoe memcpu tweak Signed-off-by: raver119 <raver119@gmail.com> * one more bitcast test Signed-off-by: raver119 <raver119@gmail.com> * - NDArray::deviceId() propagation - special multi-threaded test for data locality checks Signed-off-by: raver119 <raver119@gmail.com> * DataBuffer additional syncStream Signed-off-by: raver119 <raver119@gmail.com> * DataBuffer additional syncStream Signed-off-by: raver119 <raver119@gmail.com> * one ignored test Signed-off-by: raver119 <raver119@gmail.com> * skip host alloc for empty arrays Signed-off-by: raver119 <raver119@gmail.com> * ByteBuffer support is back Signed-off-by: raver119 <raver119@gmail.com> * DataBuffer::memcpy minor fix Signed-off-by: raver119 <raver119@gmail.com> * few minor prelu/bp tweaks Signed-off-by: raver119 <raver119@gmail.com> * nullify-related fixes Signed-off-by: raver119 <raver119@gmail.com> * PReLU fixes (#157) Signed-off-by: Alex Black <blacka101@gmail.com> * Build fixed * Fix tests * one more ByteBuffer signature restored Signed-off-by: raver119 <raver119@gmail.com> * nd4j-jdbc-hsql profiles fix Signed-off-by: raver119 <raver119@gmail.com> * nd4j-jdbc-hsql profiles fix Signed-off-by: raver119 <raver119@gmail.com> * PReLU weight init fix Signed-off-by: Alex Black <blacka101@gmail.com> * Small PReLU fix Signed-off-by: Alex Black <blacka101@gmail.com> * - INDArray.migrate() reactivated - DataBuffer::setDeviceId(...) added - InteropDataBuffer Z syncToDevice added for views Signed-off-by: raver119 <raver119@gmail.com> * missed file Signed-off-by: raver119 <raver119@gmail.com> * Small tweak Signed-off-by: Alex Black <blacka101@gmail.com> * cuda 10.2 Signed-off-by: raver119 <raver119@gmail.com> * minor fix Signed-off-by: raver119 <raver119@gmail.com> Co-authored-by: shugeo <sgazeos@gmail.com> Co-authored-by: Alex Black <blacka101@gmail.com> Co-authored-by: Alexander Stoyakin <alexander.stoyakin@gmail.com>	2020-01-04 13:27:50 +03:00
raver119	451d9d57fd	shape function override (#161 ) Signed-off-by: raver119 <raver119@gmail.com>	2020-01-04 09:06:44 +03:00
Robert Altena	53d3bd1269	shallow delete of assign from SDBase. (#164 ) Signed-off-by: Robert Altena <Rob@Ra-ai.com>	2020-01-04 15:26:39 +11:00
Alex Black	29104083cc	Various fixes (#143 ) * #8568 ArrayUtil optimization Signed-off-by: AlexDBlack <blacka101@gmail.com> * #6171 Keras ReLU and ELU support Signed-off-by: AlexDBlack <blacka101@gmail.com> * Keras softmax layer import Signed-off-by: AlexDBlack <blacka101@gmail.com> * #8549 Webjars dependency management Signed-off-by: AlexDBlack <blacka101@gmail.com> * Fix for TF import names ':0' suffix issue / NPE Signed-off-by: AlexDBlack <blacka101@gmail.com> * BiasAdd: fix default data format for TF import Signed-off-by: AlexDBlack <blacka101@gmail.com> * Update zoo test ignores Signed-off-by: AlexDBlack <blacka101@gmail.com> * #8509 SameDiff Listener API - provide frame + iteration Signed-off-by: AlexDBlack <blacka101@gmail.com> * #8520 ND4J Environment Signed-off-by: AlexDBlack <blacka101@gmail.com> * Deconv3d Signed-off-by: AlexDBlack <blacka101@gmail.com> * Deconv3d fixes + gradient check Signed-off-by: AlexDBlack <blacka101@gmail.com> * Conv3d fixes + deconv3d DType test Signed-off-by: AlexDBlack <blacka101@gmail.com> * Fix issue with deconv3d gradinet check weight init Signed-off-by: AlexDBlack <blacka101@gmail.com> * #8579 Fix BaseCudaDataBuffer constructor fix for UINT16 Signed-off-by: AlexDBlack <blacka101@gmail.com> * DataType.isNumerical() returns false for BOOL type Signed-off-by: AlexDBlack <blacka101@gmail.com> * #8504 Reduce Spark log spam for tests Signed-off-by: AlexDBlack <blacka101@gmail.com> * Clean up DL4J gradient check test spam Signed-off-by: AlexDBlack <blacka101@gmail.com> * More Gradient check spam reduction Signed-off-by: AlexDBlack <blacka101@gmail.com> * SameDiff test spam reduction Signed-off-by: AlexDBlack <blacka101@gmail.com> * Fixes for FlatBuffers mapping Signed-off-by: AlexDBlack <blacka101@gmail.com> * SameDiff log spam cleanup Signed-off-by: AlexDBlack <blacka101@gmail.com> * Tests should extend BaseNd4jTest Signed-off-by: AlexDBlack <blacka101@gmail.com> * Remove debug line in c++ op Signed-off-by: AlexDBlack <blacka101@gmail.com> * ND4J test spam cleanup Signed-off-by: AlexDBlack <blacka101@gmail.com> * DL4J test spam reduction Signed-off-by: AlexDBlack <blacka101@gmail.com> * More Dl4J and datavec test spam cleanup Signed-off-by: AlexDBlack <blacka101@gmail.com> * Fix for bad conv3d test Signed-off-by: AlexDBlack <blacka101@gmail.com> * Additional test Signed-off-by: AlexDBlack <blacka101@gmail.com> * Embedding layers: don't inherit global default activation function Signed-off-by: AlexDBlack <blacka101@gmail.com> * Trigger CI Signed-off-by: AlexDBlack <blacka101@gmail.com> * Consolidate all BaseDL4JTest classes to single class used everywhere; make timeout configurable per class Signed-off-by: AlexDBlack <blacka101@gmail.com> * Test fixes and timeout increases Signed-off-by: AlexDBlack <blacka101@gmail.com> * Timeouts and PReLU fixes Signed-off-by: AlexDBlack <blacka101@gmail.com> * Restore libnd4j build threads arg for CUDA build Signed-off-by: AlexDBlack <blacka101@gmail.com> * Increase timeouts on a few tests to avoid spurious failures on some CI machines Signed-off-by: AlexDBlack <blacka101@gmail.com> * More timeout fixes Signed-off-by: AlexDBlack <blacka101@gmail.com> * More test timeout fixes Signed-off-by: AlexDBlack <blacka101@gmail.com> * Tweak timeout for one more test Signed-off-by: AlexDBlack <blacka101@gmail.com> * Final tweaks Signed-off-by: AlexDBlack <blacka101@gmail.com> * One more ignore Signed-off-by: AlexDBlack <blacka101@gmail.com>	2020-01-04 13:45:07 +11:00
raver119	ac0d249f07	build fix for clang (#159 ) Signed-off-by: raver119@gmail.com <raver119@gmail.com>	2020-01-03 15:17:06 +03:00
shugeo	fbf7c9d38b	Fixed lu for cuda platform and tests. (#158 ) Signed-off-by: shugeo <sgazeos@gmail.com>	2020-01-02 23:25:41 +03:00
Susan Eraly	c32acb2ec7	fix if dir does not exist (#129 ) * fix if dir does not exist Signed-off-by: eraly <susan.eraly@gmail.com> * added simple test Signed-off-by: eraly <susan.eraly@gmail.com>	2019-12-30 19:48:57 -05:00
Alexander Stoyakin	010744ef9c	Lu wrapper and tests fixes (#144 ) * Tests fixed * Lu added * Test fixed * Default timeout * Tests timeouts fixed. * TF import fix * Timeouts added * Timeout fixed. * Test corrected * rgb and yiq conversion ops added * Converter ops added * Header * Yuv converters * API added * Empty test for matmul * Explanation * skip gemm/gemv on empty inputs Signed-off-by: raver119 <raver119@gmail.com> * Test added * Correct test * one more empty pass-through for mmul Signed-off-by: raver119 <raver119@gmail.com> * Cleanup * Test added * Test fixed * Added missing mapping * Added missing mapping Co-authored-by: raver119 <raver119@gmail.com>	2019-12-30 15:06:12 +03:00
raver119	9b329d2601	[WIP] bias_add NHWC loop (#149 ) * one more test Signed-off-by: raver119 <raver119@gmail.com> * one more test Signed-off-by: raver119 <raver119@gmail.com> * one more test Signed-off-by: raver119 <raver119@gmail.com> * bias_add nhwc 4D Signed-off-by: raver119 <raver119@gmail.com> * bias_add nhwc 4D Signed-off-by: raver119 <raver119@gmail.com> * bias_add nhwc 4D Signed-off-by: raver119 <raver119@gmail.com> * bias_add nhwc 4D Signed-off-by: raver119 <raver119@gmail.com> * disable test Signed-off-by: raver119 <raver119@gmail.com>	2019-12-24 20:56:49 +03:00
raver119	fc760de348	RgbToYuv & YuvToRgb skip empty arrays Signed-off-by: raver119 <raver119@gmail.com>	2019-12-24 18:45:54 +03:00
Oleh	75123b0a4c	[WIP] Oleh rgb yuv (#147 ) * libnd4j: RgbToYuv and YuvToRgb, both implementations for both cpu and cuda. Need adding tests and review Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j: RgbToYuv and YuvToRgb, replace coords method on Tad in both cpu and cuda, add tests, fixed bugs Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j: RgbToYuv and YuvToRgb minor corrections Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j: RgbToYuv and YuvToRgb corrections to use operations in-place	2019-12-24 18:30:54 +03:00
raver119	d1e5e79c10	[WIP] CUDA concat tweak (#148 ) * one special test Signed-off-by: raver119 <raver119@gmail.com> * one special test Signed-off-by: raver119 <raver119@gmail.com> * local memory for concat Signed-off-by: raver119 <raver119@gmail.com> * fixed grid size for concat Signed-off-by: raver119 <raver119@gmail.com> * fixed grid size for concat Signed-off-by: raver119 <raver119@gmail.com> * test commented out Signed-off-by: raver119 <raver119@gmail.com>	2019-12-24 17:01:03 +03:00
Abdelrauf	39d43ca170	RgbToYiq and YiqToRgb operations (#142 ) * RgbToYiq and YiqToRgb Signed-off-by: Abdelrauf <rauf@konduit.ai> * CUDA impl for RgbToYiq and YiqToRgb Signed-off-by: raver119 <raver119@gmail.com> * remove print Signed-off-by: raver119 <raver119@gmail.com> * allow inplace for hsv,rgb,yiq ops Signed-off-by: Abdelrauf <rauf@konduit.ai> Co-authored-by: raver119 <raver119@gmail.com>	2019-12-24 15:20:35 +03:00
raver119	62f93ac211	negative handling for empty arrays (#146 ) Signed-off-by: raver119 <raver119@gmail.com>	2019-12-24 13:23:25 +03:00
Alex Black	1f9e1b6022	SameDiff profiler analysis improvements (#141 ) * #8555 SameDiff profiler analysis improvements Signed-off-by: Alex Black <blacka101@gmail.com> * Fix TF sub-op aggregation Signed-off-by: Alex Black <blacka101@gmail.com> * Small filtering tweak Signed-off-by: Alex Black <blacka101@gmail.com> * Copyright headers Signed-off-by: Alex Black <blacka101@gmail.com>	2019-12-23 15:24:20 +11:00
Alex Black	ce02b6fae7	Small fixes (#140 ) * Allow scalar op result array auto allocation Signed-off-by: AlexDBlack <blacka101@gmail.com> * Don't swallow underlying exception for calculateOutputShape execution failures Signed-off-by: AlexDBlack <blacka101@gmail.com> * Ignore for known keras failure Signed-off-by: AlexDBlack <blacka101@gmail.com>	2019-12-21 17:00:46 +11:00
raver119	495256c827	minor build fix (#139 ) Signed-off-by: raver119 <raver119@gmail.com>	2019-12-21 08:07:13 +03:00
Yurii Shyrma	5d9b2a16e5	Shyrma temp (#131 ) * - specifying template instantiation for certain types in float16 and bloat16 Signed-off-by: Yurii <iuriish@yahoo.com> * - polishing bfloat16 and float16 member functions template specialization Signed-off-by: Yurii <iuriish@yahoo.com> * - rewrite and overload array +-/ scalar and scalar +-/ arr in NDAray class Signed-off-by: Yurii <iuriish@yahoo.com> * - make corrections which have to do with and rvalue lvalue conversions Signed-off-by: Yurii <iuriish@yahoo.com> * - provide move semantic in NDArray operators array +-/* array Signed-off-by: Yurii <iuriish@yahoo.com> * float16/bfloat16 tweaks Signed-off-by: raver119 <raver119@gmail.com> * one more tweak Signed-off-by: raver119 <raver119@gmail.com> * - make float16 and bfloat16 to compile successfully on cuda Signed-off-by: Yurii <iuriish@yahoo.com> * - do not use resources of view-like arrays when move semantics is applied Signed-off-by: Yurii <iuriish@yahoo.com> * - get rid of pointers in signatures NDArray methods 1 Signed-off-by: Yurii <iuriish@yahoo.com> * - correction of signature of NDArray::dup method Signed-off-by: Yurii <iuriish@yahoo.com> * - correction of signature of NDArray::reduceAlongDimension method Signed-off-by: Yurii <iuriish@yahoo.com> * - signature correction of NDArray::applyIndexReduce and applyTrueBroadcast methods Signed-off-by: Yurii <iuriish@yahoo.com> * - signature correction of NDArray::applyReduce3 and varianceAlongDimension methods Signed-off-by: Yurii <iuriish@yahoo.com> * - signature correction of NDArray::tensorsAlongDimension and diagonal methods Signed-off-by: Yurii <iuriish@yahoo.com> * - signature correction of NDArray::allTensorsAlongDimension Signed-off-by: Yurii <iuriish@yahoo.com> * - signature correction of NDArray::reduceAlongDimension 2 Signed-off-by: Yurii <iuriish@yahoo.com> * - signature correction of NDArray::applyTransform 2 Signed-off-by: Yurii <iuriish@yahoo.com> * - signature correction of NDArray::applyPairwiseTransform 2 Signed-off-by: Yurii <iuriish@yahoo.com> * - signature correction of NDArray::applyBroadcast 2 Signed-off-by: Yurii <iuriish@yahoo.com> * - signature correction of NDArray::applyTrueBroadcast 2 Signed-off-by: Yurii <iuriish@yahoo.com> * - signature correction of NDArray::applyScalar and applyScalarArr Signed-off-by: Yurii <iuriish@yahoo.com> * - signature correction of NDArray::lambda methods Signed-off-by: Yurii <iuriish@yahoo.com> * - signature correction of NDArray::reduce3 methods 2 Signed-off-by: Yurii <iuriish@yahoo.com> * - signature correction of following NDArray methods: add/sub/mul/div row/column and fillAsTriangular Signed-off-by: Yurii <iuriish@yahoo.com> * - signature correction of NDArray::tileToShape methods Signed-off-by: Yurii <iuriish@yahoo.com> * - signature correction of NDArray::isShapeSameStrict method Signed-off-by: Yurii <iuriish@yahoo.com> * minor corrections in tests Signed-off-by: Yurii <iuriish@yahoo.com> * - replace reduce op in batchnorm mkldnn Signed-off-by: Yurii <iuriish@yahoo.com> * - add explicit templates instantiations for operator+(NDArray&&. const scalar) Signed-off-by: Yurii <iuriish@yahoo.com> * - corrections of casts in float16/bfloat16 Signed-off-by: Yurii <iuriish@yahoo.com> * - provide move semantics in following NDArray methods: transform, applyTrueBroadcast, transpose, reshape, permute Signed-off-by: Yurii <iuriish@yahoo.com> * - get rid of input array A duplicate in svd cuda op Signed-off-by: Yurii <iuriish@yahoo.com> * - avoid available bug in svd cuda API Signed-off-by: Yurii <iuriish@yahoo.com> * - add temporary global memory buffer in svd cuda when calcUV = false and m != n Signed-off-by: Yurii <iuriish@yahoo.com> * - remove test with blfoat16 type for betainC Signed-off-by: Yurii <iuriish@yahoo.com> * - resolve conflicts after master has been merged in Signed-off-by: Yurii <iuriish@yahoo.com> * - changed type of affected input array in fused_batch_norm Signed-off-by: Yurii <iuriish@yahoo.com> * - add several explicit type castings Signed-off-by: Yurii <iuriish@yahoo.com> * - add ND4J_EXPORT to operators Signed-off-by: Yurii <iuriish@yahoo.com> * - add explicit template types in instantiations of template arithm operators of NDArray class Signed-off-by: Yurii <iuriish@yahoo.com> * - one more test fix Signed-off-by: Yurii <iuriish@yahoo.com> Co-authored-by: raver119 <raver119@gmail.com>	2019-12-20 22:35:39 +03:00
raver119	3e0afadea1	minor build fix Signed-off-by: raver119 <raver119@gmail.com>	2019-12-20 21:11:18 +03:00
Oleh	211c0df76f	Oleh rgb to gray scale (#138 ) * libnd4j: RgbToGrayscale op #8536 - raw implementation in user branch, need checks for integration and adding other orders Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j: RgbToGrayscale op #8536 next step of merging images Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j: RgbToGrayscale op #8536, Revert merge of hsv_to_rgb and rgb_to_hsv as cause conflicts in naming need refactoring before merge, implementation of rbg_to_grs added * libnd4j: RgbToGrayscale op #8536 imlementation and conflict resolve * libnd4j: RgbToGrayscale op #8536 merged operations with images into image, renamed methods and files * libnd4j: RgbToGrayscale op #8536 added test for rgbToGrayScale, need clarification and fixed tests case run Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j: RgbToGrayscale op #8536 bug fixing and need review * libnd4j: RgbToGrayscale op #8536 some additional corrections after review Signed-off-by: Oleg <oleg.semeniv@gmail.com> * - minor corrections in rgbToGrs test1 Signed-off-by: Yurii <iuriish@yahoo.com> * libnd4j: RgbToGrayscale op #8536, corrected tests and rbf_to_grs, fixed problems, refactoring, need review * libnd4j: RgbToGrayscale op #8536 fix for 'f' order in rgbToGrs * libnd4j: RgbToGrayscale op #8536 fixed several bugs with dimC, test case refactoring and improve Signed-off-by: Oleg <oleg.semeniv@gmail.com> * - add cuda kernel for rgbToGrs op Signed-off-by: Yurii <iuriish@yahoo.com> * - fix linkage errors Signed-off-by: Yurii <iuriish@yahoo.com> Co-authored-by: Yurii Shyrma <iuriish@yahoo.com>	2019-12-20 20:59:29 +03:00
shugeo	67d8199165	[WIP] Shugeo lup (#126 ) * Added infrastructure for implementation op lu for both cuda and cpu platforms. * Added implementation of helpers with lu op. Signed-off-by: shugeo <sgazeos@gmail.com> * Refactored LU decomposition to use vector of permutations instead. * Refactored helpers for lu op. * Fixed crash with determinant op. * Refactored cpu LU op heleper. * Added implementation for lu op. Signed-off-by: shugeo <sgazeos@gmail.com> * Fixed issue with argmax on column. * Added multithreaded behaviour for lu op helper. * Fixed multithreaded cpu implementation helpers for lu op. * Added cuda implementation for lu op helper. * Finished lu helper implementation for cuda platform. * Eliminated waste prints and comments. * Fixed race condition and multithreading issues. * Fixed memory leak with shape construction. * Corrected test for lu op to avoid near zero elements on the main diagonal." Signed-off-by: shugeo <sgazeos@gmail.com> * Improved test for adjust_constast op. Signed-off-by: shugeo <sgazeos@gmail.com> * Fixed issues with cuda implementation of resize_bicubic helpers. Signed-off-by: shugeo <sgazeos@gmail.com>	2019-12-20 17:56:28 +03:00
Alexander Stoyakin	6d8a063c9b	nd4j-tests cleanup (#137 ) * Fixed tests * Invalid test removed	2019-12-20 16:38:33 +03:00
Abdelrauf	3c9a2a5cd9	Fix for hsv and rgb ranges (#136 ) Signed-off-by: Abdelrauf <rauf@konduit.ai>	2019-12-20 08:48:30 +03:00
raver119	43e118de1e	maxThreads -> maxMasterThreads Signed-off-by: raver119 <raver119@gmail.com>	2019-12-19 18:41:27 +03:00
raver119	8b877a8ddf	- 3d loops parallelism fix (#135 ) - additional check for maxMasterThreads <= maxThreads Signed-off-by: raver119 <raver119@gmail.com>	2019-12-19 16:50:08 +03:00
Alex Black	3d8f6d50a1	SameDiff profiler / tracing and profile analysis/comparison (#133 ) * Profiler Signed-off-by: Alex Black <blacka101@gmail.com> * Next steps, polishing, and loading SD/TF format JSON Signed-off-by: AlexDBlack <blacka101@gmail.com> * Next steps Signed-off-by: AlexDBlack <blacka101@gmail.com> * Profile comparison method Signed-off-by: AlexDBlack <blacka101@gmail.com> * Make profiling result writing async to reduce main thread overhead Signed-off-by: AlexDBlack <blacka101@gmail.com> * Profiling polishing Signed-off-by: AlexDBlack <blacka101@gmail.com> * Fix Signed-off-by: AlexDBlack <blacka101@gmail.com> * Profile analyzer fixes Signed-off-by: Alex Black <blacka101@gmail.com> * Polish Signed-off-by: Alex Black <blacka101@gmail.com> * Cleanup Signed-off-by: Alex Black <blacka101@gmail.com> * Small formatting improvement Signed-off-by: Alex Black <blacka101@gmail.com> * Formatting tweak Signed-off-by: Alex Black <blacka101@gmail.com> * License headers Signed-off-by: Alex Black <blacka101@gmail.com>	2019-12-19 23:43:58 +11:00
shugeo	e303c06042	Shugeo pad fix3 (#132 ) * Expanding allowed paddings type to 64bit ints also. * Extended to int64 paddins data types for mirror_pad op. Signed-off-by: shugeo <sgazeos@gmail.com>	2019-12-19 13:14:02 +03:00
shugeo	de3c0afdce	Shugeo bicubic fix3 (#130 ) * Added test for resize_bicubic. * Eliminated wasted in test of bicubic resize. * Switched off multithreading explicit. * Eliminated waste comments and conform proper float constants. Signed-off-by: shugeo <sgazeos@gmail.com> * Fixed multithreading with resize_bicubic helper for cpu platform. Signed-off-by: shugeo <sgazeos@gmail.com>	2019-12-19 13:12:35 +03:00
shugeo	fc7c6d4e82	Shugeo roll fix3 (#127 ) * Added tests for roll with scalar shift and axis. * Fixed problem with roll on 1D input with scalar axis and test. * Only cosmetic changes.	2019-12-19 13:10:06 +03:00
Alexander Stoyakin	f5068f3980	Added missing Java ops wrappers (#122 ) * Timeouts added * Added some ops * Ops added * Fixed tests * Minor fix * Some fixes * Digamma added * Small fixes * Timeouts added * Added some ops * Ops added * Fixed tests * Minor fix * Some fixes * Digamma added * Small fixes * Fused batch norm fixes- Signed-off-by: AlexDBlack <blacka101@gmail.com> * Tests switched off. * Added test for resize_bicubic. * Eliminated wasted in test of bicubic resize. * Switched off multithreading explicit. * HsvToRgb and RgbToHsv added * Eliminated waste comments and conform proper float constants. Signed-off-by: shugeo <sgazeos@gmail.com> * Fixed multithreading with resize_bicubic helper for cpu platform. Signed-off-by: shugeo <sgazeos@gmail.com> * ResizeBicubic was fixed. * Some fixes * Fix op name * Validation fixed. * Clarifications for tests * Wrappers and small fixes for new ops.	2019-12-19 20:15:48 +11:00
Alex Black	49955eaafc	Merge pull request #8548 from KonduitAI/mp_update_python_packages python version bump	2019-12-19 10:25:19 +11:00
Max Pumperla	4ffef95a2c	increase	2019-12-18 16:00:30 +01:00
Max Pumperla	e718cc659b	python version bump	2019-12-18 15:56:03 +01:00
Alexandre Boulanger	de3975f088	RL4J: Remove processing done on observations in Policy & Async (#8471 ) * Removed processing from Policy.play() and fixed missing resets Signed-off-by: unknown <aboulang2002@yahoo.com> * Adjusted unit test to check if DQNs have been reset Signed-off-by: unknown <aboulang2002@yahoo.com> * Fixed a couple of problems, added and updated unit tests Signed-off-by: unknown <aboulang2002@yahoo.com> * Removed processing from AsyncThreadDiscrete Signed-off-by: unknown <aboulang2002@yahoo.com> * Fixed a few problems Signed-off-by: unknown <aboulang2002@yahoo.com>	2019-12-18 16:27:05 +09:00
Abdelrauf	e0a9cb6c08	[WIP] HSV,RGB color model conversions (#125 ) * CUDA implementation for hsv_to_rgb and rgb_to_hsv Signed-off-by: raver119 <raver119@gmail.com> * hsv_to_rgb and rgb_to_hsv operations Test coverage: c order 1d, 2d, 3d array Signed-off-by: Abdelrauf <rauf@konduit.ai> * Index check Signed-off-by: Abdelrauf <rauf@konduit.ai> * Suppress Msvc floating point errors Signed-off-by: Abdelrauf <rauf@konduit.ai> * Added Index Check for adjust_saturation and adjust_hue Signed-off-by: Abdelrauf <rauf@konduit.ai> * minor fix Signed-off-by: raver119 <raver119@gmail.com> * Fixes missed Msvc floating narrowing errors Signed-off-by: Abdelrauf <rauf@konduit.ai>	2019-12-17 09:42:09 +03:00
Kohei Tamura	65ef0622ff	Update Japanese translation for Deeplearning4J UI (#8525 ) Signed-off-by: k-tamura <ktamura.biz.80@gmail.com>	2019-12-17 12:37:07 +09:00
Alex Black	bfd9e3692a	Add op counting to TensorFlowImportValidator (#128 ) * Add op counting to TensorFlowImportValidator Signed-off-by: AlexDBlack <blacka101@gmail.com> * Test tweak Signed-off-by: AlexDBlack <blacka101@gmail.com>	2019-12-17 10:23:37 +11:00
raver119	bd4f77c652	Compilation units (#124 ) * IndexReduce and Reduce3 split into few units Signed-off-by: raver119 <raver119@gmail.com> * IndexReductionLoops split as well Signed-off-by: raver119 <raver119@gmail.com> * reduce_float split as well Signed-off-by: raver119 <raver119@gmail.com>	2019-12-14 21:59:37 +02:00
raver119	fdda0221ed	[WIP] CUDA build (#109 ) * working prototype of new CUDA build with cmake Signed-off-by: raver119 <raver119@gmail.com> * get rid of older stuff Signed-off-by: raver119 <raver119@gmail.com> * remove legacy CUDA debug section Signed-off-by: raver119 <raver119@gmail.com> * fPIC for GCC Signed-off-by: raver119 <raver119@gmail.com> * - switch to /MD - make MSVC runtime lib configurable from 1 place Signed-off-by: raver119 <raver119@gmail.com> * few last tweaks Signed-off-by: raver119 <raver119@gmail.com> * mae static library optional Signed-off-by: raver119 <raver119@gmail.com> * typo fixed Signed-off-by: raver119 <raver119@gmail.com>	2019-12-14 14:38:17 +03:00
raver119	f78a638c9a	Merge pull request #8532 from eclipse/release/1.0.0-beta6 Release 1.0.0-beta6	2019-12-14 10:58:08 +03:00
Serhii Shepel	c98390a27a	Update to version 1.0.0-SNAPSHOT Signed-off-by: Serhii Shepel <9946053+sshepel@users.noreply.github.com>	2019-12-13 17:26:41 +02:00
Serhii Shepel	9cf197c243	Update to version 1.0.0-beta6 Signed-off-by: Serhii Shepel <9946053+sshepel@users.noreply.github.com>	2019-12-13 17:26:18 +02:00
Alex Black	4bcc1eee11	Merge pull request #8529 from KonduitAI/ab_jackson nd4j-jackson: exclude java.xml.stream.XML*Factory from service loader	2019-12-13 22:13:47 +11:00
AlexDBlack	0ab39a2274	nd4j-jackson: exclude java.xml.stream.XML*Factory from service loader to avoid clashes with other non-shaded jackson etc on classpath Signed-off-by: AlexDBlack <blacka101@gmail.com>	2019-12-13 21:41:28 +11:00
Alex Black	228d06fbb0	Merge pull request #8526 from KonduitAI/master Merge	2019-12-12 18:20:40 +11:00
Fariz Rahman	709295281e	suppress err (#123 )	2019-12-12 09:40:29 +03:00

1 2 3 4 5 ...

710 Commits