cavis

Author	SHA1	Message	Date
Samuel Audet	1c89512ec0	Add Maven profiles for ARM builds to pom.xml files (#265 ) * Add Maven profiles for ARM builds to pom.xml files Signed-off-by: Samuel Audet <samuel.audet@gmail.com> * Remove mkl from dependencies when running on non intel/amd platforms * Downgrade openblas for now * Change back to 0.3.8 Co-authored-by: Adam Gibson <1144306+agibsonccc@users.noreply.github.com>	2020-03-04 11:11:01 +03:00
raver119	f990b2486d	simplified addBias2D for CUDA (#285 ) Signed-off-by: raver119 <raver119@gmail.com>	2020-03-04 09:50:55 +03:00
raver119	11d148a5eb	get back to single-byte reads Signed-off-by: raver119 <raver119@gmail.com>	2020-03-04 08:14:39 +03:00
Fariz Rahman	fec620fafa	TensorflowConversion Data Types (#284 ) * dtypes * bf16 and bool * tests	2020-03-04 11:46:32 +11:00
raver119	d9cfa8073f	bigger reads Signed-off-by: raver119 <raver119@gmail.com>	2020-03-03 14:19:55 +03:00
raver119	ebee7687e8	mkldnn version upgrade Signed-off-by: raver119 <raver119@gmail.com>	2020-03-03 08:57:02 +03:00
Yurii Shyrma	78934c17ad	profiling of stack and unstack ops (#261 ) * - profiling of stack and unstack ops Signed-off-by: Yurii <iuriish@yahoo.com> * - fix bug in cpu concat op Signed-off-by: Yurii <iuriish@yahoo.com> * - correction of cuda stack and unstack Signed-off-by: Yurii <iuriish@yahoo.com> * - change shape.h method which operates with unity dimensions strides Signed-off-by: Yurii <iuriish@yahoo.com> * - rearrange stack tests Signed-off-by: Yurii <iuriish@yahoo.com> * - correct evaluation of smallest stride for moving through contiguous axis Signed-off-by: Yurii <iuriish@yahoo.com> * - forgot to update signature of function strideOverContigAxis in cuda concat and split ops Signed-off-by: Yurii <iuriish@yahoo.com> * - remove ShapeUtils::shapeAsString method applied before input arrays validations Signed-off-by: Yurii <iuriish@yahoo.com> * - further removing of ShapeUtils::shapeAsString Signed-off-by: Yurii <iuriish@yahoo.com> * - take sub-array shapeIndo/offset calculation out of NDArray class - add possibility of contiguous memory copy in execTransformAny op if opNum == assign Signed-off-by: Yurii <iuriish@yahoo.com> * - correct test_empty_scatter_2 in EmptyTests.cpp Signed-off-by: Yurii <iuriish@yahoo.com> * - profiling of slice op Signed-off-by: Yurii <iuriish@yahoo.com> * - get rid of contiguous memcpy for some cases in concat and split ops Signed-off-by: Yurii <iuriish@yahoo.com> * - forgot to declare oid nd4j::SpecialMethods<T>::splitCpuGeneric Signed-off-by: Yurii <iuriish@yahoo.com> * - correct typo in calculation of threads in cuda split op Signed-off-by: Yurii <iuriish@yahoo.com> * - forgot to correct another set of threads variables in split cuda ops Signed-off-by: Yurii <iuriish@yahoo.com> * - further conflicts resolving Signed-off-by: Yurii <iuriish@yahoo.com> Co-authored-by: raver119 <raver119@gmail.com>	2020-03-03 07:32:37 +03:00
raver119	0f581e74e3	one small test rearrangement Signed-off-by: raver119 <raver119@gmail.com>	2020-03-02 19:52:11 +03:00
raver119	c54cdaab75	full bert graph (#282 ) Signed-off-by: raver119 <raver119@gmail.com>	2020-03-02 18:14:32 +03:00
raver119	63fa3c2ef3	libnd4j polishing (#273 ) * initial set of include changes Signed-off-by: raver119 <raver119@gmail.com> * one more tweak Signed-off-by: raver119 <raver119@gmail.com> * few more rearrangements Signed-off-by: raver119 <raver119@gmail.com> * few more rearrangements Signed-off-by: raver119 <raver119@gmail.com> * few more rearrangements Signed-off-by: raver119 <raver119@gmail.com> * cuda includes rearrangements Signed-off-by: raver119 <raver119@gmail.com> * java update Signed-off-by: raver119 <raver119@gmail.com> * = namespace changed to sd - few CMake variables renamed with SD_ prefix Signed-off-by: raver119 <raver119@gmail.com> * java update Signed-off-by: raver119 <raver119@gmail.com> * LoopKind minor fix Signed-off-by: raver119 <raver119@gmail.com> * few more changes Signed-off-by: raver119 <raver119@gmail.com> * few more changes Signed-off-by: raver119 <raver119@gmail.com> * few more changes Signed-off-by: raver119 <raver119@gmail.com> * sanitizer is optional now Signed-off-by: raver119 <raver119@gmail.com> * dev tests updated Signed-off-by: raver119 <raver119@gmail.com> * few more changes Signed-off-by: raver119 <raver119@gmail.com> * last update Signed-off-by: raver119 <raver119@gmail.com> * java update Signed-off-by: raver119 <raver119@gmail.com>	2020-03-02 12:49:41 +03:00
Alex Black	483c3d7b8c	Assorted SameDiff/DL4J fixes (#279 ) * #8565 Normalizer toString/hashcode Signed-off-by: Alex Black <blacka101@gmail.com> * #8731 ImagePreProcessingScaler lables/segmentation fix Signed-off-by: Alex Black <blacka101@gmail.com> * #8691 Fix SameDiffLayer/Vertx finetuning and parameter setting support Signed-off-by: Alex Black <blacka101@gmail.com> * #8663 DL4J embedding layer weight init - don't depend on vocab size Signed-off-by: Alex Black <blacka101@gmail.com> * EmbeddingLayer test tweak Signed-off-by: Alex Black <blacka101@gmail.com>	2020-03-02 16:15:49 +11:00
Oleh	f116f53d61	Loops auto-vectorization problem fix (#277 ) * libnd4j cast loop types Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j more type castination added to loops Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j sync casting types of iterated variable in loops Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j more loops reviewed for vectorization problem fix Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j fixed several typos Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j several more files reviewed to fix auto-vectorization problem in loops Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j merge master and reviewed more files to fix auto-vectorization problem in loops Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j several type casting added in broadcasting that were missed, fixed mac builds Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j double check all files and fix several more places in loops Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j fixed builds Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j revert changes for lup.cpp Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j more files reviewed for auto-vectorization problem fix Signed-off-by: Oleg <oleg.semeniv@gmail.com>	2020-02-28 17:04:45 +03:00
raver119	5332ace32b	better inplace exec with FastPath (#280 ) Signed-off-by: raver119 <raver119@gmail.com>	2020-02-28 12:06:30 +03:00
shugeo	330a69d4e2	Shugeo solve ls (#203 ) * lstsq op. Initial commit. Signed-off-by: shugeo <sgazeos@gmail.com> * Least squares linear problem solve op (lstsq). Cpu draft implementation. Signed-off-by: shugeo <sgazeos@gmail.com> * Fixed shape routine and tests. Signed-off-by: shugeo <sgazeos@gmail.com> * Added test for lstsq op. Signed-off-by: shugeo <sgazeos@gmail.com> * Rectification for lstsq op implementation. Signed-off-by: shugeo <sgazeos@gmail.com> * Corrected test to avoid numerical inconsistensy. Signed-off-by: shugeo <sgazeos@gmail.com> * Added prints for check computing. Signed-off-by: shugeo <sgazeos@gmail.com> * Corrected tests to use evalueate facility instead. Signed-off-by: shugeo <sgazeos@gmail.com> * CPU implementation of MatrixSolveLs op and tests. Signed-off-by: shugeo <sgazeos@gmail.com> * Added cuda implementation for helpers with lstsq op. Signed-off-by: shugeo <sgazeos@gmail.com> * Refactored tests for lstsq op. Signed-off-by: shugeo <sgazeos@gmail.com> * Added processing for empty inputs. Signed-off-by: shugeo <sgazeos@gmail.com> * Merged tests. Signed-off-by: shugeo <sgazeos@gmail.com> * Refactored lstsq op for fast case. Signed-off-by: shugeo <sgazeos@gmail.com> * Fixed test. Signed-off-by: shugeo <sgazeos@gmail.com> * Refactored lstsq op. Signed-off-by: shugeo <sgazeos@gmail.com> * Fixed some issues with solve. Signed-off-by: shugeo <sgazeos@gmail.com> * Fixed lstsq op to avoid erros. Signed-off-by: shugeo <sgazeos@gmail.com> * Added kernel for giagonal factor Signed-off-by: shugeo <sgazeos@gmail.com> * lstsq wrapper and triangular_solve fixed * Added proper processing empty inputs and test. Signed-off-by: shugeo <sgazeos@gmail.com> * SequenceMask test * Build fixed * Added proper processing of empty inputs with solve op. Signed-off-by: shugeo <sgazeos@gmail.com> * Mapping added * Added check of input shapes with solve op. Signed-off-by: shugeo <sgazeos@gmail.com> * Added a couple of tests for lstsq op and minor changes with cuda helper for one.' Signed-off-by: shugeo <sgazeos@gmail.com> * Tests on * Refactored test for lstsq op. Signed-off-by: shugeo <sgazeos@gmail.com> * Fixed test * Added another approach for lstsq op aka solve_ls. Signed-off-by: shugeo <sgazeos@gmail.com> * Finished cpu part for solve_ls op helpers. * Added helper for low triangular matrix inversion. Signed-off-by: shugeo <sgazeos@gmail.com> * Refactored alternate solve_ls cpu implementation. Signed-off-by: shugeo <sgazeos@gmail.com> * Removed alternate approach for solve_ls op. Added multithreading with matrix inversion. Signed-off-by: shugeo <sgazeos@gmail.com> * Assert fixed * Refactored multithreading for inverse matricies. Signed-off-by: shugeo <sgazeos@gmail.com> Co-authored-by: Alexander Stoyakin <alexander.stoyakin@gmail.com>	2020-02-28 11:37:26 +03:00
raver119	358c650b62	one micro fix Signed-off-by: raver119 <raver119@gmail.com>	2020-02-27 19:28:26 +03:00
raver119	31e3a2f7a5	transparent conversion to FastPath execution within Graph (#278 ) Signed-off-by: raver119 <raver119@gmail.com>	2020-02-27 16:10:38 +03:00
Alexander Stoyakin	353f901c7c	[WIP] Handle missing functionality for binary models (#269 ) * Handle missing functionality for binary models * Exception text fixed	2020-02-27 12:00:15 +11:00
Oleh	b4575d11e9	Loops auto-vectorization problem fix (#274 ) * libnd4j cast loop types Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j more type castination added to loops Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j sync casting types of iterated variable in loops Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j more loops reviewed for vectorization problem fix Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j fixed several typos Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j several more files reviewed to fix auto-vectorization problem in loops Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j merge master and reviewed more files to fix auto-vectorization problem in loops Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j several type casting added in broadcasting that were missed, fixed mac builds Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j double check all files and fix several more places in loops Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j fixed builds Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j revert changes for lup.cpp Signed-off-by: Oleg <oleg.semeniv@gmail.com>	2020-02-26 21:12:19 +03:00
raver119	5c806d2fb5	reshape tweak (#275 ) * - expand dims tweak - reshape memcpy Signed-off-by: raver119 <raver119@gmail.com> * validation fix Signed-off-by: raver119 <raver119@gmail.com>	2020-02-26 14:05:32 +03:00
Oleh	b686368b82	Refactoring split operation (#266 ) * libnd4j moved split operation implementation to helpers before special case adding Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j minor fixes for general split operation move, merge master Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libndj4 split cpu implementation Signed-off-by: Oleg <oleg.semeniv@gmail.com> * - provide cuda helper for split op Signed-off-by: Yurii <iuriish@yahoo.com> * - minor correction Signed-off-by: Yurii <iuriish@yahoo.com> * - minor correction 2 Signed-off-by: Yurii <iuriish@yahoo.com> * libnd4j moved split implementation from specials to split.cpp Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j update loopkind selections for 3D, 4D and 5D cases Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j removed unnecessary BUILD_SINGLE_TEMPLATE Signed-off-by: Oleg <oleg.semeniv@gmail.com> Co-authored-by: Yurii Shyrma <iuriish@yahoo.com>	2020-02-26 10:20:39 +03:00
raver119	cf67c7165a	nano fix Signed-off-by: raver119 <raver119@gmail.com>	2020-02-25 15:20:51 +03:00
raver119	f6442b6724	few minor tweaks (#272 ) Signed-off-by: raver119 <raver119@gmail.com>	2020-02-25 11:13:23 +03:00
raver119	241ed05c64	VariableSpace uses unordered maps as well (#270 ) Signed-off-by: raver119 <raver119@gmail.com>	2020-02-24 21:58:23 +03:00
Oleh	f0706b21aa	Split operation improvement (#262 ) * libnd4j moved split operation implementation to helpers before special case adding Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j minor fixes for general split operation move, merge master Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libndj4 split cpu implementation Signed-off-by: Oleg <oleg.semeniv@gmail.com> * - provide cuda helper for split op Signed-off-by: Yurii <iuriish@yahoo.com> * - minor correction Signed-off-by: Yurii <iuriish@yahoo.com> * - minor correction 2 Signed-off-by: Yurii <iuriish@yahoo.com> Co-authored-by: Yurii Shyrma <iuriish@yahoo.com> Co-authored-by: raver119 <raver119@gmail.com>	2020-02-24 08:22:41 +03:00
shugeo	1bb3ae4b03	Shugeo unordered map (#256 ) * Refactored usage of std::map to std::unordered_map instead. Signed-off-by: shugeo <sgazeos@gmail.com> * Eliminated crash with wrong ShapeDescriptor hash. Signed-off-by: shugeo <sgazeos@gmail.com> * Eliminated crash with TadDescriptor hash. Signed-off-by: shugeo <sgazeos@gmail.com> * Refactored Stash hash. Signed-off-by: shugeo <sgazeos@gmail.com> * Refactored hashes. Signed-off-by: shugeo <sgazeos@gmail.com> * Refactored TadDescriptor hash and top_k mapping. * Refactored hashes for ShapeDescriptor and TadDescriptor classes. Signed-off-by: shugeo <sgazeos@gmail.com> * Refactored hash for ConstantDescriptor and ShapeDescriptor classes. Signed-off-by: shugeo <sgazeos@gmail.com> * Fixed map using with cuda platform. Signed-off-by: shugeo <sgazeos@gmail.com> * - few rearrangements for hash functions - shared openblas allowed Signed-off-by: raver119 <raver119@gmail.com> * exports Signed-off-by: raver119 <raver119@gmail.com> * exports Signed-off-by: raver119 <raver119@gmail.com> * Stash reverted to std::map Signed-off-by: raver119@gmail.com <raver119@gmail.com> * Added additional test. Signed-off-by: shugeo <sgazeos@gmail.com> * different maps for different compilers Signed-off-by: raver119 <raver119@gmail.com> * missing include Signed-off-by: raver119 <raver119@gmail.com> * fix the leak Signed-off-by: raver119 <raver119@gmail.com> Co-authored-by: raver119 <raver119@gmail.com>	2020-02-24 07:51:01 +03:00
Alex Black	a0ed5487ca	Upgrade openblas version to 0.3.8 (#264 ) Signed-off-by: Alex Black <blacka101@gmail.com>	2020-02-22 23:42:52 +11:00
raver119	e78be14cc1	arm fix (#260 ) * range check for scalar_int Signed-off-by: raver119 <raver119@gmail.com> * no simd Signed-off-by: raver119 <raver119@gmail.com> * no ops Signed-off-by: raver119 <raver119@gmail.com> * cyclic shift? Signed-off-by: raver119 <raver119@gmail.com> * left split Signed-off-by: raver119 <raver119@gmail.com> * left split Signed-off-by: raver119 <raver119@gmail.com> * rot ops unrolled templates Signed-off-by: raver119 <raver119@gmail.com> * no rotl/rotr for uint64 Signed-off-by: raver119 <raver119@gmail.com> * no rotl/rotr for uint64 2 Signed-off-by: raver119 <raver119@gmail.com> * no rotl/rotr for uint64 3 Signed-off-by: raver119 <raver119@gmail.com> * ARM_BUILD declared Signed-off-by: raver119 <raver119@gmail.com>	2020-02-21 14:31:00 +03:00
Alex Black	e4ddf109c3	Merge pull request #8723 from KonduitAI/master Merge recent development updates	2020-02-21 20:00:35 +11:00
Alex Black	d19dbb955c	Merge remote-tracking branch 'eclipse/master'	2020-02-21 19:57:07 +11:00
Oleh	0748c7e7c2	Oleh broadcast4d (#257 ) * libnd4j raw implementation of native broadcast for special cases Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j fixed bugs for special case of 4D loop broadcast, add some tests, need more testing and discussion Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j added 3D and 5D cases support and tests, need testing with different orders Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j correctd case selection for broadcast 3,4,5D loops, fixed several places for more stable behavior, clean up Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j minor corrections to avoid some risks in strides selection, added tests and rename some variables Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j optimize usage the stride selection for all loops in separate ShapeUtils method copyCertainStridesFromShapeInfo, merge master Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j remove per request several tests for 3D, 4D and 5D broadcast loops Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j removed some loac changes that had not been sync with serve playground, turn on new loops usage	2020-02-21 07:46:05 +03:00
Yurii Shyrma	f7a9190407	profiling of concat op (both cuda and cpu) (#151 ) * - profiling of concat op (both cuda and cpu) Signed-off-by: Yurii <iuriish@yahoo.com> * better comparison for large concat Signed-off-by: raver119 <raver119@gmail.com> * - further improving of concat op Signed-off-by: Yurii <iuriish@yahoo.com> * some loggin Signed-off-by: raver119 <raver119@gmail.com> * - add possibility to verify presence of trailing unities in shape and set strides/ews correspondingly - restrict second simple case in concat op to c order only Signed-off-by: Yurii <iuriish@yahoo.com> * - move concat op to specials_single.cpp file Signed-off-by: Yurii <iuriish@yahoo.com> * - get rid of second concat op declaration in transforms.cpp file Signed-off-by: Yurii <iuriish@yahoo.com> Co-authored-by: raver119 <raver119@gmail.com>	2020-02-20 21:19:01 +03:00
raver119	215641ea9e	Minor improvements (#255 ) * static increments in loops Signed-off-by: raver119 <raver119@gmail.com> * specials and concat split into separate units Signed-off-by: raver119 <raver119@gmail.com>	2020-02-20 11:43:26 +03:00
Samuel Audet	b119631e60	Update version of the JavaCPP Presets for Gym Signed-off-by: Samuel Audet <samuel.audet@gmail.com>	2020-02-20 15:48:37 +09:00
Serhii Shepel	d9058b469a	Add classifier property for dl4j-test-resources (#249 )	2020-02-19 15:31:21 +02:00
Yurii Shyrma	c5193ecb81	Shyrma gather (#254 ) * - profiling gather op for aurora Signed-off-by: Yurii <iuriish@yahoo.com> * - include contiguous memcpy in gather op Signed-off-by: Yurii <iuriish@yahoo.com>	2020-02-19 09:35:52 +03:00
Samuel Audet	1efe2dc846	RL4J: Use Py_AddPath() instead of Py_SetPath() in GymEnv (issue #8688 ) Also update versions of JavaCPP Presets for OpenBLAS and Gym Signed-off-by: Samuel Audet <samuel.audet@gmail.com>	2020-02-19 00:31:12 +09:00
Abdelrauf	72f9cda019	Added missing bfloat16 (#252 ) Signed-off-by: AbdelRauf <rauf@konduit.ai>	2020-02-18 17:01:43 +03:00
raver119	da39a63c9b	one more bert-like test Signed-off-by: raver119 <raver119@gmail.com>	2020-02-18 11:20:38 +03:00
Yurii Shyrma	22c7aa9acf	Shyrma mkl matmul (#250 ) * - provide matmul code based on mkl api Signed-off-by: Yurii <iuriish@yahoo.com> * - correct typo in mkl matmul op Signed-off-by: Yurii <iuriish@yahoo.com> * - take into account empty arrays in mkl matmul op Signed-off-by: Yurii <iuriish@yahoo.com> * - fix bug in mkl matmul and group all matmul tests in one file Signed-off-by: Yurii <iuriish@yahoo.com>	2020-02-18 08:58:01 +03:00
Alex Black	c8882cbfa5	Test fixes + cleanup (#245 ) * Test spam reduction Signed-off-by: Alex Black <blacka101@gmail.com> * Arbiter bad import fixes Signed-off-by: Alex Black <blacka101@gmail.com> * Small spark test tweak Signed-off-by: AlexDBlack <blacka101@gmail.com> * Arbiter test log spam reduction Signed-off-by: Alex Black <blacka101@gmail.com> * More test spam reduction Signed-off-by: Alex Black <blacka101@gmail.com>	2020-02-18 10:29:06 +11:00
raver119	2698fbf541	Broadcast perf improvements (#248 ) * broadcast as scalar edge case Signed-off-by: raver119 <raver119@gmail.com> * missing return Signed-off-by: raver119 <raver119@gmail.com> * few fixes Signed-off-by: raver119 <raver119@gmail.com> * one more fix Signed-off-by: raver119 <raver119@gmail.com> * no need for lambdas Signed-off-by: raver119 <raver119@gmail.com>	2020-02-17 16:25:09 +03:00
raver119	f9d51b7278	More compilation units (#246 ) * weird edge case Signed-off-by: raver119 <raver119@gmail.com> * weird edge case Signed-off-by: raver119 <raver119@gmail.com> * get rid of it Signed-off-by: raver119 <raver119@gmail.com> * crop and resize reorganized Signed-off-by: raver119 <raver119@gmail.com> * restore test Signed-off-by: raver119 <raver119@gmail.com> * remove unwanted unit refs in cmale Signed-off-by: raver119 <raver119@gmail.com>	2020-02-17 10:23:05 +03:00
Yurii Shyrma	011c272fde	Shyrma transpose (#244 ) * - provide contiguous strides for ouput in transpose op Signed-off-by: Yurii <iuriish@yahoo.com> * - provide contiguous strides for output in permute op Signed-off-by: Yurii <iuriish@yahoo.com> * - take into account empty shapes properly in transpose/permute op Signed-off-by: Yurii <iuriish@yahoo.com>	2020-02-17 08:04:28 +03:00
raver119	9e3c1b02b1	Perf improvements (#242 ) * initial commit Signed-off-by: raver119 <raver119@gmail.com> * meh Signed-off-by: raver119 <raver119@gmail.com> * better ExpandDims impl Signed-off-by: raver119 <raver119@gmail.com> * better Squeeze impl Signed-off-by: raver119 <raver119@gmail.com> * better Softmax impl Signed-off-by: raver119 <raver119@gmail.com> * one test disabled Signed-off-by: raver119 <raver119@gmail.com> * more accurate impl Signed-off-by: raver119 <raver119@gmail.com> * - GraphProfiler now prints full shapeInfo instead of shape - softmax typo fix Signed-off-by: raver119 <raver119@gmail.com>	2020-02-14 16:20:31 +03:00
Oleh	6e6289b6b9	Oleh bert multiply true broad cast (#239 ) * libnd4j trueBroadcast rank 3 row implementation of special case Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j rule clarify for second special case for all tests pass * libnd4j parallel_tad loop switch on in special case * libnd4j more general case for special case 2, need additional testing Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j more general case for trueBroadcast special cases added * libnd4j minor corrections and clean up * libnd4j one more minor fix Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j fixed check point to support all Y common vector representations in first special case for trueBroadcast Signed-off-by: Oleg <oleg.semeniv@gmail.com> Co-authored-by: raver119 <raver119@gmail.com>	2020-02-14 12:04:38 +03:00
Alexander Stoyakin	4206171b70	Ignored tests (#243 )	2020-02-14 09:27:46 +03:00
Shams Ul Azeem	f165160edb	MultiLayerConfiguration should be ComputationGraphConfiguration for validating a cg model conf (#240 )	2020-02-14 12:06:00 +11:00
Andrii T	c34790f932	Copied and pasted RegressionTest100b4.java to RegressionTest100b6.jav… (#215 ) * Copied and pasted RegressionTest100b4.java to RegressionTest100b6.java with renamed b4->b6 * assertEquals > assertTrue for half dtype Signed-off-by: atuzhykov <andrewtuzhukov@gmail.com>	2020-02-14 11:53:35 +11:00
raver119	3de3cd8277	R119 tests (#238 ) * one small test Signed-off-by: raver119 <raver119@gmail.com> * one small test Signed-off-by: raver119 <raver119@gmail.com> * bert test Signed-off-by: raver119 <raver119@gmail.com> * Graph FlowPath fix Signed-off-by: raver119 <raver119@gmail.com> * - GraphProfiler tweaks - NodeProfile now includes shapes Signed-off-by: raver119 <raver119@gmail.com> * RELU_layer inplace tweak Signed-off-by: raver119 <raver119@gmail.com> * meh Signed-off-by: raver119 <raver119@gmail.com> * identity tweaks Signed-off-by: raver119 <raver119@gmail.com> * bert result validation Signed-off-by: raver119 <raver119@gmail.com> * - bunch of Shape ops have inplace exec forbidden now - Legacy ops have inplace exec disabled by default now Signed-off-by: raver119 <raver119@gmail.com> * ffast-math enabled Signed-off-by: raver119 <raver119@gmail.com> * ffast-math enabled Signed-off-by: raver119 <raver119@gmail.com> * allow some legacy ops to be inplace Signed-off-by: raver119 <raver119@gmail.com> * disable -fast_math Signed-off-by: raver119 <raver119@gmail.com> * disable expensive test for cuda Signed-off-by: raver119 <raver119@gmail.com>	2020-02-13 20:59:35 +03:00
Yurii Shyrma	fe47f52896	Oleh tenzor mmul (#231 ) * Libnd4j: TensorMMul backprop op #8174, raw implementation Signed-off-by: Oleg <oleg.semeniv@gmail.com> * Libnd4j: TensorMMul backprop op #8174 merge master and some corrections Signed-off-by: Oleg <oleg.semeniv@gmail.com> * Libnd4j: TensorMMul backprop op #8174 algorithm update, need testing, sync with master * Libnd4j: TensorMMul backprop op #8174 fixed incorrect B axes calculation Signed-off-by: Oleg <oleg.semeniv@gmail.com> * Libnd4j: TensorMMul backprop op #8174 optimize axes identification and fix bug of indeces overlapping, added first test. need testing with different shapes Signed-off-by: Oleg <oleg.semeniv@gmail.com> * Libnd4j: TensorMMul backprop op #8174 some fixes and improvements need more testing Signed-off-by: Oleg <oleg.semeniv@gmail.com> * Libnd4j: TensorMMul backprop op #8174 fixed order of matrix multiply Signed-off-by: Oleg <oleg.semeniv@gmail.com> * Libnd4j: TensorMMul backprop op #8174 fixed issue of incorrect axes definition, add tests based on TF, need additional testing for case dLdC not equal 1 Signed-off-by: Oleg <oleg.semeniv@gmail.com> * Libnd4j: TensorMMul backprop op #8174 fixed scalar case add test Signed-off-by: Oleg <oleg.semeniv@gmail.com> * Libnd4j: TensorMMul backprop op #8174 fixed bp algorithm, axes definition, need some mode testing with different orders combination f,c; c,f f,f and add some checks for inputs Signed-off-by: Oleg <oleg.semeniv@gmail.com> * Libnd4j: TensorMMul backprop op #8174 some checks and corrections added tests, exists the problem with different input orders support A-f B-c and A-f B-f Signed-off-by: Oleg <oleg.semeniv@gmail.com> * Libnd4j: TensorMMul backprop op #8174 sync master Signed-off-by: Oleg <oleg.semeniv@gmail.com> * - correct bug in MmulHelper::tensorDot(a, b, c, axes_a, axes_b,permutForC) Signed-off-by: Yurii <iuriish@yahoo.com> * Libnd4j: TensorMMul backprop op #8174 code clean up and refactoring Signed-off-by: Oleg <oleg.semeniv@gmail.com> * - add check for linspase ordered permutations in ShapeUtils::evalShapeForTensorDot Signed-off-by: Yurii <iuriish@yahoo.com> * - provide additional code in shape::reshape stuff in order to reduce amount of allocation/copy operations during reshaping procedure Signed-off-by: Yurii <iuriish@yahoo.com> * - further work on problem of wrong shape evaluation during permute/reshape procedures Signed-off-by: Yurii <iuriish@yahoo.com> * - still looking for bug reason in reshape/permute stuff Signed-off-by: Yurii <iuriish@yahoo.com> * - correct bug in transform cuda native ops Signed-off-by: Yurii <iuriish@yahoo.com> * - correct bug in NDArray::assign Signed-off-by: Yurii <iuriish@yahoo.com> * - remove old shape::reshape stuff Signed-off-by: Yurii <iuriish@yahoo.com> * - add possibility to disable copy of old buffer to new buffer during reshape operation in NDArray class Signed-off-by: Yurii <iuriish@yahoo.com> * - correct bug in tensorDot which had to do with wrong pointers assigments Signed-off-by: Yurii <iuriish@yahoo.com> Co-authored-by: Oleh <oleg.semeniv@gmail.com>	2020-02-13 20:33:54 +03:00

1 2 3 4 5 ...

780 Commits