cavis

Author	SHA1	Message	Date
raver119	b23ebee432	- MKL-DNN version upgrade - deviceMutex replaced for CPU Signed-off-by: raver119 <raver119@gmail.com>	2020-03-20 12:42:29 +03:00
Yurii Shyrma	e700b59f80	Shyrma weights format (#329 ) * - start to introduce additional weights formats into conv2d ops Signed-off-by: Yurii <iuriish@yahoo.com> * - provide weights format variety in backprop conv2d and deconv2d ops, testing and fixing bugs Signed-off-by: Yurii <iuriish@yahoo.com> * - forgot to recover kernels sizes in deconv2d_bp test Signed-off-by: Yurii <iuriish@yahoo.com> * - built in weights format in depthwise conv 2d op Signed-off-by: Yurii <iuriish@yahoo.com> * - provide new weights formats in mkl dnn conv ops Signed-off-by: Yurii <iuriish@yahoo.com> * - provide new weights formats in cuda conv helpers Signed-off-by: Yurii <iuriish@yahoo.com> * - working with new weights format in cudnn conv api Signed-off-by: Yurii <iuriish@yahoo.com> * - take into account order of arrays in cudnn tensor descriptions Signed-off-by: Yurii <iuriish@yahoo.com> * - provide new weights formats in cpu conv3d (ff/bp) Signed-off-by: Yurii <iuriish@yahoo.com> * - provide new weights formats in cpu deconv3d (ff/bp) Signed-off-by: Yurii <iuriish@yahoo.com> * - provide new weights formats in conv3d ops (ff/bp) based on mkl api Signed-off-by: Yurii <iuriish@yahoo.com> * - provide new weights formats in conv3d ops (ff/bp) based on cudnn api Signed-off-by: Yurii <iuriish@yahoo.com> * - resolve conflicts 2 Signed-off-by: Yurii <iuriish@yahoo.com> Co-authored-by: raver119 <raver119@gmail.com>	2020-03-20 12:11:27 +03:00
shugeo	5dae4069cf	Shugeo random expo fix2 (#295 ) * Refactored exponential distribution implementation. Signed-off-by: shugeo <sgazeos@gmail.com> * Refactored exponential distribution and tests. Signed-off-by: shugeo <sgazeos@gmail.com> * Refactored test to new result sets. Signed-off-by: shugeo <sgazeos@gmail.com>	2020-03-20 11:33:20 +03:00
Alex Black	2497290cb0	AdaGrad validation test (#334 ) Signed-off-by: Alex Black <blacka101@gmail.com>	2020-03-20 17:25:46 +11:00
raver119	7a2ac800dd	Nullify (#304 ) * initial commit Signed-off-by: raver119 <raver119@gmail.com> * bunch of tweaks Signed-off-by: raver119 <raver119@gmail.com> * hamming distance nullification Signed-off-by: raver119 <raver119@gmail.com> * Add output array value assignment for testing/debugging Signed-off-by: Alex Black <blacka101@gmail.com> * don't assign empty arrays Signed-off-by: raver119 <raver119@gmail.com> * conv2d/conv3d/depthwise2d nullified Signed-off-by: raver119 <raver119@gmail.com> * conv2d/conv3d/depthwise2d nullified Signed-off-by: raver119 <raver119@gmail.com> * conv2d/conv3d/depthwise2d nullified Signed-off-by: raver119 <raver119@gmail.com> * few more fixes Signed-off-by: raver119 <raver119@gmail.com> * im2col Signed-off-by: raver119 <raver119@gmail.com> * pooling? Signed-off-by: raver119 <raver119@gmail.com> * more nullified Signed-off-by: raver119 <raver119@gmail.com> * ismax nullified Signed-off-by: raver119 <raver119@gmail.com> * rollback ismax nullification Signed-off-by: raver119 <raver119@gmail.com> * synchronized cublas handle use on per-device basis Signed-off-by: raver119 <raver119@gmail.com> * hiding method from jcpp Signed-off-by: raver119 <raver119@gmail.com> * get rid of test assigns in DeclarableOp Signed-off-by: raver119 <raver119@gmail.com> * get rid of assigns Signed-off-by: raver119 <raver119@gmail.com> * proper deviceId is back Signed-off-by: raver119 <raver119@gmail.com> * include fixed Signed-off-by: raver119 <raver119@gmail.com> Co-authored-by: Alex Black <blacka101@gmail.com>	2020-03-20 08:49:28 +03:00
Adam Gibson	30a28fae45	Windows fix (#333 ) * Fix cmake detection in msys * Revert windows change * Update to unix line endings	2020-03-20 12:14:03 +09:00
Adam Gibson	0cf4a45573	Fixes #8763 (#310 ) * Fix cmake detection in msys * Fix toolchain file on windows * Make android 64 bit work * Fix libnd4j build script on msys * Update build script for windows/linux * Encoding issue for ci * Update pom.xml * Update pom.xml * Update pom.xml * Remove mingw * Ensure android x86 builds are inline with arm builds * Update toolchains and env variables for x86 * Move profile for build program up to parent * Fix blas vendor and add comment * Update cuda presets version * Set default value and move properties back to child pom * Change program from hard coded to use the script as the program * Update pom.xml * Update pom.xml * Static lib fix * Update static lib output * Get rid of old comments * Update static for buiding	2020-03-19 14:53:21 +09:00
Shams Ul Azeem	9c77bfa85f	Support for more numpy datatypes (#241 ) * Adding more datatypes support in datavec-python * Using numpy C API for creating numpy arrays * Adding parameterized tests * Adding support for BFLOAT16 (by converting it to FLOAT) * Cleanup * Using casting instead of creating an array * Giving out a warning while casting array from BFLOAT16 to FLOAT * Add syncToPrimary and syncToSpecial methods to BaseDataBuffer Signed-off-by: Alex Black <blacka101@gmail.com> * Python exec: sync to host before passing pointers Signed-off-by: Alex Black <blacka101@gmail.com> * Added copyright header * use np api (#267) * python exec / numpy - check object type before cast (#268) * use np api * verify object before cast * fix cong * cuda fix * inplace test + tiny fix * more test * fix double alloc * rem tags * fix cuda check * Fix implicit CUDA dependency in datavec-python tests; remove new method, add test Signed-off-by: Alex Black <blacka101@gmail.com> Co-authored-by: Alex Black <blacka101@gmail.com> Co-authored-by: Fariz Rahman <farizrahman4u@gmail.com>	2020-03-19 00:48:37 +11:00
Samuel Audet	5cd143611e	Merge remote-tracking branch 'eclipse/master' Signed-off-by: Samuel Audet <samuel.audet@gmail.com>	2020-03-18 16:17:14 +09:00
raver119	77244f5496	avg/max pooling3d bp fixed (#323 ) Signed-off-by: raver119 <raver119@gmail.com>	2020-03-16 18:17:42 +03:00
raver119	bac130bd78	CUDA host pointer fix (#322 ) * CUDA fix: host pointer propagation Signed-off-by: raver119 <raver119@gmail.com> * disable logging Signed-off-by: raver119 <raver119@gmail.com>	2020-03-16 12:07:34 +03:00
Alex Black	2cd4522f94	Add updater tests/validation (#319 ) Signed-off-by: Alex Black <blacka101@gmail.com>	2020-03-16 10:35:15 +03:00
raver119	4cf2afad2b	benchmarks fixes (#321 ) * bunch of small fixes Signed-off-by: raver119 <raver119@gmail.com> * validation for legacy random op Signed-off-by: raver119 <raver119@gmail.com> * get rid of test Signed-off-by: raver119 <raver119@gmail.com>	2020-03-16 10:31:06 +03:00
Oleh	e7a995e959	Tanh backpropagation mkldnn implementation (#308 ) * libnd4j first step of tanh_bp operation implementation on mkldnn Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j optimize several places and added test case for tanh_bp Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j minor corrections and renaming, added one more test case Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j missed mkldnn data format definition Signed-off-by: Oleg <oleg.semeniv@gmail.com>	2020-03-13 19:01:00 +03:00
Yurii Shyrma	e42b4e96c3	correct output empty shapes deducing in split op (#311 ) * - correct output empty shapes deducing in split op Signed-off-by: Yurii <iuriish@yahoo.com> * java test fixed Signed-off-by: raver119 <raver119@gmail.com> * - split broadcast::exec function on individual functions corresponding to switch arg Signed-off-by: Yurii <iuriish@yahoo.com> * - split broadcast::exec _int and _bool function on individual functions corresponding to switch arg Signed-off-by: Yurii <iuriish@yahoo.com> Co-authored-by: raver119 <raver119@gmail.com>	2020-03-12 18:25:54 +03:00
Oleh	41bde8f885	Softmax BP mkldnn implementation (#301 ) * libnd4j mkldnn softmax_bp operation implementation and integration, 2 tests added, need some refactoring and code clean up and more testing with different input shapes Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j softmax_bp update, code refactoring, etc Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j merge master, fixed typos, minor tweaks, code clean up Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j integrate mkldnnUtils helpers in other mkldnn operations Signed-off-by: Oleg <oleg.semeniv@gmail.com>	2020-03-12 18:25:29 +03:00
raver119	88f39fad67	pairwise compilation units updated Signed-off-by: raver119 <raver119@gmail.com>	2020-03-12 12:50:09 +03:00
Alex Black	3d7adcf25a	Merge pull request #8717 from eclipse/sa_gym RL4J: Use Py_AddPath() instead of Py_SetPath() in GymEnv (issue #8688)	2020-03-12 13:17:22 +11:00
Samuel Audet	9a5e21d617	Uupdate versions of JavaCPP Presets for HDF5, OpenBLAS, and Gym Signed-off-by: Samuel Audet <samuel.audet@gmail.com>	2020-03-12 07:11:58 +09:00
Yurii Shyrma	ebab6b6410	Shyrma broadcast2 (#309 ) * - profiling broadcast ops for aurora Signed-off-by: Yurii <iuriish@yahoo.com> * - correct loop limit type in shape::haveSameShapeAndStrides Signed-off-by: Yurii <iuriish@yahoo.com>	2020-03-11 17:58:53 +03:00
Yurii Shyrma	58550b7c98	[WIP] Shyrma coords (#305 ) * - provide faster index2coords function for cpu Signed-off-by: Yurii <iuriish@yahoo.com> * - new faster index2coords function is introduced into cpu code Signed-off-by: Yurii <iuriish@yahoo.com> * - replace long long coordinates with int coordinates Signed-off-by: Yurii <iuriish@yahoo.com> * - add missed reload of coords2index function Signed-off-by: Yurii <iuriish@yahoo.com> * - reststart jenkins Signed-off-by: Yurii <iuriish@yahoo.com> * - rollback changes in convolutions.cu and addBias.cu Signed-off-by: Yurii <iuriish@yahoo.com>	2020-03-11 16:21:59 +03:00
raver119	50b7d82b96	more compilation units Signed-off-by: raver119 <raver119@gmail.com>	2020-03-11 11:43:18 +03:00
raver119	a7a97d8259	rl4j: update host pointers content before reading them Signed-off-by: raver119 <raver119@gmail.com>	2020-03-11 10:57:55 +03:00
Alexandre Boulanger	8b10f0b876	RL4J: Add TransformProcess, part 2 (#8766 ) * Part 2 of TransformProcess Signed-off-by: Alexandre Boulanger <aboulang2002@yahoo.com> * Fix compile errors Signed-off-by: Samuel Audet <samuel.audet@gmail.com> * Revert unrelated changes Signed-off-by: Samuel Audet <samuel.audet@gmail.com> Co-authored-by: Samuel Audet <samuel.audet@gmail.com>	2020-03-11 11:56:41 +09:00
Yurii Shyrma	6aaca58506	Shyrma broadcast (#302 ) * - profiling TrueBroadcastHelper Signed-off-by: Yurii <iuriish@yahoo.com> * - further improving of TrueBroadcastHelper Signed-off-by: Yurii <iuriish@yahoo.com> * - further profiling of broadcast op Signed-off-by: Yurii <iuriish@yahoo.com> * - implementation of broadcastShapeHelper which inserts unities in shapes of arrays to be broadcasted Signed-off-by: Yurii <iuriish@yahoo.com> * - provide additional method in ConstantShapeHelper class for deducing broadcast shapes with unities Signed-off-by: Yurii <iuriish@yahoo.com> * - provide new NativeOps helpers for usual and true broadcast methods Signed-off-by: Yurii <iuriish@yahoo.com> * enable bert profiler Signed-off-by: raver119 <raver119@gmail.com> * - delete unnessesary tests Signed-off-by: Yurii <iuriish@yahoo.com> Co-authored-by: raver119 <raver119@gmail.com>	2020-03-10 16:29:09 +03:00
raver119	0faf83b1b6	Merge pull request #8772 from Bam4d/ignore_javacpp_generated Remove the two files that get generated by javacpp to avoid conflicts…	2020-03-10 13:09:37 +03:00
Bam4d	0459cf9525	Remove the two files that get generated by javacpp to avoid conflicts. Also add them to .gitignore Signed-off-by: Bam4d <chrisbam4d@gmail.com>	2020-03-10 10:05:56 +00:00
Oleh	c3223dbc7a	Improve ResultSet usage in libnd4j (#281 ) * libnd4j profiling DeclarableOp and Tests by replacing return ResultSet pointer by instance Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j profiling semantic change in tests cases Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j some corrections to make new ResultSet semantic works, fixed one test Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j more tests fixes Signed-off-by: Oleg <oleg.semeniv@gmail.com> * - correct copy and move assignment operators of ResultSet class Signed-off-by: Yurii <iuriish@yahoo.com> Co-authored-by: Yurii <iuriish@yahoo.com> Co-authored-by: raver119 <raver119@gmail.com>	2020-03-10 07:42:50 +03:00
raver119	57210b936c	Revert "OpenMP Threads execution (#297 )" (#299 ) This reverts commit dd2043ef485a96de3d64563f1eed4c50a8cd72f7.	2020-03-09 08:22:49 +03:00
raver119	dd2043ef48	OpenMP Threads execution (#297 ) * omp threads backported Signed-off-by: raver119 <raver119@gmail.com> * omp scalar reduce Signed-off-by: raver119 <raver119@gmail.com> * timing Signed-off-by: raver119 <raver119@gmail.com> * timing Signed-off-by: raver119 <raver119@gmail.com> * minor tweaks Signed-off-by: raver119 <raver119@gmail.com> * minor tweaks Signed-off-by: raver119 <raver119@gmail.com> * namespace change Signed-off-by: raver119 <raver119@gmail.com> * num_threads Signed-off-by: raver119 <raver119@gmail.com> * one minor fix Signed-off-by: raver119 <raver119@gmail.com>	2020-03-09 08:21:44 +03:00
Andrii T	a2ec3dbc97	Image namespace (#176 ) * created NDImage.java and fixed constructor in AdjustContrast.java * created NDImage.java and fixed constructor in AdjustContrast.java * created NDImage.java and fixed constructor in AdjustContrast.java v2 * regenerated NDImage from cleaned Image,kt also cleaned AdjustContrast.java * draft of NDCNN * draft of NDCNN * started NDRNN * started NDRNN * looking like finished with namespace * Regenerate namespaces Signed-off-by: AlexDBlack <blacka101@gmail.com> * Add ND4J namespace methods for new namespaces Signed-off-by: AlexDBlack <blacka101@gmail.com> * Fixes, cleanup Signed-off-by: Alex Black <blacka101@gmail.com> * More fixes Signed-off-by: Alex Black <blacka101@gmail.com> * Fixes Signed-off-by: Alex Black <blacka101@gmail.com> * Fix Signed-off-by: Alex Black <blacka101@gmail.com> Co-authored-by: Andrii Tuzhykov <andrew@unrealists.com> Co-authored-by: Andrii Tuzhykov <andrew@konduit.ai> Co-authored-by: AlexDBlack <blacka101@gmail.com>	2020-03-09 13:35:17 +11:00
Alex Black	a80fb99a5f	DL4J integrations tests updates + add SameDiff support (#298 ) * Revive and start updating DL4J integration tests Signed-off-by: Alex Black <blacka101@gmail.com> * Add SameDiff support - first pass Signed-off-by: Alex Black <blacka101@gmail.com> * SameDiff test case generation Signed-off-by: Alex Black <blacka101@gmail.com> * SameDiff integration tests polishing Signed-off-by: Alex Black <blacka101@gmail.com> * More SameDiff integration test fixes Signed-off-by: Alex Black <blacka101@gmail.com> * Final polish Signed-off-by: Alex Black <blacka101@gmail.com> * Small test tweak Signed-off-by: Alex Black <blacka101@gmail.com>	2020-03-07 22:44:41 +11:00
Oleh	ead5162c97	Tanh mkldnn implementation (#296 ) * libnd4j first step of softmax mkldnn implementation Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j raw implementation of mkldnn softmax Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j merge master and added softmax to MklDnnTests Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j some corrections for softmax mkldnn Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j merge branch, fixed problem with negative axis, fixed dnnl::memory::format_tag selection, test cases added Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j minor corrections to avoid risk connected with negative axis usage Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j fixed windows builds, added switcher to use mkldnn sofmax version only for 3D, 4D, 5D, 6D arrays Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j fixed dataType selection per request Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j fix for mac and windows builds Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j builds fix Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j first spet of elementwize tanh implementation on mkldnn Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j fixed typo in error message for softmax MKLDNN, test case added, implementation of tanh on MKLDNN, need supported DataType testing Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j several fixes for tanh and temporary performance test added Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j fixed mkldnn platform loader for tanh Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j MklDnn tanh removed unsupported data types, removed performance test case, added more appropriate equivalence test case, code clean up Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j fixed problem with empty input case for MklDnn tanh and softmax Signed-off-by: Oleg <oleg.semeniv@gmail.com>	2020-03-06 17:11:22 +03:00
Alex Black	e6a7b94fe4	Loss namespace (#294 ) * codegen for SDLoss. WIP. Signed-off-by: Robert Altena <Rob@Ra-ai.com> * first pass of SDLoss. Signed-off-by: Robert Altena <Rob@Ra-ai.com> * wip. Firsat cut of new op constructors. UNTESTED , NOT COMPILED YET. Signed-off-by: Robert Altena <Rob@Ra-ai.com> * updated op signatures. Signed-off-by: Robert Altena <Rob@Ra-ai.com> * add NDLoss tests. Signed-off-by: Robert Altena <Rob@Ra-ai.com> * fix test. Signed-off-by: Robert Altena <Rob@Ra-ai.com> * adds loss default params. factory. Signed-off-by: Robert Altena <Rob@Ra-ai.com> * Regenerate NDLoss Signed-off-by: AlexDBlack <blacka101@gmail.com> * adds tests for null weights. Signed-off-by: Robert Altena <Rob@Ra-ai.com> * Last few tweaks Signed-off-by: Alex Black <blacka101@gmail.com> Co-authored-by: Robert Altena <Rob@Ra-ai.com>	2020-03-06 16:07:22 +11:00
Alex Black	7494117e90	#8751 Arbiter grid search candidate generator fix [WIP] (#292 ) * #8751 Arbiter grid search candidate generator fix Signed-off-by: Alex Black <blacka101@gmail.com> * Small fix Signed-off-by: Alex Black <blacka101@gmail.com> * Timeout Signed-off-by: Alex Black <blacka101@gmail.com>	2020-03-06 12:01:21 +11:00
Alex Black	19d5a8d49d	Various fixes (#290 ) * Add check to ensure ALL tests extend BaseND4JTest for proper timeouts + logging Signed-off-by: Alex Black <blacka101@gmail.com> * Add 'must extend BaseDL4JTest' check for deeplearning4j-core Signed-off-by: Alex Black <blacka101@gmail.com> * Flush logging on workspace exit during tests Signed-off-by: Alex Black <blacka101@gmail.com>	2020-03-06 00:02:32 +11:00
raver119	2911da061b	blas fallback (#291 ) Signed-off-by: raver119 <raver119@gmail.com> Co-authored-by: raver119 <raver119@gmail.com>	2020-03-05 14:11:13 +03:00
raver119	784a2d13f8	separate omp impl for softmax (#289 ) Signed-off-by: raver119 <raver119@gmail.com>	2020-03-05 11:14:22 +03:00
raver119	3bb22a6ff8	strided_slice without view (#288 ) Signed-off-by: raver119 <raver119@gmail.com>	2020-03-05 09:56:52 +03:00
raver119	ca96a13ed0	softmax as standalone compilation unit Signed-off-by: raver119 <raver119@gmail.com>	2020-03-05 08:45:10 +03:00
Samuel Audet	58aa5a3a9b	RL4J: Add TransformProcess, part 1 (#8711 ) * Added TransformProcess, part 1 Signed-off-by: unknown <aboulang2002@yahoo.com> * Renamed TemporalMergeTransform to HistoryMergeTransform Signed-off-by: unknown <aboulang2002@yahoo.com> * changed INDArrayHelper to use Nd4j.expandDims Signed-off-by: Alexandre Boulanger <aboulang2002@yahoo.com> * Adjusted copyrights Signed-off-by: unknown <aboulang2002@yahoo.com>	2020-03-05 14:43:13 +09:00
Oleh	4d81af9fe9	Softmax operation implementation for mkldnn (#286 ) * libnd4j first step of softmax mkldnn implementation Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j raw implementation of mkldnn softmax Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j merge master and added softmax to MklDnnTests Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j some corrections for softmax mkldnn Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j merge branch, fixed problem with negative axis, fixed dnnl::memory::format_tag selection, test cases added Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j minor corrections to avoid risk connected with negative axis usage Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j fixed windows builds, added switcher to use mkldnn sofmax version only for 3D, 4D, 5D, 6D arrays Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j fixed dataType selection per request Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j fix for mac and windows builds Signed-off-by: Oleg <oleg.semeniv@gmail.com> * libnd4j builds fix Signed-off-by: Oleg <oleg.semeniv@gmail.com>	2020-03-04 19:36:42 +03:00
Samuel Audet	1c89512ec0	Add Maven profiles for ARM builds to pom.xml files (#265 ) * Add Maven profiles for ARM builds to pom.xml files Signed-off-by: Samuel Audet <samuel.audet@gmail.com> * Remove mkl from dependencies when running on non intel/amd platforms * Downgrade openblas for now * Change back to 0.3.8 Co-authored-by: Adam Gibson <1144306+agibsonccc@users.noreply.github.com>	2020-03-04 11:11:01 +03:00
raver119	f990b2486d	simplified addBias2D for CUDA (#285 ) Signed-off-by: raver119 <raver119@gmail.com>	2020-03-04 09:50:55 +03:00
raver119	11d148a5eb	get back to single-byte reads Signed-off-by: raver119 <raver119@gmail.com>	2020-03-04 08:14:39 +03:00
Fariz Rahman	fec620fafa	TensorflowConversion Data Types (#284 ) * dtypes * bf16 and bool * tests	2020-03-04 11:46:32 +11:00
raver119	d9cfa8073f	bigger reads Signed-off-by: raver119 <raver119@gmail.com>	2020-03-03 14:19:55 +03:00
raver119	ebee7687e8	mkldnn version upgrade Signed-off-by: raver119 <raver119@gmail.com>	2020-03-03 08:57:02 +03:00
Yurii Shyrma	78934c17ad	profiling of stack and unstack ops (#261 ) * - profiling of stack and unstack ops Signed-off-by: Yurii <iuriish@yahoo.com> * - fix bug in cpu concat op Signed-off-by: Yurii <iuriish@yahoo.com> * - correction of cuda stack and unstack Signed-off-by: Yurii <iuriish@yahoo.com> * - change shape.h method which operates with unity dimensions strides Signed-off-by: Yurii <iuriish@yahoo.com> * - rearrange stack tests Signed-off-by: Yurii <iuriish@yahoo.com> * - correct evaluation of smallest stride for moving through contiguous axis Signed-off-by: Yurii <iuriish@yahoo.com> * - forgot to update signature of function strideOverContigAxis in cuda concat and split ops Signed-off-by: Yurii <iuriish@yahoo.com> * - remove ShapeUtils::shapeAsString method applied before input arrays validations Signed-off-by: Yurii <iuriish@yahoo.com> * - further removing of ShapeUtils::shapeAsString Signed-off-by: Yurii <iuriish@yahoo.com> * - take sub-array shapeIndo/offset calculation out of NDArray class - add possibility of contiguous memory copy in execTransformAny op if opNum == assign Signed-off-by: Yurii <iuriish@yahoo.com> * - correct test_empty_scatter_2 in EmptyTests.cpp Signed-off-by: Yurii <iuriish@yahoo.com> * - profiling of slice op Signed-off-by: Yurii <iuriish@yahoo.com> * - get rid of contiguous memcpy for some cases in concat and split ops Signed-off-by: Yurii <iuriish@yahoo.com> * - forgot to declare oid nd4j::SpecialMethods<T>::splitCpuGeneric Signed-off-by: Yurii <iuriish@yahoo.com> * - correct typo in calculation of threads in cuda split op Signed-off-by: Yurii <iuriish@yahoo.com> * - forgot to correct another set of threads variables in split cuda ops Signed-off-by: Yurii <iuriish@yahoo.com> * - further conflicts resolving Signed-off-by: Yurii <iuriish@yahoo.com> Co-authored-by: raver119 <raver119@gmail.com>	2020-03-03 07:32:37 +03:00
raver119	0f581e74e3	one small test rearrangement Signed-off-by: raver119 <raver119@gmail.com>	2020-03-02 19:52:11 +03:00

1 2 3 4 5 ...

772 Commits