- 09 Jul, 2018 1 commit
-
-
Robert Kimball authored
* Cache some generated functions in backwards tests to speed performance * more caching
-
- 02 Jul, 2018 1 commit
-
-
Sandeep authored
* declare sigmoid for core fusion * add simple test for sigmoid * info fusion status * cp op as main op * builds as expected * move sigmoid fusion code * add reference kernel * sigmoid bprop reference kernel and clang-format * add delta to bprop * fprop called * compiles bprop * move tests * serializer support * address comments in code * add doc * naming similar to core ops * fix failing test * fix failing test * address clang issue * more changes * change test macro
-
- 02 Jun, 2018 1 commit
-
-
Yixing Lao authored
-
- 18 May, 2018 1 commit
-
-
Nick Korovaiko authored
* use reference kernel for reverse_sequence for int * move tests * resolve CI errors * TEST to NGRAPH_TEST
-
- 10 May, 2018 2 commits
-
-
Yixing Lao authored
* test_control in util
-
Robert Kimball authored
* Add mechanism for disabling specific backend unit tests from a manifest file. Populate the test manifest files for CPU, GPU and INTERPRETER. * update docs for new manifest controlled transformer unit tests
-
- 09 May, 2018 1 commit
-
-
Chris Sullivan authored
* Added op::AvgPool cudnn impl. which works for 2-3 spatial dimesions and no/symmetric padding. Enabled tests. * Added cuda-c implementation of average pool which handles 1-3 spatial dimensions as well as asymmetric padding. This commit also introduces several helper functions for performing fast integer division and fast constant memory access. * Formatting. Removed bool that was used for testing to force the cuda impl. over cudnn. * Added CUDNN AvgPoolBackprop implementation. * Removed inline enum in preference of a helper struct. Removed instances of multiple declarations on a single line. Updated comments. * Removed _prefix to helper functions in anonymous namespace.
-
- 01 May, 2018 1 commit
-
-
Matthew Brookhart authored
-
- 24 Apr, 2018 1 commit
-
-
Robert Kimball authored
* get all ops working * enable autodiff tests for IE backend
-
- 21 Apr, 2018 1 commit
-
-
Adam Straw authored
* ie backend and manager with passing unit tests except for select/function * fix function_call and select * simplify implemenation by removing support for convert and select * remove manager
-
- 16 Apr, 2018 1 commit
-
-
Adam Procter authored
-
- 13 Apr, 2018 2 commits
-
-
Robert Kimball authored
* remove deprecated * remove all legacy Backend API usage remove deprecated files * pull in changes from master * fix GPU calls * disable tests in convolution generator * update per PR comments. Enable performance counter feature. * update per PR comments * fix build error * fix conditionally compiled test :(
-
Chris Sullivan authored
* Begin prototype of cudnn_emitter. * Added GPURuntimeContext to gpu_external_function for passing through to JIT functions. * gpu_emitters now utilize gpu runtime context. * Moved cublas and cudnn handles into GPURuntimeContext pointer and out of callframe EntryPoint. * Added CUDNNEmitter, comparable to MKLDNNEmitter, which allows for cudnn kernels to be defined via lambda primitives that are emitted and subsequently called during graph execution. An example implementation is provided for op::Sum. * Added GPURuntimeContext to gpu_external_function for passing through to JIT functions. * gpu_emitters now utilize gpu runtime context. * Moved cublas and cudnn handles into GPURuntimeContext pointer and out of callframe EntryPoint. * GPURuntimeContext should be stored as unique_ptr in external function. * GPURuntimeContext should be stored as unique_ptr in external function. * Extract raw pointer from unique for cudnn_emitter. * Removing unrelated code from PR. * GPURuntimeContext needs to be a strict C interface in case the native compiler and clang are utilizing different glibc ABIs. Updated to reflect this. * Added cudnn::primitive typedef for better readability. * Moved allocation of CudaFunctionPool to external function so that it is available during gpu emission. * Fixed too-late initialization of cudart. * Fixed too-late initialization of cudart. * CUDNNEmitter moved into superset class GPUPrimitiveEmitter. The GPUPrimitiveEmitter handles the emission of all gpu primitives, including cudnn, cuda, and cublas. CUBLASEmitter support not yet included. * Added unordered_map for cacheing primitives in the gpu_emitter. * Added dtor to GPUPrimitiveEmitter to cleanup compiled functions. * Adding back a serialized model graph that was accidentally rem* Added a few additional helpers to use ngraph::row_major_strides. * added whitespace per @fengleitian's comment * added whitespace per @fengleitian's comment * Remove implicit type conversions from size_t to int. * Add op::MaxPool, op::MaxPoolBackprop and op::Pad to GPU transformer (#817) * Added pooling for 1 and 2dimensions. 1d uses a cuda kernel and 2d utilizes cudnn. Padding is not yet supported. * Normalized call signature on gpu emission for 1d max pool. Added a few comments. * Max pool backprop impl. inprogress. Amend this commit. * Max pool backprop implemented. Note that cuDNN requests the output tensor for the maxpool operation but it is not required for computation. * Formatting and invokation for maxpool changed. * Fixed too-late initialization of cudart. * Added padding kernel that is used with maxpool. Need to investigate remaining tests. * Changed dimensionality check to correctly determine if data is 1d or not. * Added 3d MaxPooling (forward), verified by forcing 2d case to use Nd pooling routines. * Added 3d MaxPooling (backward), verified by forcing 2d case to use Nd pooling routines. * Moved cudnn prologues for maxpool into ngraph runtime and out of primitive so that the only execution occuring on the JIT runtime is the evaluation of the op kernel. * Refactored forward and backward pooling into single CUDNNEmitter::build_pooling interface with a runtime switch to determine if the op is forward or backward propagation. * Cache preconstructed cudnn kernel for maxpool if it has already been constructed. * Forgot to add padding arrays back into cudnn kernel for MaxPool in the 2d case. * Fixed namespace issues and use join(...,'_') * Refactored 4d/Nd tensor descriptor builder into single function. * Changed conditionals and comments. Now throws if MaxPool on more than 3 spatial dimensions is requested. * Fixed forward declare for GPURuntimeContext (class -> struct). * Clang complains about missing braces on brace-initializer. Fixed implicit conversions. * Fixed implicit conversions (clang). * Reverting changes on autodiff test for maxpool. @Krovatkin will update later.
-
- 04 Apr, 2018 1 commit
-
-
Nick Korovaiko authored
* refactor Adjoints to support multi-output ops * passing tests * switch to generate_adjoints(deltas) and backprop_node * remove debugging code * fix error msg * fix typo adjoitns * fix comp errors in mnist_mlp
-
- 26 Mar, 2018 1 commit
-
-
Yixing Lao authored
* registers nnp tester * check output size * rename to nnp tester * revert changes in graph_util
-
- 21 Mar, 2018 2 commits
-
-
Yixing Lao authored
Adjust CallFrame argument order to match Function
-
Robert Kimball authored
* rename directories to be consistent * rename reference namespace to match directory
-
- 20 Mar, 2018 1 commit
-
-
Sandeep authored
* topolotical-sort based node clustering * cmake builds * Argon manager renamed to NNP along with placement * nnp dir cmake changes * tests pass * more renames * somemore renames * reslove redefination * revert to ARGON_API * more PR comments and remove nnp-fusion tests as redundant * update path * fix format
-
- 14 Mar, 2018 1 commit
-
-
Fenglei authored
* add onehot op * refactor broadcast and onehot op
-
- 09 Mar, 2018 1 commit
-
-
fenglei.tian authored
-
- 08 Mar, 2018 3 commits
-
-
fenglei.tian authored
-
fenglei.tian authored
-
Chris Sullivan authored
* Added GPU emitter for op::Result. For now it simply copies the output tensor. All but 3 tests now pass. The remaining failing tests are: * GPU.dot_0_0 * GPU.dot_matrix_2x0_0x2 * GPU.dot_2x0_0 * Removed call to handle memory aliasing in gpu_external_function. * fix gpu emitter bug that will return in the middle of function * Merge pull request #609 from NervanaSystems/tfl/fix_return_bug fix gpu emitter bug that will return in the middle of function * GPU backend skips added for recent softmax test and updated aliased output test that uses op::Constant.
-
- 02 Mar, 2018 1 commit
-
-
adstraw authored
add softmax op and documentation
-
- 27 Feb, 2018 1 commit
-
-
Scott Cyphers authored
* Replace using aliases with actual classes
-
- 26 Feb, 2018 1 commit
-
-
Yixing Lao authored
* initial support for hybrid transformer * add broadcast_vector_rowwise_reversed for hybrid test * headerc * get function placement fix * conv ref test generator graph node in labmda fuction * rename map_parameter_to_source_node * type change map_parameter_to_source_node * use interpreter for numerical derivative * better comments
-
- 23 Feb, 2018 1 commit
-
-
Christian Convey authored
-
- 21 Feb, 2018 3 commits
-
-
fenglei.tian authored
-
fenglei.tian authored
-
Sandeep authored
* relu for interpreter * relu in serializer * core fusion * relu backprop * relu backprop and test interpreter * core fusion for CPU * COREFusion -> CoreFusion * relu MKL dnn
-
- 20 Feb, 2018 1 commit
-
-
pthoreho authored
-
- 16 Feb, 2018 2 commits
- 14 Feb, 2018 1 commit
-
-
fenglei.tian authored
-
- 12 Feb, 2018 1 commit
-
-
Robert Kimball authored
* fix Shape declarations
-
- 08 Feb, 2018 1 commit
-
-
Jennifer Myers authored
-
- 07 Feb, 2018 1 commit
-
-
Adam Procter authored
* Enable CPU backprop tests * Fix to dot codegen for cases where n_reduction_axes != 1
-
- 06 Feb, 2018 1 commit
-
-
adstraw authored
* modify existing autodiff unit tests to test fprop cache * cleanup * fix compile error introduced with bad merge * remove invalid negative/negative backwards power test
-
- 31 Jan, 2018 1 commit
-
-
Nick Korovaiko authored
* bprop for avg pool remove debug statements + formatting * fix CPU test failures * numeric tests * use make_shared; unprotect c-tor
-
- 23 Jan, 2018 1 commit
-
-
adstraw authored
* fix convlution reference script * convolution backprop * cleanup * fix build warnings * Missing include * fix build warning part 2 * move numeric_compare to its own header code review feedback * fix build warnings 3 * fix build warnings 4 * clang-format * cast to avoid implicit cast warning
-