- 14 Mar, 2018 1 commit
-
-
Robert Kimball authored
* Add cpio file read/write class and unit tests add reserializer Add unit test for serialize constants to cpio file. Fix bug in serializer if function has no parameters.
-
- 05 Mar, 2018 1 commit
-
-
Robert Kimball authored
* cleanup * cleanup * fix all headers to be standalone as far as includes go * include cleanup * cleanup includes * cleanup * include tester * wip * cleanup * cleanup * cleanup
-
- 28 Feb, 2018 1 commit
-
-
Robert Kimball authored
* make json lib an external project * add env var to turn warnings to errors
-
- 27 Feb, 2018 1 commit
-
-
fenglei.tian authored
-
- 26 Feb, 2018 1 commit
-
-
Yixing Lao authored
* initial support for hybrid transformer * add broadcast_vector_rowwise_reversed for hybrid test * headerc * get function placement fix * conv ref test generator graph node in labmda fuction * rename map_parameter_to_source_node * type change map_parameter_to_source_node * use interpreter for numerical derivative * better comments
-
- 21 Feb, 2018 1 commit
-
-
Sandeep authored
* relu for interpreter * relu in serializer * core fusion * relu backprop * relu backprop and test interpreter * core fusion for CPU * COREFusion -> CoreFusion * relu MKL dnn
-
- 20 Feb, 2018 1 commit
-
-
Ashok Emani authored
-
- 14 Feb, 2018 3 commits
-
-
fenglei.tian authored
-
Adam Procter authored
-
Sevin F. Varoglu authored
- enable distributed ngraph (MPI) - add AllReduce op to ngraph core, interpreter and CPU backend - add AllReduce unit test
-
- 13 Feb, 2018 3 commits
-
-
fenglei.tian authored
-
fenglei.tian authored
-
fenglei.tian authored
-
- 09 Feb, 2018 5 commits
-
-
Tristan Webb authored
* GPU kernels for reshape, GEMM, EW ADD/Mult, Maximum (A + B) * C test now with cuBLAS Additional gemm and gemv calls cmake updates for cuDNN calls memcpy wrappers in gpu_util Additional passing tests: aliased outputs, parameter, constant tensor memcopy
-
Adam Procter authored
-
Jennifer Myers authored
-
fenglei.tian authored
-
Sandeep authored
-
- 08 Feb, 2018 2 commits
-
-
Jennifer Myers authored
-
Sandeep authored
-
- 07 Feb, 2018 1 commit
-
-
Adam Procter authored
* Enable CPU backprop tests * Fix to dot codegen for cases where n_reduction_axes != 1
-
- 06 Feb, 2018 1 commit
-
-
Sandeep authored
* test relu fusion for argon backend and enable permutation over max op
-
- 05 Feb, 2018 1 commit
-
-
Nick Korovaiko authored
inline Inliner pass + tests debugging fix inliner failures due to the fact a random function is picked as an outermost one copyright headers
-
- 02 Feb, 2018 1 commit
-
-
Tristan Webb authored
GPU ew add and mult cuBLAS calls GPU (A + B) * C with cuBLAS Additional gemm and gemv calls cmake updates for cuDNN calls kernels WIP params for dot gemm more kernel WIP memcpy wrappers aliased outputs, parameter, constant tensor memcopy comment cleanup remove cruft gpu faster gemm MNIST WIP Cleanup
-
- 01 Feb, 2018 1 commit
-
-
Nick Korovaiko authored
* simplification pass * serializer change to test models * some small test fixes * addressing Scott's feedback * missed one nn * formatting fixes * simplification -> reshape_elimination
-
- 30 Jan, 2018 1 commit
-
-
Nick Korovaiko authored
cblas_gemm working on mlp rebase & small fixes enable debug output support replacing function's outputs productizing CPUFusion addressing Bob and Jayaram's feedback removing json used for simplification tests adding comments fixing formatting errors and removing dead code TODO msg removing serializer changes
-
- 24 Jan, 2018 1 commit
-
-
Tristan Webb authored
* Drwebb/gpu backend dot op (#387) * GPU Dot prod emitter switch statement * cuBLAS dot kernel call * Flush out arg substitution into gpu dot kernel call * Drwebb/gpu backend dot op (#392) * Take in CodeWriter into gpu op emitters * Introduce GPU function gen based on pass functions * Additional gpu emitter stubs * link cublas in to unit test and ngraph * Use static code gen methods for GPU, add new GPU op stubs * use pass manager to declare functions / cublas Updates * Prune down gpu_external_function wip * Switch back to GPU tensor views in GPU backend * Pass in cublas handle to GPU external function * cuMalloc memory in gpu tensor view * Use cuda runtime malloc and free for tensor view managment c * change GPU tensor view init, and use GPU tensor view for GPU call frame * include headers as system dirs * GPU tensor printing utility function * cublasSetPointer to device mode / Fix copyright notification lowercasing * Passing GPU dot product test using cuBLAS Clean up * Changes from review
-
- 19 Jan, 2018 1 commit
-
-
Robert Kimball authored
* cleanup in-memory header files * add switch to enable memory sanitizer (works like valgrind) * removed header file cleanup as it was causing a segfault on program termination
-
- 11 Jan, 2018 2 commits
-
-
Robert Kimball authored
* add interpreter nan check option * add unit test
-
Christian Convey authored
-
- 09 Jan, 2018 1 commit
-
-
Nick Korovaiko authored
* remove caching of ordered_ops * graph_util logging msgs * small cleanup * remove files for the TopologicalSort pass * remove NGRAPH_DEBUG from graph_util.hpp
-
- 05 Jan, 2018 1 commit
-
-
Tristan Webb authored
* Simple boilerplate for GPU runtime files - GPUBackend - GPU ExternalFunction - GPUManager - GPUCallFrame * Test for construction all GPU runtime classes * Comment out calls, constructors haven't been defined * Clang CUDA source example to later test compiling Clang cuda example from: https://gist.github.com/anonymous/855e277884eb6b388cd2f00d956c2fd4 * Initial nvptx compiler copied from CPU compiler sources * Define FunctionMap and Instruction for gpu external function * Rename Compiler -> NVPTXCompiler for gpu compile. Add call to compile for test * Rename StaticCompiler -> NVPTXStaticCompiler for GPU code gen * CAdd nvptx_compiler and nvptx_execution_engine to gpu sources * Compiling source unit test using hardcoded PTX * (a+b)*c test for GPU * WIP Fix compile * rmed accidentally included file * Fix compile, and LLVM link errosr from nvptx_compiler.cpp * Stub out parts needed for GPU manager * Test GPU runtime method stubs * Cleanup * Add GPU runtime to same cmake block as GPU, include CUDA headers if GPU enabled * Kill reflexive assertion * change GPU naming convention to match CPU * Snake case functions and identifiers in test case * Change element type to match changes in master * Make CUDA headers accessible for codegen with GPU transformer * clang-format * apply-code-format
-
- 29 Dec, 2017 1 commit
-
-
Scott Cyphers authored
* Function can have multiple results Remove external use of ValueType, TupleType, Tuple Remove many external uses of Output and Input * corresponding CPU backend changes * Update master changes. * Remove type arg from Function, add changes.md * Merge changes. * Move bodies to .cpp, add brief doc * Merge CPU changes. * Remove xla includes from non-xla files * Remove xla from tests * First part of xla tuple support * change fprop_cache to assume multi-output bprop functions * New wrappers for handling tuples with XLA * Review comments * remove old xla files * fix merge errors * hand edit models to use multi output instead of tuples
-
- 28 Dec, 2017 1 commit
-
-
Yixing Lao authored
-
- 21 Dec, 2017 2 commits
-
-
Robert Kimball authored
* remove ngvm * remove NGVM from cmake
-
Robert Kimball authored
* fix autodiff on non-NGVM backends. NGVM initializes all tensors to zero on allocation while the other backends do not. Had to initialize vector before use. * change autodiff tests to use INTERPRETER
-
- 18 Dec, 2017 1 commit
-
-
Adam Procter authored
* Test GitHub-JIRA integration, nothing useful in this commit NGTF-388 #comment Testing JIRA integration * WIP on convolution * Type checking for convolution * Docstrings for convolution * Add convolution reference kernel; it works on some unit tests copied and pasted from my old branch. * Bugfix for dilated conv, and improvement to conv test generation * Remove get_arguments calls from convolution stuff * Add convolution to CPU; also a few fixes to the test generation stuff * Add copyright header to convolution ref script * Move copyright header to the correct place * A few more tests * Remove fallback behavior of blanking out the convolution ref file, since we're not generating it from the build system anymore * Delete stale comment * Merge stuff for the convolution ref script * Clean up rebase mess * Review comments * Review comment (n_foo -> foo_count)
-
- 13 Dec, 2017 1 commit
-
-
Adam Procter authored
-
- 12 Dec, 2017 1 commit
-
-
Robert Kimball authored
LSTM benchmark test performance counters
-
- 05 Dec, 2017 1 commit
-
-
Robert Kimball authored
* New Interpreter backend * PR review comments * More RP fixes * oops * make autodiff tests backend aware * wip * wip * more ops * wip * fix merge error * merge fixes
-