- 05 Jan, 2018 2 commits
-
-
Robert Kimball authored
* cleanup * remove arg_index * remove argno from Input * uncleanup
-
Tristan Webb authored
* Simple boilerplate for GPU runtime files - GPUBackend - GPU ExternalFunction - GPUManager - GPUCallFrame * Test for construction all GPU runtime classes * Comment out calls, constructors haven't been defined * Clang CUDA source example to later test compiling Clang cuda example from: https://gist.github.com/anonymous/855e277884eb6b388cd2f00d956c2fd4 * Initial nvptx compiler copied from CPU compiler sources * Define FunctionMap and Instruction for gpu external function * Rename Compiler -> NVPTXCompiler for gpu compile. Add call to compile for test * Rename StaticCompiler -> NVPTXStaticCompiler for GPU code gen * CAdd nvptx_compiler and nvptx_execution_engine to gpu sources * Compiling source unit test using hardcoded PTX * (a+b)*c test for GPU * WIP Fix compile * rmed accidentally included file * Fix compile, and LLVM link errosr from nvptx_compiler.cpp * Stub out parts needed for GPU manager * Test GPU runtime method stubs * Cleanup * Add GPU runtime to same cmake block as GPU, include CUDA headers if GPU enabled * Kill reflexive assertion * change GPU naming convention to match CPU * Snake case functions and identifiers in test case * Change element type to match changes in master * Make CUDA headers accessible for codegen with GPU transformer * clang-format * apply-code-format
-
- 04 Jan, 2018 2 commits
-
-
Robert Kimball authored
-
DawnStone authored
* updated the sphinx version using pip install in Dockerfile.ngraph_cpp added a make target to build the docs to the contrib/docker/Makefile * avoid upgrade pip message during build
-
- 03 Jan, 2018 1 commit
-
-
Yixing Lao authored
-
- 02 Jan, 2018 1 commit
-
-
Matthew Brookhart authored
-
- 30 Dec, 2017 2 commits
-
-
Adam Procter authored
* Definition and type checking for max pool * Implement kernel, integrate into INTERPRETER, add a few unit tests, make function result type mismatch error message more informative (still need to update tests to reflect that) * Temporarily delete unit tests to ease merge * Temporarily delete unit tests to ease merge * Restore deleted unit tests * Fix a broken error message check in the unit tests * Update to handle various TensorViewType-related things going away; add NGVM support * Add codegen case * Change various get_blah_shape methods to return const refs, and while we're here, make a similar change where it should have been done in convolution * Use NDArray for max-pool tests
-
varun-intel authored
* recreate ops * style * recompute ops * style * fix * recreate ops * style * recompute ops * style * fix * some * more * style * remove a line * const * style * NodeMap was using non-standard operator[] behavior. * Missing include
-
- 29 Dec, 2017 2 commits
-
-
Scott Cyphers authored
* Function can have multiple results Remove external use of ValueType, TupleType, Tuple Remove many external uses of Output and Input * corresponding CPU backend changes * Update master changes. * Remove type arg from Function, add changes.md * Merge changes. * Move bodies to .cpp, add brief doc * Merge CPU changes. * Remove xla includes from non-xla files * Remove xla from tests * First part of xla tuple support * change fprop_cache to assume multi-output bprop functions * New wrappers for handling tuples with XLA * Review comments * remove old xla files * fix merge errors * hand edit models to use multi output instead of tuples
-
Yixing Lao authored
* remove llvm/clang dependency in headers * copy elision
-
- 28 Dec, 2017 6 commits
-
-
Yixing Lao authored
-
Robert Kimball authored
* add larger test models
-
Jai Menon authored
This avoids bloating .data and clears the path for code model fixes later
-
Robert Kimball authored
* wip * constants as globals * const emitter rewrite
-
Jai Menon authored
* CMake: TBB integration placeholder * CMake: Integrate TBB * CMake: Indent * CMake: Rewrite TBB integration * CMake: More TBB integration changes * CMake: Install TBB headers and DSOs * CMake: Don't install the TBB debug DSO * CMake: Propagate ngraph's configured compiler setting over to MKL-DNN * CMake: Restore TBB debug DSO installation * CMake: Add installed headers to search path. This needs to be cleaned up along with other header search cleanup * CPU: Build and execute TBB flowgraphs * CPU: TBB fixes * CPU: More TBB fixes * CPU: Allow both TBB and serial codegen for now * TBB: get_arguments -> get_input_ops * CPU: Use node methods * CPU: Add TBB headers in the build directory to the search path * TBB: Incorporate various changes from master * CMake: Indentation fix * CMake: Indentation fix * CMake: TBB is mandatory so remove additional predicates * TBB: Add a test * CMake: Fix linker flags with GCC
-
Matthew Brookhart authored
* in progress * working cache_fprop, no tests * style fix * all inputs to bprop (except adjoints) are cached from fprop * fix typos, make sure to check count == 0 * fix code format
-
- 27 Dec, 2017 5 commits
-
-
Robert Kimball authored
* cleanup * cleanup * expand * wip * undo
-
Robert Kimball authored
-
Robert Kimball authored
* enable -O3 optimization * add flags to support release/debug builds
-
Robert Kimball authored
* nan unit test * fix NAN issue * add INFINITY support
-
Christian Convey authored
This reverts commit 39383029. It looks like the commit actually suppressed parallel makes of MKL-DNN, at least in the case where ngraph itself was being built with parallel make. It also introduced problems with make jobserver warnings.
-
- 26 Dec, 2017 1 commit
-
-
Robert Kimball authored
* add resource file generator and store all headers used by codegen in memory.
-
- 22 Dec, 2017 2 commits
-
-
Robert Kimball authored
* cleanup * cleanup * update serializer to emit small, simple element_type. backwards compatible. * allow for selecting indenting when serializing
-
Jai Menon authored
-
- 21 Dec, 2017 6 commits
-
-
Robert Kimball authored
* remove ngvm * remove NGVM from cmake
-
Robert Kimball authored
* fix autodiff on non-NGVM backends. NGVM initializes all tensors to zero on allocation while the other backends do not. Had to initialize vector before use. * change autodiff tests to use INTERPRETER
-
Robert Kimball authored
set code model back to default as medium is causing the CPU.divide_by_zero_int32 unit test to sefault when it throws an exception from the generated code (#328)
-
Yixing Lao authored
-
Jai Menon authored
* CPU: Optimize Eigen based rowwise vector broadcast * CPU: Remove the need for transposing the broadcast vector * CPU: Optimize to a replicate expression * CPU: Change code model to medium and compile for the host CPU instead of hardcoding BDW
-
Robert Kimball authored
* remove ParameterizedConstant * use simpler element Type definition * Move TraitedType to NGVM directory
-
- 20 Dec, 2017 5 commits
-
-
Christian Convey authored
* Adds CMake variables `MKLDNN_BUILD_COMMAND_EXTRA_FLAGS` and `MKLDNN_CMAKE_EXTRA_FLAGS`.
-
Robert Kimball authored
* aliased output unit test * add support for aliased outputs to INTERPRETER and CPU
-
Adam Procter authored
* Implement 'not' in interpreter and CPU; add unit tests for same * Fix compile failure on CI
-
crlishka authored
* Initial pass as supporting ngraph_dist builds with both gcc and clang. Includes deprecated targets to support Jenkins CI with old feature and development branches that may not have these changes. * Added build_all, check_all, and install_all targets * Broke build and check steps into separate make sections. Each step already generated its own log. * Updated README.md * Fixed a bug in the docker build context, which was overly general and included the build directories. This caused a loop where the docker images rebuilt into multi-gigabyte images after testing was run (yikes). New context is the contrib/docker directory, which is tiny. No more image rebuilds. * Rename the docker image to be less generic, so that another tool doesn't accidentally use the same name * Change compatibility target "make check_cpu" to run "make check_all", so that all builds are done. * Add a conversion step to change the test-suite names for all unit tests, to include an indicator for what compiler was used for the builds (_gcc vs _clang). * Unfortunately, modifying the test-suite name with compiler suffix did not show in Jenkins test results. Now modifying classname instead. * Switch builds to be RelWithDebInfo, which turns on optimization and includes debug symbols. Remove verbose output when creating tarball, to reduce log fluff.
-
Adam Procter authored
-
- 19 Dec, 2017 3 commits
-
-
Robert Kimball authored
* used something other than aligned_alloc for mac * Fix compile error for aligned_free on Mac
-
Robert Kimball authored
* remove use of ParameterizedConstant from unit test. Now using Constant instead. Constant is not a templated class. * move ParameterizedTensorView to NGVM directory * element_type cleanup
-
Robert Kimball authored
* add unit test * fix Parameter as output in INTERPRETER * cpu working * parameter_as_output passing for INTERPRETER * still works * cleanup
-
- 18 Dec, 2017 2 commits
-
-
Scott Cyphers authored
* Add and use get_shape() and get_element_type() on Input/Output * Fix Output * Formatting. * Format. * Use reference * Convolution.
-
Adam Procter authored
* Test GitHub-JIRA integration, nothing useful in this commit NGTF-388 #comment Testing JIRA integration * WIP on convolution * Type checking for convolution * Docstrings for convolution * Add convolution reference kernel; it works on some unit tests copied and pasted from my old branch. * Bugfix for dilated conv, and improvement to conv test generation * Remove get_arguments calls from convolution stuff * Add convolution to CPU; also a few fixes to the test generation stuff * Add copyright header to convolution ref script * Move copyright header to the correct place * A few more tests * Remove fallback behavior of blanking out the convolution ref file, since we're not generating it from the build system anymore * Delete stale comment * Merge stuff for the convolution ref script * Clean up rebase mess * Review comments * Review comment (n_foo -> foo_count)
-