1. 05 Jan, 2018 1 commit
    • Tristan Webb's avatar
      Drwebb/gpu runtime boilerplate (#314) · feab44b5
      Tristan Webb authored
      * Simple boilerplate for GPU runtime files
      
        - GPUBackend
        - GPU ExternalFunction
        - GPUManager
        - GPUCallFrame
      
      * Test for construction all GPU runtime classes
      
      * Comment out calls, constructors haven't been defined
      
      * Clang CUDA source example to later test compiling
      
      Clang cuda example from:
      https://gist.github.com/anonymous/855e277884eb6b388cd2f00d956c2fd4
      
      * Initial nvptx compiler copied from CPU compiler sources
      
      * Define FunctionMap and Instruction for gpu external function
      
      * Rename Compiler -> NVPTXCompiler for gpu compile. Add call to compile for test
      
      * Rename StaticCompiler -> NVPTXStaticCompiler for GPU code gen
      
      * CAdd nvptx_compiler and nvptx_execution_engine to gpu sources
      
      * Compiling source unit test using hardcoded PTX
      
      * (a+b)*c test for GPU
      
      * WIP Fix compile
      
      * rmed accidentally included file
      
      * Fix compile, and LLVM link errosr from nvptx_compiler.cpp
      
      * Stub out parts needed for GPU manager
      
      * Test GPU runtime method stubs
      
      * Cleanup
      
      * Add GPU runtime to same cmake block as GPU, include CUDA headers if GPU enabled
      
      * Kill reflexive assertion
      
      * change GPU naming convention to match CPU
      
      * Snake case functions and identifiers in test case
      
      * Change element type to match changes in master
      
      * Make CUDA headers accessible for codegen with GPU transformer
      
      * clang-format
      
      * apply-code-format
      feab44b5
  2. 04 Jan, 2018 2 commits
  3. 03 Jan, 2018 1 commit
  4. 02 Jan, 2018 1 commit
  5. 30 Dec, 2017 2 commits
    • Adam Procter's avatar
      Forward prop for max pooling (#305) · d901282e
      Adam Procter authored
      * Definition and type checking for max pool
      
      * Implement kernel, integrate into INTERPRETER, add a few unit tests, make function result type mismatch error message more informative (still need to update tests to reflect that)
      
      * Temporarily delete unit tests to ease merge
      
      * Temporarily delete unit tests to ease merge
      
      * Restore deleted unit tests
      
      * Fix a broken error message check in the unit tests
      
      * Update to handle various TensorViewType-related things going away; add NGVM support
      
      * Add codegen case
      
      * Change various get_blah_shape methods to return const refs, and while we're here, make a similar change where it should have been done in convolution
      
      * Use NDArray for max-pool tests
      d901282e
    • varun-intel's avatar
      recreate ops (#325) · 66d06693
      varun-intel authored
      * recreate ops
      
      * style
      
      * recompute ops
      
      * style
      
      * fix
      
      * recreate ops
      
      * style
      
      * recompute ops
      
      * style
      
      * fix
      
      * some
      
      * more
      
      * style
      
      * remove a line
      
      * const
      
      * style
      
      * NodeMap was using non-standard operator[] behavior.
      
      * Missing include
      66d06693
  6. 29 Dec, 2017 2 commits
    • Scott Cyphers's avatar
      Get value types out of public API, multi-values from Function (#340) · d092cb91
      Scott Cyphers authored
      * Function can have multiple results
      Remove external use of ValueType, TupleType, Tuple
      Remove many external uses of Output and Input
      
      * corresponding CPU backend changes
      
      * Update master changes.
      
      * Remove type arg from Function, add changes.md
      
      * Merge changes.
      
      * Move bodies to .cpp, add brief doc
      
      * Merge CPU changes.
      
      * Remove xla includes from non-xla files
      
      * Remove xla from tests
      
      * First part of xla tuple support
      
      * change fprop_cache to assume multi-output bprop functions
      
      * New wrappers for handling tuples with XLA
      
      * Review comments
      
      * remove old xla files
      
      * fix merge errors
      
      * hand edit models to use multi output instead of tuples
      d092cb91
    • Yixing Lao's avatar
      Remove LLVM/Clang dependency in headers (#341) · 7c59ca2e
      Yixing Lao authored
      * remove llvm/clang dependency in headers
      
      * copy elision
      7c59ca2e
  7. 28 Dec, 2017 6 commits
    • Yixing Lao's avatar
      1c5abc19
    • Robert Kimball's avatar
      Add bigger models to performance benchmarks (#342) · 2d2fc8c2
      Robert Kimball authored
      * add larger test models
      2d2fc8c2
    • Jai Menon's avatar
      Move header resource to .rodata (#344) · 19a10d79
      Jai Menon authored
      This avoids bloating .data and clears the path
      for code model fixes later
      19a10d79
    • Robert Kimball's avatar
      Rewrite the way constants are emitted in the CPU backend (#332) · 603a7d1a
      Robert Kimball authored
      * wip
      
      * constants as globals
      
      * const emitter rewrite
      603a7d1a
    • Jai Menon's avatar
      Build and execute TBB flow graphs in the CPU backend (#304) · c2c33748
      Jai Menon authored
      * CMake: TBB integration placeholder
      
      * CMake: Integrate TBB
      
      * CMake: Indent
      
      * CMake: Rewrite TBB integration
      
      * CMake: More TBB integration changes
      
      * CMake: Install TBB headers and DSOs
      
      * CMake: Don't install the TBB debug DSO
      
      * CMake: Propagate ngraph's configured compiler setting over to MKL-DNN
      
      * CMake: Restore TBB debug DSO installation
      
      * CMake: Add installed headers to search path.
      This needs to be cleaned up along with other header search cleanup
      
      * CPU: Build and execute TBB flowgraphs
      
      * CPU: TBB fixes
      
      * CPU: More TBB fixes
      
      * CPU: Allow both TBB and serial codegen for now
      
      * TBB: get_arguments -> get_input_ops
      
      * CPU: Use node methods
      
      * CPU: Add TBB headers in the build directory to the search path
      
      * TBB: Incorporate various changes from master
      
      * CMake: Indentation fix
      
      * CMake: Indentation fix
      
      * CMake: TBB is mandatory so remove additional predicates
      
      * TBB: Add a test
      
      * CMake: Fix linker flags with GCC
      c2c33748
    • Matthew Brookhart's avatar
      Fprop Cache Util Function (#312) · bc63f7bb
      Matthew Brookhart authored
      * in progress
      
      * working cache_fprop, no tests
      
      * style fix
      
      * all inputs to bprop (except adjoints) are cached from fprop
      
      * fix typos, make sure to check count == 0
      
      * fix code format
      bc63f7bb
  8. 27 Dec, 2017 5 commits
  9. 26 Dec, 2017 1 commit
  10. 22 Dec, 2017 2 commits
  11. 21 Dec, 2017 6 commits
  12. 20 Dec, 2017 5 commits
    • Christian Convey's avatar
      Adds more control for building MKL-DNN. (#322) · 39383029
      Christian Convey authored
      * Adds CMake variables `MKLDNN_BUILD_COMMAND_EXTRA_FLAGS`
        and `MKLDNN_CMAKE_EXTRA_FLAGS`.
      39383029
    • Robert Kimball's avatar
      Add support for aliased output to CPU and INTERPRETER backends (#320) · d5e814aa
      Robert Kimball authored
      * aliased output unit test
      * add support for aliased outputs to INTERPRETER and CPU
      d5e814aa
    • Adam Procter's avatar
      Implement 'not' in interpreter and CPU; add unit tests for same (#321) · 83433ef2
      Adam Procter authored
      * Implement 'not' in interpreter and CPU; add unit tests for same
      
      * Fix compile failure on CI
      83433ef2
    • crlishka's avatar
      Support builds of ngraph_dist with both gcc and clang (#306) · 4c52d420
      crlishka authored
      * Initial pass as supporting ngraph_dist builds with both gcc and clang.  Includes deprecated targets to support Jenkins CI with old feature and development branches that may not have these changes.
      
      * Added build_all, check_all, and install_all targets
      
      * Broke build and check steps into separate make sections.  Each step already generated its own log.
      
      * Updated README.md
      
      * Fixed a bug in the docker build context, which was overly general and included the build directories.  This caused a loop where the docker images rebuilt into multi-gigabyte images after testing was run (yikes).  New context is the contrib/docker directory, which is tiny.  No more image rebuilds.
      
      * Rename the docker image to be less generic, so that another tool doesn't accidentally use the same name
      
      * Change compatibility target "make check_cpu" to run "make check_all", so that all builds are done.
      
      * Add a conversion step to change the test-suite names for all unit tests, to include an indicator for what compiler was used for the builds (_gcc vs _clang).
      
      * Unfortunately, modifying the test-suite name with compiler suffix did not show in Jenkins test results.  Now modifying classname instead.
      
      * Switch builds to be RelWithDebInfo, which turns on optimization and includes debug symbols.  Remove verbose output when creating tarball, to reduce log fluff.
      4c52d420
    • Adam Procter's avatar
      Workaround for gcc limitation (#319) · d8433899
      Adam Procter authored
      d8433899
  13. 19 Dec, 2017 3 commits
  14. 18 Dec, 2017 3 commits
    • Scott Cyphers's avatar
      Cyphers/shape (#310) · 8a569f27
      Scott Cyphers authored
      * Add and use get_shape() and get_element_type() on Input/Output
      
      * Fix Output
      
      * Formatting.
      
      * Format.
      
      * Use reference
      
      * Convolution.
      8a569f27
    • Adam Procter's avatar
      Convolution forward prop (#294) · 122db5ff
      Adam Procter authored
      * Test GitHub-JIRA integration, nothing useful in this commit
      
      NGTF-388 #comment Testing JIRA integration
      
      * WIP on convolution
      
      * Type checking for convolution
      
      * Docstrings for convolution
      
      * Add convolution reference kernel; it works on some unit tests copied and pasted from my old branch.
      
      * Bugfix for dilated conv, and improvement to conv test generation
      
      * Remove get_arguments calls from convolution stuff
      
      * Add convolution to CPU; also a few fixes to the test generation stuff
      
      * Add copyright header to convolution ref script
      
      * Move copyright header to the correct place
      
      * A few more tests
      
      * Remove fallback behavior of blanking out the convolution ref file, since we're not generating it from the build system anymore
      
      * Delete stale comment
      
      * Merge stuff for the convolution ref script
      
      * Clean up rebase mess
      
      * Review comments
      
      * Review comment (n_foo -> foo_count)
      122db5ff
    • Adam Procter's avatar
      Interpreter tweaks (#311) · 3b84d91a
      Adam Procter authored
      * Replace function->get_result()->get_outputs() call with function->get_outputs()
      
      * Test for parameter straight to output
      
      * Roll back cmake modification for INTERPRETER autodiff
      3b84d91a