1. 29 Sep, 2018 1 commit
  2. 27 Sep, 2018 1 commit
    • Robert Kimball's avatar
      Major cleanup of runtime::TensorView (#1682) · 8caa2717
      Robert Kimball authored
      * remove get_tensor method
      
      * remove get_element_count as it is redundant to get_size
      
      * fix a few issues with examples and python wrapper
      
      * rename get_size to get_element_count and add get_size_in_bytes method
      8caa2717
  3. 15 Sep, 2018 1 commit
  4. 14 Sep, 2018 1 commit
    • Scott Cyphers's avatar
      Cyphers/layout (#1602) · 2f79f707
      Scott Cyphers authored
      * Remove "view"
      Simplify layout
      
      * Fix merge error
      
      * fix build error
      
      * PR1602. IntelGPU backend. Compilation fixed.
      2f79f707
  5. 12 Sep, 2018 1 commit
  6. 31 Aug, 2018 1 commit
    • Scott Cyphers's avatar
      Validate/infer types as a virtual function (#1463) · 132b5305
      Scott Cyphers authored
      * Validate/infer types as a virtual function
      BroadcastLike, ScalarConstantLike
      Set output shape separately from output types
      Remove some obsolete tests
      Replace *Like ops with fixed type versions
      
      * Review comments
      
      * Remove TensorViewType
      
      * Remove PrimaryTensorView
      
      * Reviewer comments
      
      * Style fix.
      
      * Another try at GPU
      
      * GPU
      
      * Simplify class hierarchy
      
      * More of previous.
      
      * Better errors
      
      * Remove unneeded classes
      
      * Merge
      
      * Update Python bindings
      
      * clang-format
      
      * Remove redundant files
      
      * Remove commented out lines
      
      * clang-format
      
      * Group includes
      
      * Missing tests
      
      * Merge error
      132b5305
  7. 29 Aug, 2018 2 commits
  8. 27 Aug, 2018 1 commit
  9. 24 Aug, 2018 1 commit
  10. 13 Aug, 2018 1 commit
  11. 02 Aug, 2018 2 commits
  12. 09 Jul, 2018 1 commit
  13. 06 Jul, 2018 1 commit
  14. 03 Jul, 2018 1 commit
  15. 22 Jun, 2018 1 commit
  16. 13 Jun, 2018 1 commit
    • Fenglei's avatar
      gpu deconvolution (#1099) · 40069d27
      Fenglei authored
      * add pad_dilation function
      
      * add dilation to gpu_emitter
      
      * add CoordinateDiff constructor to GPUShape
      
      * remove unecessary cast
      
      * working version for forward
      
      * forward working
      
      * forward test all pass
      
      * deconvolution forward
      
      * backward data dilation
      
      * forward test passed
      
      * initial to 0
      
      * fix bug for get_padded_shape and clang format
      
      * code style, change variable names
      
      * refactor convolution conditions
      
      * fix bug padding_below_diff
      
      * change pad_dilation to pad_dynamic, compare to pad
      
      * remove passed convolution test from skip list, clang format
      
      * change pad to use GPUShape
      40069d27
  17. 06 Jun, 2018 1 commit
  18. 04 Jun, 2018 1 commit
    • Robert Kimball's avatar
      Modernize cmake usage (#1032) · eef750df
      Robert Kimball authored
      * Update cmake files to more modern approach
      
      * disable building libraries that are not required
      
      * handle more build cases
      
      * add versions to backend libs. add start of package target.
      
      * add create_backend to backends
      
      * temporary workaround to tbb not linking correctly with gcc
      
      * install codegen lib
      
      * force tbb to link to the cpu backend so that it is available for codegen
      
      * fix clang build error
      
      * fix warning for codegen build
      
      * update cuda header paths
      
      * change error message for opening backend shared library
      
      * set lib path
      eef750df
  19. 02 Jun, 2018 1 commit
  20. 14 May, 2018 1 commit
  21. 11 May, 2018 1 commit
  22. 10 May, 2018 3 commits
  23. 09 May, 2018 1 commit
  24. 08 May, 2018 1 commit
    • Jayaram Bobba's avatar
      Computation reuse (#945) · 41c50b44
      Jayaram Bobba authored
      * Make temp memory pools static to avoid memory allocation overheads
      
      * Initial implementation for graph control to enable caching and computation reuse
      
      * Added sphinx documentation
      
      * Turned off memory buffer reuse in CPU transformer to support computation reuse. Added unit test
      
      * Change memoizable to cacheable
      
      * Change memoizable to cacheable
      
      * Rename variables
      41c50b44
  25. 20 Apr, 2018 1 commit
  26. 16 Apr, 2018 2 commits
  27. 13 Apr, 2018 1 commit
    • Robert Kimball's avatar
      Remove legacy Backend API (#848) · ec501913
      Robert Kimball authored
      * remove deprecated
      
      * remove all legacy Backend API usage
      
      remove deprecated files
      
      * pull in changes from master
      
      * fix GPU calls
      
      * disable tests in convolution generator
      
      * update per PR comments. Enable performance counter feature.
      
      * update per PR comments
      
      * fix build error
      
      * fix conditionally compiled test :(
      ec501913
  28. 09 Apr, 2018 1 commit
    • Robert Kimball's avatar
      New backend/transformer API (#739) · 777600c6
      Robert Kimball authored
      * force backend compile() to make a copy of the graph
      
      fix copy_with_new_args on ops that have function pointers internal
      
      update unit test for new backend API
      
      add unit test for multiple simulataneous backends
      
      * move get_subdevices virtual method to Manager class
      
      * update GPU to latest
      
      * update call methods
      
      * add remove_compiled_function()
      777600c6
  29. 04 Apr, 2018 1 commit
    • Nick Korovaiko's avatar
      Support multi-output ops in Adjoints (#796) · 5f0e8dc3
      Nick Korovaiko authored
      * refactor Adjoints to support multi-output ops
      
      * passing tests
      
      * switch to generate_adjoints(deltas) and backprop_node
      
      * remove debugging code
      
      * fix error msg
      
      * fix typo adjoitns
      
      * fix comp errors in mnist_mlp
      5f0e8dc3
  30. 02 Apr, 2018 1 commit
  31. 30 Mar, 2018 1 commit
  32. 29 Mar, 2018 1 commit
  33. 24 Mar, 2018 1 commit
    • Louis Feng's avatar
      ngmx-372 Fuse RNN matrix operations (#724) · de760a38
      Louis Feng authored
      * test graph.
      
      * rnn matrix fusion wip.
      
      * WIP.
      
      * matrix fused.
      
      * clean up.
      
      * additional refactor.
      
      * fixed merge errors.
      
      * testing.
      
      * added test case.
      
      * more testing.
      
      * more testing.
      
      * bias wip.
      
      * debuggging.
      
      * fusing bias too.
      
      * disabled debug outputs.
      
      * removed debugging.
      
      * removed testing models.
      
      * removed debug models.
      
      * reset nbench.
      
      * clean up and testing.
      
      * removed debug code.
      
      * updated comments and var names.
      
      * rename var.
      
      * removed debug code.
      
      * removed debug code.
      
      * fixed compiler warnings.
      
      * refactored test.
      
      * Added const to parameter.
      
      * address some of the review comments.
      
      * added comments and addressed reviews.
      de760a38
  34. 21 Mar, 2018 2 commits