1. 13 Aug, 2018 1 commit
  2. 02 Aug, 2018 2 commits
  3. 09 Jul, 2018 1 commit
  4. 06 Jul, 2018 1 commit
  5. 03 Jul, 2018 1 commit
  6. 22 Jun, 2018 1 commit
  7. 13 Jun, 2018 1 commit
    • Fenglei's avatar
      gpu deconvolution (#1099) · 40069d27
      Fenglei authored
      * add pad_dilation function
      
      * add dilation to gpu_emitter
      
      * add CoordinateDiff constructor to GPUShape
      
      * remove unecessary cast
      
      * working version for forward
      
      * forward working
      
      * forward test all pass
      
      * deconvolution forward
      
      * backward data dilation
      
      * forward test passed
      
      * initial to 0
      
      * fix bug for get_padded_shape and clang format
      
      * code style, change variable names
      
      * refactor convolution conditions
      
      * fix bug padding_below_diff
      
      * change pad_dilation to pad_dynamic, compare to pad
      
      * remove passed convolution test from skip list, clang format
      
      * change pad to use GPUShape
      40069d27
  8. 06 Jun, 2018 1 commit
  9. 04 Jun, 2018 1 commit
    • Robert Kimball's avatar
      Modernize cmake usage (#1032) · eef750df
      Robert Kimball authored
      * Update cmake files to more modern approach
      
      * disable building libraries that are not required
      
      * handle more build cases
      
      * add versions to backend libs. add start of package target.
      
      * add create_backend to backends
      
      * temporary workaround to tbb not linking correctly with gcc
      
      * install codegen lib
      
      * force tbb to link to the cpu backend so that it is available for codegen
      
      * fix clang build error
      
      * fix warning for codegen build
      
      * update cuda header paths
      
      * change error message for opening backend shared library
      
      * set lib path
      eef750df
  10. 02 Jun, 2018 1 commit
  11. 14 May, 2018 1 commit
  12. 11 May, 2018 1 commit
  13. 10 May, 2018 3 commits
  14. 09 May, 2018 1 commit
  15. 08 May, 2018 1 commit
    • Jayaram Bobba's avatar
      Computation reuse (#945) · 41c50b44
      Jayaram Bobba authored
      * Make temp memory pools static to avoid memory allocation overheads
      
      * Initial implementation for graph control to enable caching and computation reuse
      
      * Added sphinx documentation
      
      * Turned off memory buffer reuse in CPU transformer to support computation reuse. Added unit test
      
      * Change memoizable to cacheable
      
      * Change memoizable to cacheable
      
      * Rename variables
      41c50b44
  16. 20 Apr, 2018 1 commit
  17. 16 Apr, 2018 2 commits
  18. 13 Apr, 2018 1 commit
    • Robert Kimball's avatar
      Remove legacy Backend API (#848) · ec501913
      Robert Kimball authored
      * remove deprecated
      
      * remove all legacy Backend API usage
      
      remove deprecated files
      
      * pull in changes from master
      
      * fix GPU calls
      
      * disable tests in convolution generator
      
      * update per PR comments. Enable performance counter feature.
      
      * update per PR comments
      
      * fix build error
      
      * fix conditionally compiled test :(
      ec501913
  19. 09 Apr, 2018 1 commit
    • Robert Kimball's avatar
      New backend/transformer API (#739) · 777600c6
      Robert Kimball authored
      * force backend compile() to make a copy of the graph
      
      fix copy_with_new_args on ops that have function pointers internal
      
      update unit test for new backend API
      
      add unit test for multiple simulataneous backends
      
      * move get_subdevices virtual method to Manager class
      
      * update GPU to latest
      
      * update call methods
      
      * add remove_compiled_function()
      777600c6
  20. 04 Apr, 2018 1 commit
    • Nick Korovaiko's avatar
      Support multi-output ops in Adjoints (#796) · 5f0e8dc3
      Nick Korovaiko authored
      * refactor Adjoints to support multi-output ops
      
      * passing tests
      
      * switch to generate_adjoints(deltas) and backprop_node
      
      * remove debugging code
      
      * fix error msg
      
      * fix typo adjoitns
      
      * fix comp errors in mnist_mlp
      5f0e8dc3
  21. 02 Apr, 2018 1 commit
  22. 30 Mar, 2018 1 commit
  23. 29 Mar, 2018 1 commit
  24. 24 Mar, 2018 1 commit
    • Louis Feng's avatar
      ngmx-372 Fuse RNN matrix operations (#724) · de760a38
      Louis Feng authored
      * test graph.
      
      * rnn matrix fusion wip.
      
      * WIP.
      
      * matrix fused.
      
      * clean up.
      
      * additional refactor.
      
      * fixed merge errors.
      
      * testing.
      
      * added test case.
      
      * more testing.
      
      * more testing.
      
      * bias wip.
      
      * debuggging.
      
      * fusing bias too.
      
      * disabled debug outputs.
      
      * removed debugging.
      
      * removed testing models.
      
      * removed debug models.
      
      * reset nbench.
      
      * clean up and testing.
      
      * removed debug code.
      
      * updated comments and var names.
      
      * rename var.
      
      * removed debug code.
      
      * removed debug code.
      
      * fixed compiler warnings.
      
      * refactored test.
      
      * Added const to parameter.
      
      * address some of the review comments.
      
      * added comments and addressed reviews.
      de760a38
  25. 21 Mar, 2018 2 commits
  26. 16 Mar, 2018 1 commit
  27. 11 Mar, 2018 2 commits
  28. 07 Mar, 2018 1 commit
  29. 06 Mar, 2018 1 commit
    • Nick Korovaiko's avatar
      op::Result ver3 (#594) · 5c7e9844
      Nick Korovaiko authored
      * the first stab at op::Result
      
      format fixes
      
      disabling logging
      
      op::Result, 2nd attempt
      
      purge stale code
      
      disable logging
      
      fix copyright header
      
      * initial cleanup
      
      * cleanup2
      
      * remove dead code
      
      * result.cpp, fix comments
      
      * fix comment
      5c7e9844
  30. 05 Mar, 2018 1 commit
    • Robert Kimball's avatar
      Include cleanup (#583) · cec89708
      Robert Kimball authored
      * cleanup
      
      * cleanup
      
      * fix all headers to be standalone as far as includes go
      
      * include cleanup
      
      * cleanup includes
      
      * cleanup
      
      * include tester
      
      * wip
      
      * cleanup
      
      * cleanup
      
      * cleanup
      cec89708
  31. 28 Feb, 2018 2 commits
  32. 27 Feb, 2018 1 commit
  33. 26 Feb, 2018 1 commit
    • Yixing Lao's avatar
      Initial support for hybrid transformer (#526) · 7f08b97b
      Yixing Lao authored
      * initial support for hybrid transformer
      
      * add broadcast_vector_rowwise_reversed for hybrid test
      
      * headerc
      
      * get function placement fix
      
      * conv ref test generator graph node in labmda fuction
      
      * rename map_parameter_to_source_node
      
      * type change map_parameter_to_source_node
      
      * use interpreter for numerical derivative
      
      * better comments
      7f08b97b