1. 14 Mar, 2018 1 commit
    • Robert Kimball's avatar
      Yet another serialization option (#619) · 28602f31
      Robert Kimball authored
      * Add cpio file read/write class and unit tests
      
      add reserializer
      
      Add unit test for serialize constants to cpio file. Fix bug in serializer if function has no parameters.
      28602f31
  2. 05 Mar, 2018 1 commit
    • Robert Kimball's avatar
      Include cleanup (#583) · cec89708
      Robert Kimball authored
      * cleanup
      
      * cleanup
      
      * fix all headers to be standalone as far as includes go
      
      * include cleanup
      
      * cleanup includes
      
      * cleanup
      
      * include tester
      
      * wip
      
      * cleanup
      
      * cleanup
      
      * cleanup
      cec89708
  3. 28 Feb, 2018 1 commit
  4. 27 Feb, 2018 1 commit
  5. 26 Feb, 2018 1 commit
    • Yixing Lao's avatar
      Initial support for hybrid transformer (#526) · 7f08b97b
      Yixing Lao authored
      * initial support for hybrid transformer
      
      * add broadcast_vector_rowwise_reversed for hybrid test
      
      * headerc
      
      * get function placement fix
      
      * conv ref test generator graph node in labmda fuction
      
      * rename map_parameter_to_source_node
      
      * type change map_parameter_to_source_node
      
      * use interpreter for numerical derivative
      
      * better comments
      7f08b97b
  6. 21 Feb, 2018 1 commit
  7. 20 Feb, 2018 1 commit
  8. 14 Feb, 2018 3 commits
  9. 13 Feb, 2018 3 commits
  10. 09 Feb, 2018 5 commits
  11. 08 Feb, 2018 2 commits
  12. 07 Feb, 2018 1 commit
  13. 06 Feb, 2018 1 commit
  14. 05 Feb, 2018 1 commit
    • Nick Korovaiko's avatar
      inline_function_call (#439) · bef56921
      Nick Korovaiko authored
      inline
      
      Inliner pass + tests
      
      debugging
      
      fix inliner failures due to the fact a random function is picked as an outermost one
      
      copyright headers
      bef56921
  15. 02 Feb, 2018 1 commit
    • Tristan Webb's avatar
      GPU kernels for reshape, GEMM, EW ADD/Mult, Maximum · 1f6284ff
      Tristan Webb authored
      GPU ew add and mult cuBLAS calls
      
      GPU (A + B) * C with cuBLAS
      
      Additional gemm and gemv calls
      
      cmake updates for cuDNN calls
      
      kernels WIP
      
      params for dot gemm
      
      more kernel WIP
      
      memcpy wrappers
      
      aliased outputs, parameter, constant tensor memcopy
      
      comment cleanup
      
      remove cruft
      
      gpu faster gemm
      
      MNIST WIP
      
      Cleanup
      1f6284ff
  16. 01 Feb, 2018 1 commit
  17. 30 Jan, 2018 1 commit
    • Nick Korovaiko's avatar
      fuse dot(a,b) + c (#418) · ea29c6e3
      Nick Korovaiko authored
      cblas_gemm working on mlp
      
      rebase & small fixes
      
      enable debug output
      
      support replacing function's outputs
      
      productizing CPUFusion
      
      addressing Bob and Jayaram's feedback
      
      removing json used for simplification tests
      
      adding comments
      
      fixing formatting errors and removing dead code
      
      TODO msg
      
      removing serializer changes
      ea29c6e3
  18. 24 Jan, 2018 1 commit
    • Tristan Webb's avatar
      Drwebb/gpu backend dot op (#413) · 94d80ffa
      Tristan Webb authored
      * Drwebb/gpu backend dot op (#387)
      
      * GPU Dot prod emitter switch statement
      
      * cuBLAS dot kernel call
      
      * Flush out arg substitution into gpu dot kernel call
      
      * Drwebb/gpu backend dot op (#392)
      
      * Take in CodeWriter into gpu op emitters
      
      * Introduce GPU function gen based on pass functions
      
      * Additional gpu emitter stubs
      
      * link cublas in to unit test and ngraph
      
      * Use static code gen methods for GPU, add new GPU op stubs
      
      * use pass manager to declare functions / cublas Updates
      
      * Prune down gpu_external_function wip
      
      * Switch back to GPU tensor views in GPU backend
      
      * Pass in cublas handle to GPU external function
      
      * cuMalloc memory in gpu tensor view
      
      * Use cuda runtime malloc and free for tensor view managment c
      
      * change GPU tensor view init, and use GPU tensor view for GPU call frame
      
      * include headers as system dirs
      
      * GPU tensor printing utility function
      
      * cublasSetPointer to device mode / Fix copyright notification lowercasing
      
      * Passing GPU dot product test using cuBLAS
      
      Clean up
      
      * Changes from review
      94d80ffa
  19. 19 Jan, 2018 1 commit
  20. 11 Jan, 2018 2 commits
  21. 09 Jan, 2018 1 commit
  22. 05 Jan, 2018 1 commit
    • Tristan Webb's avatar
      Drwebb/gpu runtime boilerplate (#314) · feab44b5
      Tristan Webb authored
      * Simple boilerplate for GPU runtime files
      
        - GPUBackend
        - GPU ExternalFunction
        - GPUManager
        - GPUCallFrame
      
      * Test for construction all GPU runtime classes
      
      * Comment out calls, constructors haven't been defined
      
      * Clang CUDA source example to later test compiling
      
      Clang cuda example from:
      https://gist.github.com/anonymous/855e277884eb6b388cd2f00d956c2fd4
      
      * Initial nvptx compiler copied from CPU compiler sources
      
      * Define FunctionMap and Instruction for gpu external function
      
      * Rename Compiler -> NVPTXCompiler for gpu compile. Add call to compile for test
      
      * Rename StaticCompiler -> NVPTXStaticCompiler for GPU code gen
      
      * CAdd nvptx_compiler and nvptx_execution_engine to gpu sources
      
      * Compiling source unit test using hardcoded PTX
      
      * (a+b)*c test for GPU
      
      * WIP Fix compile
      
      * rmed accidentally included file
      
      * Fix compile, and LLVM link errosr from nvptx_compiler.cpp
      
      * Stub out parts needed for GPU manager
      
      * Test GPU runtime method stubs
      
      * Cleanup
      
      * Add GPU runtime to same cmake block as GPU, include CUDA headers if GPU enabled
      
      * Kill reflexive assertion
      
      * change GPU naming convention to match CPU
      
      * Snake case functions and identifiers in test case
      
      * Change element type to match changes in master
      
      * Make CUDA headers accessible for codegen with GPU transformer
      
      * clang-format
      
      * apply-code-format
      feab44b5
  23. 29 Dec, 2017 1 commit
    • Scott Cyphers's avatar
      Get value types out of public API, multi-values from Function (#340) · d092cb91
      Scott Cyphers authored
      * Function can have multiple results
      Remove external use of ValueType, TupleType, Tuple
      Remove many external uses of Output and Input
      
      * corresponding CPU backend changes
      
      * Update master changes.
      
      * Remove type arg from Function, add changes.md
      
      * Merge changes.
      
      * Move bodies to .cpp, add brief doc
      
      * Merge CPU changes.
      
      * Remove xla includes from non-xla files
      
      * Remove xla from tests
      
      * First part of xla tuple support
      
      * change fprop_cache to assume multi-output bprop functions
      
      * New wrappers for handling tuples with XLA
      
      * Review comments
      
      * remove old xla files
      
      * fix merge errors
      
      * hand edit models to use multi output instead of tuples
      d092cb91
  24. 28 Dec, 2017 1 commit
  25. 21 Dec, 2017 2 commits
  26. 18 Dec, 2017 1 commit
    • Adam Procter's avatar
      Convolution forward prop (#294) · 122db5ff
      Adam Procter authored
      * Test GitHub-JIRA integration, nothing useful in this commit
      
      NGTF-388 #comment Testing JIRA integration
      
      * WIP on convolution
      
      * Type checking for convolution
      
      * Docstrings for convolution
      
      * Add convolution reference kernel; it works on some unit tests copied and pasted from my old branch.
      
      * Bugfix for dilated conv, and improvement to conv test generation
      
      * Remove get_arguments calls from convolution stuff
      
      * Add convolution to CPU; also a few fixes to the test generation stuff
      
      * Add copyright header to convolution ref script
      
      * Move copyright header to the correct place
      
      * A few more tests
      
      * Remove fallback behavior of blanking out the convolution ref file, since we're not generating it from the build system anymore
      
      * Delete stale comment
      
      * Merge stuff for the convolution ref script
      
      * Clean up rebase mess
      
      * Review comments
      
      * Review comment (n_foo -> foo_count)
      122db5ff
  27. 13 Dec, 2017 1 commit
  28. 12 Dec, 2017 1 commit
  29. 05 Dec, 2017 1 commit
    • Robert Kimball's avatar
      New Interpreter backend (#287) · 025a1b92
      Robert Kimball authored
      * New Interpreter backend
      
      * PR review comments
      
      * More RP fixes
      
      * oops
      
      * make autodiff tests backend aware
      
      * wip
      
      * wip
      
      * more ops
      
      * wip
      
      * fix merge error
      
      * merge fixes
      025a1b92