1. 20 Jul, 2018 12 commits
  2. 19 Jul, 2018 2 commits
    • L.S. Cook's avatar
      update version and add glossary defs (#1215) · 1df7602e
      L.S. Cook authored
      * update version and add glossary defs
      
      * clean up graph rewrite code blocks
      
      * PR feedback
      
      * add better details to LSTM def
      
      * RNN def generalized
      
      * adding fancy formulas to RNN def glossary entry
      
      * Address API breaking change in PR 1164
      
      * all of the documentation re default install path needed updated with pr 1164
      
      * Assert manual compilation process to build ngraph_dist locally as a sensible default
      1df7602e
    • shssf's avatar
      IntelGPUBackend: const, div, maxpool and max operations (#1234) · 8908c9df
      shssf authored
      * IntelGPUBackend: const, div, maxpool and max operations
      
      * IntelGPUBackend: negative, abs, relu, sqrt, tanh and substract operations
      
      * Update intelgpu_backend.cpp
      8908c9df
  3. 18 Jul, 2018 13 commits
  4. 17 Jul, 2018 2 commits
    • Jaikrishnan Menon's avatar
    • Jayaram Bobba's avatar
      Added more convolution variants to DEX (#1223) · 9bb0b653
      Jayaram Bobba authored
      * CPU Direct Execution: Implement ConvertLayout and refactor
      
      * CPU Direct Execution: Implement Convolution
      
      * 1) Adds computation reuse to direct execution
      2) Add avg_pool, broadcast and convolution_bias to direct execution
      3) Moved some computation reuse utility functions to graph_utils
      
      * Use lists instead of vectors to avoid reallocation overheads
      
      * - Added convolution variants to direct execution
      - Removed ConvolutionBiasRelu, use ConvolutionBias instead
      - Reduced code duplication by moving functionality to mkldnn_emitter
        from cpu_emitter
      
      * Style fix
      
      * Moved mkldnn build_convolution to a templated method
      
      * Style fix
      
      * refactored mkldnn conv bprop builders
      
      * Style fix
      9bb0b653
  5. 14 Jul, 2018 4 commits
  6. 13 Jul, 2018 7 commits
    • Chris Sullivan's avatar
      Refactored GPU backend state into BackendContext (#1186) · 55a25d41
      Chris Sullivan authored
      * Refactored GPU backend state into BackendContext and moved it to the highest level GPU_Backend.
      Some bugs have appeared in so doing. Needs investigation.
      
      * extra *block_size
      
      * change grid_size to threads
      
      * Bug fix in softmax cache parameters.
      
      * Additional bug fix for maxpool1d cache parameters.
      
      * Bug fix in softmax cache parameters.
      
      * Additional bug fix for maxpool1d cache parameters.
      
      * Remove temporary print statements.
      
      * Use nthreads in primitive hash.
      
      * Switched from using stack references for cudnn and cublas handles to heap pointers held only the c-struct GPURuntimeContext but managed by the GPU_Backend.
      
      * Refactored the use of GPURuntimeContext* ctx throughout the emitters.
      
      * Use std::prev instead of operator-- for memory iteratory capture
      
      * bug fix from abaf1d7
      55a25d41
    • dmyershov's avatar
      Backend/API: Implementation of the call method for IntelGPU (#1199) · 8bde818c
      dmyershov authored
      * Backend/API: Implementation of the call method for IntelGPU
      
      * intel_gpu_style_fix_1199
      
      * Copy memory from clDNN to Tensor
      
      * Code style fix in 1199.2
      8bde818c
    • Nick Korovaiko's avatar
      get_subgraph_outputs (towards checking that intermediate nodes in a matched graph not used) (#1207) · 83e7dba5
      Nick Korovaiko authored
      * get_subgraph_outputs
      
      * simplify the condition
      83e7dba5
    • Robert Kimball's avatar
      minor speed increase (#1218) · 33b54ce1
      Robert Kimball authored
      33b54ce1
    • Jaikrishnan Menon's avatar
      346f480f
    • Jayaram Bobba's avatar
      Jbobba/dex computation reuse (#1219) · 7d59542d
      Jayaram Bobba authored
      * CPU Direct Execution: Implement ConvertLayout and refactor
      
      * CPU Direct Execution: Implement Convolution
      
      * 1) Adds computation reuse to direct execution
      2) Add avg_pool, broadcast and convolution_bias to direct execution
      3) Moved some computation reuse utility functions to graph_utils
      
      * Use lists instead of vectors to avoid reallocation overheads
      
      * - Style fix
      
      * style fix
      7d59542d
    • Fenglei's avatar
      gpu_external_function and gpu constant memory refactor (#1189) · 260cb90d
      Fenglei authored
      * refactor external function
      
      * wokring version
      
      * fix bug
      
      * add emit_fucntions, emit_declare_constants, emit_declare_functions
      
      * add std::
      
      * add functions declaration
      
      * fix bugs
      
      * fix bugs
      
      * separate temp memory allocation and release
      
      * add invoke_constant_ptr function, clean up outputs for function
      
      * fix bugs, compiled ok
      
      * add ctx to emit_declare_constant
      
      * cleanup code, code style
      
      * remove using std, code style
      
      * revert std changes
      
      * change function names based Chris's comments
      
      * add ResultCopyElimination to pass_manager
      
      * clang format
      260cb90d