1. 10 Aug, 2018 3 commits
  2. 09 Aug, 2018 6 commits
  3. 08 Aug, 2018 19 commits
  4. 07 Aug, 2018 12 commits
    • Jaikrishnan Menon's avatar
    • Jaikrishnan Menon's avatar
    • Jaikrishnan Menon's avatar
      4efcb76e
    • Nick Korovaiko's avatar
      DEX LRN (#1344) · c2e98505
      Nick Korovaiko authored
      * DEX LRN
      
      * merge after jbobba's changes
      c2e98505
    • Matthew Brookhart's avatar
      reduce fprop cache outputs (#1343) · efa2561e
      Matthew Brookhart authored
      * reduce fprop cache outputs
      
      * refactor traverse nodes
      
      * Slight refactor, add test, adress PR comments
      
      * fix formatting
      efa2561e
    • Jaikrishnan Menon's avatar
      DEX: Softmax (#1341) · f1c29c9c
      Jaikrishnan Menon authored
      * Add helper macros to select from a partial set of ranks and element types
      
      * CPU Direct Execution: Implement Softmax
      
      * Add softmax builder to the build script
      
      * Update
      f1c29c9c
    • Jaikrishnan Menon's avatar
    • dmyershov's avatar
      49d15902
    • Anna Alberska's avatar
      IntelGPU backend: And, Or operations (#1337) · 91a3bf87
      Anna Alberska authored
      * IntelGPU backend: And, Or operations
      
      * Code format update: intelgpu_backend.cpp and intelgpu_op_custom_kernels.cpp
      
      * Update logical operations
      91a3bf87
    • Fenglei's avatar
      cuda optimize softmax (#1310) · 154dc47a
      Fenglei authored
      * Updated softmax.
      
      * Formatting.
      
      * Updated convolution.
      
      * Use build_primitive overloading. Add helper to emit type_string given a node.
      
      * Formatting.
      
      * Update ConvolutionBackpropData.
      
      * convolution backprop & max pool memory primitive cacheing (#1303)
      
      * Updated ConvolutionBackpropFilters.
      * Update MaxPool.
      
      * Update Max and Min. (#1307)
      
      * softmax optimization
      
      * fix bug
      
      * fix bugs
      
      * clang format
      
      * remove comments
      
      * add softmax divide
      
      * fix bugs
      
      * fix bug
      
      * fix bug
      
      * clang format
      
      * remove unused header
      
      * register
      
      * using single parameters instead of array
      
      * using build_elementwise instead of build_elementwise_collective
      
      * remove workspace as csullivan suggested
      154dc47a
    • Anna Alberska's avatar
      IntelGPU backend: AvgPool operation(partially) (#1336) · 8db7b24b
      Anna Alberska authored
      * IntelGPU backend: AvgPool operation(partially)
      
      * Code format update intelgpu_backend.cpp
      
      * Delete code duplication in pooling ops intelgpu_backend.cpp
      8db7b24b
    • Chris Sullivan's avatar
      Auto. gen. kernel signatures and argument expansion (#1326) · 8476dea0
      Chris Sullivan authored
      * Add GPUKernelArgs for storing kernel arguments.
      
      * Formatting.
      
      * Resolve tensor addresses when extracting arg list via GPUKernelArgs.
      
      * Updated arg list resolution so that placeholder arguments can be added anywhere in the argument list.
      
      * const ref. args and changed add_args to use add_arg. also expanded type_names map.
      
      * GPUKernelArgs bug fix for return values.
      
      * add_placeholders expects pointers for later resolution
      
      * Formatting.
      
      * Add comments to GPUKernelArgs
      
      * Changed GPUKernelArgs interface to use a runtime variable number of arguments.
      
      * Removed/updated comment.
      
      * Address review comments: Remove combined address resolution and argument list retrieval. Remove unecessary extra type entries in type_map.
      
      * Add space between pragma once and includes.
      
      * Broadcast optimization (#1322)
      
      * Implement GPUKernelArgs with op::Broadcast.
      
      * Removed excess type insertion in kernel signature for broadcast impl.
      
      * Support new auto kernel signature generation for op::Broadcast. Add boolean to helpers to determine if parameters are registers or arrays.
      
      * Removed commented code.
      
      * Update broadcast impl. for new GPUKernelArgs interface.
      
      * Updated based on interface change to GPUKernelArgs.
      
      * Formatting.
      
      * CUDNNHostParameters now implement GPUHostParameters. (#1324)
      
      * Formatting.
      8476dea0