1. 22 Jan, 2018 1 commit
    • L.S. Cook's avatar
      Scaffolding for userdocs (#402) · 2c048174
      L.S. Cook authored
      * adding work and organization for docs
      
      * Summary of changes
      
      - removed rst file from css folder
      - added substitutions to conf file so folks don't have to type out the whole name
      - added branding notice so link in footer works
      - added glossary so we can have a place for terms relative to libngraph
      - added placeholder for MXNet and TF frontend under new section
      - cleaned up api page
      
      * Fix code fencing error and indentation issue causing build to throw error
      
      * adding MXNet integration guide details and other miscellaneous structure
      
      * fix unicode error in rst epilog
      
      * testing with unicode marker for rst_epilog string
      
      * replace api scaffolding with TODO to avoid breaking Jenkins
      2c048174
  2. 21 Jan, 2018 1 commit
  3. 20 Jan, 2018 5 commits
  4. 19 Jan, 2018 7 commits
  5. 18 Jan, 2018 5 commits
  6. 17 Jan, 2018 5 commits
    • Robert Kimball's avatar
      f6a578b4
    • varun-intel's avatar
      remove a node from users (#379) · 981dabef
      varun-intel authored
      * remove a node from users
      
      * style
      981dabef
    • Robert Kimball's avatar
      Add mxnet seq2seq serialized model for benchmarking (#385) · 5ad1de22
      Robert Kimball authored
      * add mxnet seq2seq forward and backward
      
      * add benchmarks for seq2seq forward and backward
      5ad1de22
    • Matthew Brookhart's avatar
      Numerically stable sum so we can pass mxnet unit tests (#381) · b6c98de1
      Matthew Brookhart authored
      * Numerically stable sum so we can pass mxnet unit tests
      
      * Add a small initial residual
      b6c98de1
    • Tristan Webb's avatar
      Drwebb/gpu external function (#367) · c5549682
      Tristan Webb authored
      * Initial GPU_ExternalFunction implementation
      
      Other changes:
      
      Add GPU runtime to same cmake block as GPU, include CUDA headers if GPU enabled
      
      Initial passing (a+b)*c test
      
      Properly link cuda libraries
      
      Simple GPUTensorView implementation
      
      Initial GPU emitter
      
      GPU codegen initial function gen, no kernels yet
      
      Rename GPU emitter and tensor_view_wrapper to match naming convention
      
      * GPU external function based on BASE
      
      * Fix stray base -> gpu
      
      * TensorViewWrapper -> GPU_TensorViewWrapper
      
      * Copy over emitter from base transformer
      
      * Fix for naming dense layout
      
      * Copy kernel emitters from base -> gpu and strip out kernel_utils
      
      * Add aliases to GPU_TensorViewWrappers
      
      * More fixes for naming descriptor::TensorViews
      
      * Move in call_frame implementation from base -> gpu
      
      * apply code format
      
      * GPU codegen running A+B*C
      
      gpu emitters
      gpu ctx setup cuda_module kernels
      Remove GPU_CF perf counters
      Use gpu kernels in external function
      Add GPU 1d dot test
      
      Review Changes:
      * Remove CPU specific kernel emitting method bodies
      
      * Use copy_data from test/util.cpp, uncomment compileTest
      
      * Use test_utils copy_data function
      
      * Grab function name from pass manager for def, clean up indentation
      c5549682
  7. 16 Jan, 2018 3 commits
  8. 14 Jan, 2018 2 commits
  9. 12 Jan, 2018 1 commit
  10. 11 Jan, 2018 2 commits
  11. 10 Jan, 2018 4 commits
    • Nick Korovaiko's avatar
      Pattern matching for sum (#293) · 4345e39d
      Nick Korovaiko authored
      * the first stab at pattern for sum
      
      test refactoring, debug msg clean up, formatting fixes
      
      removing v1 and cleaning up v2 + formatting
      
      rollback the changes in reduce_ops
      
      rename v2 -> sum_pred
      
      remove unused funcs
      
      switch to new c-tors
      
      remove TensorViewType
      
      removing an assert
      
      fix a docstring to match a c-tor
      
      * fixes after rebase
      4345e39d
    • Adam Procter's avatar
      c5ffe8e9
    • Robert Kimball's avatar
      7b1dc3e3
    • Matthew Brookhart's avatar
      Switch from Eigen to OpenMP for loops for DS2 kernels (#345) · 7df687c1
      Matthew Brookhart authored
      * speed up reduceslice with kernel emitter
      
      * const-ify and fix a clang warning
      
      * add elementwise ops, slice to for loops
      
      * add broadcast codegen
      
      * add Exp
      
      * fix bugs introduced in eigen kernels
      
      * fix another introduced bug in Eigen
      
      * Fix an Atomic Bug with Sum, do some cleanup
      
      * unit tests pass
      
      * Add Reshape Op, passes Tests
      
      * rewrite sum to correctly handle muti-threading
      
      * Code Cleanup
      
      * add some extra unary ops
      
      * Address review comments
      
      * fix an error in the review comment refactor
      
      * Add Power op
      
      * Add (most) of the Logic Ops
      
      * Make Concat default to OpenMP kernel
      
      * fix n-D reshape issue
      7df687c1
  12. 09 Jan, 2018 3 commits
  13. 08 Jan, 2018 1 commit