1. 14 May, 2018 5 commits
  2. 11 May, 2018 9 commits
  3. 10 May, 2018 5 commits
  4. 09 May, 2018 4 commits
    • Chris Sullivan's avatar
      Add op::Or and op::And to GPU transformer (#979) · 8508410f
      Chris Sullivan authored
      * Moved emit_elementwise implementation into CUDAEmitter and added logical_and and logical_or ops.
      
      * Updated comment and formatting.
      
      * Added check for multi-output elementwise ops.
      8508410f
    • Yixing Lao's avatar
      RTLD_GLOBAL fix codgen link (#984) · 7bc6b785
      Yixing Lao authored
      7bc6b785
    • Chris Sullivan's avatar
      CUDNN and CUDA kernels for AvgPool (forward/backward) (#951) · b1b3d4d6
      Chris Sullivan authored
      * Added op::AvgPool cudnn impl. which works for 2-3 spatial dimesions and no/symmetric padding. Enabled tests.
      
      * Added cuda-c implementation of average pool which handles 1-3 spatial
      dimensions as well as asymmetric padding. This commit also introduces
      several helper functions for performing fast integer division and
      fast constant memory access.
      
      * Formatting. Removed bool that was used for testing to force the cuda impl. over cudnn.
      
      * Added CUDNN AvgPoolBackprop implementation.
      
      * Removed inline enum in preference of a helper struct. Removed instances of multiple declarations on a single line. Updated comments.
      
      * Removed _prefix to helper functions in anonymous namespace.
      b1b3d4d6
    • Yixing Lao's avatar
      Expose ngraph unit test util as a library (#980) · 9e6d67f2
      Yixing Lao authored
      * create ngraph_test_util
      
      * installs libngraph_test_util
      9e6d67f2
  5. 08 May, 2018 15 commits
    • L.S. Cook's avatar
      Leona/doc cleanup 2 (#946) · 22ea1f95
      L.S. Cook authored
      * doc updates
      
      * test add section on transformers to graph basics
      
      * Fix typo on abs
      
      * Adding more background and detail for graph-building concepts unique to nGraph
      
      * First pass at updating nGraph basics for StackOverflow kinds of questions
      
      * Forgot to add a file
      
      * Update for new naming and capitalization conventions
      
      * add edits from first PR review
      
      * More updates from PR review
      22ea1f95
    • Sevin F. Varoglu's avatar
      Minor changes for AllReduce (#944) · e30b3c61
      Sevin F. Varoglu authored
      * Minor changes for AllReduce
      e30b3c61
    • Chris Sullivan's avatar
      [cuDNN:Part 1] minimal refactoring of op::reduce (#965) · 682f7b04
      Chris Sullivan authored
      * Refactored the cudnn reduce kernel to use the nGraph Shape -> cudnnTensorDescriptor cudnn helpers that the other kernels use.
      
      * Added cacheing to cudnn reduce op.
      
      * Adding back hashing call before returning primitive index to op::Reduce (bug fix).
      
      * [cuDNN:Part 2] Descriptor Creation/Destruction refactoring (#969)
      
      * Added a cuDNN descriptor factory which manages the construction and destruction of cuDNN descriptors.
      It correctly calls Create/Destroy based on the cuDNN descriptor type. Previously the Destroy functions were not being called.
      
      * Removed commented code and changed class to struct on cudnn_descriptor.
      
      * Added comments and updated a few variable names.
      
      * Clang compiled cuDNN kernels (those not part of CUDNNEmitter)
      now use the CUDNNDescriptor factory.
      682f7b04
    • Adam Rogowiec's avatar
      [Py] (Partially) automatic generation of python API doc (#973) · 11ec6449
      Adam Rogowiec authored
      * First version of nGraph Python API documentation pages.
      
      * Remove use of (`) apostrophes since they generate Sphinx warnings.
      
      * Fix submodule rendering in Sphinx PythonAPI docs
      
      * Add updated index and conf py files
      
      * Updated version of Python API doc.
      
      * Better description for the autosummary
      
      * FIX: Disable autogeneration of summary files in documentation.
      
      * Remove redundant file containing duplicate information.
      
      * Remove unused Sphinx extensions.
      
      * (Partially) automatic generation of python API doc.
      
      - Detailed documentation is generated automatically
      - Summaries are generated automatically for all files except runtime.py
        which is parsed incorrectly.
      - No additional steps required. Only run `make html`. Under the hood there
        is lunched script which generates all required files to some temporary
        directory under `/tmp/sphinx_auto_py_doc/` and copied to docs directory
        afterwards.
      - Remove `autosummary_generate` from in configuration file since we use
        custom script which performs this step.
      
      * Review update.
      
      * Automatically generated stub files for doc.
      
      - Added just few empty lines which has been removed during recent merge.
      11ec6449
    • Fenglei's avatar
      add gpu concat op (#931) · 57d58e50
      Fenglei authored
      * add concat op
      
      * change to concat
      
      * add more code for gpu concat
      
      * compile sucess version with bug
      
      * add emit_concat_op
      
      * runable with wrong result
      
      * working version
      
      * add some comments
      
      * delete old comments.
      
      * delete old comments.
      
      * remove bug doxyen comments
      57d58e50
    • Nick Korovaiko's avatar
    • Christian Convey's avatar
      Fixes compiler warning. (#974) · acc4e46d
      Christian Convey authored
      acc4e46d
    • Pruthvi's avatar
      update mkldnn to version v0.14 (#971) · 28797f78
      Pruthvi authored
      28797f78
    • Robert Kimball's avatar
    • DawnStone's avatar
      add apt-transport-https for contrib/docker/Dockerfile for GPU on Ubuntu 16.04 (#953) · c06769d9
      DawnStone authored
      * workaround to avoid apt-get update error for GPU docker image build on Ubuntu 16.04: The method driver /usr/lib/apt/methods/https could not be found.
      
      * removed extraneous proxy variables from the contrib/docker/Dockerfile for GPU on Ubuntu 16.04
      c06769d9
    • Nick Korovaiko's avatar
      Algebraic Simplification for Product (#949) · 659d2565
      Nick Korovaiko authored
      * product simplifier
      
      * char -> signed char
      659d2565
    • Jayaram Bobba's avatar
      Computation reuse (#945) · 41c50b44
      Jayaram Bobba authored
      * Make temp memory pools static to avoid memory allocation overheads
      
      * Initial implementation for graph control to enable caching and computation reuse
      
      * Added sphinx documentation
      
      * Turned off memory buffer reuse in CPU transformer to support computation reuse. Added unit test
      
      * Change memoizable to cacheable
      
      * Change memoizable to cacheable
      
      * Rename variables
      41c50b44
    • DawnStone's avatar
    • Adam Rogowiec's avatar
      [Py] Fix Python API doc. (#921) · 882e721c
      Adam Rogowiec authored
      * First version of nGraph Python API documentation pages.
      
      * Remove use of (`) apostrophes since they generate Sphinx warnings.
      
      * Fix submodule rendering in Sphinx PythonAPI docs
      
      * Add updated index and conf py files
      
      * Updated version of Python API doc.
      
      * Better description for the autosummary
      
      * FIX: Disable autogeneration of summary files in documentation.
      882e721c
    • Nick Korovaiko's avatar
      MaxPoolWithIndices (#900) · a174c8c9
      Nick Korovaiko authored
      * MaxPoolWithIndices CPU Fusion
      
      * fix test to pass checks in cpu_fusion
      
      * pass test
      
      * clean up
      
      * add a new pass, add layouts
      
      * remove the opt from cpu_fusion
      
      * refactor cpu_layout logic for maxpool, clean up comments
      
      * add comment w.r.t. indices tensor
      
      * rename to cpu_workspace_insertion
      
      * add CPUWorkspaceInsertion pass for TF
      a174c8c9
  6. 07 May, 2018 2 commits