- 26 Sep, 2018 12 commits
-
-
Adam Procter authored
-
Adam Procter authored
-
Adam Procter authored
-
Adam Procter authored
-
Adam Procter authored
-
Adam Procter authored
-
Adam Procter authored
-
Adam Procter authored
-
Adam Procter authored
-
Adam Procter authored
-
Adam Procter authored
-
Adam Procter authored
-
- 25 Sep, 2018 6 commits
-
-
Adam Procter authored
Move definition of StaticInitializers out of `#ifndef NGRAPH_DEX_ONLY`
-
shssf authored
* IntelGPU backend: Types workaround removal * PR1681. Fix style
-
Chris Sullivan authored
* Don't reuse memory for non-destructive ops. * Update comment.
-
Robert Kimball authored
-
Jaikrishnan Menon authored
* CPU Builder: Replace uint8_t with double in partial element type set * Keep support for uint8_t kernels
-
Robert Kimball authored
* update nbench output to be more consistent for single vs. directory benchmark. * wip * add types used in model * fix test * style * catch all exceptions when running any option
-
- 24 Sep, 2018 2 commits
-
-
Robert Kimball authored
* fix CPU dependency
-
Nick Korovaiko authored
* zero dim elem fix * switch to find * fix runtime errors
-
- 22 Sep, 2018 4 commits
-
-
shssf authored
-
Jaikrishnan Menon authored
-
shssf authored
-
shssf authored
* IntelGPU backend. Use common nGraph optimization passes * PR1667. Move variable definition.
-
- 21 Sep, 2018 16 commits
-
-
Nick Korovaiko authored
-
Nick Korovaiko authored
-
Amy Zhuang authored
* Add CPU horizontal fusion pass for inception. * Name change. * Move horizontal fusion to cpu_fusion. * Change horizontal fusion pass for inception to a general horizontal fusion pass. Add a unit test conv_horizontal_fusion to cpu_fusion. * Rename files. * Correct cpu_fusion.hpp. * Add NGRAPH_DEBUG. * Set native layout when input format of slice is nChw16c or nChw8c and lower bound of channels is not a multiple of 16 or 8.
-
Pruthvi authored
* added stream writers to dump meta information related to CONSTANT, INTERMEDIATE, PARAMETER's * WIP added method to dump Input and output reference address in to debug manifest from cpu_call_frame * added suppport to dump memory refernce in the debug manifest * - added macro to dump debug manifest - added support to release_function once dumping the memory references * added a method to write a generated stream to a file * dont release function, if NGRAPH_DEX_DEBUG flag set * Addressed PR comments - dump debug manifest using stringstream instead of CodeWriter - Moved dumping debug manifest logic to cpu_external_function * revert changes to call_frame * - Addressed PR comments * :Remove call_once from dumping debug_manifest * address PR comments * address PR comments
-
Nick Korovaiko authored
* extend concat elimination to fold slice + reshape * relax branch tip to be anything not just goe * add support for transpose when concat of slice + reshape occurred on internal axis. * simplify reshape order permutation * multi-axis slice + concat do not cancel and are now disabled. generalize detection of axis reordering when intermediate reshape is present to include logical reshape and reshape that results in axis reordering for the cases: parent_shape.size gt, eq, and lt concat_shape.size. * check that slices are in order * add one user check on reshape * add more checks * fix warnings * Reshape axis order did not include enough dimensions when the transposed and reshaped result was of lower rank.
-
shssf authored
-
Nick Korovaiko authored
* inject details into visualize_tree pass * address bob's feedback * revert back to map
-
Pruthvi authored
* - fixed performance_counter for CPU DEX * made changes to start clock @ the first functor and collected perf_counter @ the exceution of last functor for given kernel execution * - made changes to have max_pool kernel to have exactly one functor - assert if functor != num_of_ops * fixed profiler_counter for TBB execution * avoid unnecessary calculation to enables<map> value * Addressed PR comments - updated "enables" -> list<functors>
-
Robert Kimball authored
-
shssf authored
-
Sang Ik Lee authored
-
shssf authored
-
L.S. Cook authored
-
Jayaram Bobba authored
* Common pass registration for codegen and Dex * Make return indices optional for cpu workspace insertion
-
tsocha authored
-
tsocha authored
* [ONNX] LRN op * Review fix pt. 1
-