- 19 Sep, 2019 1 commit
-
-
Scott Cyphers authored
* Fix unsigned comparison and implicit double conversion * Eliminate sign/unsigned compare warnings, implicit double conversion * Missed type * Neww int type * omp compromise * style
-
- 21 Jul, 2019 3 commits
-
-
Robert Kimball authored
-
Robert Kimball authored
-
Robert Kimball authored
-
- 17 Jul, 2019 1 commit
-
-
Adam Procter authored
* Re-enable -Wswitch and -Wswitch-enum * Collapse identical switch cases * Fix (I think) compilation errors * Partially fix missing cases in GPU xformer, so clang will show me the rest of them... * One last enum value
-
- 11 Jul, 2019 1 commit
-
-
Robert Kimball authored
-
- 09 Jul, 2019 2 commits
-
-
Robert Kimball authored
-
Robert Kimball authored
-
- 05 Jul, 2019 1 commit
-
-
Robert Kimball authored
-
- 05 Jun, 2019 1 commit
-
-
Robert Kimball authored
* remove tensor offset from tensor read/write calls because it was never used * fix build errors * fix build errors * fix python test errors * more python fixes * revert change * Make old version of read/write deprecated * fix python read overload * one more try to fix python binding * fix python * yet another try * why is this so hard * fix? * add text to changes.md
-
- 02 Jun, 2019 1 commit
-
-
Jayaram Bobba authored
Avoid negative values in int64 initialization for cases where int64 parameters are used as indices (#3004)
-
- 29 Apr, 2019 1 commit
-
-
Robert Kimball authored
* Change warmup to run full, normal iterations * style
-
- 26 Jan, 2019 1 commit
-
-
Robert Kimball authored
-
- 14 Jan, 2019 1 commit
-
-
Rob Earhart authored
-
- 12 Jan, 2019 1 commit
-
-
Robert Kimball authored
* first cut at raspberry pi backend * rename rpi to generic cpu * disable cursed test
-
- 03 Jan, 2019 1 commit
-
-
Robert Kimball authored
* update licenses for 2019 * style
-
- 19 Dec, 2018 1 commit
-
-
Ransford Hyman Jr authored
* Fixed the benchmark file to reflect the updated compile function usage. Compile returns the compiled function now * Removed redundant compilation
-
- 28 Nov, 2018 1 commit
-
-
yimeisun123 authored
* Fix uninitialized parameter tensor data in nbench * Remove const modifier in the loop to fix compilation error * Add condition check before writing tensor data * Remove the condition check when initializing pramater tensor data
-
- 14 Nov, 2018 1 commit
-
-
yimeisun123 authored
-
- 20 Oct, 2018 1 commit
-
-
Adam Procter authored
Fixes an error when building on macOS (with CPU backend disabled, but not sure if that makes a difference): ``` /Users/amprocte/Work/ngraph/src/tools/nbench/benchmark.cpp:36:29: error: use of undeclared identifier '_MM_FLUSH_ZERO_ON' _MM_SET_FLUSH_ZERO_MODE(_MM_FLUSH_ZERO_ON); ^ /Users/amprocte/Work/ngraph/src/tools/nbench/benchmark.cpp:37:33: error: use of undeclared identifier '_MM_DENORMALS_ZERO_ON' _MM_SET_DENORMALS_ZERO_MODE(_MM_DENORMALS_ZERO_ON); ```
-
- 19 Oct, 2018 1 commit
-
-
Jayaram Bobba authored
-
- 30 Sep, 2018 1 commit
-
-
Robert Kimball authored
-
- 29 Sep, 2018 1 commit
-
-
Robert Kimball authored
* rename files * rename runtime TensorView to Tensor * rename HostTensorView to HostTensor
-
- 27 Sep, 2018 1 commit
-
-
Robert Kimball authored
* remove get_tensor method * remove get_element_count as it is redundant to get_size * fix a few issues with examples and python wrapper * rename get_size to get_element_count and add get_size_in_bytes method
-
- 12 Sep, 2018 1 commit
-
-
Robert Kimball authored
* add option to copy intput/output data for each iteration * add support for stale buffers
-
- 30 Aug, 2018 1 commit
-
-
Robert Kimball authored
* only print details if details enabled * refactor print routines to main file * refactor for multi dir support * dir support prints nice results
-
- 29 Aug, 2018 1 commit
-
-
Robert Kimball authored
* use line comments instead of multiline comments for license header * update more * update new files * more header updates * style
-
- 08 Aug, 2018 1 commit
-
-
Jaikrishnan Menon authored
* Add an option to exclude the first iteration * Switch to warmup iterations * Cleanup
-
- 03 Aug, 2018 1 commit
-
-
Robert Kimball authored
* compiles but does not link
-
- 17 Jul, 2018 1 commit
-
-
Jaikrishnan Menon authored
-
- 14 Jul, 2018 1 commit
-
-
Fenglei authored
* using async gpu timers * remove sync for cuda calls, add async gpu stopwatch, add count to timing-detail * add debug sync * make timer static * move timer to runtime context
-
- 03 Jul, 2018 1 commit
-
-
Robert Kimball authored
* nbench cleanup * update style
-
- 08 May, 2018 1 commit
-
-
Jayaram Bobba authored
* Make temp memory pools static to avoid memory allocation overheads * Initial implementation for graph control to enable caching and computation reuse * Added sphinx documentation * Turned off memory buffer reuse in CPU transformer to support computation reuse. Added unit test * Change memoizable to cacheable * Change memoizable to cacheable * Rename variables
-
- 20 Apr, 2018 1 commit
-
-
Robert Kimball authored
* Move runtime::Manager functionality into runtime::Backend * Remove unused files * remove obsolete function
-
- 13 Apr, 2018 1 commit
-
-
Robert Kimball authored
* remove deprecated * remove all legacy Backend API usage remove deprecated files * pull in changes from master * fix GPU calls * disable tests in convolution generator * update per PR comments. Enable performance counter feature. * update per PR comments * fix build error * fix conditionally compiled test :(
-
- 02 Apr, 2018 1 commit
-
-
Robert Kimball authored
* add random init for all input types, not just float * remove debug
-
- 29 Mar, 2018 1 commit
-
-
Robert Kimball authored
-
- 21 Mar, 2018 1 commit
-
-
Yixing Lao authored
Adjust CallFrame argument order to match Function
-
- 11 Mar, 2018 1 commit
-
-
Robert Kimball authored
* fix detailed timing flag * more detailed info
-
- 05 Mar, 2018 1 commit
-
-
Robert Kimball authored
* cleanup * cleanup * fix all headers to be standalone as far as includes go * include cleanup * cleanup includes * cleanup * include tester * wip * cleanup * cleanup * cleanup
-