- 02 Aug, 2018 11 commits
-
-
Nick Korovaiko authored
* lrn init * fix comment * mkldnn lrn (#1295) * add serializer + fix compiler warnings
-
Jaikrishnan Menon authored
* Fix the first_iteration flag so it works when more than one call-frame exists Static variables defined in lambda expressions are not private to a lambda so move this to the runtime context * Shave off a few microseconds by initializing intermediates exactly once * Make all execution paths use first_iteration in the runtime context
-
Michał Karzyński authored
* [Py] Add __repr__ to Strides and CoordDiff * Apply clang-format * Repr fix * Apply clang-format
-
Michał Karzyński authored
* [Py] Add convolution_backprop_data to API * Conv fix
-
Chris Sullivan authored
* Updated softmax. * Formatting. * Updated convolution. * Use build_primitive overloading. Add helper to emit type_string given a node. * Formatting. * Update ConvolutionBackpropData. * convolution backprop & max pool memory primitive cacheing (#1303) * Updated ConvolutionBackpropFilters. * Update MaxPool. * Update Max and Min. (#1307)
-
Fenglei authored
* move add,mult,min,max,sqrt to elementwise_op, increase op per threads
-
Amy Zhuang authored
* Implement trigonometric ops for direct execution. * Rename files.
-
Robert Kimball authored
* build on suse w/gcc 4.8.5 * fix SUSE build error * add comments * remove template function * update per review comment * fix nan check emitted code
-
varun-intel authored
* updated * type prop * disable test in manifest * try to exclude * style * double * dobule * more * style * more * vecs * fix goe
-
Robert Kimball authored
-
Jaikrishnan Menon authored
* Work around some buggy (and deprecated) rpath directives * Add missing newline * Revert "Add missing newline" This reverts commit 95aebb7f14850afcd59c53ece0bb4663b8c38660. * Encoding fixes
-
- 01 Aug, 2018 6 commits
-
-
Louis Feng authored
* hacking to support dot of 3 by 2 inputs with gemm_batch. * clean up. * testing inplace reshape. * fixed a compile error. * added comments on todo. * check for output. * check for annotation. * more optimizations WIP. * sum simd. * moved parallel for * testing sum vectorization. * fixed merge errors. * sum wip. * more logic. * sum refactor and clean up. * clean up. * removed unrelated changes. * removed related changes from merge. * fixed clang compile errors.
-
shssf authored
-
Fenglei authored
* move to cuda_emiiter * fix bug, clang format * size_t to uint32_t * reverse_axes * add rank back, clang format * remove unused code and file * remove unused code and file * manually merge with master
-
Anna Alberska authored
* IntelGPU backend: Power, Sigmoid and ReluBackprop operations * style changed to ReluBackprop * Update intelgpu_backend.cpp
-
dmyershov authored
-
Nick Korovaiko authored
* rank3xrank2 cpu_emitter version 1 * refactoring matmulbias * add comment
-
- 29 Jul, 2018 2 commits
-
-
shssf authored
* IntelGPU backend: Dot operation (partially implemented) * PR1275. Debug output deleted. * PR1275. Comments addressed
-
Chris Sullivan authored
* Broadcast and Pad bug fix. * Added NO_THROW version of the cuda error checking defines. Now utilizing these in dtors. This reverts commit 68d9d6eafb1475c83c47229ab3c784c3d392ddbd. * Revert "Broadcast and Pad bug fix." This reverts commit 099c79792a2e7b9b8727b48de90f623953691f4c.
-
- 28 Jul, 2018 3 commits
-
-
rsketine authored
* Update Dockerfile took out make -j 8 as we are running out of virtual memory. added df -k for getting the disk space info in logs. * Update .travis.yml As we bring up 3 containers here i am reducing to 2 and trying to see if this is deployed on different machines then should not matter. * Update .travis.yml 3 in parallel should pass also 2 of then are passing so trying this out
-
shssf authored
* IntelGPU backend: Pad operation * PR1267. Comments addressed
-
Amy Zhuang authored
* Add TBB flow graphs to DEX. * Make edges from dummy start node to head nodes when traversing nodes. * Use static_cast to cast TBB graph node. Undefine __TBB_PREVIEW_LIGHTWEIGHT_POLICY. * Code formatting. * Remove clang wreserved-id-macro warning.
-
- 27 Jul, 2018 8 commits
-
-
Amy Zhuang authored
* Add NGRAPH_INTRA_OP_PARALLELISM to control size of thread pools. * Initialize variable.
-
Nick Korovaiko authored
-
Nick Korovaiko authored
-
Nick Korovaiko authored
-
tsocha authored
* Update input_validation.py * Update runtime.py * Update types.py * Update broadcasting.py
-
Fenglei authored
* optimize concat * compile sucess * multi inputs * clang format
-
Adam Procter authored
* Testing out some ideas for better error messages on AvgPool * Add uncaught_exception() check to ConstructionAssertLogger dtor * More general assertion class, not homed inside Node * Minor formatting change * NODE_ASSERT for type prop failure * Produce lighter-weight DummyAssertionHandler when assertion succeeds * New ctor for AssertionHelper that takes a single location arg; more const&-ness for the constructors * Remove move constructor for AssertionHelper; fix broken test in assertion.cpp * Miscellaneous improvements * Templatized AssertionHelper so different exception classes can be used; implemented TYPE_CHECK_ASSERT around this * Changed from a "stack" of locations to a single location (the stack was too complicated) * Added "FAIL" classes/macros which do not take a condition * Rename a helper function * Cleanup, cruft removal * Add test to make sure the assert helper has the lifetime we expect * Missing includes
-
L.S. Cook authored
-
- 26 Jul, 2018 4 commits
-
-
Robert Kimball authored
-
L.S. Cook authored
* editing how to execute computation file for clarity and linenos * Add placeholder for runtime docs * Update section on backends, interpreter, and FPGA options * add updated master to fix python_ci * Weird autosummary issue reverted * Clarify new section * fix up docs * Update pattern matcher doc based on Nik's presentation slides WIP * Update doc structure and examples * remove old folder * Fix broken Tensorview refs * new section on distr training * updated index w/drafted outline * . helping people document code more efficiently * edit WIP branch * WIP editing * WIP editing * init distributed doc * PR review edits * modify dist doc and dist mnist_mlp * Finish PR review comment fixes so far * Improving distributed training docs * Fix build error now that we have documented inteface backends use * update example build and run * update how-to distributed training doc * Editing distr train docs * Reword section to avoid strange doc build error * rebuild for zero errors for CI * split patternmatcher PR * PR feedback added * Add more help and detail for MXNet and neon distr * Resolve merge conflicts due to patternmatcher doc split * Resolve merge conflicts due to patternmatcher doc split * Resolve build errors manually * These files are already added to the branch * fix style * update with glossary def and link to Intel paper on synchronous SGD * fix link to sgd * remove comm_rank in dist example
-
Chris Sullivan authored
* Broadcast and Pad bug fix.
-
shssf authored
* IntelGPUBackend: Broadcast operation * IntelGPUBackend: more tests for Broadcast operation * Move macro to static C function in Broadcast tests
-
- 25 Jul, 2018 1 commit
-
-
Jayaram Bobba authored
CPU Direct Execution Part 4
-
- 23 Jul, 2018 5 commits
-
-
Jaikrishnan Menon authored
-
Jaikrishnan Menon authored
-
Jaikrishnan Menon authored
-
Jaikrishnan Menon authored
-
Jaikrishnan Menon authored
This reverts commit 549a4fd1.
-