- 08 Mar, 2018 1 commit
-
-
Chris Sullivan authored
* Added GPU emitter for op::Result. For now it simply copies the output tensor. All but 3 tests now pass. The remaining failing tests are: * GPU.dot_0_0 * GPU.dot_matrix_2x0_0x2 * GPU.dot_2x0_0 * Removed call to handle memory aliasing in gpu_external_function. * fix gpu emitter bug that will return in the middle of function * Merge pull request #609 from NervanaSystems/tfl/fix_return_bug fix gpu emitter bug that will return in the middle of function * GPU backend skips added for recent softmax test and updated aliased output test that uses op::Constant.
-
- 02 Mar, 2018 1 commit
-
-
adstraw authored
add softmax op and documentation
-
- 27 Feb, 2018 1 commit
-
-
Scott Cyphers authored
* Replace using aliases with actual classes
-
- 26 Feb, 2018 1 commit
-
-
Yixing Lao authored
* initial support for hybrid transformer * add broadcast_vector_rowwise_reversed for hybrid test * headerc * get function placement fix * conv ref test generator graph node in labmda fuction * rename map_parameter_to_source_node * type change map_parameter_to_source_node * use interpreter for numerical derivative * better comments
-
- 23 Feb, 2018 1 commit
-
-
Christian Convey authored
-
- 21 Feb, 2018 3 commits
-
-
fenglei.tian authored
-
fenglei.tian authored
-
Sandeep authored
* relu for interpreter * relu in serializer * core fusion * relu backprop * relu backprop and test interpreter * core fusion for CPU * COREFusion -> CoreFusion * relu MKL dnn
-
- 20 Feb, 2018 1 commit
-
-
pthoreho authored
-
- 16 Feb, 2018 2 commits
- 14 Feb, 2018 1 commit
-
-
fenglei.tian authored
-
- 12 Feb, 2018 1 commit
-
-
Robert Kimball authored
* fix Shape declarations
-
- 08 Feb, 2018 1 commit
-
-
Jennifer Myers authored
-
- 07 Feb, 2018 1 commit
-
-
Adam Procter authored
* Enable CPU backprop tests * Fix to dot codegen for cases where n_reduction_axes != 1
-
- 06 Feb, 2018 1 commit
-
-
adstraw authored
* modify existing autodiff unit tests to test fprop cache * cleanup * fix compile error introduced with bad merge * remove invalid negative/negative backwards power test
-
- 31 Jan, 2018 1 commit
-
-
Nick Korovaiko authored
* bprop for avg pool remove debug statements + formatting * fix CPU test failures * numeric tests * use make_shared; unprotect c-tor
-
- 23 Jan, 2018 1 commit
-
-
adstraw authored
* fix convlution reference script * convolution backprop * cleanup * fix build warnings * Missing include * fix build warning part 2 * move numeric_compare to its own header code review feedback * fix build warnings 3 * fix build warnings 4 * clang-format * cast to avoid implicit cast warning
-
- 20 Jan, 2018 1 commit
-
-
Robert Kimball authored
* wip * wip * remove get_vector from runtime::TensorView class as it was for unit test only * cleanup * move writting vector to runtime::TensorView to the unit test dir * merge fix * PR review change * update from PR comment * update changes file
-
- 18 Jan, 2018 1 commit
-
-
Nick Korovaiko authored
-
- 08 Jan, 2018 1 commit
-
-
Adam Procter authored
-
- 29 Dec, 2017 1 commit
-
-
Scott Cyphers authored
* Function can have multiple results Remove external use of ValueType, TupleType, Tuple Remove many external uses of Output and Input * corresponding CPU backend changes * Update master changes. * Remove type arg from Function, add changes.md * Merge changes. * Move bodies to .cpp, add brief doc * Merge CPU changes. * Remove xla includes from non-xla files * Remove xla from tests * First part of xla tuple support * change fprop_cache to assume multi-output bprop functions * New wrappers for handling tuples with XLA * Review comments * remove old xla files * fix merge errors * hand edit models to use multi output instead of tuples
-
- 21 Dec, 2017 2 commits
-
-
Robert Kimball authored
* remove ngvm * remove NGVM from cmake
-
Robert Kimball authored
* remove ParameterizedConstant * use simpler element Type definition * Move TraitedType to NGVM directory
-
- 11 Dec, 2017 1 commit
-
-
Adam Procter authored
* WIP generalized dot * Add some multi-axis 3D, 4D, and 5D tests * Add test on some 'pretty big' tensors * Reworked dot to have less flexible axis-pairing behavior * Backprop for dot... and a fix for a dumb bug in CoordinateTransform * Forgot to commit some stuff in merge * Disable tests that currently don't work on CPU * Fix temporarily disabled test that should pass on NGVM and INTERPRETER but wasn't due to new axis-selection convention for dot * Remove obsolete ScalarTensorProduct kernel/instruction * Review comment * s/n_dot_axes/dot_axis_count/ * s/dot_axis_count/reduction_axes_count/ * Adapt CPU emitter dot fallback to new kernel
-
- 05 Dec, 2017 1 commit
-
-
Robert Kimball authored
* New Interpreter backend * PR review comments * More RP fixes * oops * make autodiff tests backend aware * wip * wip * more ops * wip * fix merge error * merge fixes
-
- 29 Nov, 2017 1 commit
-
-
Adam Procter authored
* Autodiff for abs * Formatting, more tests for abs * Cos autodiff; also a clarifying comment in the abs test * Forgot cos.cpp * Sin autodiff * Again, forgot to add sin.cpp :/ * Tan autodiff * Minor formatting tweak * Commit partial work on select backprop so I can run valgrind on the server :/ * Fix boolean thingy so it works on Linux * Autodiff for ceiling, convert (untested), floor, sign. Fix unit test for tan. Implement ceiling, floor in VM. * Fix bug in abs/sign unit test ranges * Add multiplicative inverse and square-root ops (needed for hyperbolic trig autodiff) * Better formula for sqrt adjoints * Autodiff for hyperbolic trig ops * Forgot to add cpp files for hyperbolics * Remove inv (don't need it after all); also formatting (oops) * fix bug with Convert autodiff * Autodiff for concat * Restore the accidentally-commented-out unit test for abs * Formatting * Fix 'unordered_map.at' exception when Adjoints::Adjoints visit a node that has never been add_delta'd; remove workarounds for that bug * Fix erroneous 'not implemented' docstring for op::Not * Autodiff for sum * Checking in broken support for replace_slice so I can test it with valgrind on the dev box * Fix unit test bug with tensor initialization; formatting * Implement replace-slice in CPU backend * Autodiff for Slice and ReplaceSlice * Tweak docs for ReplaceSlice * Remove no-longer-needed cast of arg list to runtime::TensorView
-
- 25 Nov, 2017 2 commits
-
-
Adam Procter authored
-
Robert Kimball authored
* ParameterizedTensorView and ElementType cleanup * change element to_type() to from()
-
- 08 Nov, 2017 1 commit
-
-
Scott Cyphers authored
-
- 02 Nov, 2017 2 commits
-
-
Adam Procter authored
-
Adam Procter authored
-
- 01 Nov, 2017 1 commit
-
-
Adam Procter authored
-
- 30 Oct, 2017 2 commits
-
-
Robert Kimball authored
-
Robert Kimball authored
-
- 27 Oct, 2017 1 commit
-
-
Scott Cyphers authored
-
- 20 Oct, 2017 3 commits
-
-
Scott Cyphers authored
-
Scott Cyphers authored
-
Scott Cyphers authored
-
- 19 Oct, 2017 1 commit
-
-
Scott Cyphers authored
Multiply and Add numeric/autodiff unit tests pass
-