- 07 Feb, 2018 3 commits
-
-
Yixing Lao authored
* revert to shared argon lib * installs argon headers * -DINSTALLED_HEADERS_PATH=${CMAKE_INSTALL_PREFIX}/include * formatg * remove debug info * revert * revert back to cpp-master
-
Adam Procter authored
* Fix warnings in zero-sized tests; fix zero-sized tests return vector type for comparison ops * Revert changes to cpu_emitter
-
Adam Procter authored
* Enable CPU backprop tests * Fix to dot codegen for cases where n_reduction_axes != 1
-
- 06 Feb, 2018 3 commits
-
-
Sandeep authored
* test relu fusion for argon backend and enable permutation over max op
-
Adam Procter authored
1. AvgPoolBackprop type checking 2. Padding for max-pool op and kernel 3. Max pool backprop kernel, and integrating everything with CPU backend/serializer 3a. Add optional backpointer from the max pool backprop op to the forward prop op
-
adstraw authored
* modify existing autodiff unit tests to test fprop cache * cleanup * fix compile error introduced with bad merge * remove invalid negative/negative backwards power test
-
- 05 Feb, 2018 3 commits
-
-
Jayaram Bobba authored
-
Jayaram Bobba authored
-
Nick Korovaiko authored
inline Inliner pass + tests debugging fix inliner failures due to the fact a random function is picked as an outermost one copyright headers
-
- 03 Feb, 2018 2 commits
-
-
Robert Kimball authored
fix clone of function with multiple outputs
-
Scott Cyphers authored
-
- 01 Feb, 2018 1 commit
-
-
Nick Korovaiko authored
* simplification pass * serializer change to test models * some small test fixes * addressing Scott's feedback * missed one nn * formatting fixes * simplification -> reshape_elimination
-
- 31 Jan, 2018 1 commit
-
-
Nick Korovaiko authored
* bprop for avg pool remove debug statements + formatting * fix CPU test failures * numeric tests * use make_shared; unprotect c-tor
-
- 30 Jan, 2018 2 commits
-
-
Nick Korovaiko authored
cblas_gemm working on mlp rebase & small fixes enable debug output support replacing function's outputs productizing CPUFusion addressing Bob and Jayaram's feedback removing json used for simplification tests adding comments fixing formatting errors and removing dead code TODO msg removing serializer changes
-
Adam Procter authored
-
- 28 Jan, 2018 1 commit
-
-
Robert Kimball authored
-
- 24 Jan, 2018 3 commits
-
-
Tristan Webb authored
* Drwebb/gpu backend dot op (#387) * GPU Dot prod emitter switch statement * cuBLAS dot kernel call * Flush out arg substitution into gpu dot kernel call * Drwebb/gpu backend dot op (#392) * Take in CodeWriter into gpu op emitters * Introduce GPU function gen based on pass functions * Additional gpu emitter stubs * link cublas in to unit test and ngraph * Use static code gen methods for GPU, add new GPU op stubs * use pass manager to declare functions / cublas Updates * Prune down gpu_external_function wip * Switch back to GPU tensor views in GPU backend * Pass in cublas handle to GPU external function * cuMalloc memory in gpu tensor view * Use cuda runtime malloc and free for tensor view managment c * change GPU tensor view init, and use GPU tensor view for GPU call frame * include headers as system dirs * GPU tensor printing utility function * cublasSetPointer to device mode / Fix copyright notification lowercasing * Passing GPU dot product test using cuBLAS Clean up * Changes from review
-
Adam Procter authored
-
Scott Cyphers authored
* Remove TupleType, ValueType * Fix compile error.
-
- 23 Jan, 2018 1 commit
-
-
adstraw authored
* fix convlution reference script * convolution backprop * cleanup * fix build warnings * Missing include * fix build warning part 2 * move numeric_compare to its own header code review feedback * fix build warnings 3 * fix build warnings 4 * clang-format * cast to avoid implicit cast warning
-
- 20 Jan, 2018 3 commits
-
-
Robert Kimball authored
* wip * using namespace cleanup
-
Robert Kimball authored
-
Robert Kimball authored
* wip * wip * remove get_vector from runtime::TensorView class as it was for unit test only * cleanup * move writting vector to runtime::TensorView to the unit test dir * merge fix * PR review change * update from PR comment * update changes file
-
- 19 Jan, 2018 5 commits
-
-
Adam Procter authored
-
Adam Procter authored
-
Robert Kimball authored
* cleanup in-memory header files * add switch to enable memory sanitizer (works like valgrind) * removed header file cleanup as it was causing a segfault on program termination
-
Tristan Webb authored
* Add mention of blob ref of original file from caffe2 * Mention location of source listing originally from LLVM project
-
Adam Procter authored
* Average pool type checking and kernel; type checking tests * Fix and enable average-pool tests * Docstring fix * Extend AvgPool op type checking to support padding * Untested code for padded avg-pool * Unit tests for padded avg-pool * Add CPU implementation * Temp delete * Docstring fix * Docstring fix * Add tests mixing padding and stride * Temporary cut to ease merge * Restore temporary cut for merge * Empty commit to try to force CI to wake up
-
- 18 Jan, 2018 3 commits
-
-
Nick Korovaiko authored
-
Matthew Brookhart authored
-
Robert Kimball authored
* add test for empty tuple * fix null function breaking
-
- 17 Jan, 2018 3 commits
-
-
Robert Kimball authored
* add mxnet seq2seq forward and backward * add benchmarks for seq2seq forward and backward
-
Matthew Brookhart authored
* Numerically stable sum so we can pass mxnet unit tests * Add a small initial residual
-
Tristan Webb authored
* Initial GPU_ExternalFunction implementation Other changes: Add GPU runtime to same cmake block as GPU, include CUDA headers if GPU enabled Initial passing (a+b)*c test Properly link cuda libraries Simple GPUTensorView implementation Initial GPU emitter GPU codegen initial function gen, no kernels yet Rename GPU emitter and tensor_view_wrapper to match naming convention * GPU external function based on BASE * Fix stray base -> gpu * TensorViewWrapper -> GPU_TensorViewWrapper * Copy over emitter from base transformer * Fix for naming dense layout * Copy kernel emitters from base -> gpu and strip out kernel_utils * Add aliases to GPU_TensorViewWrappers * More fixes for naming descriptor::TensorViews * Move in call_frame implementation from base -> gpu * apply code format * GPU codegen running A+B*C gpu emitters gpu ctx setup cuda_module kernels Remove GPU_CF perf counters Use gpu kernels in external function Add GPU 1d dot test Review Changes: * Remove CPU specific kernel emitting method bodies * Use copy_data from test/util.cpp, uncomment compileTest * Use test_utils copy_data function * Grab function name from pass manager for def, clean up indentation
-
- 16 Jan, 2018 1 commit
-
-
Adam Procter authored
-
- 12 Jan, 2018 1 commit
-
-
Adam Procter authored
Sub-PR: image dilation tests (#362) via @adstraw
-
- 11 Jan, 2018 2 commits
-
-
Robert Kimball authored
* add interpreter nan check option * add unit test
-
Christian Convey authored
-
- 10 Jan, 2018 2 commits
-
-
Nick Korovaiko authored
* the first stab at pattern for sum test refactoring, debug msg clean up, formatting fixes removing v1 and cleaning up v2 + formatting rollback the changes in reduce_ops rename v2 -> sum_pred remove unused funcs switch to new c-tors remove TensorViewType removing an assert fix a docstring to match a c-tor * fixes after rebase
-
Adam Procter authored
-