- 09 Feb, 2018 6 commits
-
-
Louis Feng authored
-
Tristan Webb authored
* GPU kernels for reshape, GEMM, EW ADD/Mult, Maximum (A + B) * C test now with cuBLAS Additional gemm and gemv calls cmake updates for cuDNN calls memcpy wrappers in gpu_util Additional passing tests: aliased outputs, parameter, constant tensor memcopy
-
adstraw authored
* compare derivatives from bprop and bprop with fprop cache * code format
-
Adam Procter authored
-
Jennifer Myers authored
-
Sandeep authored
-
- 08 Feb, 2018 4 commits
-
-
Jennifer Myers authored
-
Sandeep authored
-
Nick Korovaiko authored
* deserialize fix to return the right outermost function * fix tests * more switches to print out/visualize shapes
-
Robert Kimball authored
-
- 07 Feb, 2018 3 commits
-
-
Yixing Lao authored
* revert to shared argon lib * installs argon headers * -DINSTALLED_HEADERS_PATH=${CMAKE_INSTALL_PREFIX}/include * formatg * remove debug info * revert * revert back to cpp-master
-
Adam Procter authored
* Fix warnings in zero-sized tests; fix zero-sized tests return vector type for comparison ops * Revert changes to cpu_emitter
-
Adam Procter authored
* Enable CPU backprop tests * Fix to dot codegen for cases where n_reduction_axes != 1
-
- 06 Feb, 2018 3 commits
-
-
Sandeep authored
* test relu fusion for argon backend and enable permutation over max op
-
Adam Procter authored
1. AvgPoolBackprop type checking 2. Padding for max-pool op and kernel 3. Max pool backprop kernel, and integrating everything with CPU backend/serializer 3a. Add optional backpointer from the max pool backprop op to the forward prop op
-
adstraw authored
* modify existing autodiff unit tests to test fprop cache * cleanup * fix compile error introduced with bad merge * remove invalid negative/negative backwards power test
-
- 05 Feb, 2018 3 commits
-
-
Jayaram Bobba authored
-
Jayaram Bobba authored
-
Nick Korovaiko authored
inline Inliner pass + tests debugging fix inliner failures due to the fact a random function is picked as an outermost one copyright headers
-
- 03 Feb, 2018 2 commits
-
-
Robert Kimball authored
fix clone of function with multiple outputs
-
Scott Cyphers authored
-
- 01 Feb, 2018 1 commit
-
-
Nick Korovaiko authored
* simplification pass * serializer change to test models * some small test fixes * addressing Scott's feedback * missed one nn * formatting fixes * simplification -> reshape_elimination
-
- 31 Jan, 2018 1 commit
-
-
Nick Korovaiko authored
* bprop for avg pool remove debug statements + formatting * fix CPU test failures * numeric tests * use make_shared; unprotect c-tor
-
- 30 Jan, 2018 2 commits
-
-
Nick Korovaiko authored
cblas_gemm working on mlp rebase & small fixes enable debug output support replacing function's outputs productizing CPUFusion addressing Bob and Jayaram's feedback removing json used for simplification tests adding comments fixing formatting errors and removing dead code TODO msg removing serializer changes
-
Adam Procter authored
-
- 28 Jan, 2018 1 commit
-
-
Robert Kimball authored
-
- 24 Jan, 2018 3 commits
-
-
Tristan Webb authored
* Drwebb/gpu backend dot op (#387) * GPU Dot prod emitter switch statement * cuBLAS dot kernel call * Flush out arg substitution into gpu dot kernel call * Drwebb/gpu backend dot op (#392) * Take in CodeWriter into gpu op emitters * Introduce GPU function gen based on pass functions * Additional gpu emitter stubs * link cublas in to unit test and ngraph * Use static code gen methods for GPU, add new GPU op stubs * use pass manager to declare functions / cublas Updates * Prune down gpu_external_function wip * Switch back to GPU tensor views in GPU backend * Pass in cublas handle to GPU external function * cuMalloc memory in gpu tensor view * Use cuda runtime malloc and free for tensor view managment c * change GPU tensor view init, and use GPU tensor view for GPU call frame * include headers as system dirs * GPU tensor printing utility function * cublasSetPointer to device mode / Fix copyright notification lowercasing * Passing GPU dot product test using cuBLAS Clean up * Changes from review
-
Adam Procter authored
-
Scott Cyphers authored
* Remove TupleType, ValueType * Fix compile error.
-
- 23 Jan, 2018 1 commit
-
-
adstraw authored
* fix convlution reference script * convolution backprop * cleanup * fix build warnings * Missing include * fix build warning part 2 * move numeric_compare to its own header code review feedback * fix build warnings 3 * fix build warnings 4 * clang-format * cast to avoid implicit cast warning
-
- 20 Jan, 2018 3 commits
-
-
Robert Kimball authored
* wip * using namespace cleanup
-
Robert Kimball authored
-
Robert Kimball authored
* wip * wip * remove get_vector from runtime::TensorView class as it was for unit test only * cleanup * move writting vector to runtime::TensorView to the unit test dir * merge fix * PR review change * update from PR comment * update changes file
-
- 19 Jan, 2018 5 commits
-
-
Adam Procter authored
-
Adam Procter authored
-
Robert Kimball authored
* cleanup in-memory header files * add switch to enable memory sanitizer (works like valgrind) * removed header file cleanup as it was causing a segfault on program termination
-
Tristan Webb authored
* Add mention of blob ref of original file from caffe2 * Mention location of source listing originally from LLVM project
-
Adam Procter authored
* Average pool type checking and kernel; type checking tests * Fix and enable average-pool tests * Docstring fix * Extend AvgPool op type checking to support padding * Untested code for padded avg-pool * Unit tests for padded avg-pool * Add CPU implementation * Temp delete * Docstring fix * Docstring fix * Add tests mixing padding and stride * Temporary cut to ease merge * Restore temporary cut for merge * Empty commit to try to force CI to wake up
-
- 18 Jan, 2018 2 commits
-
-
Nick Korovaiko authored
-
Matthew Brookhart authored
-