refactor cudaarithm reductions:
* remove overloads with explicit buffer, now BufferPool is used * added async versions for all reduce functions
Showing
Please
register
or
sign in
to comment
* remove overloads with explicit buffer, now BufferPool is used * added async versions for all reduce functions