Preallocate intermediate buffers (#1231)
* Utilize GPUMemoryManager/Allocator for preallocation of intermediate tensor buffer memory. * Formatting. * Merge with master required rework of memory due to CFE pass. Moved function memory pool allocation to pass as a result. * Formatting. * Added pass source files. * Updated tests to account for new assert check. All GPUAllocators should be deconstructed before allocation is made in GPUMemoryManager. * GPUAllocator::close() can be used to close the allocator prior to destruction * Removed open allocators. Replaced check with inspection of pass::MemoryManager node list. * Formatting. * Rename m_memory_buffers -> m_tensor_memory_buffers. Use full path to static alignment variable. * FunctionMemoryReservation -> TensorMemoryReservation. Only return true in pass if reservation is made (bug fix). * Moved static compilation mutex. * Update external function with new pass name. * GPU_ExternalFunction: Add s_memory_pool_alignment, remove optimize_and_assemble method.
Showing
Please
register
or
sign in
to comment