streamOrderedAllocation cudaMallocAsync cudaFreeAsync cudaMemPoolSetAttribute cudaDeviceGetDefaultMemPool whole ./ ../ ../../common/inc Performance Strategies true streamOrderedAllocation.cu 1:CUDA Basic Topics 1:Performance Strategies sm60 sm61 sm70 sm72 sm75 sm80 sm86 x86_64 linux windows7 arm ppc64le linux 6.0 stream Ordered Allocation exe