UnifiedMemoryPerf cudaStreamDestroy cudaFree cudaMallocHost cudaMallocManaged cudaMemPrefetchAsync cudaStreamCreate cudaStreamAttachMemAsync cudaFreeHost cudaMalloc cudaMemcpyAsync cudaStreamSynchronize cudaHostGetDevicePointer cudaMemcpy cudaGetDeviceProperties whole ./ ../ ../../../Common CUDA Systems Integration Unified Memory CUDA Streams and Events Pinned System Paged Memory CUDA Unified Memory Pinned Memory Zero copy buffer UVM Streams true matrixMultiplyPerf.cu UVM 1:CUDA Basic Topics 1:CUDA Systems Integration 1:Unified Memory sm35 sm37 sm50 sm52 sm53 sm60 sm61 sm70 sm72 sm75 sm80 sm86 sm87 x86_64 linux x86_64 macosx windows7 arm sbsa aarch64 ppc64le linux 3.5 Unified and other CUDA Memories Performance exe