UnifiedMemoryPerf cudaMallocManaged cudaStreamAttachMemAsync cudaMemcpyAsync cudaMallocHost cudaMalloc whole ./ ../ ../../common/inc CUDA Systems Integration Unified Memory CUDA Streams and Events Pinned System Paged Memory CUDA Unified Memory Pinned Memory Zero copy buffer UVM Streams true matrixMultiplyPerf.cu UVM 1:CUDA Basic Topics 1:CUDA Systems Integration 1:Unified Memory x86_64 linux x86_64 macosx windows7 arm aarch64 ppc64le linux 3.0 Unified and other CUDA Memories Performance exe