conjugateGradientMultiDeviceCG -ewp -maxrregcount=64 --std=c++11 cudaDeviceEnablePeerAccess cudaMemset cudaFree cudaMallocManaged cudaMemPrefetchAsync cudaHostAlloc cudaOccupancyMaxActiveBlocksPerMultiprocessor cudaStreamCreate cudaGetDeviceCount cudaFreeHost cudaSetDevice cudaDeviceCanAccessPeer cudaLaunchCooperativeKernel cudaStreamSynchronize cudaMemAdvise cudaGetDeviceProperties whole ./ ../ ../../../Common Unified Memory Linear Algebra Cooperative Groups MultiDevice Cooperative Groups CUBLAS Library CUSPARSE Library CUDA Sparse Matrix Unified Memory Multi-GPU CPP11 cudadevrt true conjugateGradientMultiDeviceCG.cu UVM MDCG CPP11 1:CUDA Advanced Topics 3:Linear Algebra sm60 sm61 sm70 sm72 sm75 sm80 sm86 sm87 x86_64 linux ppc64le linux windows aarch64 sbsa 6.0 conjugateGradient using MultiDevice Cooperative Groups exe