binaryPartitionCG --std=c++11 cudaStreamCreateWithFlags cudaFree cudaMallocHost cudaFreeHost cudaStreamSynchronize cudaMalloc cudaMemsetAsync cudaMemcpyAsync cudaOccupancyMaxPotentialBlockSize whole ./ ../ ../../../Common Cooperative Groups CUDA Parallel Reduction Cooperative Groups CPP11 true binaryPartitionCG.cu 1:CUDA Basic Topics sm50 sm52 sm53 sm60 sm61 sm70 sm72 sm75 sm80 sm86 sm87 sm89 sm90 x86_64 linux windows7 arm sbsa ppc64le linux all Binary Partition Cooperative Groups exe