# Parallel Reduction Sample Requirements cuda-python>=13.0.0 cuda-core>=0.6.0 cuda-cccl>=1.0.0 cupy-cuda13x>=13.0.0 numpy>=2.3.2