# Parallel Reduction Sample Requirements cuda-python>=13.0.0 cuda-core>=1.0.0 cuda-cccl>=1.0.0 cupy-cuda13x>=14.0.0 numpy>=2.3.2