[./convolutionSeparable] - Starting... GPU Device 0: "Hopper" with compute capability 9.0 Image Width x Height = 3072 x 3072 Allocating and initializing host arrays... Allocating and initializing CUDA arrays... Running GPU convolution (16 identical iterations)... convolutionSeparable, Throughput = 74676.0329 MPixels/sec, Time = 0.00013 s, Size = 9437184 Pixels, NumDevsUsed = 1, Workgroup = 0 Reading back GPU results... Checking the results... ...running convolutionRowCPU() ...running convolutionColumnCPU() ...comparing the results ...Relative L2 norm: 0.000000E+00 Shutting down... Test passed