mirror of
https://github.com/NVIDIA/cuda-samples.git
synced 2024-11-25 02:49:17 +08:00
22 lines
610 B
Plaintext
22 lines
610 B
Plaintext
[./convolutionSeparable] - Starting...
|
|
GPU Device 0: "Hopper" with compute capability 9.0
|
|
|
|
Image Width x Height = 3072 x 3072
|
|
|
|
Allocating and initializing host arrays...
|
|
Allocating and initializing CUDA arrays...
|
|
Running GPU convolution (16 identical iterations)...
|
|
|
|
convolutionSeparable, Throughput = 74676.0329 MPixels/sec, Time = 0.00013 s, Size = 9437184 Pixels, NumDevsUsed = 1, Workgroup = 0
|
|
|
|
Reading back GPU results...
|
|
|
|
Checking the results...
|
|
...running convolutionRowCPU()
|
|
...running convolutionColumnCPU()
|
|
...comparing the results
|
|
...Relative L2 norm: 0.000000E+00
|
|
|
|
Shutting down...
|
|
Test passed
|