cuda-samples/bin/x86_64/linux/release/APM_cuSolverSp_LinearSolver.txt

GPU Device 0: "Hopper" with compute capability 9.0

Using default input file [../../../../Samples/4_CUDA_Libraries/cuSolverSp_LinearSolver/lap2D_5pt_n100.mtx]
step 1: read matrix market format
sparse matrix A is 10000 x 10000 with 49600 nonzeros, base=1
step 2: reorder the matrix A to minimize zero fill-in
        if the user choose a reordering by -P=symrcm, -P=symamd or -P=metis
step 2.1: no reordering is chosen, Q = 0:n-1 
step 2.2: B = A(Q,Q) 
step 3: b(j) = 1 + j/n 
step 4: prepare data on device
step 5: solve A*x = b on CPU 
step 6: evaluate residual r = b - A*x (result on CPU)
(CPU) |b - A*x| = 5.393685E-12 
(CPU) |A| = 8.000000E+00 
(CPU) |x| = 1.136492E+03 
(CPU) |b| = 1.999900E+00 
(CPU) |b - A*x|/(|A|*|x| + |b|) = 5.931079E-16 
step 7: solve A*x = b on GPU
step 8: evaluate residual r = b - A*x (result on GPU)
(GPU) |b - A*x| = 1.970424E-12 
(GPU) |A| = 8.000000E+00 
(GPU) |x| = 1.136492E+03 
(GPU) |b| = 1.999900E+00 
(GPU) |b - A*x|/(|A|*|x| + |b|) = 2.166745E-16 
timing chol: CPU =   0.097956 sec , GPU =   0.103812 sec
show last 10 elements of solution vector (GPU) 
consistent result for different reordering and solver 
x[9990] = 3.000016E+01
x[9991] = 2.807343E+01
x[9992] = 2.601354E+01
x[9993] = 2.380285E+01
x[9994] = 2.141866E+01
x[9995] = 1.883070E+01
x[9996] = 1.599668E+01
x[9997] = 1.285365E+01
x[9998] = 9.299423E+00
x[9999] = 5.147265E+00
Updating Samples for 12.1 2023-03-01 09:41:29 +08:00			`GPU Device 0: "Hopper" with compute capability 9.0`

			`Using default input file [../../../../Samples/4_CUDA_Libraries/cuSolverSp_LinearSolver/lap2D_5pt_n100.mtx]`
			`step 1: read matrix market format`
			`sparse matrix A is 10000 x 10000 with 49600 nonzeros, base=1`
			`step 2: reorder the matrix A to minimize zero fill-in`
			`if the user choose a reordering by -P=symrcm, -P=symamd or -P=metis`
			`step 2.1: no reordering is chosen, Q = 0:n-1`
			`step 2.2: B = A(Q,Q)`
			`step 3: b(j) = 1 + j/n`
			`step 4: prepare data on device`
			`step 5: solve A*x = b on CPU`
			`step 6: evaluate residual r = b - A*x (result on CPU)`
			`(CPU) \|b - A*x\| = 5.393685E-12`
			`(CPU) \|A\| = 8.000000E+00`
			`(CPU) \|x\| = 1.136492E+03`
			`(CPU) \|b\| = 1.999900E+00`
			`(CPU) \|b - Ax\|/(\|A\|\|x\| + \|b\|) = 5.931079E-16`
			`step 7: solve A*x = b on GPU`
			`step 8: evaluate residual r = b - A*x (result on GPU)`
			`(GPU) \|b - A*x\| = 1.970424E-12`
			`(GPU) \|A\| = 8.000000E+00`
			`(GPU) \|x\| = 1.136492E+03`
			`(GPU) \|b\| = 1.999900E+00`
			`(GPU) \|b - Ax\|/(\|A\|\|x\| + \|b\|) = 2.166745E-16`
			`timing chol: CPU = 0.097956 sec , GPU = 0.103812 sec`
			`show last 10 elements of solution vector (GPU)`
			`consistent result for different reordering and solver`
			`x[9990] = 3.000016E+01`
			`x[9991] = 2.807343E+01`
			`x[9992] = 2.601354E+01`
			`x[9993] = 2.380285E+01`
			`x[9994] = 2.141866E+01`
			`x[9995] = 1.883070E+01`
			`x[9996] = 1.599668E+01`
			`x[9997] = 1.285365E+01`
			`x[9998] = 9.299423E+00`
			`x[9999] = 5.147265E+00`