shawnz
ebc1078379
Bug 5280038: Update cuda-c-linking as per CUDA 13.0 API change
2025-05-14 15:54:41 +08:00
Rob Armstrong
2861e78272
Merge branch 'peggyt_bug_fix' into 'master'
...
Bug 5056055: limit register usage to 128 per thread in debug mode
See merge request cuda-samples/cuda-samples!110
2025-05-12 08:55:45 -07:00
Rob Armstrong
494d598f86
Merge branch 'shawnz_bugs_fix' into 'master'
...
Shawnz bugs fix
See merge request cuda-samples/cuda-samples!111
2025-05-12 08:54:40 -07:00
shawnz
c6208f5897
Bug 5263330: Update CUFFT errors as per latest changes on CUDA 13.0
2025-05-12 15:39:08 +08:00
shawnz
8f33cc6094
Bug 5274280: Enable 8_Platform_Specific/Tegra/EGLSync_CUDAEvent_Interop
2025-05-12 15:02:31 +08:00
shawnz
2ec9cf394a
Bug 5272236: Update the include file copy path as path changes on 13.0
2025-05-12 15:00:52 +08:00
Peggy Tian
770e433a9e
Bug 5056055: limit register usage to 128 per thread in debug mode to comply with the maximum number of 32-bit registers per SM
2025-05-12 06:04:22 +00:00
Rob Armstrong
c6af90553e
Merge branch 'master' into 'master'
...
Bug 5236593: Increase the pending kernel launch limit to 4096
See merge request cuda-samples/cuda-samples!109
2025-05-07 09:47:51 -07:00
Rob Armstrong
7989d1fcc8
Merge branch 'shawnz_bug_fix' into 'master'
...
Bug 5217339: Replace SM 101 with 110 for Thor
See merge request cuda-samples/cuda-samples!108
2025-05-07 09:46:41 -07:00
Peggy Tian
611008fa86
Bug 5236593: Increase the pending kernel launch limit to 4096
2025-05-07 17:38:52 +08:00
shawnz
bf628887f1
Bug 5217339: Replace SM 101 with 110 for Thor
2025-05-07 14:10:35 +08:00
Rob Armstrong
330dd8472f
Merge external PR #363
2025-05-05 08:45:01 -07:00
Francesco Rizzi
b530f1cf42
Fix bug in 6_Performance/transpose: copy sharedmem kernel ( #363 )
...
Update kernel loop bounds handling, main loop data copy to avoid incorrect reuse of output results.
---------
Authored-by: Francesco Rizzi <francesco.rizzi@ng-analytics.com>
2025-05-05 08:43:23 -07:00
Rob Armstrong
8ee551c99a
Udpate README for 13.0
2025-05-01 15:23:36 -07:00
Rob Armstrong
148014e709
Merge cuda_a_dev 13.0 changes to master
2025-05-01 15:22:40 -07:00
Rob Armstrong
cab7c66b4f
Update pre-config to include Python and JSON for EOL, whitespace checks
v12.9
2025-05-01 10:17:42 -07:00
Rob Armstrong
8d400cfb7f
Additional minor changes to run_tests.py output formatting
2025-05-01 10:14:09 -07:00
Rob Armstrong
f2645c5df8
Final merge of 12.9 changes into cuda_a_dev
2025-05-01 09:55:03 -07:00
Rob Armstrong
6d6d964f97
Minor changes to run_tests.py output formatting
2025-05-01 09:54:25 -07:00
Rob Armstrong
ab68d58d59
Remove unused bin/x86_64 directory hierarchy
2025-05-01 09:53:54 -07:00
Rob Armstrong
c70d79cf3b
Final 12.9 README updates
2025-05-01 09:39:06 -07:00
Rob Armstrong
9ac81370fa
Update 12.9 changes from 'master' into 'cuda_a_dev'
2025-04-30 09:48:21 -07:00
Rob Armstrong
14b1bfdcc4
Replace README references to "CUDA Toolkit 12.5" with general "CUDA Toolkit"
2025-04-30 09:46:45 -07:00
Rob Armstrong
c14a0114d6
Some samples require multiple GPUs. Update 'run_tests.py' to skip them on single- or no-GPU systems.
2025-04-30 09:45:20 -07:00
Rob Armstrong
ee15cc0fe2
Merge branch 'shawnz_bugs_fix' into 'master'
...
Bug fix for 5241914, 5164417 and 5097376
See merge request cuda-samples/cuda-samples!107
2025-04-28 08:53:11 -07:00
shawnz
3438fd4875
Update README for OpenMP
2025-04-28 23:44:45 +08:00
shawnz
b27b55ec70
Bug 5241914: Fix the error message for cuSolverDn_LinearSolver
2025-04-27 16:57:02 +08:00
shawnz
49159f3739
Bug 5164417 and 5097376: Fix the OpenMP issue finding issue for MSVC and Glang
2025-04-27 16:50:12 +08:00
Rob Armstrong
93cafa8fe9
Update 12.9 changes from 'master' into 'cuda_a_dev'
2025-04-21 09:22:29 -07:00
Rob Armstrong
1680a1dc7f
Update Windows FreeImage configuration instructions in README.md
2025-04-21 09:20:22 -07:00
Rob Armstrong
49daf0e4e0
Merge Bug 5199167: Fix the includes issue for 5_Domain_Specific\simpleD3D12
...
See merge request cuda-samples/cuda-samples!106
2025-04-21 08:11:52 -07:00
shawnz
a45fd3bd7c
Bug 5199167: Fix the includes issue for 5_Domain_Specific\simpleD3D12
2025-04-21 11:52:33 +08:00
Rob Armstrong
1627e96677
Merge branch 'shawnz_bugs_fix_cuda_a_dev' into 'cuda_a_dev'
...
Bug 5196362: Update parameters of cuCtxCreate for vectorAddMMAP
See merge request cuda-samples/cuda-samples!105
2025-04-17 09:29:53 -07:00
shawnz
7e90d36120
Bug 5196362: Update parameters of cuCtxCreate for vectorAddMMAP
2025-04-17 10:53:03 +08:00
Rob Armstrong
9e50fdc01f
Merge branch 'shawnz_bugs_fix_cuda_a_dev' into 'cuda_a_dev'
...
Bug 5214721: Correct the path of nvvm64_40_0.dll
See merge request cuda-samples/cuda-samples!104
2025-04-15 09:55:49 -07:00
shawnz
2c0b36a967
Bug 5214721: Correct the path of nvvm64_40_0.dll
2025-04-15 14:27:58 +08:00
Rob Armstrong
83397dc811
Merge branch 'shawnz_bugs_fix_cuda_a_dev' into 'cuda_a_dev'
...
Udpate CHANGELOG.md and nvvm64_40_0.dll for CUDA 13.0.
See merge request cuda-samples/cuda-samples!103
2025-04-14 09:48:09 -07:00
shawnz
640b566412
Bug 5214721: Update path for nvvm64_40_0.dll on CUDA 13.0
2025-04-14 16:34:24 +08:00
shawnz
da24673a9f
Update CHANGELOG.md for CUDA 13.0 changes
2025-04-14 16:33:12 +08:00
Rob Armstrong
bded2585a4
Merge branch 'shawnz_bugs_fix_cuda_a_dev' into 'cuda_a_dev'
...
Change for fixing Bug 5196362, 5184356, 5212196, 5214258 and 5214259
See merge request cuda-samples/cuda-samples!102
2025-04-11 07:07:55 -07:00
shawnz
5384563c57
Remove SM < 75 for cudaNvSci
2025-04-11 15:06:52 +08:00
shawnz
01a62e2bc0
Bug 5184356: Update the computeMode for remaining 3 samples
2025-04-11 10:40:35 +08:00
shawnz
02fdb070ad
Bug 5196362, 5184356, 5212196, 5214258 and 5214259: Update sameples for CUDA13.0 API changes
2025-04-10 18:27:18 +08:00
Rob Armstrong
278f4adbd2
Merge branch 'master' into cuda_a_dev
2025-04-09 08:33:37 -07:00
Rob Armstrong
d00076a7c1
Merge branch 'shawnz_bugs_fix_cuda_a_dev' into 'cuda_a_dev'
...
Bug 5163983: Remove SM < 75 in CMakeLists.txt of some samples
See merge request cuda-samples/cuda-samples!101
2025-04-09 08:29:42 -07:00
shawnz
4672b8ba2b
Bug 5163983: Remove SM < 75 in CMakeLists.txt of some samples
2025-04-09 15:10:40 +08:00
Rob Armstrong
0345908807
Update run_tests.py to enable multithreading
2025-04-07 08:48:44 -07:00
Rob Armstrong
3b9c8ce2e9
Merge branch 'shawnz_bugs_fix' into 'master'
...
Bug 5207005: Append pid in shmName for Linux only as this is for MIG scenario
See merge request cuda-samples/cuda-samples!100
2025-04-07 08:21:40 -07:00
shawnz
e77d6eb5ab
Bug 5207005: Append pid in shmName for Linux only as this is for MIG scenario
2025-04-07 17:17:17 +08:00
Rob Armstrong
ac700327a2
Add folders to CMakeLists.txt for supporting generators and IDEs
2025-04-05 09:54:24 -07:00