shawnz
da3b7a2b3c
Update the vulkanImageCUDA/vulkanImageCUDA.cu for Windows headers
2025-05-19 17:43:08 +08:00
shawnz
5987a9e9fa
Update transpose for code format check
2025-05-19 17:38:42 +08:00
shawnz
107f3f537f
Update the include files sequence for vulkan samples on Windows
2025-05-19 17:38:22 +08:00
Francesco Rizzi
b530f1cf42
Fix bug in 6_Performance/transpose: copy sharedmem kernel ( #363 )
...
Update kernel loop bounds handling, main loop data copy to avoid incorrect reuse of output results.
---------
Authored-by: Francesco Rizzi <francesco.rizzi@ng-analytics.com>
2025-05-05 08:43:23 -07:00
Rob Armstrong
c70d79cf3b
Final 12.9 README updates
2025-05-01 09:39:06 -07:00
Rob Armstrong
14b1bfdcc4
Replace README references to "CUDA Toolkit 12.5" with general "CUDA Toolkit"
2025-04-30 09:46:45 -07:00
shawnz
b27b55ec70
Bug 5241914: Fix the error message for cuSolverDn_LinearSolver
2025-04-27 16:57:02 +08:00
shawnz
49159f3739
Bug 5164417 and 5097376: Fix the OpenMP issue finding issue for MSVC and Glang
2025-04-27 16:50:12 +08:00
shawnz
a45fd3bd7c
Bug 5199167: Fix the includes issue for 5_Domain_Specific\simpleD3D12
2025-04-21 11:52:33 +08:00
shawnz
e77d6eb5ab
Bug 5207005: Append pid in shmName for Linux only as this is for MIG scenario
2025-04-07 17:17:17 +08:00
Rob Armstrong
ac700327a2
Add folders to CMakeLists.txt for supporting generators and IDEs
2025-04-05 09:54:24 -07:00
shawnz
a32d5badf7
Bug 5196977: Update includes for nbody
2025-04-03 15:30:05 +08:00
Rob Armstrong
1fd22429c3
Merge branch 'shawnz_bugs_fix' into 'master'
...
Change for fixing bugs: 5196977, 4914019, 4191696 and 5199167 .
See merge request cuda-samples/cuda-samples!97
2025-04-02 22:28:17 -07:00
Rob Armstrong
00ac0a1673
Remove bandwidthTest subdirectory from CMakeLists.txt
2025-04-02 22:27:30 -07:00
shawnz
b013387a39
Update code format
2025-04-03 11:23:26 +08:00
Rob Armstrong
7d1730f348
Remove outdated bandwidthTest sample
2025-04-02 11:19:48 -07:00
shawnz
718fe6486d
Bug 5199167: Adjust the include header files sequence for simpleD3D11/simpleD3D11Texture
2025-04-02 15:10:29 +08:00
shawnz
ad9908e32b
Bug4914019 & 4191696: Append pid in shmName for MIG multiple thread scenario
2025-04-02 11:20:09 +08:00
shawnz
952d6edf92
Bug 5196977: Include helper_gl.h before cuda_gl_interop.h
2025-04-01 16:07:32 +08:00
shawnz
4abbdf4e80
Bug 5194249: Need to include cuda_runtime.h for cudaNvSci after the clang format change
2025-03-31 14:57:31 +08:00
Rob Armstrong
ceab6e8bcc
Apply consistent code formatting across the repo. Add clang-format and pre-commit hooks.
2025-03-27 10:30:07 -07:00
Rob Armstrong
c0ab53f986
Update all sample CMakeLists.txt to include ENABLE_CUDA_DEBUG flag to enable cuda-gdb
2025-03-26 10:08:59 -07:00
Rob Armstrong
b87c243bbb
Add -lineinfo flag to all targets to include line information for developer tools
2025-03-26 09:44:20 -07:00
Rob Armstrong
e214cd29aa
Update gencode arguments for separate kernel fatbin builds
2025-03-26 09:28:37 -07:00
shawnz
2848d3bd21
Bug 5176886: Enable nvJPEG samples for aarch64
2025-03-21 13:02:14 +08:00
shawnz
bd0f630bf4
Bug 5133197: Add cmake toolchain and and update the CMakeList of some sample for tegra linux cross build
2025-03-20 12:43:44 +08:00
shawnz
ab9166a6b2
Bug 5139353 and 5139213: Enhancement for streamOrderedAllocationIPC
2025-03-12 15:28:54 +08:00
Rob Armstrong
9370f11e69
graphConditionalNodes: Additional tweaks to launch dimension initialization ( #348 )
2025-03-05 18:18:37 -08:00
Rob Armstrong
8d901e745d
graphConditionalNodes: Change launch dimension initialization for better cross-platform compatibility ( #346 )
2025-03-05 08:33:35 -08:00
Shawn Zeng
9adce9d9f2
Update file CMakeLists.txt
2025-03-03 19:19:50 -08:00
Rob Armstrong
bcad2c9e61
graphConditionalNodes: Add switch, while, if/else conditional examples and minor cleanup ( #344 )
2025-03-03 17:50:22 -08:00
Shawn Zeng
310e7f2a11
Bug 5143332: Remove the redundant content in 0_Introduction/CMakeLists.txt
2025-03-03 17:37:48 -08:00
Shawn Zeng
7f0f63f311
Bug 5034785: Update all non-ctx nppi APIs to ctx APIs as per latest change on NPP
2025-02-27 03:01:47 -08:00
Shawn Zeng
acd3a015c8
Revert "Bug 5034785: Update all non-ctx nppi APIs to ctx APIs as per latest change on NPP"
...
This reverts commit a9869fd6eaeecc748fc5f10f4b331fa41efbdaca
2025-02-27 02:48:03 -08:00
shawnz
a9869fd6ea
Bug 5034785: Update all non-ctx nppi APIs to ctx APIs as per latest change on NPP
2025-02-27 18:43:53 +08:00
XSShawnZeng
3e8f91d1a1
Several small bug fixes for Windows platforms
...
* Enhancement for GLFW include and lib search
* Fixing issue #321 : A potential bug in memMapIPCDrv/memMapIpc.cpp
* Update CMakelist.txt for the sample 0_Introduction/template
* Copy .dll to correct dir for 5_Domain_Specific/Mandelbrot
* Fix typo
* Update changelog for cudaNvSciBufMultiplanar
2025-02-26 08:23:39 -08:00
Jonathan Bentz
f3b7c41ad6
cudaNvSci: Update README.md fixing typo ( #337 )
...
Fixes #193
2025-02-21 09:21:43 -08:00
Jonathan Bentz
29fb758e62
conjugateGradient: Ensure allocated memory is freed ( #336 )
...
Fixes #202
2025-02-21 09:20:53 -08:00
Jonathan Bentz
3bc08136ff
Update README.md link for sortingNetworks ( #335 )
...
Fixes #302
2025-02-21 09:19:21 -08:00
Jonathan Bentz
85eefa06c4
boxFilter: Remove unused parameter ( #338 )
...
Fixes : #122
2025-02-21 09:17:45 -08:00
XSShawnZeng
c357dd1e6b
Fixing issue #321 : A potential bug in memMapIPCDrv/memMapIpc.cpp ( #334 )
2025-02-21 09:14:25 -08:00
Jonathan Bentz
efb46383e0
Transpose: Change TILE_DIM to 32 to fix bank conflicts
...
Fixes #175
2025-02-20 15:46:44 -08:00
XSShawnZeng
8d564d5e3a
Enhancement for GLFW include and lib search ( #331 )
...
Fixes NVIDIA bug 5115098
2025-02-20 08:06:40 -08:00
Jake Hemstad
37c5bcbef4
Update kernels.cuh
2025-02-19 17:33:10 -08:00
Rob Armstrong
940a4c7a91
memMapIpc: Resolve build-time warnings and minor potential issues ( #329 )
...
* Fix compute performance calculation type casting in gpuGetMaxGflopsDeviceIdDRV() for #109
* 3_CUDA_Features/memMapIPCDrv: Increase procIdx buffer size to prevent potential buffer overflow
* memMapIPCDrv: Fix memory leaks and improve header inclusion
- Remove redundant string.h header
- Add memory cleanup for dynamically allocated JIT options and log buffer
- Fix printf format specifier for unsigned long long
2025-02-19 15:52:20 -08:00
ohmaya
61bd39800d
simplePrintf.cu: "Compute capability" text ( #299 )
...
Compute %d.%d capability => Compute capability %d.%d
2025-02-19 15:22:34 -08:00
Rob Armstrong
94765c1597
Fix minor typo in README.md ( #326 )
2025-02-18 17:14:14 -08:00
Rob Armstrong
c87881f02c
Update matrix multiplication sample README references ( #325 )
...
- Clarify reference to Shared Memory section in CUDA programming guide
- Update cuBLAS interface version description
- Add hyperlink to Shared Memory documentation
2025-02-18 14:02:59 -08:00
Rob Armstrong
25400b6b3c
Merge pull request #287 from steffen-v/patch-1
...
fix "gridy" comandline argument for initMC
2025-02-18 13:30:27 -08:00
shawnz
fb6fcb0110
Enhancement for finding GLFW on WIN and copy .dll files to executable dir for some samples
2025-02-14 22:37:51 +08:00