Francesco Rizzi
b530f1cf42
Fix bug in 6_Performance/transpose: copy sharedmem kernel ( #363 )
...
Update kernel loop bounds handling, main loop data copy to avoid incorrect reuse of output results.
---------
Authored-by: Francesco Rizzi <francesco.rizzi@ng-analytics.com>
2025-05-05 08:43:23 -07:00
Rob Armstrong
cab7c66b4f
Update pre-config to include Python and JSON for EOL, whitespace checks
v12.9
2025-05-01 10:17:42 -07:00
Rob Armstrong
8d400cfb7f
Additional minor changes to run_tests.py output formatting
2025-05-01 10:14:09 -07:00
Rob Armstrong
6d6d964f97
Minor changes to run_tests.py output formatting
2025-05-01 09:54:25 -07:00
Rob Armstrong
ab68d58d59
Remove unused bin/x86_64 directory hierarchy
2025-05-01 09:53:54 -07:00
Rob Armstrong
c70d79cf3b
Final 12.9 README updates
2025-05-01 09:39:06 -07:00
Rob Armstrong
14b1bfdcc4
Replace README references to "CUDA Toolkit 12.5" with general "CUDA Toolkit"
2025-04-30 09:46:45 -07:00
Rob Armstrong
c14a0114d6
Some samples require multiple GPUs. Update 'run_tests.py' to skip them on single- or no-GPU systems.
2025-04-30 09:45:20 -07:00
Rob Armstrong
ee15cc0fe2
Merge branch 'shawnz_bugs_fix' into 'master'
...
Bug fix for 5241914, 5164417 and 5097376
See merge request cuda-samples/cuda-samples!107
2025-04-28 08:53:11 -07:00
shawnz
3438fd4875
Update README for OpenMP
2025-04-28 23:44:45 +08:00
shawnz
b27b55ec70
Bug 5241914: Fix the error message for cuSolverDn_LinearSolver
2025-04-27 16:57:02 +08:00
shawnz
49159f3739
Bug 5164417 and 5097376: Fix the OpenMP issue finding issue for MSVC and Glang
2025-04-27 16:50:12 +08:00
Rob Armstrong
1680a1dc7f
Update Windows FreeImage configuration instructions in README.md
2025-04-21 09:20:22 -07:00
Rob Armstrong
49daf0e4e0
Merge Bug 5199167: Fix the includes issue for 5_Domain_Specific\simpleD3D12
...
See merge request cuda-samples/cuda-samples!106
2025-04-21 08:11:52 -07:00
shawnz
a45fd3bd7c
Bug 5199167: Fix the includes issue for 5_Domain_Specific\simpleD3D12
2025-04-21 11:52:33 +08:00
Rob Armstrong
0345908807
Update run_tests.py to enable multithreading
2025-04-07 08:48:44 -07:00
Rob Armstrong
3b9c8ce2e9
Merge branch 'shawnz_bugs_fix' into 'master'
...
Bug 5207005: Append pid in shmName for Linux only as this is for MIG scenario
See merge request cuda-samples/cuda-samples!100
2025-04-07 08:21:40 -07:00
shawnz
e77d6eb5ab
Bug 5207005: Append pid in shmName for Linux only as this is for MIG scenario
2025-04-07 17:17:17 +08:00
Rob Armstrong
ac700327a2
Add folders to CMakeLists.txt for supporting generators and IDEs
2025-04-05 09:54:24 -07:00
Rob Armstrong
17703dd426
Merge branch 'shawnz_bugs_fix' into 'master'
...
Bug 5196977: Update includes for nbody
See merge request cuda-samples/cuda-samples!98
2025-04-03 01:16:20 -07:00
shawnz
a32d5badf7
Bug 5196977: Update includes for nbody
2025-04-03 15:30:05 +08:00
Rob Armstrong
1fd22429c3
Merge branch 'shawnz_bugs_fix' into 'master'
...
Change for fixing bugs: 5196977, 4914019, 4191696 and 5199167 .
See merge request cuda-samples/cuda-samples!97
2025-04-02 22:28:17 -07:00
Rob Armstrong
00ac0a1673
Remove bandwidthTest subdirectory from CMakeLists.txt
2025-04-02 22:27:30 -07:00
shawnz
b013387a39
Update code format
2025-04-03 11:23:26 +08:00
Rob Armstrong
9d921e0fe7
Add CONTRIBUTING.md
2025-04-02 11:29:16 -07:00
Rob Armstrong
7d1730f348
Remove outdated bandwidthTest sample
2025-04-02 11:19:48 -07:00
shawnz
718fe6486d
Bug 5199167: Adjust the include header files sequence for simpleD3D11/simpleD3D11Texture
2025-04-02 15:10:29 +08:00
shawnz
ad9908e32b
Bug4914019 & 4191696: Append pid in shmName for MIG multiple thread scenario
2025-04-02 11:20:09 +08:00
shawnz
952d6edf92
Bug 5196977: Include helper_gl.h before cuda_gl_interop.h
2025-04-01 16:07:32 +08:00
Rob Armstrong
685709bfc7
Merge branch 'shawnz_bugs_fix' into 'master'
...
Bug fix for bug 5194249, 5188945 and 5164374
See merge request cuda-samples/cuda-samples!95
2025-03-31 08:00:50 -07:00
shawnz
0c92c34ca9
Bug 5164374: Remove the register keyword has been deprecated and removed from the C++17 standard
2025-03-31 15:13:56 +08:00
shawnz
0d82634f70
5188945: Add freeglut and glew64 .dll files for minsizeRel/RelWithDebInfo build
2025-03-31 15:07:29 +08:00
shawnz
4abbdf4e80
Bug 5194249: Need to include cuda_runtime.h for cudaNvSci after the clang format change
2025-03-31 14:57:31 +08:00
Rob Armstrong
914ca00f89
Small update to README.md to clarify test script usage.
2025-03-28 15:16:10 -07:00
Rob Armstrong
c8034f368a
Add helper utility to test run all built samples (see README.md for usage details)
2025-03-28 15:07:07 -07:00
Rob Armstrong
ceab6e8bcc
Apply consistent code formatting across the repo. Add clang-format and pre-commit hooks.
2025-03-27 10:30:07 -07:00
Rob Armstrong
2cd58fbc9a
Update README version for 12.9
2025-03-26 10:24:22 -07:00
Rob Armstrong
c0ab53f986
Update all sample CMakeLists.txt to include ENABLE_CUDA_DEBUG flag to enable cuda-gdb
2025-03-26 10:08:59 -07:00
Rob Armstrong
b87c243bbb
Add -lineinfo flag to all targets to include line information for developer tools
2025-03-26 09:44:20 -07:00
Rob Armstrong
e214cd29aa
Update gencode arguments for separate kernel fatbin builds
2025-03-26 09:28:37 -07:00
Rob Armstrong
06d72496c2
Merge branch 'shawnz_tegra_crossbuild_toolchain' into 'master'
...
Bug 5133197: Add cmake toolchain and and update the CMakeList of some sample...
See merge request cuda-samples/cuda-samples!94
2025-03-25 14:52:02 -07:00
shawnz
2848d3bd21
Bug 5176886: Enable nvJPEG samples for aarch64
2025-03-21 13:02:14 +08:00
shawnz
bd0f630bf4
Bug 5133197: Add cmake toolchain and and update the CMakeList of some sample for tegra linux cross build
2025-03-20 12:43:44 +08:00
shawnz
ab9166a6b2
Bug 5139353 and 5139213: Enhancement for streamOrderedAllocationIPC
2025-03-12 15:28:54 +08:00
Rob Armstrong
c90a1c6981
Merge public repo changes
2025-03-08 08:30:35 -08:00
Rob Armstrong
9370f11e69
graphConditionalNodes: Additional tweaks to launch dimension initialization ( #348 )
2025-03-05 18:18:37 -08:00
Rob Armstrong
291435e0b4
graphConditionalNodes: Additional tweaks to launch dimension initialization ( #348 )
2025-03-05 18:17:27 -08:00
Rob Armstrong
8d901e745d
graphConditionalNodes: Change launch dimension initialization for better cross-platform compatibility ( #346 )
2025-03-05 08:33:35 -08:00
Rob Armstrong
990ebc01c2
graphConditionalNodes: Change launch dimension initialization for better cross-platform compatibility ( #346 )
2025-03-05 08:32:58 -08:00
Shawn Zeng
9adce9d9f2
Update file CMakeLists.txt
2025-03-03 19:19:50 -08:00