shawnz
|
b38ed29c95
|
Bug 5412815: Fix the issue of cudaTensorCoreGemm.cu
|
2025-07-25 15:16:12 +08:00 |
|
Rob Armstrong
|
8433f89993
|
Globally remove VSCode tasks.json referencing obsolete build steps
|
2025-07-07 08:31:22 -07:00 |
|
shawnz
|
e674cc36fe
|
Bug 5339530: Set socket creating folder to /tmp for QNX
|
2025-06-13 16:34:40 +08:00 |
|
shawnz
|
9424c15848
|
Bug 5331767: Specify sm list of cdp samples for QNX
|
2025-06-10 15:33:07 +08:00 |
|
shawnz
|
de5fa98e6e
|
Bug 5323118: Remove the -lpthread and -lrt which are not supported on QNX
|
2025-06-10 15:18:04 +08:00 |
|
shawnz
|
5f6d46dfea
|
Bug 5323018: Update the CMakeLists.txt and Common/helper_multiprocess.cpp of ptxjit and memMapIPCDrv for QNX cross build
|
2025-06-09 19:53:25 +08:00 |
|
shawnz
|
49307463b5
|
Bug 5133216: Add QNX tooltrain and cross build support
|
2025-05-29 16:57:28 +08:00 |
|
shawnz
|
0d08748ffa
|
Bug 5305842: Update CMakeLists.txt for cdp samples for sbsa CUDA Toolkit supporting on aarch64-linux platforms
|
2025-05-28 11:42:49 +08:00 |
|
shawnz
|
1141dd7af4
|
Bug 5281036: Limit the register number of debug version for cdpAdvancedQuicksort
|
2025-05-20 14:47:44 +08:00 |
|
Rob Armstrong
|
c6af90553e
|
Merge branch 'master' into 'master'
Bug 5236593: Increase the pending kernel launch limit to 4096
See merge request cuda-samples/cuda-samples!109
|
2025-05-07 09:47:51 -07:00 |
|
Peggy Tian
|
611008fa86
|
Bug 5236593: Increase the pending kernel launch limit to 4096
|
2025-05-07 17:38:52 +08:00 |
|
shawnz
|
bf628887f1
|
Bug 5217339: Replace SM 101 with 110 for Thor
|
2025-05-07 14:10:35 +08:00 |
|
Rob Armstrong
|
9ac81370fa
|
Update 12.9 changes from 'master' into 'cuda_a_dev'
|
2025-04-30 09:48:21 -07:00 |
|
Rob Armstrong
|
14b1bfdcc4
|
Replace README references to "CUDA Toolkit 12.5" with general "CUDA Toolkit"
|
2025-04-30 09:46:45 -07:00 |
|
shawnz
|
02fdb070ad
|
Bug 5196362, 5184356, 5212196, 5214258 and 5214259: Update sameples for CUDA13.0 API changes
|
2025-04-10 18:27:18 +08:00 |
|
Rob Armstrong
|
278f4adbd2
|
Merge branch 'master' into cuda_a_dev
|
2025-04-09 08:33:37 -07:00 |
|
shawnz
|
4672b8ba2b
|
Bug 5163983: Remove SM < 75 in CMakeLists.txt of some samples
|
2025-04-09 15:10:40 +08:00 |
|
shawnz
|
e77d6eb5ab
|
Bug 5207005: Append pid in shmName for Linux only as this is for MIG scenario
|
2025-04-07 17:17:17 +08:00 |
|
shawnz
|
a1b5a6f6e3
|
Bug 5163983: Remove SM < 75 in CMakeLists.txt of some samples
|
2025-04-03 15:51:59 +08:00 |
|
shawnz
|
b013387a39
|
Update code format
|
2025-04-03 11:23:26 +08:00 |
|
shawnz
|
ad9908e32b
|
Bug4914019 & 4191696: Append pid in shmName for MIG multiple thread scenario
|
2025-04-02 11:20:09 +08:00 |
|
Rob Armstrong
|
69522dd5b7
|
CUDA 13.0 removes support for Maxwell, Pascal, and Volta architecture offline compilation
|
2025-03-27 10:43:00 -07:00 |
|
Rob Armstrong
|
ceab6e8bcc
|
Apply consistent code formatting across the repo. Add clang-format and pre-commit hooks.
|
2025-03-27 10:30:07 -07:00 |
|
Rob Armstrong
|
c0ab53f986
|
Update all sample CMakeLists.txt to include ENABLE_CUDA_DEBUG flag to enable cuda-gdb
|
2025-03-26 10:08:59 -07:00 |
|
Rob Armstrong
|
b87c243bbb
|
Add -lineinfo flag to all targets to include line information for developer tools
|
2025-03-26 09:44:20 -07:00 |
|
Rob Armstrong
|
e214cd29aa
|
Update gencode arguments for separate kernel fatbin builds
|
2025-03-26 09:28:37 -07:00 |
|
Rob Armstrong
|
9370f11e69
|
graphConditionalNodes: Additional tweaks to launch dimension initialization (#348)
|
2025-03-05 18:18:37 -08:00 |
|
Rob Armstrong
|
8d901e745d
|
graphConditionalNodes: Change launch dimension initialization for better cross-platform compatibility (#346)
|
2025-03-05 08:33:35 -08:00 |
|
Rob Armstrong
|
bcad2c9e61
|
graphConditionalNodes: Add switch, while, if/else conditional examples and minor cleanup (#344)
|
2025-03-03 17:50:22 -08:00 |
|
XSShawnZeng
|
c357dd1e6b
|
Fixing issue #321: A potential bug in memMapIPCDrv/memMapIpc.cpp (#334)
|
2025-02-21 09:14:25 -08:00 |
|
Rob Armstrong
|
940a4c7a91
|
memMapIpc: Resolve build-time warnings and minor potential issues (#329)
* Fix compute performance calculation type casting in gpuGetMaxGflopsDeviceIdDRV() for #109
* 3_CUDA_Features/memMapIPCDrv: Increase procIdx buffer size to prevent potential buffer overflow
* memMapIPCDrv: Fix memory leaks and improve header inclusion
- Remove redundant string.h header
- Add memory cleanup for dynamically allocated JIT options and log buffer
- Fix printf format specifier for unsigned long long
|
2025-02-19 15:52:20 -08:00 |
|
shawnz
|
fb6fcb0110
|
Enhancement for finding GLFW on WIN and copy .dll files to executable dir for some samples
|
2025-02-14 22:37:51 +08:00 |
|
Rob Armstrong
|
7a9bd38ecc
|
Update OpenGL sample build settings on Windows
|
2025-02-10 23:12:38 -08:00 |
|
Rob Armstrong
|
95308ffc23
|
Add missing build targets to general samples
|
2025-02-08 13:04:26 -05:00 |
|
Rob Armstrong
|
56852fbb50
|
Add explicit system libraries needed on some Linux distributions
|
2025-02-07 15:43:17 -05:00 |
|
Rob Armstrong
|
e8a041d783
|
Ensure -Wno-deprecated-gpu-targets is passed to PTX and fatbin compilation steps
|
2025-02-04 12:07:51 -08:00 |
|
Rob Armstrong
|
c631850c15
|
Remove compute capability 8.7 build target for non-Tegra builds
|
2025-02-04 12:01:38 -08:00 |
|
Rob Armstrong
|
9539ca5fa3
|
Remove compute capability 7.2 build target for non-Tegra builds
|
2025-02-04 11:58:25 -08:00 |
|
Rob Armstrong
|
0f4bdfad99
|
CMake: Add '-Wno-deprecated-gpu-targets' to suppress warning messages during build about Maxwell, Pascal, and Volta
|
2025-02-04 11:51:17 -08:00 |
|
shawnz
|
ab0b386ac4
|
Update tegra SMs]
|
2025-01-23 16:20:34 +08:00 |
|
shawnz
|
08fae276b4
|
Add Tegra SMs in CMakeLists.txt general samples
|
2025-01-23 11:02:56 +08:00 |
|
Rob Armstrong
|
b203467419
|
Update CUDA architectures list
|
2025-01-22 17:49:44 -08:00 |
|
XSShawnZeng
|
95128a1cf3
|
Update CMakeLists.txt
|
2025-01-20 10:05:45 +08:00 |
|
XSShawnZeng
|
3848a7c63c
|
Update CMakeLists.txt
|
2025-01-20 10:05:18 +08:00 |
|
XSShawnZeng
|
d9633314f8
|
Update CMakeLists.txt
|
2025-01-20 10:04:50 +08:00 |
|
XSShawnZeng
|
576c24f97e
|
Update CMakeLists.txt
|
2025-01-20 10:04:24 +08:00 |
|
XSShawnZeng
|
970a6a1151
|
Update CMakeLists.txt
|
2025-01-20 10:04:02 +08:00 |
|
shawnz
|
76e2d2052c
|
Update separate SM list for cdp samples and update the watershedSegmentationNPP for build failed of Bug 4668487
|
2025-01-17 12:04:27 +08:00 |
|
Jonathan Bentz
|
e633580eff
|
a couple more typos in comments
|
2025-01-14 18:12:09 -06:00 |
|
Jonathan Bentz
|
df9ea79df4
|
typo errors in a few CMakelists files
|
2025-01-14 18:09:26 -06:00 |
|