Commit Graph

  • c6208f5897 Bug 5263330: Update CUFFT errors as per latest changes on CUDA 13.0 shawnz 2025-05-12 15:39:08 +08:00
  • 8f33cc6094 Bug 5274280: Enable 8_Platform_Specific/Tegra/EGLSync_CUDAEvent_Interop shawnz 2025-05-12 15:02:31 +08:00
  • 2ec9cf394a Bug 5272236: Update the include file copy path as path changes on 13.0 shawnz 2025-05-12 15:00:52 +08:00
  • 770e433a9e Bug 5056055: limit register usage to 128 per thread in debug mode to comply with the maximum number of 32-bit registers per SM Peggy Tian 2025-05-12 06:04:22 +00:00
  • dd1a11f648 Merge branch 'master' into cuda_a_dev Rob Armstrong 2025-05-07 09:56:50 -07:00
  • c6af90553e Merge branch 'master' into 'master' Rob Armstrong 2025-05-07 09:47:51 -07:00
  • 7989d1fcc8 Merge branch 'shawnz_bug_fix' into 'master' Rob Armstrong 2025-05-07 09:46:41 -07:00
  • 611008fa86 Bug 5236593: Increase the pending kernel launch limit to 4096 Peggy Tian 2025-05-07 17:38:52 +08:00
  • bf628887f1 Bug 5217339: Replace SM 101 with 110 for Thor shawnz 2025-05-07 14:10:35 +08:00
  • f288b2e261 Fix bug in 6_Performance/transpose: copy sharedmem kernel (#363) Francesco Rizzi 2025-05-05 17:43:23 +02:00
  • 330dd8472f Merge external PR #363 Rob Armstrong 2025-05-05 08:45:01 -07:00
  • b530f1cf42
    Fix bug in 6_Performance/transpose: copy sharedmem kernel (#363) Francesco Rizzi 2025-05-05 17:43:23 +02:00
  • b63fd83b6c Update README for 13.1 Rob Armstrong 2025-05-01 15:24:57 -07:00
  • 8ee551c99a Udpate README for 13.0 Rob Armstrong 2025-05-01 15:23:36 -07:00
  • 148014e709 Merge cuda_a_dev 13.0 changes to master Rob Armstrong 2025-05-01 15:22:40 -07:00
  • cab7c66b4f Update pre-config to include Python and JSON for EOL, whitespace checks v12.9 Rob Armstrong 2025-05-01 10:17:42 -07:00
  • 8d400cfb7f Additional minor changes to run_tests.py output formatting Rob Armstrong 2025-05-01 10:14:09 -07:00
  • f2645c5df8 Final merge of 12.9 changes into cuda_a_dev Rob Armstrong 2025-05-01 09:55:03 -07:00
  • 6d6d964f97 Minor changes to run_tests.py output formatting Rob Armstrong 2025-05-01 09:54:25 -07:00
  • ab68d58d59 Remove unused bin/x86_64 directory hierarchy Rob Armstrong 2025-05-01 09:53:54 -07:00
  • c70d79cf3b Final 12.9 README updates Rob Armstrong 2025-05-01 09:39:06 -07:00
  • 9ac81370fa Update 12.9 changes from 'master' into 'cuda_a_dev' Rob Armstrong 2025-04-30 09:48:21 -07:00
  • 14b1bfdcc4 Replace README references to "CUDA Toolkit 12.5" with general "CUDA Toolkit" Rob Armstrong 2025-04-30 09:46:45 -07:00
  • c14a0114d6 Some samples require multiple GPUs. Update 'run_tests.py' to skip them on single- or no-GPU systems. Rob Armstrong 2025-04-30 09:45:20 -07:00
  • ee15cc0fe2 Merge branch 'shawnz_bugs_fix' into 'master' Rob Armstrong 2025-04-28 08:53:11 -07:00
  • 3438fd4875 Update README for OpenMP shawnz 2025-04-28 23:44:45 +08:00
  • b27b55ec70 Bug 5241914: Fix the error message for cuSolverDn_LinearSolver shawnz 2025-04-27 16:57:02 +08:00
  • 49159f3739 Bug 5164417 and 5097376: Fix the OpenMP issue finding issue for MSVC and Glang shawnz 2025-04-27 16:50:12 +08:00
  • 93cafa8fe9 Update 12.9 changes from 'master' into 'cuda_a_dev' Rob Armstrong 2025-04-21 09:22:29 -07:00
  • 1680a1dc7f Update Windows FreeImage configuration instructions in README.md Rob Armstrong 2025-04-21 09:20:22 -07:00
  • 49daf0e4e0 Merge Bug 5199167: Fix the includes issue for 5_Domain_Specific\simpleD3D12 Rob Armstrong 2025-04-21 08:11:52 -07:00
  • a45fd3bd7c Bug 5199167: Fix the includes issue for 5_Domain_Specific\simpleD3D12 shawnz 2025-04-21 11:52:33 +08:00
  • 1627e96677 Merge branch 'shawnz_bugs_fix_cuda_a_dev' into 'cuda_a_dev' Rob Armstrong 2025-04-17 09:29:53 -07:00
  • 7e90d36120 Bug 5196362: Update parameters of cuCtxCreate for vectorAddMMAP shawnz 2025-04-17 10:53:03 +08:00
  • 9e50fdc01f Merge branch 'shawnz_bugs_fix_cuda_a_dev' into 'cuda_a_dev' Rob Armstrong 2025-04-15 09:55:49 -07:00
  • 2c0b36a967 Bug 5214721: Correct the path of nvvm64_40_0.dll shawnz 2025-04-15 14:27:58 +08:00
  • 83397dc811 Merge branch 'shawnz_bugs_fix_cuda_a_dev' into 'cuda_a_dev' Rob Armstrong 2025-04-14 09:48:09 -07:00
  • 640b566412 Bug 5214721: Update path for nvvm64_40_0.dll on CUDA 13.0 shawnz 2025-04-14 16:34:24 +08:00
  • da24673a9f Update CHANGELOG.md for CUDA 13.0 changes shawnz 2025-04-14 16:33:12 +08:00
  • bded2585a4 Merge branch 'shawnz_bugs_fix_cuda_a_dev' into 'cuda_a_dev' Rob Armstrong 2025-04-11 07:07:55 -07:00
  • 5384563c57 Remove SM < 75 for cudaNvSci shawnz 2025-04-11 15:06:52 +08:00
  • 01a62e2bc0 Bug 5184356: Update the computeMode for remaining 3 samples shawnz 2025-04-11 10:40:35 +08:00
  • 02fdb070ad Bug 5196362, 5184356, 5212196, 5214258 and 5214259: Update sameples for CUDA13.0 API changes shawnz 2025-04-10 18:27:18 +08:00
  • 278f4adbd2 Merge branch 'master' into cuda_a_dev Rob Armstrong 2025-04-09 08:33:37 -07:00
  • d00076a7c1 Merge branch 'shawnz_bugs_fix_cuda_a_dev' into 'cuda_a_dev' Rob Armstrong 2025-04-09 08:29:42 -07:00
  • 4672b8ba2b Bug 5163983: Remove SM < 75 in CMakeLists.txt of some samples shawnz 2025-04-09 15:10:40 +08:00
  • 0345908807 Update run_tests.py to enable multithreading Rob Armstrong 2025-04-07 08:48:44 -07:00
  • 3b9c8ce2e9 Merge branch 'shawnz_bugs_fix' into 'master' Rob Armstrong 2025-04-07 08:21:40 -07:00
  • e77d6eb5ab Bug 5207005: Append pid in shmName for Linux only as this is for MIG scenario shawnz 2025-04-07 17:17:17 +08:00
  • ac700327a2 Add folders to CMakeLists.txt for supporting generators and IDEs Rob Armstrong 2025-04-05 09:54:24 -07:00
  • 56e669e2e4 Merge branch 'shawnz_bugs_fix_cuda_a_dev' into 'cuda_a_dev' Rob Armstrong 2025-04-03 01:16:47 -07:00
  • 17703dd426 Merge branch 'shawnz_bugs_fix' into 'master' Rob Armstrong 2025-04-03 01:16:20 -07:00
  • a1b5a6f6e3 Bug 5163983: Remove SM < 75 in CMakeLists.txt of some samples shawnz 2025-04-03 15:51:59 +08:00
  • a32d5badf7 Bug 5196977: Update includes for nbody shawnz 2025-04-03 15:30:05 +08:00
  • 1fd22429c3 Merge branch 'shawnz_bugs_fix' into 'master' Rob Armstrong 2025-04-02 22:28:17 -07:00
  • 00ac0a1673 Remove bandwidthTest subdirectory from CMakeLists.txt Rob Armstrong 2025-04-02 22:27:30 -07:00
  • b013387a39 Update code format shawnz 2025-04-03 11:23:26 +08:00
  • 9d921e0fe7 Add CONTRIBUTING.md Rob Armstrong 2025-04-02 11:29:16 -07:00
  • 7d1730f348 Remove outdated bandwidthTest sample Rob Armstrong 2025-04-02 11:19:48 -07:00
  • a4fba501a6 Merge branch 'master' into cuda_a_dev Rob Armstrong 2025-04-02 08:40:31 -07:00
  • 718fe6486d Bug 5199167: Adjust the include header files sequence for simpleD3D11/simpleD3D11Texture shawnz 2025-04-02 15:10:29 +08:00
  • ad9908e32b Bug4914019 & 4191696: Append pid in shmName for MIG multiple thread scenario shawnz 2025-04-02 11:20:09 +08:00
  • 952d6edf92 Bug 5196977: Include helper_gl.h before cuda_gl_interop.h shawnz 2025-04-01 16:07:32 +08:00
  • 685709bfc7 Merge branch 'shawnz_bugs_fix' into 'master' Rob Armstrong 2025-03-31 08:00:50 -07:00
  • 0c92c34ca9 Bug 5164374: Remove the register keyword has been deprecated and removed from the C++17 standard shawnz 2025-03-31 15:13:56 +08:00
  • 0d82634f70 5188945: Add freeglut and glew64 .dll files for minsizeRel/RelWithDebInfo build shawnz 2025-03-31 15:07:29 +08:00
  • 4abbdf4e80 Bug 5194249: Need to include cuda_runtime.h for cudaNvSci after the clang format change shawnz 2025-03-31 14:57:31 +08:00
  • 89789bd848 Merge branch 'master' into cuda_a_dev Rob Armstrong 2025-03-28 15:16:45 -07:00
  • 914ca00f89 Small update to README.md to clarify test script usage. Rob Armstrong 2025-03-28 15:16:10 -07:00
  • 8f6b189dae Merge test script into 13.0 branch Rob Armstrong 2025-03-28 15:07:39 -07:00
  • c8034f368a Add helper utility to test run all built samples (see README.md for usage details) Rob Armstrong 2025-03-28 15:07:07 -07:00
  • 69522dd5b7 CUDA 13.0 removes support for Maxwell, Pascal, and Volta architecture offline compilation Rob Armstrong 2025-03-27 10:43:00 -07:00
  • eddc6fd7e1 Merge branch 'master' into cuda_a_dev Rob Armstrong 2025-03-27 10:38:16 -07:00
  • ceab6e8bcc Apply consistent code formatting across the repo. Add clang-format and pre-commit hooks. Rob Armstrong 2025-03-27 10:30:07 -07:00
  • 2cd58fbc9a Update README version for 12.9 Rob Armstrong 2025-03-26 10:24:22 -07:00
  • 7ceb3122fc Merge branch 'master' into cuda_a_dev Rob Armstrong 2025-03-26 10:20:59 -07:00
  • c0ab53f986 Update all sample CMakeLists.txt to include ENABLE_CUDA_DEBUG flag to enable cuda-gdb Rob Armstrong 2025-03-26 10:08:59 -07:00
  • b87c243bbb Add -lineinfo flag to all targets to include line information for developer tools Rob Armstrong 2025-03-26 09:44:20 -07:00
  • e214cd29aa Update gencode arguments for separate kernel fatbin builds Rob Armstrong 2025-03-26 09:28:37 -07:00
  • 06d72496c2 Merge branch 'shawnz_tegra_crossbuild_toolchain' into 'master' Rob Armstrong 2025-03-25 14:52:02 -07:00
  • 2848d3bd21 Bug 5176886: Enable nvJPEG samples for aarch64 shawnz 2025-03-21 13:02:14 +08:00
  • bd0f630bf4 Bug 5133197: Add cmake toolchain and and update the CMakeList of some sample for tegra linux cross build shawnz 2025-03-19 18:17:17 +08:00
  • ab9166a6b2 Bug 5139353 and 5139213: Enhancement for streamOrderedAllocationIPC shawnz 2025-03-12 15:28:54 +08:00
  • 62781dc15e Merge branch 'master' into cuda_a_dev Rob Armstrong 2025-03-08 08:32:28 -08:00
  • c90a1c6981 Merge public repo changes Rob Armstrong 2025-03-08 08:30:35 -08:00
  • 3f97ef1288 Merge branch 'master' into 'cuda_a_dev' Rob Armstrong 2025-03-05 18:25:17 -08:00
  • 408c9f69a8 graphConditionalNodes: Change launch dimension initialization for better cross-platform compatibility (#346) Rob Armstrong 2025-03-05 18:25:17 -08:00
  • 9370f11e69 graphConditionalNodes: Additional tweaks to launch dimension initialization (#348) Rob Armstrong 2025-03-05 18:17:27 -08:00
  • 291435e0b4
    graphConditionalNodes: Additional tweaks to launch dimension initialization (#348) Rob Armstrong 2025-03-05 18:17:27 -08:00
  • 5df07f114e graphConditionalNodes: Change launch dimension initialization for better cross-platform compatibility (#346) Rob Armstrong 2025-03-05 08:32:58 -08:00
  • 8d901e745d graphConditionalNodes: Change launch dimension initialization for better cross-platform compatibility (#346) Rob Armstrong 2025-03-05 08:32:58 -08:00
  • 990ebc01c2
    graphConditionalNodes: Change launch dimension initialization for better cross-platform compatibility (#346) Rob Armstrong 2025-03-05 08:32:58 -08:00
  • 541e9fc3f5 Update file CMakeLists.txt Shawn Zeng 2025-03-03 19:42:45 -08:00
  • 9adce9d9f2 Update file CMakeLists.txt Shawn Zeng 2025-03-03 19:19:50 -08:00
  • b6f3b7add9 graphConditionalNodes: Add switch, while, if/else conditional examples and minor cleanup (#344) Shawn Zeng 2025-03-03 19:03:48 -08:00
  • bcad2c9e61 graphConditionalNodes: Add switch, while, if/else conditional examples and minor cleanup (#344) Rob Armstrong 2025-03-03 17:49:17 -08:00
  • e7b23470d5
    graphConditionalNodes: Add switch, while, if/else conditional examples and minor cleanup (#344) Rob Armstrong 2025-03-03 17:49:17 -08:00
  • 310e7f2a11 Bug 5143332: Remove the redundant content in 0_Introduction/CMakeLists.txt Shawn Zeng 2025-03-03 17:37:48 -08:00
  • 7f0f63f311 Bug 5034785: Update all non-ctx nppi APIs to ctx APIs as per latest change on NPP Shawn Zeng 2025-02-27 03:01:47 -08:00
  • acd3a015c8 Revert "Bug 5034785: Update all non-ctx nppi APIs to ctx APIs as per latest change on NPP" Shawn Zeng 2025-02-27 02:48:03 -08:00