Commit Graph

  • 42ff742bf5
    Merge pull request #1 from sangeetsatheesh/sangeetsatheesh-fix-typo Sangeet S 2024-01-17 13:16:53 -05:00
  • 8ccb13c6f0
    Fix typo #161 Sangeet S 2024-01-17 13:16:01 -05:00
  • 1fa9c59db4 Fixing missing limits header on Vulkan app and missing cuDLA apps v12.2 Rob Nertney 2024-01-03 05:10:37 +00:00
  • e8568c4173 Fixing jitlto regression, including missing cuDLA source files for bug #235, and updating changelogs Rob Nertney 2023-11-09 16:52:00 +00:00
  • b5c84e6996 Updating Samples for 12.3 and updating props files v12.3 Rob Nertney 2023-10-23 18:44:49 +00:00
  • c46754b877 Update samples for 12.3 Rob Nertney 2023-10-20 17:38:48 +00:00
  • 03309a2d42 Changelog updates Rob Nertney 2023-06-29 19:33:40 +00:00
  • 5688ee0013 Removing stray cpp from master Rob Nertney 2023-05-31 17:48:13 +00:00
  • 755892b795 Removing stray cpp from v12.1 v12.1 Rob Nertney 2023-05-31 17:46:09 +00:00
  • 8004ad59ab Fix #194 and add Large Kernel Parameters Sample Rob Nertney 2023-05-31 04:43:22 +00:00
  • e612904184
    Merge pull request #182 from Wenlong-Zhu/master Rob Nertney 2023-03-27 20:53:45 -07:00
  • 81cf058e30 Updating Samples for 12.1 Rob Nertney 2023-03-01 01:41:29 +00:00
  • 26665bf33b Fixing README Rob Nertney 2023-02-27 22:35:39 +00:00
  • 00bb9bc367 Updating files for Ada architecture Rob Nertney 2023-02-27 22:33:19 +00:00
  • e4789153d5 Updating License Header Rob Nertney 2023-02-09 19:02:33 +00:00
  • 1c2efac7c8 Adding SM number for Ada Architecture Rob Nertney 2023-02-07 19:06:53 +00:00
  • 3d553b2ea1 Adding JIT LTO Sample Rob Nertney 2023-02-07 19:06:38 +00:00
  • 9316529638 Fix cudaExtent.width set error. wenlong-zhu 2023-02-04 00:00:44 +08:00
  • 2b689228b7 Updating samples for 12.0 v12.0 Rob Nertney 2022-12-08 20:19:55 +00:00
  • 81992093d2 Update samples for CUDA 11.8 with correct props v11.8 Rob Nertney 2022-10-14 17:43:37 -07:00
  • b312abaa07 add check for filename in nvrtc_helper.h Rutwik Choughule 2022-02-03 18:12:24 +05:30
  • 8f21b899b6 update dependency related links in README files Rutwik Choughule 2022-01-27 17:58:13 +05:30
  • 0cbe5f2d82 update makefiles to waive unsupported samples on QNX Rutwik Choughule 2022-01-27 17:57:02 +05:30
  • 805e60bdfc update lib path for conda Rutwik Choughule 2022-01-27 17:55:38 +05:30
  • 9d4c014f60 update sample cudaNvSci Rutwik Choughule 2022-01-25 17:22:31 +05:30
  • 7db525e24d update sample cudaNvSci v11.6 Rutwik Choughule 2022-01-25 17:18:04 +05:30
  • a78fba92df update sample cudaNvSci v11.5 Rutwik Choughule 2022-01-25 17:06:02 +05:30
  • 876266c941 update Makefiles Rutwik Choughule 2022-01-25 14:06:23 +05:30
  • b9063abcf4 update lib path for conda Rutwik Choughule 2022-01-18 19:03:54 +05:30
  • 97bb3938a6 update Makefile for matrixMulDrv Rutwik Choughule 2022-01-18 18:55:51 +05:30
  • bf8c6dd043 update lib path for conda Rutwik Choughule 2022-01-14 02:31:40 +05:30
  • 2e41896e1b add and update samples for CUDA 11.6 Rutwik Choughule 2022-01-13 11:35:24 +05:30
  • 11de19f00c update command line argument parsing fix ignoring single letter arguments Rutwik Choughule 2021-11-30 23:42:19 +05:30
  • 471dd47f84 update simpleVulkan fix typo in importCudaExternalMemory() Rutwik Choughule 2021-11-26 18:47:48 +05:30
  • 0563025cde update simpleVulkan sample Rutwik Choughule 2021-11-23 18:51:00 +05:30
  • 3a05f29b94 update cuDLA samples fix missing DPRINTF fix Makefile Rutwik Choughule 2021-11-23 14:34:52 +05:30
  • e64c65a0d3 update sample conjugateGradientMultiDeviceCG remove use of deprecated function cudaLaunchCooperativeKernelMultiDevice() Rutwik Choughule 2021-11-18 10:16:22 +05:30
  • af0e1af181 remove duplicate sample Rutwik Choughule 2021-11-18 10:15:10 +05:30
  • 01789304f0 update sample bf16TensorCoreGemm to add explicit casting Rutwik Choughule 2021-11-01 13:22:44 +05:30
  • 1f76a2d110 add and update samples for CUDA 11.5 Rutwik Choughule 2021-10-21 16:34:49 +05:30
  • 3342d604fe update sample conjugateGradientMultiDeviceCG to remove use of deprecated function cudaLaunchCooperativeKernelMultiDevice() Rutwik Choughule 2021-08-09 22:52:15 +05:30
  • ba04faaf73 add and update samples with CUDA 11.4 update 1 support v11.4.1 Rutwik Choughule 2021-08-03 19:24:45 +05:30
  • 8789eb6266 add and update samples with CUDA 11.4 update 1 support Rutwik Choughule 2021-08-03 19:02:58 +05:30
  • e950012e72 add and update samples with CUDA 11.4 support v11.4 Rutwik Choughule 2021-06-30 11:26:41 +05:30
  • 95b7cea7bc Merge branch 'master' of https://github.com/NVIDIA/cuda-samples Rutwik Choughule 2021-06-10 18:37:53 +05:30
  • 2aeaf51b11 cudaNvSciNvMedia plane offset correction Rutwik Choughule 2021-06-10 17:33:24 +05:30
  • ba5a483c6e update vulkan samples with SPIR-V shaders Rutwik Choughule 2021-06-10 17:30:25 +05:30
  • 7a5b3e6c8c update vulkan samples with SPIR-V shaders Rutwik Choughule 2021-06-02 17:17:21 +05:30
  • 0787bc0489
    correction in include path Rutwik Choughule 2021-05-04 16:36:42 +05:30
  • 5c3ec60fae correction in include path Rutwik Choughule 2021-05-03 14:17:55 +05:30
  • 568b39bd5b add and update samples with CUDA 11.3 support v11.3 Rutwik Choughule 2021-04-16 11:54:26 +05:30
  • 067cb65523 -- Add partitioned cuda pipeline prod-cons gemm kernel -- Add cudaCompressibleMemory sample to use copy engine vs SM writes depending on arch Mahesh Doijade 2021-03-03 22:53:02 +05:30
  • b882fa00ee -- Add freeglut and glew64 libs for simpleGL -- delete freeimage libs and headers, user needs to install it for building samples depending on it v11.2 Mahesh Doijade 2020-12-11 17:34:13 +05:30
  • 1cd3264681 Add and update samples with CUDA 11.2 support Mahesh Doijade 2020-12-10 01:05:32 +05:30
  • 92b0568792 Add cudaNvSciNvMedia sample with/without nvsci* APIs, it takes RGBA image as input and produces YUV via nvmedia this YUV is consumed by cuda which converts it to grayscale image which is written to file as output Mahesh Doijade 2020-11-24 16:28:04 +05:30
  • c4e2869a2b add multi-warp cooperative groups based reduction kernel in reduction sample v11.1 Mahesh Doijade 2020-09-24 16:49:58 +05:30
  • dd2dba3489 helper_string: fix the missing "/" at end of 3 up path in sdkFindFilePath() Mahesh Doijade 2020-09-17 20:45:23 +05:30
  • 44052982a7 add ewp tags to VS project of conjugateGradientMultiDeviceCG Mahesh Doijade 2020-09-17 20:01:41 +05:30
  • cd76533c3f Add and update samples for cuda 11.1 support Mahesh Doijade 2020-09-15 23:45:56 +05:30
  • e6ce58fef4 [p2pBandwidthLatency] increase default buffer size and add support to specific buffer size through command line Mahesh Doijade 2020-06-30 19:05:55 +05:30
  • 0ec4bd58e5 cudaCompressibleMemory: when unable to allocate compressible memory waive the execution v11.0 Mahesh Doijade 2020-06-01 20:22:22 +05:30
  • 908dddb207 cudaCompressibleMemory: refactor and refine, and use only cuMemMap API for non-compressible allocation. hence, remove use of cudaMalloc. Mahesh Doijade 2020-05-27 19:00:33 +05:30
  • b60c149b17 Correct the runtime API section in documentation when sample has both driver as well as runtime API Mahesh Doijade 2020-05-22 22:33:02 +05:30
  • 56b5b4c767 -- add cudaCompressibleMemory sample -- fix simpleD3D12 build issue -- fix simpleVulkanMMAP shader file locating issue Mahesh Doijade 2020-05-22 22:07:30 +05:30
  • 53e9e413a8 Update VS project files to use cuda 11.0 props/targets Mahesh Doijade 2020-05-19 18:16:05 +05:30
  • 4f6e02970b Add and update samples for cuda 11.0 support Mahesh Doijade 2020-05-18 22:22:06 +05:30
  • 6be514679b Add and update samples for cuda 10.2 support v10.2 Mahesh Doijade 2019-10-23 19:00:39 +05:30
  • 489d9f7b1f conjugateGradientCudaGraphs: Add cusparseScsrmv & cublasSdot instead of gpuSpMV & gpuDotProduct as they are graph compliant Mahesh Doijade 2019-08-19 15:47:24 +05:30
  • c8483e0798 Add release notes for cuda 10.1 update 2 support 10.1.2 Mahesh Doijade 2019-08-05 18:58:54 +05:30
  • 58d0ceff18 Add and update samples for cuda 10.1 update 2 support Mahesh Doijade 2019-08-02 23:15:47 +05:30
  • 337815dbee Add and update samples with CUDA 10.1 Update 1 support 10.1.1 Mahesh Doijade 2019-04-10 20:12:09 +05:30
  • 1abc294982 --Update README.md for cuda 10.1 release -- fix multidevice CG sample to check for concurrentManagedAccess as it needs systemwide atomics v10.1 Mahesh Doijade 2019-01-23 16:56:20 +05:30
  • b458dafcd6 Add and update samples with CUDA 10.1 support Mahesh Doijade 2019-01-23 01:34:43 +05:30
  • 32f0fc6111 fix the description of CUDA IPC as it is supported on Windows as well since CUDA 9.2 Mahesh Doijade 2018-10-24 13:35:39 +05:30
  • 6bbaf9ac80 -- use DX_SDK env variable only for non-DX12 samples VS files as DX12 does not need legacy DX SDK v10.0.1 Mahesh Doijade 2018-10-09 22:42:25 +05:30
  • fcb23487a8 -- Add simpleD3D12 sample which demonstrates DX12-CUDA interop rendering sinewave -- Update the documentation with description about this sample Mahesh Doijade 2018-10-04 20:05:23 +05:30
  • 91dc60dd96 simpleVulkan: windows.h should be included before VersionHelpers.h v10.0 Mahesh Doijade 2018-09-10 20:41:05 +05:30
  • 21c36d3568 Add and Update samples for CUDA 10.0 Mahesh Doijade 2018-08-24 22:35:15 +05:30
  • 63e044cd0f remove VS project files for sample conjugateGradientMultiDeviceCG as windows support is not expected v9.2 mdoijade 2018-04-16 22:13:17 +05:30
  • 0bce11f7a5 -- Update README.md with changes for repo link and other updates -- remove MDCG support on windows -- integrate fixes to nvrtc samples makefile and driver api samples makefiles mdoijade 2018-04-11 00:09:13 +05:30
  • 90632045e3 add github cuda samples specific search paths in sdkFindFilePath() and remove all the redundant paths from p4 samples world bug: 200397368 mdoijade 2018-03-15 19:48:48 +05:30
  • d08d485c67 Update samples list to include additional samples. Andy Dick 2018-03-09 18:05:01 -08:00
  • 8bb8c5fac0 Initial public release for CUDA 9.2. Andy Dick 2018-03-02 16:07:37 -08:00