Commit Graph

61 Commits

Author SHA1 Message Date
Rob Nertney
3d553b2ea1 Adding JIT LTO Sample 2023-02-07 19:06:38 +00:00
Rob Nertney
2b689228b7 Updating samples for 12.0 2022-12-08 20:19:55 +00:00
Rob Nertney
81992093d2 Update samples for CUDA 11.8 with correct props 2022-10-14 17:43:37 -07:00
Rutwik Choughule
b312abaa07 add check for filename in nvrtc_helper.h 2022-02-03 18:12:24 +05:30
Rutwik Choughule
8f21b899b6 update dependency related links in README files 2022-01-27 17:58:13 +05:30
Rutwik Choughule
0cbe5f2d82 update makefiles to waive unsupported samples on QNX 2022-01-27 17:57:02 +05:30
Rutwik Choughule
805e60bdfc update lib path for conda 2022-01-27 17:55:38 +05:30
Rutwik Choughule
9d4c014f60 update sample cudaNvSci 2022-01-25 17:22:31 +05:30
Rutwik Choughule
bf8c6dd043 update lib path for conda 2022-01-14 02:31:40 +05:30
Rutwik Choughule
2e41896e1b add and update samples for CUDA 11.6 2022-01-13 11:35:24 +05:30
Rutwik Choughule
11de19f00c update command line argument parsing
fix ignoring single letter arguments
2021-11-30 23:42:19 +05:30
Rutwik Choughule
471dd47f84 update simpleVulkan
fix typo in importCudaExternalMemory()
2021-11-26 18:47:48 +05:30
Rutwik Choughule
0563025cde update simpleVulkan sample 2021-11-23 18:51:00 +05:30
Rutwik Choughule
3a05f29b94 update cuDLA samples
fix missing DPRINTF
fix Makefile
2021-11-23 14:34:52 +05:30
Rutwik Choughule
e64c65a0d3 update sample conjugateGradientMultiDeviceCG
remove use of deprecated function cudaLaunchCooperativeKernelMultiDevice()
2021-11-18 10:16:22 +05:30
Rutwik Choughule
af0e1af181 remove duplicate sample 2021-11-18 10:15:10 +05:30
Rutwik Choughule
01789304f0 update sample bf16TensorCoreGemm to add explicit casting 2021-11-01 13:22:44 +05:30
Rutwik Choughule
1f76a2d110 add and update samples for CUDA 11.5 2021-10-21 16:34:49 +05:30
Rutwik Choughule
3342d604fe update sample conjugateGradientMultiDeviceCG to remove use of deprecated function cudaLaunchCooperativeKernelMultiDevice() 2021-08-09 22:52:15 +05:30
Rutwik Choughule
ba04faaf73 add and update samples with CUDA 11.4 update 1 support 2021-08-03 19:24:45 +05:30
Rutwik Choughule
8789eb6266 add and update samples with CUDA 11.4 update 1 support 2021-08-03 19:02:58 +05:30
Rutwik Choughule
e950012e72 add and update samples with CUDA 11.4 support 2021-06-30 11:26:41 +05:30
Rutwik Choughule
95b7cea7bc Merge branch 'master' of https://github.com/NVIDIA/cuda-samples 2021-06-10 18:37:53 +05:30
Rutwik Choughule
2aeaf51b11 cudaNvSciNvMedia plane offset correction 2021-06-10 17:33:24 +05:30
Rutwik Choughule
ba5a483c6e update vulkan samples with SPIR-V shaders 2021-06-10 17:30:25 +05:30
Rutwik Choughule
7a5b3e6c8c update vulkan samples with SPIR-V shaders 2021-06-02 17:17:21 +05:30
Rutwik Choughule
0787bc0489
correction in include path 2021-05-04 16:36:42 +05:30
Rutwik Choughule
5c3ec60fae correction in include path 2021-05-03 14:17:55 +05:30
Rutwik Choughule
568b39bd5b add and update samples with CUDA 11.3 support 2021-04-16 11:54:26 +05:30
Mahesh Doijade
067cb65523 -- Add partitioned cuda pipeline prod-cons gemm kernel
-- Add cudaCompressibleMemory sample to use copy engine vs SM writes
   depending on arch
2021-03-03 22:53:02 +05:30
Mahesh Doijade
b882fa00ee -- Add freeglut and glew64 libs for simpleGL
-- delete freeimage libs and headers, user needs to install it for
building samples depending on it
2020-12-11 17:34:13 +05:30
Mahesh Doijade
1cd3264681 Add and update samples with CUDA 11.2 support 2020-12-10 01:05:32 +05:30
Mahesh Doijade
92b0568792 Add cudaNvSciNvMedia sample with/without nvsci* APIs, it takes RGBA image as input and produces YUV via nvmedia
this YUV is consumed by cuda which converts it to grayscale image which is written to file as output
2020-11-24 16:28:04 +05:30
Mahesh Doijade
c4e2869a2b add multi-warp cooperative groups based reduction kernel in reduction sample 2020-09-24 16:49:58 +05:30
Mahesh Doijade
dd2dba3489 helper_string: fix the missing "/" at end of 3 up path in sdkFindFilePath() 2020-09-17 20:45:23 +05:30
Mahesh Doijade
44052982a7 add ewp tags to VS project of conjugateGradientMultiDeviceCG 2020-09-17 20:01:41 +05:30
Mahesh Doijade
cd76533c3f Add and update samples for cuda 11.1 support 2020-09-15 23:45:56 +05:30
Mahesh Doijade
e6ce58fef4 [p2pBandwidthLatency] increase default buffer size and add support to specific buffer size through command line 2020-06-30 19:05:55 +05:30
Mahesh Doijade
0ec4bd58e5 cudaCompressibleMemory: when unable to allocate compressible memory waive the execution 2020-06-01 20:22:22 +05:30
Mahesh Doijade
908dddb207 cudaCompressibleMemory: refactor and refine, and use only cuMemMap API for non-compressible allocation.
hence, remove use of cudaMalloc.
2020-05-27 19:00:33 +05:30
Mahesh Doijade
b60c149b17 Correct the runtime API section in documentation when sample has both driver as well as runtime API 2020-05-22 22:33:02 +05:30
Mahesh Doijade
56b5b4c767 -- add cudaCompressibleMemory sample
-- fix simpleD3D12 build issue
-- fix simpleVulkanMMAP shader file locating issue
2020-05-22 22:07:30 +05:30
Mahesh Doijade
53e9e413a8 Update VS project files to use cuda 11.0 props/targets 2020-05-19 18:16:05 +05:30
Mahesh Doijade
4f6e02970b Add and update samples for cuda 11.0 support 2020-05-18 22:22:06 +05:30
Mahesh Doijade
6be514679b Add and update samples for cuda 10.2 support 2019-10-23 19:00:39 +05:30
Mahesh Doijade
489d9f7b1f conjugateGradientCudaGraphs: Add cusparseScsrmv & cublasSdot instead of
gpuSpMV & gpuDotProduct as they are graph compliant
2019-08-19 15:47:24 +05:30
Mahesh Doijade
c8483e0798 Add release notes for cuda 10.1 update 2 support 2019-08-05 18:58:54 +05:30
Mahesh Doijade
58d0ceff18 Add and update samples for cuda 10.1 update 2 support 2019-08-02 23:15:47 +05:30
Mahesh Doijade
337815dbee Add and update samples with CUDA 10.1 Update 1 support 2019-04-10 20:12:09 +05:30
Mahesh Doijade
1abc294982 --Update README.md for cuda 10.1 release
-- fix multidevice CG sample to check for concurrentManagedAccess as it needs systemwide atomics
2019-01-23 16:56:20 +05:30