Rob Nertney
|
26665bf33b
|
Fixing README
|
2023-02-27 22:35:39 +00:00 |
|
Rob Nertney
|
00bb9bc367
|
Updating files for Ada architecture
|
2023-02-27 22:33:19 +00:00 |
|
Rob Nertney
|
e4789153d5
|
Updating License Header
|
2023-02-09 19:02:33 +00:00 |
|
Rob Nertney
|
1c2efac7c8
|
Adding SM number for Ada Architecture
|
2023-02-07 19:06:53 +00:00 |
|
Rob Nertney
|
3d553b2ea1
|
Adding JIT LTO Sample
|
2023-02-07 19:06:38 +00:00 |
|
Rob Nertney
|
2b689228b7
|
Updating samples for 12.0
|
2022-12-08 20:19:55 +00:00 |
|
Rob Nertney
|
81992093d2
|
Update samples for CUDA 11.8 with correct props
|
2022-10-14 17:43:37 -07:00 |
|
Rutwik Choughule
|
b312abaa07
|
add check for filename in nvrtc_helper.h
|
2022-02-03 18:12:24 +05:30 |
|
Rutwik Choughule
|
8f21b899b6
|
update dependency related links in README files
|
2022-01-27 17:58:13 +05:30 |
|
Rutwik Choughule
|
0cbe5f2d82
|
update makefiles to waive unsupported samples on QNX
|
2022-01-27 17:57:02 +05:30 |
|
Rutwik Choughule
|
805e60bdfc
|
update lib path for conda
|
2022-01-27 17:55:38 +05:30 |
|
Rutwik Choughule
|
9d4c014f60
|
update sample cudaNvSci
|
2022-01-25 17:22:31 +05:30 |
|
Rutwik Choughule
|
bf8c6dd043
|
update lib path for conda
|
2022-01-14 02:31:40 +05:30 |
|
Rutwik Choughule
|
2e41896e1b
|
add and update samples for CUDA 11.6
|
2022-01-13 11:35:24 +05:30 |
|
Rutwik Choughule
|
11de19f00c
|
update command line argument parsing
fix ignoring single letter arguments
|
2021-11-30 23:42:19 +05:30 |
|
Rutwik Choughule
|
471dd47f84
|
update simpleVulkan
fix typo in importCudaExternalMemory()
|
2021-11-26 18:47:48 +05:30 |
|
Rutwik Choughule
|
0563025cde
|
update simpleVulkan sample
|
2021-11-23 18:51:00 +05:30 |
|
Rutwik Choughule
|
3a05f29b94
|
update cuDLA samples
fix missing DPRINTF
fix Makefile
|
2021-11-23 14:34:52 +05:30 |
|
Rutwik Choughule
|
e64c65a0d3
|
update sample conjugateGradientMultiDeviceCG
remove use of deprecated function cudaLaunchCooperativeKernelMultiDevice()
|
2021-11-18 10:16:22 +05:30 |
|
Rutwik Choughule
|
af0e1af181
|
remove duplicate sample
|
2021-11-18 10:15:10 +05:30 |
|
Rutwik Choughule
|
01789304f0
|
update sample bf16TensorCoreGemm to add explicit casting
|
2021-11-01 13:22:44 +05:30 |
|
Rutwik Choughule
|
1f76a2d110
|
add and update samples for CUDA 11.5
|
2021-10-21 16:34:49 +05:30 |
|
Rutwik Choughule
|
3342d604fe
|
update sample conjugateGradientMultiDeviceCG to remove use of deprecated function cudaLaunchCooperativeKernelMultiDevice()
|
2021-08-09 22:52:15 +05:30 |
|
Rutwik Choughule
|
ba04faaf73
|
add and update samples with CUDA 11.4 update 1 support
|
2021-08-03 19:24:45 +05:30 |
|
Rutwik Choughule
|
8789eb6266
|
add and update samples with CUDA 11.4 update 1 support
|
2021-08-03 19:02:58 +05:30 |
|
Rutwik Choughule
|
e950012e72
|
add and update samples with CUDA 11.4 support
|
2021-06-30 11:26:41 +05:30 |
|
Rutwik Choughule
|
95b7cea7bc
|
Merge branch 'master' of https://github.com/NVIDIA/cuda-samples
|
2021-06-10 18:37:53 +05:30 |
|
Rutwik Choughule
|
2aeaf51b11
|
cudaNvSciNvMedia plane offset correction
|
2021-06-10 17:33:24 +05:30 |
|
Rutwik Choughule
|
ba5a483c6e
|
update vulkan samples with SPIR-V shaders
|
2021-06-10 17:30:25 +05:30 |
|
Rutwik Choughule
|
7a5b3e6c8c
|
update vulkan samples with SPIR-V shaders
|
2021-06-02 17:17:21 +05:30 |
|
Rutwik Choughule
|
0787bc0489
|
correction in include path
|
2021-05-04 16:36:42 +05:30 |
|
Rutwik Choughule
|
5c3ec60fae
|
correction in include path
|
2021-05-03 14:17:55 +05:30 |
|
Rutwik Choughule
|
568b39bd5b
|
add and update samples with CUDA 11.3 support
|
2021-04-16 11:54:26 +05:30 |
|
Mahesh Doijade
|
067cb65523
|
-- Add partitioned cuda pipeline prod-cons gemm kernel
-- Add cudaCompressibleMemory sample to use copy engine vs SM writes
depending on arch
|
2021-03-03 22:53:02 +05:30 |
|
Mahesh Doijade
|
b882fa00ee
|
-- Add freeglut and glew64 libs for simpleGL
-- delete freeimage libs and headers, user needs to install it for
building samples depending on it
|
2020-12-11 17:34:13 +05:30 |
|
Mahesh Doijade
|
1cd3264681
|
Add and update samples with CUDA 11.2 support
|
2020-12-10 01:05:32 +05:30 |
|
Mahesh Doijade
|
92b0568792
|
Add cudaNvSciNvMedia sample with/without nvsci* APIs, it takes RGBA image as input and produces YUV via nvmedia
this YUV is consumed by cuda which converts it to grayscale image which is written to file as output
|
2020-11-24 16:28:04 +05:30 |
|
Mahesh Doijade
|
c4e2869a2b
|
add multi-warp cooperative groups based reduction kernel in reduction sample
|
2020-09-24 16:49:58 +05:30 |
|
Mahesh Doijade
|
dd2dba3489
|
helper_string: fix the missing "/" at end of 3 up path in sdkFindFilePath()
|
2020-09-17 20:45:23 +05:30 |
|
Mahesh Doijade
|
44052982a7
|
add ewp tags to VS project of conjugateGradientMultiDeviceCG
|
2020-09-17 20:01:41 +05:30 |
|
Mahesh Doijade
|
cd76533c3f
|
Add and update samples for cuda 11.1 support
|
2020-09-15 23:45:56 +05:30 |
|
Mahesh Doijade
|
e6ce58fef4
|
[p2pBandwidthLatency] increase default buffer size and add support to specific buffer size through command line
|
2020-06-30 19:05:55 +05:30 |
|
Mahesh Doijade
|
0ec4bd58e5
|
cudaCompressibleMemory: when unable to allocate compressible memory waive the execution
|
2020-06-01 20:22:22 +05:30 |
|
Mahesh Doijade
|
908dddb207
|
cudaCompressibleMemory: refactor and refine, and use only cuMemMap API for non-compressible allocation.
hence, remove use of cudaMalloc.
|
2020-05-27 19:00:33 +05:30 |
|
Mahesh Doijade
|
b60c149b17
|
Correct the runtime API section in documentation when sample has both driver as well as runtime API
|
2020-05-22 22:33:02 +05:30 |
|
Mahesh Doijade
|
56b5b4c767
|
-- add cudaCompressibleMemory sample
-- fix simpleD3D12 build issue
-- fix simpleVulkanMMAP shader file locating issue
|
2020-05-22 22:07:30 +05:30 |
|
Mahesh Doijade
|
53e9e413a8
|
Update VS project files to use cuda 11.0 props/targets
|
2020-05-19 18:16:05 +05:30 |
|
Mahesh Doijade
|
4f6e02970b
|
Add and update samples for cuda 11.0 support
|
2020-05-18 22:22:06 +05:30 |
|
Mahesh Doijade
|
6be514679b
|
Add and update samples for cuda 10.2 support
|
2019-10-23 19:00:39 +05:30 |
|
Mahesh Doijade
|
489d9f7b1f
|
conjugateGradientCudaGraphs: Add cusparseScsrmv & cublasSdot instead of
gpuSpMV & gpuDotProduct as they are graph compliant
|
2019-08-19 15:47:24 +05:30 |
|